-
Duke University
- Durham, NC
- yueqianlin.com
- @YueqianL
- in/yueqian-lin
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Semi-automated research assistant for academic research and software development. Supports Claude Code, OpenCode, and Codex CLI across ideation, coding, experiments, writing, and publication.
A high-throughput and memory-efficient inference and serving engine for LLMs
Lightweight coding agent that runs in your terminal
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Tensors and Dynamic neural networks in Python with strong GPU acceleration
SGLang is a high-performance serving framework for large language models and multimodal models.
A Datacenter Scale Distributed Inference Serving Framework
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models
An app that brings language models directly to your phone.
Composable building blocks to build LLM Apps
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
A framework for efficient model inference with omni-modality models
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
FlashInfer: Kernel Library for LLM Serving
The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Build, evaluate, and integrate long-term memory for self-evolving agents.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A general fine-tuning kit geared toward image/video/audio diffusion models.
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
Turn Claude Code from a chat assistant into an autonomous coding system