-
BSc, THU -> PhD, HKUST
- https://scholar.google.com/citations?user=5U4P54wAAAAJ&hl=zh-CN
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Collection of Summer 2026 tech internships!
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
A generative world for general-purpose robotics & embodied AI learning.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fully open reproduction of DeepSeek-R1
Fast and memory-efficient exact attention
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
verl: Volcano Engine Reinforcement Learning for LLMs