-
Alibaba
- China
- csyhhu.github.io
Highlights
- Pro
Stars
From Pipeline to Council — A lightweight agentic framework for search & recommendation. 五个 Agent 重构搜广推管线。
This repo contains the Hugging Face Deep Reinforcement Learning Course.
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
FlashMLA: Efficient Multi-head Latent Attention Kernels
Fully open reproduction of DeepSeek-R1
Awesome LLM compression research papers and tools.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Pytorch library for fast transformer implementations
Structured state space sequence models
A curated list of awesome projects and resources related to autonomous AI agents.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Open-Sora: Democratizing Efficient Video Production for All
Open-source super AI assistant & Agent Harness. Plans tasks, runs tools and skills, self-evolves with memory and knowledge. Multi-model, multi-channel. Lightweight, extensible, one-line install. (f…
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Humanoid Agents: Platform for Simulating Human-like Generative Agents
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
The hub for EleutherAI's work on interpretability and learning dynamics
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
A framework for few-shot evaluation of language models.
Transformer related optimization, including BERT, GPT
An Efficient Pipelined Data Parallel Approach for Training Large Model
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…