-
MEGVII Research
- Beijing, China
Highlights
- Pro
Stars
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.
Fast, Sharp & Reliable Agentic Intelligence
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
Must-read papers on Repository-level Code Generation & Issue Resolution 🔥
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
[CVPR 2026] ViStoryBench: AI Story Visualization Benchmark
A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithms with minimal intrusion.
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Official Repo for Open-Reasoner-Zero
Summarize existing representative LLMs text datasets.
Collective communications library with various primitives for multi-machine training.
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
A quick guide (especially) for trending instruction finetuning datasets
Repository for organizing datasets and papers used in Open LLM.
📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Awesome-LLM: a curated list of Large Language Model
[CVPR 2025] DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation & [ICLR 2024] DFormer & [NeuriPS 2025] OmniSegmentor
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
🩹Editing large language models within 10 seconds⚡
Aligning pretrained language models with instruction data generated by themselves.
A research project for natural language generation, containing the official implementations by MSRA NLC team.
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX