-
SCSE, BUAA
Highlights
- Pro
Stars
A list of awesome papers and resources of recommender system on large language model (LLM).
An open-source AI agent that brings the power of Gemini directly into your terminal.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Train transformer language models with reinforcement learning.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Accessible large language models via k-bit quantization for PyTorch.
[ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting
a tools for tracking point and object mask by multiple click
[ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"
[ACM MM 2024] Offical Code for "HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting"
Stable Diffusion web UI
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
LimSim & LimSim++: Integrated traffic and autonomous driving simulators with (M)LLM support