-
Chongqing University
- Chongqing, China
- error666.top
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
An Open Source implementation of Notebook LM with more flexibility and features
Awesome paper list and repos of the paper "A comprehensive survey of embodied world models".
Ultra-high-performance, secure, all-in-one acceleration engine for developer resources
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
SWE-bench: Can Language Models Resolve Real-world Github Issues?
codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)
Inspect: A framework for large language model evaluations
2025年12月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
Recipes to scale inference-time compute of open models
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
Paper List of Inference/Test Time Scaling/Computing
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…
No fortress, purely open ground. OpenManus is Coming.
Optimizing inference proxy for LLMs