Skip to content
View shizhediao's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report shizhediao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 566 125 Updated Dec 18, 2025

ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.

Python 422 53 Updated Dec 23, 2025

An interface library for RL post training with environments.

Python 859 138 Updated Dec 24, 2025

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 11,282 1,253 Updated Oct 10, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

71,348 8,166 Updated Dec 22, 2025

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,493 836 Updated Dec 16, 2025
Python 980 284 Updated Dec 5, 2025

A character-level language diffusion model trained on Tiny Shakespeare

Python 614 54 Updated Nov 15, 2025

PhD/MBA-level human-annotated rubrics dataset across Physics, Chemistry, Finance and Consulting

Python 26 1 Updated Oct 30, 2025

Lab webpage.

TypeScript 2 Updated Dec 14, 2025

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 607 52 Updated Oct 29, 2025

Contexts Optical Compression

Python 21,567 1,929 Updated Oct 25, 2025

The best ChatGPT that $100 can buy.

Python 3 Updated Oct 23, 2025

The best ChatGPT that $100 can buy.

Python 39,235 4,970 Updated Dec 23, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,660 187 Updated Oct 2, 2025

Humanity's Last Exam

Python 1,281 81 Updated Oct 7, 2025

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 26,555 3,777 Updated Dec 24, 2025

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 537 57 Updated Sep 11, 2025
Python 102 12 Updated Dec 8, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,236 1,883 Updated Sep 8, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 517 34 Updated Nov 26, 2025

OpenAI Frontier Evals

Python 967 114 Updated Dec 6, 2025

High accuracy RAG for answering questions from scientific documents with citations

Python 7,932 809 Updated Dec 23, 2025
Python 68 3 Updated Jun 10, 2025

H-Net: Hierarchical Network with Dynamic Chunking

Python 797 90 Updated Nov 20, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,760 709 Updated Nov 7, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,284 106 Updated Dec 15, 2025

A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.

Python 59 5 Updated Jul 6, 2025
Next