shizhediao

Follow

🎯

Focusing

shizhediao shizhediao

🎯

Focusing

Follow

Researcher at NVIDIA

320 followers · 158 following

NVIDIA
California
https://shizhediao.github.io/

Achievements

Achievements

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

longvideoagent / LongVideoAgent

50 3 Updated Dec 24, 2025

sierra-research / tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 566 125 Updated Dec 18, 2025

NVlabs / ToolOrchestra

ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.

Python 422 53 Updated Dec 23, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 859 138 Updated Dec 24, 2025

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 11,282 1,253 Updated Oct 10, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

71,348 8,166 Updated Dec 22, 2025

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,493 836 Updated Dec 16, 2025

ZTE-AICloud / Co-Sight

Python 980 284 Updated Dec 5, 2025

nathan-barry / tiny-diffusion

A character-level language diffusion model trained on Tiny Shakespeare

Python 614 54 Updated Nov 15, 2025

NVlabs / ProfBench

PhD/MBA-level human-annotated rubrics dataset across Physics, Chemistry, Finance and Consulting

Python 26 1 Updated Oct 30, 2025

ScaleML / scaleml.github.io

Lab webpage.

TypeScript 2 Updated Dec 14, 2025

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 607 52 Updated Oct 29, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,567 1,929 Updated Oct 25, 2025

shizhediao / nanochat

Forked from karpathy/nanochat

The best ChatGPT that $100 can buy.

Python 3 Updated Oct 23, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,235 4,970 Updated Dec 23, 2025

evalplus / evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,660 187 Updated Oct 2, 2025

centerforaisafety / hle

Humanity's Last Exam

Python 1,281 81 Updated Oct 7, 2025

HKUDS / LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 26,555 3,777 Updated Dec 24, 2025

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 537 57 Updated Sep 11, 2025

allenai / agent-baselines

Python 102 12 Updated Dec 8, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,236 1,883 Updated Sep 8, 2025

NVIDIA-AI-Blueprints / aiq-research-assistant

Python 239 76 Updated Oct 27, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 517 34 Updated Nov 26, 2025

openai / frontier-evals

OpenAI Frontier Evals

Python 967 114 Updated Dec 6, 2025

Future-House / paper-qa

High accuracy RAG for answering questions from scientific documents with citations

Python 7,932 809 Updated Dec 23, 2025

test-time-interaction / TTI

Python 68 3 Updated Jun 10, 2025

goombalab / hnet

H-Net: Hierarchical Network with Dynamic Chunking

Python 797 90 Updated Nov 20, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

9,760 709 Updated Nov 7, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,284 106 Updated Dec 15, 2025

bminixhofer / tokenkit

A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.

Python 59 5 Updated Jul 6, 2025