Skip to content
View ZihanWang314's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report ZihanWang314

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Trajectory optimization methods for improving LLM agents via weak-to-strong learning.

Python 3 1 Updated Aug 2, 2025
Python 38 2 Updated Aug 15, 2025
Python 90 3 Updated Oct 2, 2025

Nano vLLM

Python 7,008 891 Updated Aug 31, 2025

A paper list for spatial reasoning

143 4 Updated Jun 11, 2025

LLM/VLM gaming agents and model evaluation through games.

Python 775 84 Updated Sep 12, 2025

TStar is a unified temporal search framework for long-form video question answering

Python 68 1 Updated Sep 2, 2025
Python 221 30 Updated Oct 1, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,925 115 Updated Apr 3, 2025

Code release for DynamicTanh (DyT)

Python 1,017 84 Updated Mar 30, 2025

Autoregressive Entity Retrieval

Python 794 104 Updated Jul 6, 2023

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 220 27 Updated Sep 14, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,798 907 Updated Sep 30, 2025

Official Repo for Open-Reasoner-Zero

Python 2,045 117 Updated Jun 2, 2025

Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding…

TypeScript 5,074 198 Updated Apr 30, 2025
TeX 3 Updated Feb 14, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,335 185 Updated Oct 8, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,733 335 Updated Jul 15, 2024

Awesome-LLM: a curated list of Large Language Model

25,224 2,129 Updated Jul 31, 2025

Sokoban environment for OpenAI Gym

Python 384 87 Updated Nov 8, 2023

A system that tries to resolve all issues on a github repo with OpenHands.

Python 113 23 Updated Nov 18, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 818 43 Updated Jul 29, 2025

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,628 195 Updated Jun 9, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,565 2,240 Updated Feb 1, 2025
Python 302 18 Updated Apr 23, 2025

Code for the manim-generated scenes used in 3blue1brown videos

Python 9,857 1,977 Updated May 5, 2025

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 338 37 Updated Jul 10, 2025

Expert Specialized Fine-Tuning

Python 705 259 Updated May 22, 2025
Next