-
Peking University
-
10:07
(UTC +08:00) - https://jpthu17.github.io/
Stars
Helios: Real Real-Time Long Video Generation Model
[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
Elevate your AI research writing, no more tedious polishing ✨
Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Official repository for the UAE paper, unified-GRPO, and unified-Bench
Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…
Official implementation of Browse-Master, a tool-augmented web-search agent.
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Kimi K2 is the large language model series developed by Moonshot AI team
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
Prompts for deep research (openai, gemini,qwen)
Tongyi Deep Research, the Leading Open-source Deep Research Agent
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning