Skip to content
View jpthu17's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@PKU-YuanGroup

Block or report jpthu17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Helios: Real Real-Time Long Video Generation Model

Python 1,743 132 Updated Apr 16, 2026

[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling

82 3 Updated Mar 2, 2026

Elevate your AI research writing, no more tedious polishing ✨

20,180 1,614 Updated Mar 25, 2026

iFSQ & LlamaGen-REPA

Python 101 10 Updated Jan 27, 2026

Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Python 73 2 Updated Mar 26, 2026

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,495 1,005 Updated Apr 28, 2026

Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback

Python 267 10 Updated Jan 24, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,867 80 Updated Feb 25, 2026

Official repository for the UAE paper, unified-GRPO, and unified-Bench

Python 163 6 Updated Sep 12, 2025

Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"

Python 44 3 Updated Sep 12, 2025

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…

Python 3,071 329 Updated Apr 24, 2025

Official implementation of Browse-Master, a tool-augmented web-search agent.

Python 29 3 Updated Aug 22, 2025

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)

Python 256 43 Updated Dec 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,050 2,068 Updated Mar 27, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,700 822 Updated Jan 21, 2026

Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".

Python 36 Updated Jul 10, 2025

This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".

Python 91 4 Updated Jul 10, 2025

Prompts for deep research (openai, gemini,qwen)

116 14 Updated May 17, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,413 1,323 Updated Jul 9, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,765 1,446 Updated Feb 27, 2026

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,703 1,448 Updated Jan 4, 2026

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 113 11 Updated Dec 20, 2024

LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry

Python 51 3 Updated Oct 9, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 876 29 Updated Dec 23, 2025

toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

Python 31 2 Updated Sep 1, 2024

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,269 115 Updated Aug 16, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 738 50 Updated Oct 15, 2025

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,379 83 Updated May 16, 2025
Next