Skip to content
View MPX0222's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MPX0222

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
79 stars written in Python
Clear filter

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,043 51 Updated Jul 30, 2025

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 888 65 Updated Sep 26, 2025

OpenFE: automated feature generation with expert-level performance

Python 827 108 Updated May 27, 2024

The official code of ARPO & AEPO

Python 761 36 Updated Nov 5, 2025

A version of verl to support diverse tool use

Python 668 50 Updated Nov 5, 2025

codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)

Python 667 65 Updated Oct 30, 2025

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 596 43 Updated Oct 29, 2024

[🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!

Python 594 15 Updated May 1, 2025

[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"

Python 435 17 Updated Sep 28, 2025

Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).

Python 385 50 Updated Apr 12, 2024

[VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.

Python 375 40 Updated Sep 8, 2025

A Gym for Agentic LLMs

Python 348 20 Updated Oct 30, 2025

Semantic Evaluation for Text-to-SQL with Distilled Test Suites

Python 304 70 Updated Jun 5, 2024
Python 299 13 Updated May 29, 2025

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 241 25 Updated Nov 8, 2025

[NeurIPS 2025] Thinkless: LLM Learns When to Think

Python 241 18 Updated Sep 26, 2025

Build deep learning applications in a new and easy way.

Python 241 21 Updated Dec 3, 2024

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

Python 229 11 Updated Oct 29, 2025

🔥[NeurIPS'25] DeepFund: Pilot for Your Next Fund Investment

Python 214 35 Updated Oct 30, 2025

MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, realistic, and adaptive scene generation for applications in…

Python 193 7 Updated May 5, 2025

This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.

Python 175 7 Updated Jul 7, 2025

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 161 13 Updated Nov 1, 2025

🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”

Python 139 17 Updated Oct 2, 2025

🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"

Python 116 12 Updated Oct 23, 2025

Plotly dataset-visualization pairs, feature extraction scripts, and model training code for VizML (CHI 2019)

Python 110 30 Updated May 20, 2021

(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis

Python 107 6 Updated Sep 22, 2025

[NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"

Python 105 12 Updated Nov 6, 2025

[NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"

Python 87 4 Updated Nov 29, 2024

Data derived from the Linked Births and Deaths Data (LBIDD); simulated pairs of treatment assignment and outcomes; scoring code

Python 84 13 Updated May 23, 2018

This is the official implementation for **"AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!**".

Python 80 4 Updated Oct 16, 2025