-
Meituan / ECNU / UC San Diego
- San Diego, CA
-
06:16
(UTC -08:00) - https://wjn1996.github.io/
- https://wjn1996.blog.csdn.net/
Stars
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
LongCat Audio Tokenizer and Detokenizer
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
Latest Advances on System-2 Reasoning
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
[EMNLP2025 Main] Code, Result and Files for paper[Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?]
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Technical report of Kimina-Prover Preview.
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
A series of technical report on Slow Thinking with LLM
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A unified evaluation framework for large language models
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
800,000 step-level correctness labels on LLM solutions to MATH problems
The related works and background techniques about Openai o1