-
Chinese Academy of Sciences
- Beijing, China
- https://me.meirtz.com/about
Stars
Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.
CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning
query-only test-time-training for long-context language modeling
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.
Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.
LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild
Dynamic Context Selection for Efficient Long-Context LLMs
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…
slime is an LLM post-training framework for RL Scaling.
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent
Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
Production-ready platform for agentic workflow development.
User Profile-Based Long-Term Memory for AI Chatbot Applications.
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
The missing star history graph of GitHub repos - https://star-history.com
Latest Advances on Long Chain-of-Thought Reasoning