Meirtz

🫠

I may be slow to respond.

Lingrui Mei Meirtz

🫠

I may be slow to respond.

PhD Student at Chinese Academy of Sciences. Intern at Tencent HY. Computational Linguistics / Reinforcement Learning / Racing / Gaming

171 followers · 61 following

Chinese Academy of Sciences
Beijing, China
https://me.meirtz.com/about

Achievements

Organizations

Stars

snarktank / ralph

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 10,523 1,199 Updated Feb 2, 2026

wuhang03 / CamReasoner

CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

Python 23 1 Updated Feb 11, 2026

RachitBansal / qTTT

query-only test-time-training for long-context language modeling

Python 3 Updated Oct 7, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 917 109 Updated Feb 16, 2026

FasterDecoding / SnapKV

Python 303 27 Updated Jul 10, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,185 394 Updated Jul 11, 2024

ISEEKYAN / mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 193 55 Updated Feb 11, 2026

KANABOON1 / MemGen

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 307 24 Updated Feb 3, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,108 501 Updated Feb 17, 2026

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 403 15 Updated Jan 29, 2026

alibaba / RecIS

A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.

Python 317 22 Updated Feb 9, 2026

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 594 61 Updated Feb 15, 2026

simonsshoot / Coconut_Demo

temp trival for coconut

Python 2 Updated Sep 7, 2025

yeyimilk / LLMGeo

LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild

Python 16 2 Updated Oct 31, 2024

Weiyun1025 / verl-internvl

Python 46 6 Updated Oct 20, 2025

antgroup / OmniKV

Dynamic Context Selection for Efficient Long-Context LLMs

Python 56 4 Updated May 20, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,362 428 Updated Feb 10, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,834 218 Updated Feb 17, 2026

snowflakedb / ArcticTraining

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 276 36 Updated Feb 10, 2026

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,599 186 Updated Feb 12, 2026