Skip to content
View Meirtz's full-sized avatar
🫠
I may be slow to respond.
🫠
I may be slow to respond.

Organizations

@Sa1varmy @OuO-language @LUNAD3v @DLFC

Block or report Meirtz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

TypeScript 10,523 1,199 Updated Feb 2, 2026

CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning

Python 23 1 Updated Feb 11, 2026

query-only test-time-training for long-context language modeling

Python 3 Updated Oct 7, 2025

LLM KV cache compression made easy

Python 917 109 Updated Feb 16, 2026
Python 303 27 Updated Jul 10, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,185 394 Updated Jul 11, 2024

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 193 55 Updated Feb 11, 2026

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 307 24 Updated Feb 3, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,108 501 Updated Feb 17, 2026

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 403 15 Updated Jan 29, 2026

A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.

Python 317 22 Updated Feb 9, 2026

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 594 61 Updated Feb 15, 2026

temp trival for coconut

Python 2 Updated Sep 7, 2025

LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild

Python 16 2 Updated Oct 31, 2024
Python 46 6 Updated Oct 20, 2025

Dynamic Context Selection for Efficient Long-Context LLMs

Python 56 4 Updated May 20, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,362 428 Updated Feb 10, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,834 218 Updated Feb 17, 2026

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 276 36 Updated Feb 10, 2026

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,599 186 Updated Feb 12, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,222 546 Updated Feb 14, 2026

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,452 121 Updated Nov 13, 2025

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Python 178 27 Updated Dec 11, 2025

Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 225 36 Updated Jan 27, 2026

Production-ready platform for agentic workflow development.

TypeScript 129,735 20,189 Updated Feb 17, 2026

User Profile-Based Long-Term Memory for AI Chatbot Applications.

Python 2,565 196 Updated Jan 11, 2026

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Python 112 6 Updated Feb 2, 2026

The missing star history graph of GitHub repos - https://star-history.com

TypeScript 8,487 317 Updated Feb 17, 2026

Latest Advances on Long Chain-of-Thought Reasoning

609 27 Updated Jul 18, 2025
Next