Skip to content
View lxtGH's full-sized avatar
💬
At home
💬
At home

Highlights

  • Pro

Block or report lxtGH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,711 225 Updated Apr 14, 2026

📚 A curated list of Awesome Efficient dLLMs Papers with Codes

12 Updated Apr 24, 2026

Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.

2,127 83 Updated Jun 11, 2026

A framework for few-shot evaluation of language models.

Python 13,038 3,354 Updated Jun 22, 2026

Official Repo For PerceptionDLM Codebase

Python 38 1 Updated Jun 22, 2026

Use agent to learn agent - A skeleton course on how to design, build, and operate production AI agents

JavaScript 425 61 Updated Jun 22, 2026

DreamX-World: A General-Purpose Interactive World Model

Python 550 34 Updated Jun 22, 2026

[ICML 2026] The official implementation of paper "Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key to Unification"

Python 33 Updated Jun 23, 2026

Bernini is a unified framework for video generation and editing that combines an MLLM-based semantic planner with a DiT-based renderer.

Python 933 74 Updated Jun 22, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,705 969 Updated Jun 23, 2026

Code release for "i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models"

Python 163 10 Updated Jun 11, 2026

The codebase of Cola DLM

Python 237 13 Updated Jun 11, 2026
34 Updated Jun 11, 2026

HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing

Python 294 13 Updated Mar 18, 2026
Jupyter Notebook 548 42 Updated Jun 10, 2026

**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.

Python 399 17 Updated Nov 3, 2025

[survey] Watch, Remember, Reason: Human-View Video Understanding with MLLMs

24 9 Updated Jun 13, 2026

Official open-source code for the paper "Towards One-to-Many Temporal Grounding".

Python 2 Updated Jun 20, 2026

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,469 765 Updated Mar 23, 2025

Official implementation of LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

Python 65 2 Updated Jun 6, 2026

Multimodal RL training framework for diffusion & omni models

Python 407 58 Updated Jun 23, 2026

JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation

Python 1,655 149 Updated Jun 16, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 380,065 79,589 Updated Jun 23, 2026

Toolkit for linearizing PDFs for LLM datasets/training

Python 17,404 1,400 Updated Mar 25, 2026

AllenAI's post-training codebase

Python 3,764 549 Updated Jun 22, 2026

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

164 6 Updated Feb 6, 2026

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

Python 533 219 Updated Jun 23, 2026

Writing AI Conference Papers: A Handbook for Beginners

3,840 135 Updated Jul 16, 2025
Next