-
Bytedance (Tiktok)
- Singapore
- https://lxtgh.github.io/
- @xtl994
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
📚 A curated list of Awesome Efficient dLLMs Papers with Codes
Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.
A framework for few-shot evaluation of language models.
Official Repo For PerceptionDLM Codebase
Use agent to learn agent - A skeleton course on how to design, build, and operate production AI agents
DreamX-World: A General-Purpose Interactive World Model
[ICML 2026] The official implementation of paper "Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key to Unification"
Bernini is a unified framework for video generation and editing that combines an MLLM-based semantic planner with a DiT-based renderer.
slime is an LLM post-training framework for RL Scaling.
Code release for "i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models"
HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing
**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.
[survey] Watch, Remember, Reason: Human-View Video Understanding with MLLMs
Official open-source code for the paper "Towards One-to-Many Temporal Grounding".
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Official implementation of LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing
Multimodal RL training framework for diffusion & omni models
JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Toolkit for linearizing PDFs for LLM datasets/training
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
Writing AI Conference Papers: A Handbook for Beginners