Skip to content
View lkdhy's full-sized avatar

Highlights

  • Pro

Block or report lkdhy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

767 41 Updated Oct 10, 2025

Ongoing research training transformer models at scale

Python 14,672 3,404 Updated Dec 23, 2025

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 8,170 834 Updated Sep 8, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,908 3,836 Updated Dec 23, 2025

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform stron…

Python 46 2 Updated Dec 17, 2025

This is a collection of recent papers on reasoning in video generation models.

86 1 Updated Dec 15, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,941 356 Updated Dec 23, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,721 128 Updated Dec 22, 2025

Enjoy the magic of Diffusion models!

Python 11,195 1,057 Updated Dec 20, 2025

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 194 7 Updated Oct 12, 2025

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

Python 226 5 Updated Dec 22, 2025

Contexts Optical Compression

Python 21,535 1,926 Updated Oct 25, 2025
Python 8 Updated Dec 14, 2025

LLM/VLM gaming agents and model evaluation through games.

Python 832 88 Updated Nov 16, 2025

GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal.

Shell 6,085 741 Updated Dec 19, 2025

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,817 452 Updated Oct 16, 2025

A sleek dataset viewer built entirely by AI Agent. Supports streaming large files from WebDAV, S3, SSH, Local or Hugging Face.

TypeScript 609 41 Updated Oct 21, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,190 120 Updated Nov 9, 2025

The absolute trainer to light up AI agents.

Python 9,808 792 Updated Dec 22, 2025
Python 27 2 Updated Oct 22, 2024

EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in challenging tasks.

Python 429 12 Updated Sep 24, 2025

A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more inte…

51 Updated Sep 1, 2025

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.

Python 610 44 Updated Nov 25, 2025

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,163 99 Updated Dec 8, 2025
Python 63 4 Updated Oct 22, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,696 1,355 Updated Dec 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,709 2,868 Updated Dec 22, 2025

Train transformer language models with reinforcement learning.

Python 16,738 2,372 Updated Dec 22, 2025
Python 345 20 Updated Jul 29, 2025
Next