lkdhy

Tony Li lkdhy

a Chinese ACMer, winning the Silver Medal of EC-Final

37 followers · 73 following

Fudan University
Shanghai
09:12 (UTC +08:00)
https://blog.csdn.net/weixin_50011798/article/details/135598566
https://leetcode.com/u/lkdhy/

Achievements

Highlights

Lists (2)

Sort

LLM

6 repositories

streamlit

1 repository

Starred repositories

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

767 41 Updated Oct 10, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,672 3,404 Updated Dec 23, 2025

JimmyLv / awesome-nano-banana

Forked from jamez-bondos/awesome-gpt4o-images

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 8,170 834 Updated Sep 8, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,908 3,836 Updated Dec 23, 2025

FoundationAgents / VR-Bench

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform stron…

Python 46 2 Updated Dec 17, 2025

Video-Reason / Awesome-Video-Reasoning

This is a collection of recent papers on reasoning in video generation models.

86 1 Updated Dec 15, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,941 356 Updated Dec 23, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,721 128 Updated Dec 22, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 11,195 1,057 Updated Dec 20, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 194 7 Updated Oct 12, 2025

tongjingqi / Thinking-with-Video

We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reache…

Python 226 5 Updated Dec 22, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,535 1,926 Updated Oct 25, 2025

betmma / VLMPuzzle

Python 8 Updated Dec 14, 2025

lmgame-org / GamingAgent

LLM/VLM gaming agents and model evaluation through games.

Python 832 88 Updated Nov 16, 2025

github / copilot-cli

GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal.

Shell 6,085 741 Updated Dec 19, 2025

HKUDS / AI-Researcher

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,817 452 Updated Oct 16, 2025

stardustai / dataset-viewer

A sleek dataset viewer built entirely by AI Agent. Supports streaming large files from WebDAV, S3, SSH, Local or Hugging Face.

TypeScript 609 41 Updated Oct 21, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,190 120 Updated Nov 9, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 9,808 792 Updated Dec 22, 2025

chengtan9907 / ReviewMT

Python 27 2 Updated Oct 22, 2024

ByteDance-Seed / EvaLearn

EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in challenging tasks.

Python 429 12 Updated Sep 24, 2025

tongjingqi / Awesome-Agent-RL

A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more inte…

51 Updated Sep 1, 2025

LightChen233 / Awesome-AI4Research

204 15 Updated Aug 5, 2025

SkyworkAI / Matrix-3D

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.

Python 610 44 Updated Nov 25, 2025

Francis-Rings / StableAvatar

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,163 99 Updated Dec 8, 2025