The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,533 319 Updated May 26, 2026

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,706 238 Updated Jun 17, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,838 254 Updated Dec 30, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,958 18,090 Updated Jun 15, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,014 373 Updated Apr 6, 2026

umnooob / signvip

Python 23 7 Updated Nov 26, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 492 32 Updated May 9, 2026

facebookresearch / large_concept_model

Large Concept Models: Language modeling in a sentence representation space

Python 2,363 210 Updated Jan 29, 2025

zcgzcgzcg1 / MRC_book

《机器阅读理解：算法与实践》代码

Python 157 59 Updated Jul 25, 2024

huggingface / speech-to-speech

Build local voice agents with open-source models

Python 4,884 582 Updated Jun 15, 2026

KlingAIResearch / HumanAesExpert

Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"

Python 118 2 Updated Apr 15, 2025

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,608 83 Updated Oct 16, 2025

facebookresearch / cwm

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 883 71 Updated Jun 11, 2026

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,299 157 Updated Apr 13, 2026

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,761 273 Updated Jul 18, 2025

2U1 / Qwen-VL-Series-Finetune

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,914 217 Updated May 26, 2026

Fantasy-AMAP / fantasy-talking

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,621 127 Updated Jan 26, 2026

Omni-Avatar / OmniAvatar

Python 1,831 167 Updated Aug 6, 2025

huggingface / finetrainers

Scalable and memory-optimized training of diffusion models

Python 1,360 140 Updated May 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rongjiehuang

Achievements

Achievements

Organizations

Block or report Rongjiehuang

Stars

yihedeng9 / rlhf-summary-notes

OpenGVLab / ScaleCUA

Gen-Verse / OpenClaw-RL

Tongyi-MAI / MAI-UI

verl-project / verl

ServiceNow / GroundCUA

meituan / EvoCUA

Gitlawb / openclaude

ultraworkers / claw-code

meituan-longcat / LongCat-Flash-Thinking-2601

srvk / how2-dataset

facebookresearch / sam-audio