Stars
Enjoy the magic of Diffusion models!
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets
MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.
This repository contains the official implementation of "ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body".
Official Code of CVPR 2025 paper "SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters"
Align Anything: Training All-modality Model with Feedback
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
TrackerLab is a cutting-edge modular framework for humanoid motion retargeting, trajectory tracking, and skill-level control, built on top of IsaacLab.
The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.
Train transformer language models with reinforcement learning.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
LLaMA 2 implemented from scratch in PyTorch
Mobile-Agent: The Powerful GUI Agent Family
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space (ICML2026)
Sharp Monocular View Synthesis in Less Than a Second
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance"
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Being-H is BeingBeyond's family of human-centric embodied foundation models.
An aggregation of human motion understanding research.
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
This repository contains the official implementation of "The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion".
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton…
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.