Lists (18)
Sort Name ascending (A-Z)
Starred repositories
Pytorch PI-zero and PI-zero-fast. Adapted from LeRobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
LeRobot sim2real code. Train in fast simulation and deploy visual policies zero shot to the real world
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
Links to publications that focus on the interpretation and analysis of in-context learning
Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"
Here is the resources and code for the LotteryCodec.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
リアルタイムボイスチェンジャー Realtime Voice Changer