Stars
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
A comprehensive survey on Reinforcement Learning for Vision-Language-Action models, including research papers, technical blogs, and automated monitoring tools
1st place solution of 2025 BEHAVIOR Challenge
WinstonWmj / RLinf
Forked from RLinf/RLinfRLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Making a mini version of the BDX droid. https://discord.gg/UtJZsgfQGe
[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide
Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。
WinstonWmj / MP5
Forked from IranQin/MP5This repository is a reproduction of the MP5 repository, for the convenience of others to reproduce this work.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
通过APP控制的可实现手柄控制、重力控制、循线功能、自主探索等功能的多功能智能小车
The world's simplest facial recognition api for Python and the command line
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Step into the unknown darkness, converse with the hidden secrets. Behind every baffling scenario, uncover the shocking truth. Let's explore the world of Situation Puzzles together, finding the stor…
Common used path planning algorithms with animations.
THUCNews中文文本分类数据集,该数据集包含84万篇新闻文档,总计14类;在该模型的基础上测试多个版本bert分类效果。
KDD21 Attentive Heterogeneous Graph Embedding for Job Mobility Prediction
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
AI Roadmap:机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格工程师的跨越,其中深度学习相关论文附有tensorflow caffe官方源码,应用部分含推荐算法…