-
-
stable-baselines3 Public
Forked from DLR-RM/stable-baselines3PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Python MIT License UpdatedDec 8, 2025 -
Book-Mathematical-Foundation-of-Reinforcement-Learning Public
Forked from MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-LearningThis is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
MATLAB UpdatedJun 24, 2025 -
RLAIF-V Public
Forked from RLHF-V/RLAIF-V[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
Python UpdatedMay 14, 2025 -
openvla Public
Forked from openvla/openvlaOpenVLA: An open-source vision-language-action model for robotic manipulation.
Python MIT License UpdatedMar 23, 2025 -
open-pi-zero Public
Forked from allenzren/open-pi-zeroRe-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
Python MIT License UpdatedJan 31, 2025 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Python Apache License 2.0 UpdatedAug 12, 2024 -
Video-LLaMA Public
Forked from DAMO-NLP-SG/Video-LLaMA[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 4, 2024 -
RL-VLM-F Public
Forked from yufeiwang63/RL-VLM-FCode for Reinforcement Learning from Vision Language Foundation Model Feedback
C++ UpdatedMay 22, 2024 -
BPref Public
Forked from rll-research/BPrefOfficial codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
Python MIT License UpdatedNov 3, 2021 -
tensorflow-tutorial-samples Public
Forked from geektutu/tensorflow-tutorial-samplesTensorFlow2教程 TensorFlow 2.0 Tutorial 入门教程实战案例
Python UpdatedNov 22, 2020