Stars
Low ReSource Reinforcement Learning with CPU Offloading Training Support
[NeurIPS 2025] One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding
MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.
[CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"
This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"
⏰ AI conference deadline countdowns
repository for Unbiased Gradient Boosting Decision Tree with Unbiased Feature Importance
[IJCAI 2023] The official repo of paper 'Automatic Truss Design with Reinforcement Learning'
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
ZheyuAqaZhang / transformers
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AutoX is an efficient automl tool, which is mainly aimed at data mining tasks with tabular data.