Highlights
Stars
This is a repository for listing papers on scene graph generation and application.
AssetOpsBench - Industry 4.0: A unified benchmark and framework for building, orchestrating, and evaluating domain-specific AI agents for Industry 4.0 asset operations and maintenance, with 460+ sc…
Collection of resources from prior speaking engagements and guidance on booking Chinasa for speaking engagements.
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
a fork of https://jonbarron.info/ for use in jekyll builds with markdown page updates
Reinforcement Learning for Neural Machine Translation
Reinforcement learning for machine translation use FairSeq toolkit.
Reduce the size of pretrained Hugging Face models via vocabulary trimming.
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning
TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted from TED talks www.ted.com for 109 world languages.
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
A modular RL library to fine-tune language models to human preferences
Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method