Highlights
- Pro
Lists (14)
Sort Name ascending (A-Z)
Stars
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
High accuracy RAG for answering questions from scientific documents with citations
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
A library to generate LaTeX expression from Python code.
Home of StarCoder: fine-tuning & inference!
A Collection of Variational Autoencoders (VAE) in PyTorch.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Community maintained fork of pdfminer - we fathom PDF
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Model interpretability and understanding for PyTorch
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Aligning pretrained language models with instruction data generated by themselves.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
General technology for enabling AI capabilities w/ LLMs and MLLMs
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
An unofficial PyTorch implementation of the audio LM VALL-E
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)