-
SRI International
- Princeton
- @AnirudhSom
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
All Algorithms implemented in Python
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Robust Speech Recognition via Large-Scale Weak Supervision
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Making large AI models cheaper, faster and more accessible
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Industry leading face manipulation platform
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
State-of-the-Art Text Embeddings
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Train transformer language models with reinforcement learning.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"