-
SRI International
- Princeton
- @AnirudhSom
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
All Algorithms implemented in Python
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Robust Speech Recognition via Large-Scale Weak Supervision
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Making large AI models cheaper, faster and more accessible
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Industry leading face manipulation platform
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
State-of-the-Art Text Embeddings
A collection of simple python mini projects to enhance your python skills
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Train transformer language models with reinforcement learning.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.