Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A high-throughput and memory-efficient inference and serving engine for LLMs
scikit-learn: machine learning in Python
No fortress, purely open ground. OpenManus is Coming.
Portable file server with accelerated resumable uploads, dedup, WebDAV, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file, no deps
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Data validation using Python type hints
🤗 smolagents: a barebones library for agents that think in code.
Code for the paper "Language Models are Unsupervised Multitask Learners"
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Faster Whisper transcription with CTranslate2
Datasets, Transforms and Models specific to Computer Vision
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Simple, unified interface to multiple Generative AI providers
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Accessible large language models via k-bit quantization for PyTorch.
🔥 2D and 3D Face alignment library build using pytorch
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
The official PyTorch implementation of Google's Gemma models
Video+code lecture on building nanoGPT from scratch
A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.
Everything about the SmolLM and SmolVLM family of models
Whisper realtime streaming for long speech-to-text transcription and translation
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
ControlNet++: All-in-one ControlNet for image generations and editing!