Stars
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
a subset of coco dataset for faster experimentation
PyTorch code and models for VJEPA2 self-supervised learning from video.
[AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Reference PyTorch implementation and models for DINOv3
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
ObjectClear: Complete Object Removal via Object-Effect Attention
Migrate a project from Poetry/Pipenv/pip-tools/pip to uv package manager
👤 | Face re-identification using FAISS, ArcFace & SCRFD | ONNX Runtime Inference
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
A framework for few-shot evaluation of language models.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Official inference framework for 1-bit LLMs
Highly Performant, Modular, Memory Safe and Production-ready Inference, Ingestion and Indexing built in Rust 🦀
A game theoretic approach to explain the output of any machine learning model.
A guidance language for controlling large language models.
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
[NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO