Stars
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
🐬 A comprehensive tutorial on getting started with Docker!
Simultaneous speech-to-text models
Website for viewing and exporting full transcripts of Apple Podcasts
Examples for using ONNX Runtime for machine learning inferencing.
PyEER is a python package for biometric systems performance evaluation. Includes ROC, DET, FNMR, FMR and CMC curves plotting, scores distribution plotting, EER and operating points estimation. It c…
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
The simplest, fastest repository for training/finetuning small-sized VLMs.
Workshop on development of a Local Retrieval Augmented Generation (RAG) system
Official repository of the paper: "ID-Booth: Identity-consistent Face Generation with Diffusion Models"
Clean C++ project for you to use. Features: Modern CMake, CPack, Doxygen, PlantUML, Catch Unit testing, static analysis
A vision language model for gigapixel whole slide images in histopathology
APFS module for linux, with experimental write support
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Interactively explore unstructured datasets from your dataframe.