Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,590 3,689 Updated Jul 9, 2025

s0md3v / roop

one-click face swap

Python 30,348 6,900 Updated Aug 19, 2024

python-telegram-bot / python-telegram-bot

We have made you a wrapper you can't refuse

Python 28,373 5,910 Updated Nov 8, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,966 5,812 Updated Sep 27, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,367 1,889 Updated Jun 3, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,819 2,669 Updated Jul 3, 2025

wshobson / agents

Intelligent automation and multi-agent orchestration for Claude Code

Python 20,128 2,246 Updated Nov 1, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,643 1,978 Updated Oct 21, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,060 3,182 Updated Nov 8, 2025

emcie-co / parlant

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 15,952 1,319 Updated Nov 8, 2025

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 15,926 1,269 Updated Jan 18, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,549 1,990 Updated Nov 3, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,371 1,354 Updated Oct 1, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,493 1,282 Updated Oct 12, 2025

google-deepmind / sonnet

TensorFlow-based neural network library

Python 9,891 1,309 Updated Aug 4, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,571 2,343 Updated Nov 5, 2025

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,726 1,382 Updated Dec 6, 2023

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,589 562 Updated Sep 15, 2025

numenta / nupic-legacy

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Python 6,352 1,550 Updated Dec 3, 2024

thtrieu / darkflow

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

Python 6,154 2,049 Updated Oct 23, 2023

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,042 631 Updated Aug 10, 2024

oarriaga / face_classification

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Python 5,701 1,612 Updated Mar 8, 2024

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,616 564 Updated Oct 31, 2025

quantumlib / Cirq

Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.

Python 4,763 1,148 Updated Nov 7, 2025

google-deepmind / alphageometry

Python 4,685 549 Updated Jun 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thinh P. Tran ThinhPTran

Achievements