Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,587 3,687 Updated Jul 9, 2025

s0md3v / roop

one-click face swap

Python 30,346 6,900 Updated Aug 19, 2024

python-telegram-bot / python-telegram-bot

We have made you a wrapper you can't refuse

Python 28,370 5,906 Updated Nov 3, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,964 5,813 Updated Sep 27, 2025

facebookresearch / Detectron

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python 26,391 5,442 Updated Nov 20, 2023

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,362 1,888 Updated Jun 3, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,818 2,665 Updated Jul 3, 2025

wshobson / agents

Intelligent automation and multi-agent orchestration for Claude Code

Python 20,048 2,236 Updated Nov 1, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,621 1,975 Updated Oct 21, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,056 3,181 Updated Nov 6, 2025

emcie-co / parlant

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 15,921 1,316 Updated Nov 6, 2025

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 15,920 1,268 Updated Jan 18, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,544 1,988 Updated Nov 3, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,363 1,354 Updated Oct 1, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,470 1,278 Updated Oct 12, 2025

chiphuyen / stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

Python 10,373 4,285 Updated Dec 22, 2020

google-deepmind / sonnet

TensorFlow-based neural network library

Python 9,890 1,309 Updated Aug 4, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,568 2,343 Updated Nov 5, 2025

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,727 1,382 Updated Dec 6, 2023

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,579 560 Updated Sep 15, 2025

numenta / nupic-legacy

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Python 6,352 1,550 Updated Dec 3, 2024

thtrieu / darkflow

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

Python 6,154 2,049 Updated Oct 23, 2023

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,038 629 Updated Aug 10, 2024

oarriaga / face_classification

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Python 5,701 1,612 Updated Mar 8, 2024

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,607 563 Updated Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thinh P. Tran ThinhPTran

Achievements