Skip to content
View ThinhPTran's full-sized avatar
  • Ho chi minh city, Vietnam

Block or report ThinhPTran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
327 results for source starred repositories
Clear filter
Python 1,170 156 Updated Oct 28, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 15,912 1,268 Updated Jan 18, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,974 8,201 Updated Dec 9, 2024

Face Editor for Stable Diffusion

Python 1,068 89 Updated Sep 15, 2024

one-click face swap

Python 30,335 6,900 Updated Aug 19, 2024

The easiest way to make yourself the hero of video memes.

Python 26 6 Updated Mar 20, 2025

DiffFace: Diffusion-based Face Swapping with Facial Guidance

Python 305 23 Updated Feb 16, 2025

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,324 175 Updated Mar 13, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 26,948 5,814 Updated Sep 27, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,583 562 Updated Oct 31, 2025

UniSpeech - Large Scale Self-Supervised Learning for Speech

Python 472 74 Updated Apr 5, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,811 2,663 Updated Jul 3, 2025

Speaker detection using a lip movement based RNN detector

Python 76 20 Updated Jun 22, 2018

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,043 3,177 Updated Nov 5, 2025
Python 2 1 Updated Sep 2, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,341 1,352 Updated Oct 1, 2025

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 15,842 1,306 Updated Nov 5, 2025

Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo

Python 2 2 Updated Apr 28, 2021

convert phoneme to grapheme vietnames

Python 5 5 Updated Jul 7, 2020

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,869 7,481 Updated Nov 5, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,065 252 Updated Jun 6, 2024
Python 1,165 111 Updated Oct 9, 2025

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 18,332 2,547 Updated Nov 5, 2025

Official inference framework for 1-bit LLMs

Python 24,352 1,886 Updated Jun 3, 2025

Rapid Product Development with n8n, published by Packt

HTML 42 23 Updated Jan 30, 2023

Implementation of F5-TTS in MLX

Python 593 60 Updated Mar 19, 2025
Next