Skip to content
View pb00000650's full-sized avatar

Block or report pb00000650

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Lightweight LLM Inference Performance Simulator

Python 67 18 Updated Mar 18, 2026

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko,…

Python 395 64 Updated Oct 12, 2025

Tuned OpenCL BLAS

C++ 1,170 212 Updated Apr 2, 2026

A tiny C++ Reflection implementation project

C++ 2 Updated Oct 25, 2022

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,941 769 Updated Sep 22, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,342 5,130 Updated Apr 2, 2026

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,905 2,793 Updated Jun 22, 2025

PandA-bambu public repository

C++ 318 64 Updated Feb 10, 2026

Circuit IR Compilers and Tools

C++ 2,076 446 Updated Apr 2, 2026

Development repository for the Triton language and compiler

MLIR 18,828 2,723 Updated Apr 2, 2026

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,157 288 Updated Mar 22, 2026

The official Meta Llama 3 GitHub site

Python 29,294 3,530 Updated Jan 26, 2025

Triton Compiler related materials.

42 6 Updated Mar 16, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 25,822 2,793 Updated Apr 2, 2026

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,580 1,956 Updated Apr 2, 2026

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 21,111 4,913 Updated Jan 29, 2026

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 5,025 617 Updated Jul 2, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,956 779 Updated Feb 11, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,321 6,154 Updated Feb 9, 2026

LLM inference in C/C++

C++ 100,742 16,203 Updated Apr 2, 2026

fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。

Python 12,610 2,258 Updated Apr 2, 2026

DUSt3R: Geometric 3D Vision Made Easy

Python 7,047 745 Updated Sep 24, 2025

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 2,012 351 Updated Jun 4, 2023

A latent text-to-image diffusion model

Jupyter Notebook 72,801 10,619 Updated Jun 18, 2024

how to optimize some algorithm in cuda.

Cuda 2,905 266 Updated Apr 1, 2026