Skip to content
View lifeiteng's full-sized avatar

Block or report lifeiteng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Python 300 21 Updated Jun 12, 2026

Official implementation of paper "Vocoder is not all you need".

Python 14 Updated Jun 5, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 2,303 195 Updated Apr 19, 2026

Interactive World Model papers organized by core research challenges.

Python 241 8 Updated Jun 19, 2026
Python 459 34 Updated May 26, 2026

Dataflow-Oriented Reinforcement Learning for (Multi-)Agentic LLMs

Python 90 15 Updated Jun 21, 2026

Fine-tune Gemma 4 and 3n with audio, images and text on Apple Silicon, using PyTorch and Metal Performance Shaders.

Python 1,475 104 Updated May 12, 2026

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

C 1,297 126 Updated Jun 22, 2026

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,742 61 Updated Jun 18, 2026

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

Python 3,646 463 Updated Jun 2, 2026

🌋LavaSR: Fast Speech restoration and enhancement

Python 553 49 Updated Jun 19, 2026

Minimalist ML framework for Rust

Rust 20,528 1,612 Updated Jun 22, 2026

A TurboQuant inference server

Rust 457 43 Updated Apr 24, 2026

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 5,080 635 Updated Jun 22, 2026

High-Quality Voice Cloning TTS for 600+ Languages

Python 7,663 1,199 Updated Jun 11, 2026

Expose Antigravity as OpenAI & Anthropic compatible API (base_url + key)

TypeScript 192 21 Updated Mar 21, 2026

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 314 23 Updated May 31, 2026

Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual

Python 88 6 Updated Jun 21, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 679 59 Updated May 17, 2026

AI agents running research on single-GPU nanochat training automatically

Python 88,059 12,751 Updated Mar 26, 2026

Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.

Python 251 26 Updated Mar 20, 2026

Sparse Transition Matrix-Accelerated Trie Index for Constrained Decoding (https://arxiv.org/abs/2602.22647)

Python 221 27 Updated Mar 29, 2026

Open source voice dictation technology

TypeScript 967 110 Updated Jun 5, 2026

LiteRT-LM is Google's production-ready, high-performance, open-source inference framework for deploying Large Language Models on edge devices.

C++ 5,666 589 Updated Jun 22, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,918 79,536 Updated Jun 22, 2026

Secure, Fast, and Extensible Sandbox runtime for AI agents.

Python 11,609 963 Updated Jun 22, 2026
Python 88 9 Updated Feb 24, 2026

onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime

C++ 469 134 Updated Jun 22, 2026

A framework for efficient model inference with omni-modality models

Python 5,236 1,155 Updated Jun 22, 2026
Next