LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,493 164 Updated Sep 24, 2024

ezelikman / quiet-star

Code for Quiet-STaR

Python 624 86 Updated Aug 21, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 50,067 7,158 Updated Oct 30, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 773 65 Updated Sep 23, 2024

trotsky1997 / MathBlackBox

Python 548 66 Updated Oct 28, 2024

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,264 414 Updated Oct 21, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,629 403 Updated Oct 7, 2024

jopetty / sfirah

Python 3 1 Updated May 30, 2024

jopetty / word-problem

Experiments on the impact of depth in transformers and SSMs.

Python 14 3 Updated Oct 27, 2024

arcee-ai / DistillKit

An Open Source Toolkit For LLM Distillation

Python 340 36 Updated Sep 17, 2024

ostris / ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Python 3,187 324 Updated Oct 29, 2024

MollySophia / rwkv-qualcomm

Inference rwkv5 or rwkv6 with Qualcomm AI Engine Direct SDK

C++ 35 3 Updated Oct 29, 2024

teorth / pfr

Repository for formalization of the Polynomial Freiman Ruzsa conjecture (and related results)

Lean 135 33 Updated Oct 29, 2024

yynil / RWKVinLLAMA

Python 13 Updated Oct 29, 2024

athms / mad-lab

A MAD laboratory to improve AI architecture designs 🧪

Python 92 6 Updated May 2, 2024

SizheAn / PanoHead

Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"

Python 1,914 237 Updated Feb 5, 2024

Dan-wanna-M / formatron

Formatron empowers everyone to control the format of language models' output with minimal overhead.

Python 149 6 Updated Oct 29, 2024

Picovoice / porcupine

On-device wake word detection powered by deep learning

Python 3,752 498 Updated Oct 28, 2024

AGENDD / RWKV-ASR

This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the idea of SLAM_ASR and used the RWKV language model as the LLM…

Python 31 3 Updated Oct 27, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 3,225 298 Updated Oct 18, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,934 635 Updated Oct 22, 2024

AIIRWKV / RWKV-RAG

RAG SYSTEM FOR RWKV

Python 34 2 Updated Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xiaol

Achievements

Achievements

Block or report xiaol

Starred repositories

pyannote / pyannote-audio

wenet-e2e / wespeaker

wenet-e2e / wesep

johanwind / wind-rwkv7

genmoai / models

DAGWorks-Inc / burr

TorchRWKV / flash-linear-attention

kyutai-labs / moshi

ictnlp / LLaMA-Omni