Skip to content
View yecohn's full-sized avatar

Block or report yecohn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A PyTorch native platform for training generative AI models

Python 4,871 650 Updated Dec 24, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,545 979 Updated Dec 13, 2025

Nano vLLM

Python 10,082 1,262 Updated Nov 3, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,706 698 Updated Dec 10, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,824 2,035 Updated Dec 21, 2025

The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)

Python 47 1 Updated Aug 15, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,865 344 Updated Jan 4, 2024

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Python 225 13 Updated May 18, 2025

Tina: Tiny Reasoning Models via LoRA

Python 310 39 Updated Sep 23, 2025

Efficient Triton Kernels for LLM Training

Python 5,975 454 Updated Dec 23, 2025

A next generation Python CMake adaptor and Python API for plugins

Python 427 79 Updated Dec 23, 2025

A curated list of awesome header-only C++ libraries

4,008 264 Updated Nov 6, 2025
TypeScript 27,836 2,211 Updated Aug 7, 2025

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 1,032 102 Updated Dec 30, 2024

LLM finetuned for medical question answering

Python 2 1 Updated May 15, 2023

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,247 1,190 Updated Dec 24, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,515 1,536 Updated Apr 24, 2025

Simple RL training for reasoning

Python 3,812 281 Updated Dec 23, 2025

solana arbitrage bot across multiple spot dexs

Rust 793 251 Updated Apr 10, 2023
JavaScript 1 Updated Oct 23, 2024

Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference, and more.

Python 216 15 Updated Jun 3, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,873 371 Updated Dec 17, 2025

[ACL 2023] Reasoning with Language Model Prompting: A Survey

991 70 Updated May 21, 2025

Must-read Papers on LLM Agents.

2,825 165 Updated Nov 19, 2025

Overview and tutorial of the LangChain Library

Jupyter Notebook 7,347 2,047 Updated Aug 5, 2024

A list of AI autonomous agents

24,782 2,087 Updated Feb 26, 2025

Building blocks for foundation models.

586 28 Updated Jan 3, 2024

Distribute and run LLMs with a single file.

C 23,551 1,254 Updated Dec 19, 2025

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,169 73 Updated Sep 30, 2025
Next