pipecat

pipecat pipecat

16 followers · 86 following

Ant Group
Beijing

Achievements

Organizations

Starred repositories

inclusionAI / asystem-awex

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

Python 117 9 Updated Dec 22, 2025

afterdusk / flop

IEEE 754-style floating-point converter

TypeScript 17 2 Updated Jan 30, 2023

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,012 219 Updated Dec 9, 2025

zyqCSL / DiffKV

Python 34 10 Updated Oct 11, 2025

OpenBB-finance / OpenBB

Financial data platform for analysts, quants and AI agents.

Python 55,772 5,420 Updated Dec 23, 2025

jeho-lee / Awesome-On-Device-AI-Systems

103 2 Updated Nov 24, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,911 291 Updated Dec 22, 2025

Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,208 178 Updated Jul 29, 2023

Tongkaio / CUDA_Kernel_Samples

CUDA 算子手撕与面试指南

Cuda 739 81 Updated Aug 23, 2025

shuaills / myECE408

C++ 10 Updated Jul 27, 2023

EricLBuehler / candle-vllm

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

Rust 552 65 Updated Dec 22, 2025

UFund-Me / Qbot

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 15,519 2,211 Updated Jul 6, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,445 552 Updated Dec 8, 2025

vnpy / vnpy

基于Python的开源量化交易平台开发框架

Python 34,843 10,536 Updated Dec 22, 2025

powderluv / mm_benchmarks

C++ 12 2 Updated Dec 31, 2020

joevess / IPTV

IPTV直播源抓取自动整合hao趣网直播源+TVBox直播源+其他网上直播源择取分辨率、速度最佳视频流定期更新

10,278 728 Updated Dec 31, 2024

mit-han-lab / omniserve

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 795 56 Updated Mar 6, 2025