🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,734 1,370 Updated Jun 19, 2026

huangjunheng / recommendation_model

练习下用pytorch来复现下经典的推荐系统模型, 如MF, FM, DeepConn, MMOE, PLE, DeepFM, NFM, DCN, AFM, AutoInt, ONN, FiBiNET, DCN-v2, AFN, DCAP等

Python 685 131 Updated Mar 14, 2022

whutbd / cuda-learn-note

Forked from xlite-dev/LeetCUDA

🎉CUDA 笔记 / 高频面试题汇总 / C++笔记，个人笔记，更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 47 3 Updated Jan 25, 2024

garrytan / gstack

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 112,508 16,717 Updated Jun 21, 2026

StarCycle / Awesome-Embodied-AI-Job

Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, etc

1,432 28 Updated Feb 25, 2026

modelscope / twinkle

Twinkle✨: Training workbench to make your model glow.

Python 241 33 Updated Jun 22, 2026

timercrack / trader

期货自动交易

C 8,281 1,902 Updated Feb 28, 2026

nanocoai / nanoclaw

A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…

TypeScript 29,945 12,896 Updated Jun 21, 2026

jaywcjlove / docker-tutorial

🐳 Docker入门学习笔记

1,782 300 Updated Apr 21, 2026

anthropics / skills

Public repository for Agent Skills

Python 153,671 18,122 Updated Jun 9, 2026

Tencent / hpc-ops

High Performance LLM Inference Operator Library

C++ 955 97 Updated Jun 11, 2026

google / perfetto

Production-grade client-side tracing, profiling, and analysis for complex software systems.

C++ 6,123 810 Updated Jun 21, 2026

HKUDS / AI-Trader

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 19,946 3,050 Updated Jun 11, 2026

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,380 788 Updated Jun 19, 2026

xpzouying / xiaohongshu-mcp

MCP for xiaohongshu.com

Go 14,280 2,138 Updated Jun 17, 2026

DeepLink-org / DLSlime

Composable and Embeddable Communication Runtime for Distributed AI Services

C++ 102 9 Updated Jun 5, 2026

Tencent / WeKnora

Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.

Go 16,614 2,138 Updated Jun 22, 2026

Softcatala / whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Python 1,321 125 Updated Feb 14, 2026

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,454 868 Updated Jun 22, 2026

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,150 425 Updated Jun 22, 2026

microsoft / tokenweave

Accepted to MLSys 2026

Python 88 7 Updated Apr 19, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,644 957 Updated Jun 21, 2026

QuentinFuxa / WhisperLiveKit

Simultaneous speech-to-text models

Python 10,469 1,083 Updated Jun 12, 2026

kangshantong / ps-dnn

这是一个基于参数服务器（Parameter Server）PS-Lite的分布式深度学习训练和预测框架。This is a model training and prediction framework.1) It includes a complete set of processes such as sample generation, feature extraction, model…

C++ 30 16 Updated May 31, 2022

MLNLP-World / LLMs-from-scratch-CN

LLMs-from-scratch项目中文翻译

Jupyter Notebook 2,690 444 Updated Apr 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whutbd

Achievements