Skip to content
View wonderisland's full-sized avatar

Block or report wonderisland

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,002 881 Updated Dec 4, 2025

中文文生图stable diffsion模型集合

392 23 Updated Dec 16, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,297 180 Updated Dec 17, 2025

Distributed rate limiter, using Redis counters

Go 5 2 Updated Apr 22, 2020

Distributed rate limit library based on Redis

Go 67 9 Updated Mar 25, 2025

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 563 120 Updated Dec 18, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 88,187 10,119 Updated Dec 21, 2025

[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents

Python 32 1 Updated Jun 3, 2025

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…

JavaScript 3,001 403 Updated Sep 29, 2025

Question and Answer based on Anything.

Python 13,800 1,326 Updated Mar 24, 2025

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 904 77 Updated Aug 3, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,985 1,651 Updated Nov 19, 2025

A live stream development of RL tunning for LLM agents

Python 3,685 514 Updated Oct 8, 2025

Run MCP stdio servers over SSE and SSE over stdio. AI gateway.

TypeScript 2,316 196 Updated Oct 9, 2025

Model Context Protocol Servers

TypeScript 74,785 9,080 Updated Dec 19, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 20,037 1,908 Updated Dec 15, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,666 750 Updated Dec 21, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,963 2,216 Updated Dec 15, 2025

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Python 1,946 244 Updated Jul 2, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,444 330 Updated Dec 19, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,928 919 Updated Dec 15, 2025

A curated list of Diffusion Model in RL resources (continually updated)

1,443 71 Updated Dec 15, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 1 Updated Jan 18, 2025

NVIDIA Linux open GPU kernel module source

C 16,508 1,545 Updated Dec 18, 2025

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台,自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国…

Python 1,866 144 Updated Nov 11, 2025

Efficient Triton Kernels for LLM Training

Python 5,963 452 Updated Dec 21, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 59 2 Updated Apr 9, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,031 1,095 Updated Dec 12, 2025

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,107 284 Updated Jun 4, 2024
Next