Skip to content
View pepesi's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report pepesi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Harness Engineering 学习指南 — 从概念理解到独立实践的深度学习档案

Shell 3,969 363 Updated Jun 19, 2026

Alipay DeepLink + JSBridge Security Research - 17 Verified Vulnerabilities | 支付宝DeepLink安全研究 | Full Report: innora.ai/zfb

HTML 202 162 Updated Apr 6, 2026

Training neural networks on Apple Neural Engine via reverse-engineered private APIs

Objective-C 6,873 947 Updated Mar 10, 2026

An agentic skills framework & software development methodology that works.

Shell 235,276 20,886 Updated Jun 22, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 319 90 Updated Jun 22, 2026

🐹 Clean, uninstall, analyze, optimize, and monitor your Mac from the terminal.

Shell 56,734 1,992 Updated Jun 22, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,536 265 Updated Jun 17, 2026

Kubernetes-native AI serving platform for scalable model serving.

Go 379 138 Updated Jun 22, 2026

Run Slurm on Kubernetes. A Slinky project.

Go 316 89 Updated Jun 16, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,075 120 Updated Jun 12, 2026

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

C++ 1,349 233 Updated Jun 22, 2026

The missing reverse proxy for ssh scp

Go 1,268 163 Updated Jun 19, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,573 1,487 Updated Jun 22, 2026

Python wheels that work on any linux (almost)

Shell 1,759 244 Updated Jun 21, 2026

一个持续更新的中文敏感词库,帮助开发者和内容审核者快速识别并过滤不当文本,即将迎来重大更新。

3,731 405 Updated Jun 15, 2026

Ultra-high-performance, secure, all-in-one acceleration engine for developer resources

JavaScript 8,145 1,277 Updated Jun 20, 2026

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 3,414 566 Updated Jun 19, 2026

Declaratively deploy your Kubernetes manifests, Kustomize configs, and Charts as Helm releases. Generate all-in-one manifests for use with ArgoCD.

Go 5,143 349 Updated Jun 22, 2026

llm-d helm charts and deployment examples

Go Template 58 57 Updated May 1, 2026

Gateway API Inference Extension

Jupyter Notebook 695 293 Updated Jun 17, 2026

The Cloud-Native API Gateway and AI Gateway

Go 5,574 770 Updated Jun 18, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,414 539 Updated Jun 22, 2026

所有小初高、大学PDF教材。

Roff 74,436 16,665 Updated Oct 18, 2025

The Web framework for perfectionists with deadlines.

Python 87,934 33,870 Updated Jun 19, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,925 6,440 Updated Jun 20, 2026

Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.

Shell 24,381 2,470 Updated Jun 12, 2026

Go client to download AI-Models from Cozy Hub, Hugging Face Hub, and Civitai.

Go 5 2 Updated Dec 18, 2024

Generators for kube-like API types

Go 1,834 441 Updated Jun 19, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,624 867 Updated Jun 22, 2026
Next