Skip to content
View roywei's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@apache

Block or report roywei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contexts Optical Compression

Python 21,534 1,926 Updated Oct 25, 2025

🚀MCP server for accessing RedNote(XiaoHongShu, xhs).

TypeScript 928 145 Updated May 11, 2025

MCP for xiaohongshu.com

Go 7,598 1,191 Updated Dec 21, 2025

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

Python 4,619 713 Updated Dec 1, 2025

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…

TypeScript 69,326 14,285 Updated Dec 22, 2025
Shell 27 4 Updated Jul 29, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,049 342 Updated Dec 20, 2025

Nano vLLM

Python 9,954 1,250 Updated Nov 3, 2025

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Python 24,566 3,254 Updated Dec 14, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,949 12,124 Updated Dec 22, 2025

Perplexity GPU Kernels

C++ 542 74 Updated Nov 7, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,331 613 Updated Dec 22, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,902 3,833 Updated Dec 22, 2025

Materials for learning SGLang

699 51 Updated Dec 15, 2025

Let your Claude able to think

TypeScript 16,620 1,964 Updated Nov 4, 2025

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 239 39 Updated Dec 22, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,443 552 Updated Dec 8, 2025

Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference examples.

Shell 63 33 Updated Dec 2, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,103 2,672 Updated Nov 3, 2025

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 810 104 Updated Feb 3, 2025

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 71,290 8,815 Updated Oct 21, 2025

Composable building blocks to build LLM Apps

Python 8,202 1,227 Updated Dec 22, 2025

A PyTorch native platform for training generative AI models

Python 4,864 648 Updated Dec 22, 2025

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,490 354 Updated Jul 25, 2025

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Gemini, Ollam…

Python 1,475 97 Updated Jul 27, 2025

LLM101n: Let's build a Storyteller

35,924 1,962 Updated Aug 1, 2024

A generative speech model for daily dialogue.

Python 38,380 4,166 Updated Dec 3, 2025

Slides, notes, and materials for the workshop

337 33 Updated Jun 1, 2024

an ops tool for host, cluster

Go 117 19 Updated Dec 22, 2025

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 29,821 3,025 Updated Dec 19, 2025
Next