Skip to content
View roywei's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@apache

Block or report roywei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
225 results for source starred repositories
Clear filter

Contexts Optical Compression

Python 21,551 1,926 Updated Oct 25, 2025

🚀MCP server for accessing RedNote(XiaoHongShu, xhs).

TypeScript 928 145 Updated May 11, 2025

MCP for xiaohongshu.com

Go 7,613 1,195 Updated Dec 21, 2025

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

Python 4,619 713 Updated Dec 1, 2025

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…

TypeScript 69,357 14,286 Updated Dec 23, 2025
Shell 27 4 Updated Jul 29, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,050 342 Updated Dec 20, 2025

Nano vLLM

Python 10,018 1,254 Updated Nov 3, 2025

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Python 24,575 3,255 Updated Dec 14, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,000 12,136 Updated Dec 23, 2025

Perplexity GPU Kernels

C++ 542 74 Updated Nov 7, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,338 613 Updated Dec 23, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,934 3,842 Updated Dec 23, 2025

Materials for learning SGLang

700 51 Updated Dec 15, 2025

Let your Claude able to think

TypeScript 16,621 1,964 Updated Nov 4, 2025

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 239 39 Updated Dec 23, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,446 552 Updated Dec 8, 2025

Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference examples.

Shell 63 33 Updated Dec 2, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,104 2,672 Updated Nov 3, 2025

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 810 104 Updated Feb 3, 2025

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 71,293 8,815 Updated Oct 21, 2025

Composable building blocks to build LLM Apps

Python 8,203 1,227 Updated Dec 23, 2025

A PyTorch native platform for training generative AI models

Python 4,866 649 Updated Dec 23, 2025

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,492 354 Updated Jul 25, 2025

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Gemini, Ollam…

Python 1,476 97 Updated Jul 27, 2025

A generative speech model for daily dialogue.

Python 38,384 4,166 Updated Dec 3, 2025

Slides, notes, and materials for the workshop

337 33 Updated Jun 1, 2024

an ops tool for host, cluster

Go 117 19 Updated Dec 23, 2025

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 29,851 3,027 Updated Dec 23, 2025

the first library to let you embed a developer agent in your own app!

Python 12,193 1,092 Updated Apr 7, 2024
Next