Skip to content
View roywei's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@apache

Block or report roywei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Perplexity open source garden for inference technology

Rust 584 56 Updated May 27, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,781 79,497 Updated Jun 21, 2026

Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.

Elixir 25,515 2,583 Updated Jun 9, 2026

Contexts Optical Compression

Python 23,307 2,150 Updated Jan 27, 2026

🚀MCP server for accessing RedNote(XiaoHongShu, xhs).

TypeScript 1,061 172 Updated May 11, 2025

MCP for xiaohongshu.com

Go 14,271 2,137 Updated Jun 17, 2026

Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.

Python 5,078 771 Updated Feb 16, 2026

🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.

TypeScript 78,929 15,463 Updated Jun 21, 2026
Shell 28 4 Updated Jul 29, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,417 423 Updated Jun 19, 2026

Nano vLLM

Python 14,117 2,234 Updated Apr 26, 2026

An autonomous agent that conducts deep research on any data using any LLM providers

Python 27,821 3,753 Updated May 28, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,475 18,282 Updated Jun 21, 2026

Perplexity GPU Kernels

C++ 584 94 Updated Nov 7, 2025

FlashInfer: Kernel Library for LLM Serving

Python 5,832 1,065 Updated Jun 21, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,491 6,636 Updated Jun 21, 2026

Materials for learning SGLang

846 64 Updated Jan 5, 2026

Let your Claude able to think

TypeScript 17,069 1,979 Updated Apr 7, 2026

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 299 53 Updated Jun 21, 2026

Material for gpu-mode lectures

Jupyter Notebook 6,195 624 Updated Jun 15, 2026

Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference examples.

Shell 66 34 Updated Jun 11, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,369 2,742 Updated May 19, 2026

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 849 109 Updated Feb 3, 2025

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 73,010 8,986 Updated Jun 20, 2026

Open GenAI Stack

Python 8,413 1,314 Updated Jun 20, 2026

A PyTorch native platform for training generative AI models

Python 5,452 866 Updated Jun 21, 2026

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

4,122 398 Updated Jul 25, 2025

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Gemini, Ollam…

Python 1,561 106 Updated Jul 27, 2025

LLM101n: Let's build a Storyteller

37,355 2,054 Updated Aug 1, 2024

A generative speech model for daily dialogue.

Python 39,487 4,247 Updated Apr 10, 2026
Next