Skip to content
View lambda7xx's full-sized avatar
  • Shanghai Jiao Tong University
  • Shanghai

Block or report lambda7xx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Python 47 Updated May 23, 2026
Python 6 2 Updated Jun 15, 2026
Python 138 18 Updated Jun 10, 2026
Python 88 11 Updated May 8, 2026
Python 1 Updated Jun 14, 2026

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,462 459 Updated Jun 15, 2026

Personal deep learning study notes and tutorial-style notebooks

Python 467 19 Updated Jun 15, 2026

🍎 One kernel a day keeps high latency away. A hands-on CUDA learning path featuring a rich collection of kernels, from the basics to peak performance, seamlessly integrated as PyTorch C++ extensions.

Cuda 146 8 Updated Jun 13, 2026

Hundreds of agent skills for medical research, including protocol design, data analysis, evidence insights, and academic writing.

Python 1,140 78 Updated Jun 15, 2026
Python 11 Updated Jun 10, 2026

Vortex: Programmable Sparse Attention for Agents as Algorithm Designers

Python 60 7 Updated Jun 8, 2026

An ultra-fast, distributed Safetensors loader

C++ 61 8 Updated May 27, 2026

A straightforward method for training your LLM, from downloading data to generating text.

Python 6,219 844 Updated Jun 15, 2026

An Agentic Compiler for CUDA

9 Updated May 17, 2026

Awesome List for Agentic RL

HTML 1,573 61 Updated May 26, 2026

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 615 32 Updated Jun 15, 2026

Frontier: A Discrete-Event Simulator for Modern LLM Serving

Python 28 4 Updated Jun 14, 2026

A unified framework for building, running, and training general agents at scale.

Python 340 44 Updated Jun 15, 2026

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

Python 42,782 3,486 Updated Jun 10, 2026

tutorials about polyhedral compilation.

Jupyter Notebook 65 10 Updated Jun 6, 2026

Large DNNs training framework for consumer GPUs

Python 88 15 Updated Jun 1, 2026

Go sidecar proxy that eliminates Head-of-Line Blocking in LLM inference via ML-driven SJF scheduling — zero backend modification. Paper in preparation

Python 1 Updated Jun 5, 2026

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 125,401 14,030 Updated Jun 15, 2026

Foundry materializes CUDA graphs along with its execution context to disk to support fast cold start of serving engines.

C++ 36 3 Updated Jun 15, 2026

Virtual Decoupled Cores: Composable Programming Framework and Runtime for Async GPUs

Python 17 5 Updated Jun 10, 2026

BlitzScale Router - Distributed LLM Inference Router (Rust)

Rust 3 1 Updated May 25, 2026

AI拆解论文,人人都能读懂前沿研究

TypeScript 15 Updated Jun 9, 2026

SwiftRDMA -- Exposing RDMA NIC Resources for Software-Defined RDMA Scheduling

C++ 19 Updated Jun 9, 2026

Modern RL Post-training Infrastructure: Optimized for NVIDIA/AMD GPUs with a focus on vLLM and DeepSpeed integration, CUDA/ROCm/Triton kernels, and transparent hardware-aware scaling.

Python 124 22 Updated Jun 15, 2026
Next