-
NVIDIA
- Santa Clara, CA
- https://zasdfgbnm.github.io/
- https://orcid.org/0000-0001-8510-7373
- in/xiang-gao-289a88b6
Highlights
- Pro
Stars
- All languages
- ANTLR
- Assembly
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Cython
- Elm
- Go
- HLSL
- HTML
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- LLVM
- Lean
- MDX
- MLIR
- Makefile
- Markdown
- Nix
- Objective-C
- PHP
- Perl
- PowerShell
- Python
- QML
- R
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Solidity
- Stylus
- Swift
- TeX
- TypeScript
- Vala
- Vim Script
- Vue
- YARA
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Developer-first error tracking and performance monitoring
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Kimi K2 is the large language model series developed by Moonshot AI team
A Datacenter Scale Distributed Inference Serving Framework
LlamaIndex is the leading framework for building LLM-powered agents over your data.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fast and memory-efficient exact attention
Helpful tools and examples for working with flex-attention
SGLang is a fast serving framework for large language models and vision language models.
Open-Source Quantum Chemistry – an electronic structure package in C++ driven by Python
An open protocol enabling communication and interoperability between opaque agentic applications.
Model Context Protocol Servers
A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust
A terminal workspace with batteries included
Read-only mirror of https://gitlab.gnome.org/GNOME/meld
Production-ready platform for agentic workflow development.
🦜🔗 The platform for reliable agents.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
A high-throughput and memory-efficient inference and serving engine for LLMs