anneouyang

Anne Ouyang anneouyang

CS PhD Student @Stanford | prev: MEng, B.S. in CS @mit, cuDNN @NVIDIA

219 followers · 24 following

Achievements

x3 x2

Achievements

x3 x2

Organizations

Stars

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,605 80 Updated Dec 17, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,979 1,586 Updated Dec 16, 2025

maawad / hundred-kernels

Shell 1 Updated Apr 13, 2025

bertmaher / simplegemm

Cuda 127 16 Updated Oct 22, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,270 600 Updated Dec 17, 2025

singhh5050 / AMD-writeup

C++ 2 Updated Feb 21, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,704 2,213 Updated Mar 11, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 712 102 Updated Dec 16, 2025

floodsung / LLM-with-RL-papers

A collection of LLM with RL papers

278 10 Updated Apr 24, 2024

nerfies / nerfies.github.io

JavaScript 3,811 1,658 Updated Jun 21, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,002 216 Updated Dec 9, 2025

merrymercy / awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,700 323 Updated Oct 19, 2024

google-research / circuit_training

Python 1,549 242 Updated Jul 10, 2025

olcf / cuda-training-series

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 918 335 Updated Aug 19, 2024

Dao-AILab / causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 675 147 Updated Oct 20, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,745 1,539 Updated Nov 11, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,309 78 Updated Mar 6, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

25,802 2,216 Updated Jul 31, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,416 1,963 Updated Dec 17, 2025

AjayBrahmakshatriya / AjayCXX_stream

Resources repo for Ajay CXX twitch stream: https://twitch.tv/ajaycxx

4 Updated Jan 21, 2024

bkettle / mit-thesis-template

MIT unofficial thesis template from overleaf, updated for 2023

TeX 18 10 Updated May 14, 2023

twitter / the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Python 10,424 2,237 Updated Jul 10, 2024

chenfei-wu / TaskMatrix

Python 34,331 3,266 Updated Jan 6, 2024

pytorch / kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 904 215 Updated Dec 17, 2025

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,911 71 Updated Jan 4, 2024

bqi343 / transformer-sorting

Jupyter Notebook 4 Updated Jan 19, 2023

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 122,107 20,138 Updated Dec 17, 2025

cmuparlay / parlaylib

A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines

C++ 398 75 Updated Nov 16, 2025

agentcooper / react-pdf-highlighter

Set of React components for PDF annotation

TypeScript 1,354 503 Updated Nov 22, 2024

HazyResearch / manifest

Prompt programming with FMs.

Python 444 45 Updated Jul 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anne Ouyang anneouyang

Achievements

Achievements

Organizations

Block or report anneouyang

Stars

NVIDIA / cutile-python

NVIDIA / cutlass

maawad / hundred-kernels

bertmaher / simplegemm

flashinfer-ai / flashinfer

singhh5050 / AMD-writeup

openai / swarm

ScalingIntelligence / KernelBench

floodsung / LLM-with-RL-papers

nerfies / nerfies.github.io

HazyResearch / ThunderKittens

merrymercy / awesome-tensor-compilers

google-research / circuit_training

olcf / cuda-training-series

Dao-AILab / causal-conv1d

state-spaces / mamba

hao-ai-lab / LookaheadDecoding

Hannibal046 / Awesome-LLM

NVIDIA / TensorRT-LLM

AjayBrahmakshatriya / AjayCXX_stream

bkettle / mit-thesis-template

twitter / the-algorithm-ml

chenfei-wu / TaskMatrix

pytorch / kineto

archinetai / audio-ai-timeline

bqi343 / transformer-sorting

langchain-ai / langchain

cmuparlay / parlaylib

agentcooper / react-pdf-highlighter

HazyResearch / manifest