albertfgu

Albert Gu albertfgu

Assistant Professor @ CMU Machine Learning Department

655 followers · 1 following

@_albertgu

Achievements

x3 x3

Achievements

x3 x3

Stars

lucidrains / h-net-dynamic-chunking

Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon

Python 79 2 Updated Jun 11, 2026

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 1,011 136 Updated May 30, 2026

drkameleon / complete-hsk-vocabulary

Complete, HSK 2.0/3.0 (汉语水平考试) Vocabulary Lists in Json

Ruby 248 47 Updated Mar 27, 2026

cartesia-ai / line

Cartesia Line SDK for voice agents.

Python 100 41 Updated Jun 11, 2026

goombalab / hnet

H-Net: Hierarchical Network with Dynamic Chunking

Python 856 101 Updated Nov 20, 2025

genlm / genlm-bytes

Algorithms for byte-level language modelling

Python 24 5 Updated May 14, 2026

NVlabs / GatedDeltaNet

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 598 32 Updated Mar 13, 2026

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,657 122 Updated May 24, 2026

Benjamin-Walker / selective-ssms-and-linear-cdes

Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)

Python 16 1 Updated Jan 7, 2025

plk / biblatex

biblatex is a sophisticated bibliography system for LaTeX users. It has considerably more features than traditional bibtex and supports UTF-8

TeX 586 131 Updated Jun 6, 2026

mrwoof / google-chat-takeout-reader

HTML 29 7 Updated Jan 8, 2026

srush / LLM-Training-Puzzles

What would you do with 1000 H100s...

Jupyter Notebook 1,175 73 Updated Jan 10, 2024

yyyujintang / Awesome-Mamba-Papers

Awesome Papers related to Mamba.

1,399 74 Updated Oct 17, 2024

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,559 1,008 Updated Jun 8, 2026

Hprairie / Bi-Mamba2

A Triton Kernel for incorporating Bi-Directionality in Mamba2

Python 82 Updated Dec 18, 2024

goombalab / hydra

Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"

Python 172 18 Updated Jan 30, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,426 295 Updated May 27, 2026

ggandor / leap.nvim

Neovim's answer to the mouse 🦘

Fennel 5,029 52 Updated Apr 11, 2026

kashif / pytorch-transformer-ts

Repository of Transformer based PyTorch Time Series Models

Jupyter Notebook 320 47 Updated Nov 8, 2024

srush / annotated-mamba

Annotated version of the Mamba paper

Jupyter Notebook 501 20 Updated Feb 27, 2024

sarulab-speech / UTMOS22

UT-Sarulab MOS prediction system using SSL models

Python 306 15 Updated Apr 11, 2024

calico / borzoi

RNA-seq prediction with deep convolutional neural networks.

Python 245 32 Updated Aug 28, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,984 622 Updated May 3, 2024

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,726 13,050 Updated Jun 2, 2026

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,879 287 Updated Feb 13, 2025

HazyResearch / zoology

Understand and test language model architectures on synthetic tasks.

Python 274 52 Updated Mar 22, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,755 18,007 Updated Jun 13, 2026

eugeneyan / open-llms

📋 A list of open LLMs available for commercial use.

12,798 976 Updated Feb 13, 2025

jzhang38 / LongMamba

Some preliminary explorations of Mamba's context scaling.

Python 219 10 Updated Feb 8, 2024

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,889 1,905 Updated Jun 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Albert Gu albertfgu

Achievements

Achievements

Block or report albertfgu

Stars

lucidrains / h-net-dynamic-chunking

Dao-AILab / quack

drkameleon / complete-hsk-vocabulary

cartesia-ai / line

goombalab / hnet

genlm / genlm-bytes

NVlabs / GatedDeltaNet

KellerJordan / Muon

Benjamin-Walker / selective-ssms-and-linear-cdes

plk / biblatex

mrwoof / google-chat-takeout-reader

srush / LLM-Training-Puzzles

yyyujintang / Awesome-Mamba-Papers

BlinkDL / RWKV-LM

Hprairie / Bi-Mamba2

goombalab / hydra

HazyResearch / ThunderKittens

ggandor / leap.nvim

kashif / pytorch-transformer-ts

srush / annotated-mamba

sarulab-speech / UTMOS22

calico / borzoi

jzhang38 / TinyLlama

alshedivat / al-folio

hustvl / Vim

HazyResearch / zoology

vllm-project / vllm

eugeneyan / open-llms

jzhang38 / LongMamba

NVIDIA / cutlass