Skip to content
View albertfgu's full-sized avatar

Block or report albertfgu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon

Python 79 2 Updated Jun 11, 2026

A Quirky Assortment of CuTe Kernels

Python 1,011 136 Updated May 30, 2026

Complete, HSK 2.0/3.0 (汉语水平考试) Vocabulary Lists in Json

Ruby 248 47 Updated Mar 27, 2026

Cartesia Line SDK for voice agents.

Python 100 41 Updated Jun 11, 2026

H-Net: Hierarchical Network with Dynamic Chunking

Python 856 101 Updated Nov 20, 2025

Algorithms for byte-level language modelling

Python 24 5 Updated May 14, 2026

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 598 32 Updated Mar 13, 2026

Muon is an optimizer for hidden layers in neural networks

Python 2,657 122 Updated May 24, 2026

Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)

Python 16 1 Updated Jan 7, 2025

biblatex is a sophisticated bibliography system for LaTeX users. It has considerably more features than traditional bibtex and supports UTF-8

TeX 586 131 Updated Jun 6, 2026

What would you do with 1000 H100s...

Jupyter Notebook 1,175 73 Updated Jan 10, 2024

Awesome Papers related to Mamba.

1,399 74 Updated Oct 17, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,559 1,008 Updated Jun 8, 2026

A Triton Kernel for incorporating Bi-Directionality in Mamba2

Python 82 Updated Dec 18, 2024

Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"

Python 172 18 Updated Jan 30, 2025

Tile primitives for speedy kernels

Cuda 3,426 295 Updated May 27, 2026

Neovim's answer to the mouse 🦘

Fennel 5,029 52 Updated Apr 11, 2026

Repository of Transformer based PyTorch Time Series Models

Jupyter Notebook 320 47 Updated Nov 8, 2024

Annotated version of the Mamba paper

Jupyter Notebook 501 20 Updated Feb 27, 2024

UT-Sarulab MOS prediction system using SSL models

Python 306 15 Updated Apr 11, 2024

RNA-seq prediction with deep convolutional neural networks.

Python 245 32 Updated Aug 28, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,984 622 Updated May 3, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,726 13,050 Updated Jun 2, 2026

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,879 287 Updated Feb 13, 2025

Understand and test language model architectures on synthetic tasks.

Python 274 52 Updated Mar 22, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,755 18,007 Updated Jun 13, 2026

📋 A list of open LLMs available for commercial use.

12,798 976 Updated Feb 13, 2025

Some preliminary explorations of Mamba's context scaling.

Python 219 10 Updated Feb 8, 2024

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,889 1,905 Updated Jun 11, 2026
Next