vasqu

🐢

Anton Vlasjuk vasqu

🐢

Pay Attention to Linear Recurrence

112 followers · 19 following

16:42 (UTC +02:00)
https://codeberg.org/vasqu

Achievements

x4 x3

Achievements

x4 x3

Stars

MiniMax-AI / MSA

Python 265 21 Updated Jun 13, 2026

QwenLM / FlashQLA

high-performance linear attention kernel library built on TileLang

Python 536 47 Updated May 7, 2026

facebookresearch / tensor-layouts

A pure-Python implementation of the Nvidia CuTe layout algebra intended to be approachable and easy to learn.

Python 185 12 Updated May 15, 2026

apache / tvm-ffi

Open ABI and FFI for Machine Learning Systems

C++ 411 80 Updated Jun 13, 2026

NVIDIA / nsight-python

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 204 14 Updated Jun 11, 2026

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,455 340 Updated Jan 14, 2026

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 713 89 Updated Jun 13, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,395 698 Updated May 17, 2026

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 1,292 256 Updated Jun 14, 2026

nari-labs / dia2

TTS model capable of streaming conversational audio in realtime.

Python 1,143 98 Updated Nov 29, 2025

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 884 153 Updated Jun 13, 2026

deepseek-ai / DeepSeek-V3.2-Exp

Python 1,603 176 Updated Nov 18, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 1,012 136 Updated Jun 14, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 49,329 5,488 Updated May 6, 2026

huggingface / kernels

Build compute kernels and load them from the Hub.

Python 692 105 Updated Jun 12, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,162 2,094 Updated Jun 9, 2026

NVlabs / GSPN

[CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network

Python 111 8 Updated Jul 18, 2025

OpenSparseLLMs / MoM

Python 135 6 Updated Feb 4, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,020 2,211 Updated Apr 26, 2026

HanGuo97 / log-linear-attention

Python 282 15 Updated Jun 6, 2025

astral-sh / ty

An extremely fast Python type checker and language server, written in Rust.

Python 18,939 302 Updated Jun 12, 2026

QwenLM / ParScale

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 478 26 Updated May 17, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,299 1,991 Updated Jan 9, 2026

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,436 860 Updated Jun 14, 2026

turboderp-org / exllamav3

An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs

Python 943 106 Updated Jun 14, 2026

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,666 1,485 Updated May 27, 2025

xlite-dev / ffpa-attn

🤖FFPA: Extends FlashAttention-2 via Split-D for large headdims, 1.5x~3×↑🎉 vs SDPA, up to 430T🎉 on H200.

Python 310 20 Updated Jun 12, 2026

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,960 208 Updated Jun 6, 2026

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,433 328 Updated Jul 7, 2025

pqrs-org / Karabiner-Elements

Karabiner-Elements is a powerful tool for customizing keyboards on macOS

C++ 22,312 913 Updated Jun 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anton Vlasjuk vasqu

Achievements

Achievements

Block or report vasqu

Stars

MiniMax-AI / MSA

QwenLM / FlashQLA

facebookresearch / tensor-layouts

apache / tvm-ffi

NVIDIA / nsight-python

deepseek-ai / Engram

Dao-AILab / sonic-moe

sgl-project / mini-sglang

allenai / OLMo-core

nari-labs / dia2

pytorch / helion

deepseek-ai / DeepSeek-V3.2-Exp

Dao-AILab / quack

microsoft / VibeVoice

huggingface / kernels

openai / gpt-oss

NVlabs / GSPN

OpenSparseLLMs / MoM

GeeeekExplorer / nano-vllm

HanGuo97 / log-linear-attention

astral-sh / ty

QwenLM / ParScale

QwenLM / Qwen3

pytorch / torchtitan

turboderp-org / exllamav3

SesameAILabs / csm

xlite-dev / ffpa-attn

lucidrains / titans-pytorch

MiniMax-AI / MiniMax-01

pqrs-org / Karabiner-Elements