tjtanaa

TJian tjtanaa

57 followers · 38 following

Achievements

x3 x3 x2

Achievements

x3 x3 x2

Organizations

Stars

vllm-project / agentic-api

Stateful API logic for agentic applications using vLLM

Makefile 20 5 Updated Apr 1, 2026

ROCm / FlyDSL

FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.

Python 139 31 Updated Apr 1, 2026

ArthurinRUC / cutlass-notes

From Minimal GEMM to Everything

Cuda 192 10 Updated Feb 10, 2026

SystemPanic / vllm-windows

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)

Python 372 37 Updated Mar 26, 2026

carlushuang / gcnasm

amdgpu example code in hip/asm

C++ 58 29 Updated Mar 18, 2026

alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,078 162 Updated Apr 1, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,076 672 Updated Apr 1, 2026

micytao / vllm-playground

A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and ent…

JavaScript 413 57 Updated Mar 17, 2026

meta-pytorch / tritonbench

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 335 78 Updated Apr 1, 2026

kyuz0 / amd-strix-halo-vllm-toolboxes

Python 253 34 Updated Mar 27, 2026

ROCm / mori

Modular RDMA Interface

C++ 105 30 Updated Apr 1, 2026

huggingface / hf-rocm-kernels

Python 23 5 Updated Jul 11, 2025

stackav-oss / conch

A "standard library" of Triton kernels.

Python 22 4 Updated Oct 2, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 55,568 9,698 Updated Feb 11, 2026

facebookresearch / blt

Code for BLT research paper

Python 2,031 190 Updated Nov 3, 2025

mk1-project / quickreduce

QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.

C++ 38 8 Updated Aug 29, 2025

doux124 / sg-innovationchallenge

Submission for the SG Innovation Challenge

JavaScript 3 Updated Feb 25, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 1,004 126 Updated Apr 1, 2026

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 8,092 843 Updated Mar 31, 2026

andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 568 59 Updated Sep 13, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,954 81 Updated Mar 30, 2026

NVIDIA / Star-Attention

Efficient LLM Inference over Long Sequences

Python 392 22 Updated Jun 25, 2025

codecrafters-io / build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Markdown 485,370 45,654 Updated Feb 21, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,312 5,108 Updated Apr 1, 2026

traceloop / openllmetry

Open-source observability for your GenAI or LLM application, based on OpenTelemetry

Python 6,968 912 Updated Apr 1, 2026

project-etalon / etalon

LLM Serving Performance Evaluation Harness

Python 84 12 Updated Feb 25, 2025

zml / zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 3,305 128 Updated Apr 1, 2026

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,903 368 Updated Dec 17, 2025

llumnix-project / llumnix-ray

Efficient and easy multi-instance LLM serving

Python 540 47 Updated Mar 12, 2026

alibaba / llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Python 64 6 Updated Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TJian tjtanaa

Achievements

Achievements

Organizations

Block or report tjtanaa

Stars

vllm-project / agentic-api

ROCm / FlyDSL

ArthurinRUC / cutlass-notes

SystemPanic / vllm-windows

carlushuang / gcnasm

alibaba / rtp-llm

vllm-project / vllm-omni

micytao / vllm-playground

meta-pytorch / tritonbench

kyuz0 / amd-strix-halo-vllm-toolboxes

ROCm / mori

huggingface / hf-rocm-kernels

stackav-oss / conch

FoundationAgents / OpenManus

facebookresearch / blt

mk1-project / quickreduce

doux124 / sg-innovationchallenge

NVIDIA / kvpress

bitsandbytes-foundation / bitsandbytes

andrewkchan / yalm

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

NVIDIA / Star-Attention

codecrafters-io / build-your-own-x

sgl-project / sglang

traceloop / openllmetry

project-etalon / etalon

zml / zml

hijkzzz / Awesome-LLM-Strawberry

llumnix-project / llumnix-ray

alibaba / llm-scheduling-artifact