tjtanaa

TJian tjtanaa

47 followers · 36 following

Achievements

x3 x2 x2

Achievements

x3 x2 x2

Organizations

Stars

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 1,037 142 Updated Dec 20, 2025

micytao / vllm-playground

A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and ent…

Python 170 28 Updated Dec 19, 2025

meta-pytorch / tritonbench

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 302 55 Updated Dec 20, 2025

kyuz0 / amd-strix-halo-vllm-toolboxes

Python 59 8 Updated Dec 20, 2025

ROCm / mori

Modular RDMA Interface

C++ 67 15 Updated Dec 19, 2025

huggingface / hf-rocm-kernels

Python 22 4 Updated Jul 11, 2025

stackav-oss / conch

A "standard library" of Triton kernels.

Python 17 2 Updated Oct 2, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 51,385 8,963 Updated Nov 17, 2025

facebookresearch / blt

Code for BLT research paper

Python 2,019 188 Updated Nov 3, 2025

mk1-project / quickreduce

QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.

C++ 36 7 Updated Aug 29, 2025

doux124 / sg-innovationchallenge

Submission for the SG Innovation Challenge

JavaScript 3 Updated Feb 25, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 726 83 Updated Dec 15, 2025

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,839 801 Updated Dec 12, 2025

andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 539 49 Updated Sep 13, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,850 78 Updated Dec 6, 2025

NVIDIA / Star-Attention

Efficient LLM Inference over Long Sequences

Python 393 20 Updated Jun 25, 2025

codecrafters-io / build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Markdown 450,486 42,254 Updated Oct 10, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,828 3,814 Updated Dec 21, 2025

traceloop / openllmetry

Open-source observability for your GenAI or LLM application, based on OpenTelemetry

Python 6,703 850 Updated Dec 16, 2025

project-etalon / etalon

LLM Serving Performance Evaluation Harness

Python 82 11 Updated Feb 25, 2025

zml / zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 3,005 110 Updated Dec 19, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,869 371 Updated Dec 17, 2025

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 517 44 Updated Sep 3, 2025

alibaba / llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Python 64 5 Updated Jun 5, 2024

HabanaAI / vllm-fork

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 85 134 Updated Dec 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TJian tjtanaa

Achievements

Achievements

Organizations

Block or report tjtanaa

Stars

vllm-project / vllm-omni

micytao / vllm-playground

meta-pytorch / tritonbench

kyuz0 / amd-strix-halo-vllm-toolboxes

ROCm / mori

huggingface / hf-rocm-kernels

stackav-oss / conch

FoundationAgents / OpenManus

facebookresearch / blt

mk1-project / quickreduce

doux124 / sg-innovationchallenge

NVIDIA / kvpress

bitsandbytes-foundation / bitsandbytes

andrewkchan / yalm

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

NVIDIA / Star-Attention

codecrafters-io / build-your-own-x

sgl-project / sglang

traceloop / openllmetry

project-etalon / etalon

zml / zml

hijkzzz / Awesome-LLM-Strawberry

AlibabaPAI / llumnix

alibaba / llm-scheduling-artifact

HabanaAI / vllm-fork

adithya-s-k / VARAG

javabuddy / best-system-design-resources

jxiw / MambaInLlama

efeslab / Nanoflow

ashishps1 / awesome-system-design-resources