Skip to content
View jiaqizhai's full-sized avatar

Sponsoring

@sveltejs

Block or report jiaqizhai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Github mirror of trition-lang/triton repo.

MLIR 172 55 Updated Jun 19, 2026

Ahead-of-time compilation library for Triton kernels

Python 5 Updated Jul 23, 2025

Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication feat…

C# 11,881 667 Updated Jun 19, 2026

HSTU-BLaIR: Lightweight Contrastive Text Embedding for Generative Recommender 🌱

Python 26 1 Updated Jul 4, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 888 152 Updated Jun 19, 2026

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 281 71 Updated Jun 17, 2026

TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels

Python 212 26 Updated Jun 10, 2026

Development repository for the Triton language and compiler

MLIR 19,471 2,945 Updated Jun 19, 2026

[VLDB 26, NeurIPS 25] Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 142 29 Updated Feb 22, 2026

Complete container management platform

Go 25,679 3,208 Updated Jun 18, 2026

Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.

Go 7,657 1,273 Updated Jun 16, 2026

Kubernetes-native Job Queueing

Go 2,573 653 Updated Jun 18, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,295 1,259 Updated Jun 19, 2026
Cuda 132 16 Updated Mar 19, 2026

Lightweight Kubernetes

Go 33,278 2,676 Updated Jun 18, 2026

web development for the rest of us

JavaScript 87,334 4,948 Updated Jun 17, 2026

A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools…

Python 248 55 Updated May 15, 2026

Retrieval with Learned Similarities

Python 2 Updated Aug 13, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 48,085 2,165 Updated Jun 19, 2026

AWS Glue Libraries are additions and enhancements to Spark for ETL operations.

Python 700 303 Updated Apr 24, 2026

The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.

38 3 Updated Jan 7, 2022

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on h…

Cuda 207 35 Updated May 22, 2026

Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.

Python 8,798 539 Updated Jun 14, 2026

GPUd automates monitoring, diagnostics, and issue identification for GPUs

Go 484 64 Updated Jun 17, 2026

Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)

Python 51 9 Updated Apr 23, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,929 393 Updated Jun 18, 2026

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

Python 70 7 Updated Apr 25, 2024

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,866 312 Updated Apr 6, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,249 3,989 Updated Jul 17, 2024

Inference code for LLaMA models in JAX

Python 120 5 Updated May 21, 2024
Next