Skip to content
View jiaqizhai's full-sized avatar

Sponsoring

@sveltejs

Block or report jiaqizhai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Github mirror of trition-lang/triton repo.

MLIR 171 53 Updated Jun 16, 2026

Ahead-of-time compilation library for Triton kernels

Python 5 Updated Jul 23, 2025

Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication feat…

C# 11,877 669 Updated Jun 16, 2026

HSTU-BLaIR: Lightweight Contrastive Text Embedding for Generative Recommender 🌱

Python 25 1 Updated Jul 4, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 884 153 Updated Jun 16, 2026

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 279 71 Updated Jun 15, 2026

TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels

Python 210 26 Updated Jun 10, 2026

Development repository for the Triton language and compiler

MLIR 19,447 2,939 Updated Jun 16, 2026

[VLDB 26, NeurIPS 25] Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 139 29 Updated Feb 22, 2026

Complete container management platform

Go 25,676 3,206 Updated Jun 16, 2026

Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.

Go 7,654 1,268 Updated Jun 15, 2026

Kubernetes-native Job Queueing

Go 2,568 650 Updated Jun 15, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,262 1,248 Updated Jun 16, 2026
Cuda 132 16 Updated Mar 19, 2026

Lightweight Kubernetes

Go 33,261 2,676 Updated Jun 15, 2026

web development for the rest of us

JavaScript 87,261 4,947 Updated Jun 15, 2026

A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools…

Python 248 55 Updated May 15, 2026

Retrieval with Learned Similarities

Python 2 Updated Aug 13, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 47,990 2,156 Updated Jun 16, 2026

AWS Glue Libraries are additions and enhancements to Spark for ETL operations.

Python 700 303 Updated Apr 24, 2026

The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.

38 3 Updated Jan 7, 2022

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on h…

Cuda 206 35 Updated May 22, 2026

Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.

Python 8,790 537 Updated Jun 14, 2026

GPUd automates monitoring, diagnostics, and issue identification for GPUs

Go 483 63 Updated Jun 15, 2026

Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)

Python 51 9 Updated Apr 23, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,923 391 Updated Jun 15, 2026

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

Python 70 7 Updated Apr 25, 2024

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,867 312 Updated Apr 6, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,249 3,990 Updated Jul 17, 2024

Inference code for LLaMA models in JAX

Python 120 5 Updated May 21, 2024
Next