Skip to content
View hvy's full-sized avatar
🏃‍♂️
Focusing
🏃‍♂️
Focusing

Highlights

  • Pro

Organizations

@pfnet @pfnet-research @chainer @cupy @optuna

Block or report hvy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 134 35 Updated Feb 13, 2026

Inference server benchmarking tool

Rust 142 26 Updated Oct 2, 2025

Preferred Generation Benchmark

Python 91 16 Updated Oct 28, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,187 252 Updated Dec 15, 2025

Nano vLLM

Python 11,704 1,578 Updated Nov 3, 2025
TypeScript 48 Updated May 12, 2025

The code of several works on oimo.io/works

Haxe 1,455 60 Updated Jan 15, 2025

An Intel 8086 Emulator created in Rust.

Rust 425 65 Updated Feb 16, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,399 345 Updated Feb 12, 2026

Pipeline Parallelism for PyTorch

Python 785 88 Updated Aug 21, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,889 602 Updated May 3, 2024

The registry of the OptunaHub packages

Jupyter Notebook 49 56 Updated Feb 3, 2026

Python library to use packages in OptunaHub

Python 54 14 Updated Nov 20, 2025

DiscoGrad - automatically differentiate across conditional branches in C++ programs

C++ 209 5 Updated Sep 12, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,865 70 Updated Jun 22, 2025

Development repository for the Triton language and compiler

MLIR 18,429 2,578 Updated Feb 16, 2026

A curated list for Efficient Large Language Models

Python 1,951 152 Updated Jun 17, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,251 884 Updated Dec 17, 2024

LLM inference in C/C++

C++ 95,085 14,920 Updated Feb 15, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,887 2,101 Updated Feb 16, 2026

Code release for NeuS

Python 1,754 220 Updated Feb 28, 2024

Google Research

Jupyter Notebook 37,264 8,327 Updated Feb 13, 2026

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 548 69 Updated Feb 12, 2026

Inference code for Llama models

Python 59,142 9,824 Updated Jan 26, 2025

Extended functionalities for Optuna in combination with third-party libraries.

Python 65 41 Updated Jan 22, 2026

A curated list of awesome neural radiance fields papers

TeX 6,761 600 Updated Jan 6, 2025

CPU assembly examples

Assembly 87 6 Updated May 19, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,338 768 Updated Feb 13, 2026

1st place solution for Kaggle "Happywhale - Whale and Dolphin Identification"

Python 51 13 Updated Jan 6, 2024
Next