Skip to content
View hiro-v's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report hiro-v

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Source code for the X Recommendation Algorithm

Scala 67,996 12,649 Updated Sep 8, 2025

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,623 247 Updated Sep 10, 2025

Local realtime voice AI

Python 2,391 137 Updated Nov 26, 2025

Generative AI extensions for onnxruntime

C++ 908 243 Updated Dec 20, 2025

Estimate Your LLM's Token Toll Across Various Platforms and Configurations

Python 38 8 Updated Nov 9, 2025

LLM training in simple, raw C/CUDA

Cuda 28,434 3,333 Updated Jun 26, 2025

Ikigai is an AI-powered Open Assignment System

TypeScript 36 4 Updated Oct 9, 2024

Development repository for the Triton language and compiler

MLIR 17,893 2,462 Updated Dec 21, 2025

A natural language interface for computers

Python 61,151 5,244 Updated Dec 5, 2025

A platform for community discussion. Free, open, simple.

Ruby 45,825 8,752 Updated Dec 21, 2025

Grok open release

Python 50,571 8,372 Updated Aug 30, 2024

A SQLite extension for efficient vector search, based on Faiss!

C++ 1,944 72 Updated May 5, 2024

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 390 191 Updated Dec 19, 2025

Examples using MLX Swift

Swift 2,352 348 Updated Dec 17, 2025

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

TypeScript 960 64 Updated Dec 17, 2025

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 216 27 Updated Apr 22, 2025

OpenAI compatible API for TensorRT LLM triton backend

Rust 218 31 Updated Aug 1, 2024

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

C++ 42 3 Updated Sep 26, 2024

Scheduling infrastructure for absolutely everyone.

TypeScript 39,358 11,334 Updated Dec 21, 2025

A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: https://discuss.logseq.com/t/logseq-product-roadm…

Clojure 39,894 2,394 Updated Dec 21, 2025

:octocat: Browser extension that simplifies the GitHub interface and adds useful features

TypeScript 29,950 1,637 Updated Dec 15, 2025

A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python

41,431 4,370 Updated Jul 30, 2025

OBS Studio - Free and open source software for live streaming and screen recording

C 69,175 8,913 Updated Dec 19, 2025

Package conda environments for redistribution

Python 561 98 Updated Dec 15, 2025

Stable Diffusion with Core ML on Apple Silicon

Python 17,746 1,040 Updated Jul 3, 2025

Swift Package to implement a transformers-like API in Swift

Swift 1,225 157 Updated Dec 21, 2025

Everything we actually know about the Apple Neural Engine (ANE)

2,342 89 Updated Oct 21, 2025

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 896 50 Updated Sep 30, 2025
Next