Skip to content
View hiro-v's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report hiro-v

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Source code for the X Recommendation Algorithm

Scala 67,691 12,612 Updated Sep 8, 2025

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,617 248 Updated Sep 10, 2025

Local realtime voice AI

Python 2,374 137 Updated Mar 3, 2025

Generative AI extensions for onnxruntime

C++ 873 225 Updated Nov 5, 2025

Estimate Your LLM's Token Toll Across Various Platforms and Configurations

Python 37 7 Updated Jan 30, 2025

LLM training in simple, raw C/CUDA

Cuda 28,068 3,263 Updated Jun 26, 2025

Ikigai is an AI-powered Open Assignment System

TypeScript 35 4 Updated Oct 9, 2024

Development repository for the Triton language and compiler

MLIR 17,468 2,359 Updated Nov 5, 2025

A natural language interface for computers

Python 60,783 5,212 Updated Nov 3, 2025

A platform for community discussion. Free, open, simple.

Ruby 45,490 8,704 Updated Nov 5, 2025

Grok open release

Python 50,553 8,371 Updated Aug 30, 2024

A SQLite extension for efficient vector search, based on Faiss!

C++ 1,922 71 Updated May 5, 2024

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 387 191 Updated Nov 3, 2025

Examples using MLX Swift

Swift 2,278 332 Updated Nov 4, 2025

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

TypeScript 926 58 Updated Oct 6, 2025

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 213 27 Updated Apr 22, 2025

OpenAI compatible API for TensorRT LLM triton backend

Rust 216 29 Updated Aug 1, 2024

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

C++ 42 3 Updated Sep 26, 2024

Scheduling infrastructure for absolutely everyone.

TypeScript 38,671 10,951 Updated Nov 5, 2025

A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap

Clojure 39,248 2,353 Updated Nov 5, 2025

:octocat: Browser extension that simplifies the GitHub interface and adds useful features

TypeScript 29,637 1,621 Updated Nov 4, 2025

A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python

40,713 4,344 Updated Jul 30, 2025

OBS Studio - Free and open source software for live streaming and screen recording

C 67,875 8,763 Updated Nov 4, 2025

Package conda environments for redistribution

Python 557 99 Updated Oct 27, 2025

Stable Diffusion with Core ML on Apple Silicon

Python 17,668 1,025 Updated Jul 3, 2025

Swift Package to implement a transformers-like API in Swift

Swift 1,188 144 Updated Oct 27, 2025

Everything we actually know about the Apple Neural Engine (ANE)

2,308 85 Updated Oct 21, 2025

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 893 49 Updated Sep 30, 2025
Next