Skip to content
View irexyc's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report irexyc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TypeScript 21,501 2,475 Updated Jun 13, 2026

Proxy that exposes Antigravity provided claude / gemini models, so we can use them in Claude Code and OpenClaw (Clawdbot)

JavaScript 3,756 508 Updated Jun 8, 2026

High-performance distributed data shuffling (all-to-all) library for MoE training and inference

Python 123 11 Updated Mar 7, 2026

Run the latest vscode-server on RHEL/CentOS 7!

C 408 32 Updated Jun 12, 2026

Upload files to OSS

JavaScript 12 13 Updated May 19, 2026
Python 550 49 Updated Jul 26, 2024

The Open-Source Data Annotation Platform

TypeScript 1,240 127 Updated Feb 19, 2025

Open-source multimodal data annotation platform with AI auto-annotation support.

Python 1,590 182 Updated Jun 8, 2026

万卷1.0多模态语料

573 29 Updated Oct 20, 2023

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 9,718 732 Updated Jan 3, 2025

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

Python 67,513 5,685 Updated Jun 11, 2026

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,978 194 Updated Oct 8, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 844 65 Updated Mar 6, 2025

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Python 2,492 178 Updated Nov 24, 2025

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

361 10 Updated Mar 22, 2024

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,867 2,465 Updated Jun 14, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,869 7,684 Updated Jun 14, 2026

Cronicle V2 (Orchestra) community prototype

JavaScript 269 42 Updated May 26, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,831 18,042 Updated Jun 14, 2026

A Tiny Modern C++ Header Brings Unified Interface for Different Languages

C++ 158 19 Updated Jan 12, 2023

GitHub Action to install CUDA

TypeScript 206 70 Updated Mar 29, 2026

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 4,118 333 Updated Jun 14, 2026

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…

Python 9,349 836 Updated Jun 13, 2026

Tensor library for machine learning

C++ 14,807 1,672 Updated Jun 12, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,897 701 Updated Jun 11, 2026

Download image from the Docker Hub HTTPS API

Python 890 227 Updated May 23, 2024

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 10,493 2,471 Updated Jun 8, 2025

An unprofessional open-source Chinese font derived from Fontworks' Klee One. 一款非专业的开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。

Shell 24,700 626 Updated Jun 9, 2026
Next