Skip to content
View peakji's full-sized avatar
🔜
Making progress
🔜
Making progress

Highlights

  • Pro

Organizations

@Level @hyperonym

Block or report peakji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 55,213 7,585 Updated May 5, 2026

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 19,866 2,288 Updated Jun 12, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 56,594 9,843 Updated Feb 11, 2026

how to optimize some algorithm in cuda.

Cuda 3,090 279 Updated Jun 9, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,419 545 Updated Jun 18, 2026

Header-only C++/python library for fast approximate nearest neighbors

C++ 5,255 830 Updated Mar 28, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,759 273 Updated Jul 18, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 1,027 115 Updated Sep 23, 2025

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

HTML 10,644 917 Updated Jun 18, 2026

Everything we actually know about the Apple Neural Engine (ANE)

2,482 96 Updated Mar 12, 2026

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,342 303 Updated May 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,283 8,846 Updated Jun 17, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,367 2,741 Updated May 19, 2026

Minimalist ML framework for Rust

Rust 20,510 1,610 Updated Jun 18, 2026

Fast, flexible LLM inference

Rust 7,310 626 Updated Jun 18, 2026

A lightweight coding agent for open models like Deepseek, Kimi, and Qwen

Rust 64,042 5,552 Updated Jun 18, 2026

A framework for building realtime voice AI agents 🤖🎙️📹

Python 11,046 3,242 Updated Jun 18, 2026

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 5,043 393 Updated Aug 10, 2024

A generative speech model for daily dialogue.

Python 39,475 4,248 Updated Apr 10, 2026

Minimal container for Chrome's headless shell, useful for automating / driving the web

Shell 647 78 Updated Mar 21, 2026

A collective list of free APIs

Python 442,563 48,492 Updated Jun 13, 2026

📖 100 Go Mistakes and How to Avoid Them

Go 7,910 510 Updated Apr 21, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,578 1,067 Updated Jul 1, 2024

Fast and accurate AI powered file content types detection

Python 17,153 1,051 Updated Jun 11, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,313 1,990 Updated Jan 9, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,365 14,908 Updated Jun 2, 2026

Make images smaller using best-in-class codecs, right in the browser.

TypeScript 25,388 1,939 Updated Jun 10, 2026

leaked prompts of GPTs

31,989 4,375 Updated Sep 27, 2024

A blazing fast inference solution for text embeddings models

Rust 4,877 399 Updated Jun 18, 2026
Next