Skip to content
View peakji's full-sized avatar
🔜
Making progress
🔜
Making progress

Highlights

  • Pro

Organizations

@Level @hyperonym

Block or report peakji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 55,059 7,509 Updated May 5, 2026

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 19,845 2,285 Updated Jun 12, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 56,563 9,846 Updated Feb 11, 2026

how to optimize some algorithm in cuda.

Cuda 3,084 279 Updated Jun 9, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,402 544 Updated Jun 12, 2026

Header-only C++/python library for fast approximate nearest neighbors

C++ 5,252 830 Updated Mar 28, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,761 273 Updated Jul 18, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 1,026 115 Updated Sep 23, 2025

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

HTML 10,609 910 Updated Jun 14, 2026

Everything we actually know about the Apple Neural Engine (ANE)

2,479 96 Updated Mar 12, 2026

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,343 304 Updated May 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,177 8,829 Updated Jun 15, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,354 2,740 Updated May 19, 2026

Minimalist ML framework for Rust

Rust 20,482 1,605 Updated Jun 11, 2026

Fast, flexible LLM inference

Rust 7,288 623 Updated Jun 15, 2026

A lightweight coding agent for open models like Deepseek, Kimi, and Qwen

Python 63,971 5,549 Updated Jun 10, 2026

A framework for building realtime voice AI agents 🤖🎙️📹

Python 10,979 3,227 Updated Jun 15, 2026

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 5,020 392 Updated Aug 10, 2024

A generative speech model for daily dialogue.

Python 39,456 4,244 Updated Apr 10, 2026

Minimal container for Chrome's headless shell, useful for automating / driving the web

Shell 647 78 Updated Mar 21, 2026

A collective list of free APIs

Python 441,696 48,406 Updated Jun 13, 2026

📖 100 Go Mistakes and How to Avoid Them

Go 7,904 509 Updated Apr 21, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,573 1,064 Updated Jul 1, 2024

Fast and accurate AI powered file content types detection

Python 17,135 1,051 Updated Jun 11, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,302 1,990 Updated Jan 9, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,215 14,870 Updated Jun 2, 2026

Make images smaller using best-in-class codecs, right in the browser.

TypeScript 25,364 1,937 Updated Jun 10, 2026

leaked prompts of GPTs

31,989 4,377 Updated Sep 27, 2024

A blazing fast inference solution for text embeddings models

Rust 4,866 399 Updated May 26, 2026
Next