Skip to content
View peakji's full-sized avatar
🔜
Making progress
🔜
Making progress

Highlights

  • Pro

Organizations

@Level @hyperonym

Block or report peakji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 43,512 5,670 Updated Feb 16, 2026

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 19,051 2,214 Updated Feb 14, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 54,501 9,537 Updated Feb 11, 2026

how to optimize some algorithm in cuda.

Cuda 2,821 256 Updated Feb 15, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,753 396 Updated Feb 17, 2026

Header-only C++/python library for fast approximate nearest neighbors

C++ 5,095 783 Updated Sep 14, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,753 269 Updated Jul 18, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 967 91 Updated Sep 23, 2025

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

HTML 8,971 750 Updated Feb 17, 2026

Everything we actually know about the Apple Neural Engine (ANE)

2,362 90 Updated Oct 21, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,313 296 Updated May 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,308 8,185 Updated Feb 12, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,200 2,698 Updated Nov 3, 2025

Minimalist ML framework for Rust

Rust 19,403 1,423 Updated Feb 15, 2026

Fast, flexible LLM inference

Rust 6,584 526 Updated Feb 15, 2026

A natural language interface for computers

Python 62,164 5,342 Updated Feb 9, 2026

A framework for building realtime voice AI agents 🤖🎙️📹

Python 9,347 2,833 Updated Feb 17, 2026

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 4,604 359 Updated Aug 10, 2024

A generative speech model for daily dialogue.

Python 38,708 4,207 Updated Jan 18, 2026

Minimal container for Chrome's headless shell, useful for automating / driving the web

Shell 611 75 Updated Dec 19, 2025

A collective list of free APIs

Python 398,731 42,662 Updated Nov 4, 2025

📖 100 Go Mistakes and How to Avoid Them

Go 7,821 502 Updated Sep 24, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,318 1,004 Updated Jul 1, 2024

Fast and accurate AI powered file content types detection

Python 10,115 489 Updated Feb 16, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,606 1,883 Updated Jan 9, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 85,404 12,922 Updated Feb 9, 2026

Make images smaller using best-in-class codecs, right in the browser.

TypeScript 24,745 1,850 Updated Feb 5, 2026

leaked prompts of GPTs

31,950 4,426 Updated Sep 27, 2024

A blazing fast inference solution for text embeddings models

Rust 4,501 360 Updated Feb 16, 2026
Next