Skip to content
View 1a1a11a's full-sized avatar

Highlights

  • Pro

Organizations

@cacheMon

Block or report 1a1a11a

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

106 stars written in Python
Clear filter

A collective list of free APIs

Python 415,450 45,019 Updated Mar 18, 2026

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 96,509 8,919 Updated Mar 23, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,151 14,710 Updated Mar 24, 2026

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.

Python 60,419 10,731 Updated Mar 24, 2026

Inference code for Llama models

Python 59,252 9,822 Updated Jan 26, 2025

LlamaIndex is the leading document agent and OCR platform

Python 47,937 7,086 Updated Mar 23, 2026

Making large AI models cheaper, faster and more accessible

Python 41,377 4,523 Updated Mar 16, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 40,157 6,674 Updated Mar 24, 2026

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,447 4,786 Updated Jun 2, 2025

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 39,056 3,734 Updated Jul 9, 2025

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,436 3,305 Updated Aug 17, 2024

A book-in-progress about the Linux kernel and its insides.

Python 32,255 3,500 Updated Mar 23, 2026

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Python 28,161 1,474 Updated Mar 1, 2026

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,710 5,867 Updated Aug 14, 2024

Fast and memory-efficient exact attention

Python 22,945 2,545 Updated Mar 23, 2026

Best Practices on Recommendation Systems

Python 21,542 3,303 Updated Mar 22, 2026

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,220 3,643 Updated Jul 4, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,847 2,228 Updated Mar 24, 2026

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Python 20,094 4,647 Updated Mar 24, 2026

Ongoing research training transformer models at scale

Python 15,778 3,741 Updated Mar 24, 2026

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,638 1,821 Updated Jun 27, 2024

Proxy server to bypass Cloudflare protection

Python 13,217 1,061 Updated Jan 12, 2026

Freeze (package) Python programs into stand-alone executables

Python 12,931 2,014 Updated Mar 21, 2026

Nano vLLM

Python 12,403 1,773 Updated Nov 3, 2025

Modular visual interface for GDB in Python

Python 12,165 817 Updated Nov 6, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,447 844 Updated Mar 10, 2026

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,802 1,144 Updated Jun 30, 2023

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

Python 10,568 3,445 Updated Mar 18, 2026

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 10,021 598 Updated Sep 7, 2024

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

Python 9,690 1,008 Updated Mar 24, 2026
Next