Skip to content
View jkao's full-sized avatar

Block or report jkao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
96 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 98,695 12,127 Updated Apr 15, 2026

The Web framework for perfectionists with deadlines.

Python 87,365 33,866 Updated Apr 30, 2026

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 63,358 5,548 Updated Apr 30, 2026

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 61,507 11,513 Updated Apr 30, 2026

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.

Python 60,759 10,864 Updated Apr 30, 2026

A simple, yet elegant, HTTP library.

Python 53,944 9,886 Updated Apr 30, 2026

LlamaIndex is the leading document agent and OCR platform

Python 49,064 7,336 Updated Apr 30, 2026

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 45,250 16,974 Updated Apr 30, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,384 7,519 Updated Apr 30, 2026

🎨 Diagram as Code for prototyping cloud system architectures

Python 42,234 2,725 Updated Apr 13, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 36,308 2,491 Updated Apr 30, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,588 4,096 Updated May 1, 2026

Build resilient language agents as graphs.

Python 30,924 5,282 Updated Apr 30, 2026

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 29,236 2,812 Updated Apr 29, 2026

Distributed Task Queue (development branch)

Python 28,421 5,027 Updated Apr 29, 2026

A code-completion engine for Vim

Python 25,979 2,762 Updated Apr 29, 2026

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,814 5,873 Updated Aug 14, 2024

Contexts Optical Compression

Python 22,982 2,126 Updated Jan 27, 2026

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 21,163 5,159 Updated May 1, 2026

Zipline, a Pythonic Algorithmic Trading Library

Python 19,701 4,978 Updated Feb 13, 2024

The Inter font family

Python 19,437 466 Updated Nov 19, 2024

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 18,709 2,448 Updated Apr 10, 2026

An extremely fast Python type checker and language server, written in Rust.

Python 18,474 282 Updated Apr 30, 2026

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 18,068 1,445 Updated Mar 27, 2026

historical code from reddit.com

Python 16,942 2,859 Updated Oct 17, 2017

match command-line arguments to their help text

Python 14,039 836 Updated Apr 29, 2026

Redis Python client

Python 13,526 2,674 Updated Apr 30, 2026

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,488 896 Updated Dec 17, 2024

The open-source AIOps and alert management platform

Python 11,768 1,342 Updated Apr 29, 2026

q - Run SQL directly on delimited files and multi-file sqlite databases

Python 10,349 422 Updated Feb 6, 2026
Next