Skip to content
View BBC-Esq's full-sized avatar

Block or report BBC-Esq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
63 stars written in Python
Clear filter

Get your documents ready for gen AI

Python 52,169 3,572 Updated Feb 4, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,989 2,134 Updated Jan 27, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,292 1,062 Updated Feb 3, 2026

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…

Python 6,846 720 Updated Jan 29, 2026

Tools for merging pretrained large language models.

Python 6,763 662 Updated Jan 26, 2026

Panel: The powerful data exploration & web app framework for Python

Python 5,581 575 Updated Feb 5, 2026

Minimal keyword extraction with BERT

Python 4,100 378 Updated Feb 3, 2026

A nearly-live implementation of OpenAI's Whisper.

Python 3,793 521 Updated Jan 13, 2026

A GUI for Pandas DataFrames

Python 3,266 243 Updated May 30, 2025

Real time transcription with OpenAI Whisper.

Python 2,908 484 Updated Apr 15, 2025

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,654 179 Updated Feb 3, 2026

A Python wrapper for the tesseract-ocr API

Python 2,150 260 Updated Jan 13, 2026

Convert HTML to Markdown-formatted text.

Python 2,129 292 Updated Oct 28, 2025

Python package for scraping recipes data

Python 2,089 628 Updated Feb 3, 2026

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,553 141 Updated Jan 31, 2026

Python3 library for downloading YouTube Videos.

Python 1,453 180 Updated Dec 7, 2025

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

Python 1,331 120 Updated Oct 31, 2025

A specification that python filesystems should adhere to.

Python 1,277 428 Updated Feb 2, 2026

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Python 1,204 120 Updated Dec 14, 2025

icalendar parser library for Python

Python 1,112 216 Updated Feb 4, 2026

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.

Python 1,081 87 Updated Jul 5, 2025

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Python 1,060 109 Updated May 14, 2025

Convert Word documents (.docx files) to HTML

Python 1,050 140 Updated Nov 20, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 965 366 Updated Jan 23, 2026

Iconic fonts in PyQt and PySide applications

Python 921 122 Updated Jan 23, 2026

LM Studio Apple MLX engine

Python 887 79 Updated Feb 4, 2026
Python 867 101 Updated Jan 22, 2025

A streaming multipart parser for Python.

Python 471 81 Updated Feb 1, 2026

A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.

Python 457 27 Updated Mar 10, 2025

G2P

Python 403 81 Updated Aug 11, 2025
Next