Skip to content
View bab2min's full-sized avatar

Highlights

  • Pro

Organizations

@uri-feeling

Block or report bab2min

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

177 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,614 12,026 Updated Dec 17, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,641 3,966 Updated Apr 19, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 30,409 2,062 Updated Nov 19, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,576 3,513 Updated Dec 5, 2025

Official inference framework for 1-bit LLMs

Python 24,461 1,914 Updated Jun 3, 2025

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,974 3,621 Updated Jul 28, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,885 2,678 Updated Dec 15, 2025

Magenta: Music and Art Generation with Machine Intelligence

Python 19,756 3,809 Updated Jul 8, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,964 1,651 Updated Nov 19, 2025

Train transformer language models with reinforcement learning.

Python 16,682 2,362 Updated Dec 17, 2025

SciPy library main repository

Python 14,264 5,556 Updated Dec 17, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,223 980 Updated Dec 17, 2025

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,160 430 Updated Dec 14, 2025

NumPy aware dynamic Python compiler using LLVM

Python 10,795 1,216 Updated Dec 16, 2025

Large Language Model Text Generation Inference

Python 10,709 1,246 Updated Dec 11, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,827 800 Updated Dec 12, 2025

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

Python 7,690 277 Updated Dec 11, 2025

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 7,257 870 Updated Dec 17, 2025

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,721 496 Updated Dec 14, 2025

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,454 1,312 Updated May 21, 2023

A language for constraint-guided and efficient LLM programming.

Python 4,098 214 Updated May 22, 2025

A tool for extracting plain text from Wikipedia dumps

Python 3,957 1,005 Updated May 23, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,795 288 Updated Dec 17, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,389 284 Updated Jul 17, 2025

GLM (General Language Model)

Python 3,365 336 Updated Nov 3, 2023

Improved file parsing for LLM’s

Python 3,142 140 Updated Nov 13, 2024

Top2Vec learns jointly embedded topic, document and word vectors.

Python 3,107 376 Updated Nov 14, 2024

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Python 2,936 447 Updated Nov 7, 2022

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,922 497 Updated Feb 14, 2023

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,548 286 Updated Dec 17, 2025
Next