bab2min

Minchul Lee bab2min

222 followers · 24 following

Achievements

x3 x3

Achievements

x3 x3

Highlights

Organizations

Starred repositories

177 stars written in Python

Clear filter

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,614 12,026 Updated Dec 17, 2025

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,641 3,966 Updated Apr 19, 2025

datalab-to / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 30,409 2,062 Updated Nov 19, 2025

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,576 3,513 Updated Dec 5, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,461 1,914 Updated Jun 3, 2025

sebastianruder / NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,974 3,621 Updated Jul 28, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,885 2,678 Updated Dec 15, 2025

magenta / magenta

Magenta: Music and Art Generation with Machine Intelligence

Python 19,756 3,809 Updated Jul 8, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,964 1,651 Updated Nov 19, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,682 2,362 Updated Dec 17, 2025

scipy / scipy

SciPy library main repository

Python 14,264 5,556 Updated Dec 17, 2025

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,223 980 Updated Dec 17, 2025

plasma-umass / scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,160 430 Updated Dec 14, 2025

numba / numba

NumPy aware dynamic Python compiler using LLVM

Python 10,795 1,216 Updated Dec 16, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,709 1,246 Updated Dec 11, 2025

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,827 800 Updated Dec 12, 2025

ijl / orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

Python 7,690 277 Updated Dec 11, 2025

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 7,257 870 Updated Dec 17, 2025

lucidrains / x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,721 496 Updated Dec 14, 2025

Kyubyong / transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,454 1,312 Updated May 21, 2023

eth-sri / lmql

A language for constraint-guided and efficient LLM programming.

Python 4,098 214 Updated May 22, 2025

attardi / wikiextractor

A tool for extracting plain text from Wikipedia dumps

Python 3,957 1,005 Updated May 23, 2024

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,795 288 Updated Dec 17, 2025

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,389 284 Updated Jul 17, 2025

THUDM / GLM

GLM (General Language Model)

Python 3,365 336 Updated Nov 3, 2023

Filimoa / open-parse

Improved file parsing for LLM’s

Python 3,142 140 Updated Nov 13, 2024

ddangelov / Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Python 3,107 376 Updated Nov 14, 2024

IntelLabs / nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Python 2,936 447 Updated Nov 7, 2022

facebookresearch / XLM

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,922 497 Updated Feb 14, 2023

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,548 286 Updated Dec 17, 2025

Python

Natural language processing