bab2min

Minchul Lee bab2min

222 followers · 24 following

Achievements

x3 x3

Achievements

x3 x3

Highlights

Organizations

Starred repositories

177 stars written in Python

Clear filter

google-research / electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Python 2,366 349 Updated Mar 23, 2024

microsoft / DeBERTa

The implementation of DeBERTa

Python 2,179 240 Updated Sep 29, 2023

carpedm20 / emoji

emoji terminal output for Python

Python 2,019 287 Updated Sep 21, 2025

intel / intel-extension-for-pytorch

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,995 307 Updated Dec 12, 2025

castorini / pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,990 470 Updated Dec 16, 2025

noamgat / lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,967 80 Updated Aug 24, 2025

konlpy / konlpy

Python package for Korean natural language processing.

Python 1,483 334 Updated Aug 28, 2023

kakao / khaiii

Kakao Hangul Analyzer III

Python 1,447 297 Updated Sep 1, 2025

SKTBrain / KoBERT

Korean BERT pre-trained cased (KoBERT)

Python 1,395 382 Updated Jun 14, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,309 78 Updated Mar 6, 2025

kakaobrain / pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Python 1,306 223 Updated Mar 23, 2022

dorianbrown / rank_bm25

A Collection of BM25 Algorithms in Python

Python 1,278 99 Updated Oct 8, 2024

google-research / tapas

End-to-end neural table-text understanding models.

Python 1,203 216 Updated Jul 22, 2024

rusty1s / pytorch_sparse

PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations

Python 1,093 159 Updated Aug 12, 2025

google-research-datasets / natural-questions

Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question ans…

Python 1,079 157 Updated Jul 30, 2021

kakaobrain / kogpt

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

Python 1,015 139 Updated Jan 30, 2024

hyunjun / bookmarks

Python 996 283 Updated Dec 14, 2025

lovit / soynlp

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.

Python 983 183 Updated May 7, 2025

jalan / pdftotext

Simple PDF text extraction

Python 964 105 Updated Feb 10, 2025

kuleshov / minillm

MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs

Python 938 58 Updated May 15, 2023

dropbox / hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Python 901 88 Updated Oct 24, 2025

OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 884 72 Updated Nov 26, 2025

cltk / cltk

The Classical Language Toolkit

Python 880 339 Updated Dec 8, 2025

kakaobrain / torchgpipe

A GPipe implementation in PyTorch

Python 858 98 Updated Jul 25, 2024

facebookresearch / GENRE

Autoregressive Entity Retrieval

Python 797 102 Updated Jul 6, 2023

MIND-Lab / OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Python 793 117 Updated Nov 24, 2025

google-research / long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

Python 770 84 Updated Dec 16, 2023

yaserkl / RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

Python 768 163 Updated Mar 24, 2023

ko-nlp / Korpora

Korean corpus repository

Python 735 79 Updated Oct 3, 2022

mir-aidj / all-in-one

All-In-One Music Structure Analyzer

Python 683 104 Updated May 9, 2024

Previous Next

Starred topics

Python

Natural language processing