Skip to content
View thekimk's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report thekimk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 10,342 774 Updated Mar 27, 2026

LightweightMMM 🦇 is a lightweight Bayesian Marketing Mix Modeling (MMM) library that allows users to easily train MMMs and obtain channel attribution information.

Python 1,031 232 Updated Jun 17, 2025

A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputatio…

Python 1,976 183 Updated Mar 27, 2026

A natural language interface for computers

Python 62,895 5,425 Updated Feb 9, 2026

Scrape Twitter for Tweets

Python 2,458 571 Updated Oct 5, 2022

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 2,995 256 Updated Mar 26, 2026

데이터사이언스 입문 프로젝트

Jupyter Notebook 3 Updated Dec 14, 2020

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Python 15,703 858 Updated Mar 2, 2026

Scalable and user friendly neural 🧠 forecasting algorithms.

Python 4,018 485 Updated Mar 27, 2026

Various errors for tabular/structured/time-series data

Python 18 4 Updated Mar 9, 2026

대한민국의 공휴일을 계산하는 Python 패키지입니다. 양음력 공휴일 뿐 아니라, 매년 변동되는 공휴일(대체 공휴일, 선거일 등)까지 포함하여 정확한 공휴일 정보를 제공합니다. 금일 혹은 특정 날짜가 공휴일인지 확인하거나, 주어진 연도의 모든 공휴일을 조회할 수 있습니다.

Python 28 12 Updated Jan 4, 2026

Time series easier, faster, more fun. Pytimetk.

Python 956 82 Updated Nov 27, 2025

The Social Investment Data Lab Specification is being developed as a draft data specification for describing social investment.

JavaScript 5 4 Updated Dec 27, 2022

Python API for Kiwi

Python 367 33 Updated Mar 18, 2026

이 레포지토리에서 BERT를 huggingface PyTorch 라이브러리로 빠르고 효율적으로 모델을 fine-tuning하여문장 분류에서 우수한 성능에 근접하는 방법을 보여줍니다.

6 2 Updated Dec 8, 2022

Minimal keyword extraction with BERT

Python 4,140 378 Updated Feb 3, 2026

Implementation TextRank and related utils

Python 85 42 Updated Aug 16, 2021

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 7,487 885 Updated Feb 20, 2026

BERT 기반의 문맥을 반영한 한국어 토픽 모델링 (BERT Contextualized Topic Models)

Jupyter Notebook 41 9 Updated Feb 22, 2022

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Python 1,266 152 Updated Jul 24, 2025

한글문서추출요약 with HuggingFace BERT

Jupyter Notebook 26 12 Updated Mar 18, 2022

PyTorch와 TorchText를 이용한 한국어 감정 분석 연습

Jupyter Notebook 24 13 Updated Feb 19, 2026

Korean BERT pre-trained cased (KoBERT)

Python 1,407 379 Updated Jun 14, 2025

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.

Python 985 184 Updated Mar 10, 2026

띄어쓰기 오류 교정 라이브러리입니다. CRF 와 같은 머신러닝 알고리즘이 아닌, 직관적인 접근법으로 띄어쓰기를 교정합니다.

Python 150 34 Updated Sep 26, 2019

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다

Python 354 55 Updated Mar 26, 2026

김웅곤 - 텐서플로우와 케라스로 구현한 NLP 기초 (2020년 버전)

Jupyter Notebook 177 90 Updated Apr 8, 2021

NER Task with KoBERT (with Naver NLP Challenge dataset)

Python 100 34 Updated Jun 12, 2023

🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋

493 45 Updated Nov 7, 2022
Next