Skip to content
View junhewk's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Yonsei University
  • Seoul, South Korea

Block or report junhewk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 17,595 1,569 Updated Apr 16, 2026

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.

Python 426 325 Updated Apr 9, 2026

Finetuning Large Language Models on One Consumer GPU in 2 Bits

Python 733 77 Updated May 25, 2024

R package for 'Efficient Learning of Word Representations and Sentence Classification'

C++ 45 5 Updated Mar 4, 2026

Tidy interface to 'data.table'

R 476 32 Updated Jan 13, 2026

cpp11 helps you to interact with R objects using C++ code.

C++ 224 52 Updated Apr 6, 2026

An implementation of the Language Server Protocol for R

R 652 112 Updated Mar 27, 2026

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료

Jupyter Notebook 271 139 Updated Nov 8, 2022

Manuscript of the book "Supervised Machine Learning for Text Analysis in R" by Emil Hvitfeldt and Julia Silge

TeX 264 105 Updated Mar 3, 2026

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.

Python 984 183 Updated Mar 10, 2026

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.

C++ 518 39 Updated Oct 24, 2025

Kiwi(지능형 한국어 형태소 분석기)

C++ 706 60 Updated Apr 4, 2026

R Interface to Torch

C++ 564 92 Updated Apr 16, 2026

R binding package Kiwi(Korean Intelligent Word Identifier)

R 33 3 Updated Dec 30, 2025

GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors

Python 517 39 Updated Dec 11, 2019

Analysing World bank Data

14 20 Updated Apr 14, 2019

Open Korean Text Processor - An Open-source Korean Text Processor

Scala 656 97 Updated Mar 12, 2024

국립국어원 표준국어대사전 표제어 DB

75 9 Updated Nov 17, 2018

R package to Embed All the Things! using StarSpace

C++ 103 13 Updated Nov 27, 2025

Classes and functions to create and summarize resampling objects

R 340 68 Updated Apr 14, 2026

Convert statistical analysis objects from R into tidy format

R 1,512 304 Updated Jan 26, 2026

Tidy methods for measuring model performance

R 401 61 Updated Apr 7, 2026

Orion Viewer is pdf, djvu, xps, cbz and tiff file viewer for Android devices based on mupdf and DjVuLibre libraries

Kotlin 281 65 Updated Apr 5, 2026

Learning embeddings for classification, retrieval and ranking.

C++ 3,957 527 Updated Dec 4, 2022

A repo of short "vignettes" illustrating statistical concepts

HTML 319 91 Updated Mar 4, 2024

Labelling Sequential Data in Natural Language Processing with R - using CRFsuite

C 62 11 Updated Nov 27, 2025

국회의원 정치자금 지출내역 데이터 공개(2012~2024)

50 11 Updated Dec 4, 2025

A simple, extensible Markov chain generator.

Python 3,383 349 Updated Apr 30, 2024

Markovify wrapper for R

R 81 6 Updated Jul 19, 2023
Next