Skip to content
View hksung's full-sized avatar

Block or report hksung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

56 12 Updated Jun 15, 2023

Lexicons for the Multilingual UCREL Semantic Analysis System

Python 49 17 Updated Mar 11, 2026

quick and dirty dump of 25k English words from wordfreq

Python 15 3 Updated Oct 21, 2020

List of Permanent Free LLM API (API Keys)

JavaScript 4,979 476 Updated May 27, 2026

Hierarchical Universal Modular ANotator

TypeScript 12 5 Updated May 9, 2026

Lightweight, open-source AI agent for your tools, chats, and workflows.

Python 44,102 7,804 Updated Jun 12, 2026

Korean sejong corpus download and simple analysis

Shell 149 24 Updated May 9, 2019

Pre-trained word vectors of 30+ languages

Python 2,232 388 Updated Oct 11, 2018
5 Updated Dec 21, 2022

A Large-Scale Open-Domain Sign Language Translation Dataset (ASL-English)

Python 83 11 Updated Jul 23, 2025

Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text

Python 19 1 Updated Mar 31, 2025

Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)

Python 182 25 Updated May 27, 2024

Spacy NER annotator using ipywidgets

Python 125 24 Updated Mar 25, 2024

The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)

Python 119 10 Updated Oct 8, 2020

Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.

Jupyter Notebook 49 10 Updated Sep 6, 2022

A ollama based chatbot. We use llama3 8b model via groq for this project.

Python 7 Updated Jul 4, 2025
TeX 371 189 Updated Aug 23, 2020

LLMs for Constructed Languages

HTML 48 5 Updated Apr 21, 2026

https://sharedtask.duolingo.com

Python 51 15 Updated Mar 19, 2020

Supplementary repo for UD_Korean-KSL

1 Updated Jul 15, 2025

A tool for analyzing ASC usage in English texts

Python 3 Updated Nov 22, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,556 896 Updated May 12, 2025

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 19,585 1,519 Updated Jun 12, 2026

A dataset containing human-human knowledge-grounded open-domain conversations.

Python 671 99 Updated Aug 2, 2024

A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.

Python 3,140 3,907 Updated Jun 12, 2026

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

C 7,220 1,545 Updated Jul 27, 2025

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

Python 968 220 Updated May 21, 2024

An Argument Structure Construction Treebank

Python 7 1 Updated Feb 11, 2026

Tools for checking ACL paper submissions

Python 1,001 61 Updated Dec 6, 2025
Next