Highlights
- Pro
Stars
์๋ํ๊ธ hwp viewer and editor by rust and wasm
Official implementation of โWatch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Modelsโ (CIKM 2025).
Coding problems used in aider's polyglot benchmark
[ICLR 2026] Learning to Reason without External Rewards
K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models
[EMNLP 2024] Knowledge Graph Enhanced Large Language Model Editing
Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
Acceptance rates for the major AI conferences
Universal cross-platform tokenizers binding to HF and sentencepiece
๐ A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.
๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023
A collection of papers on automated medical coding from free-texts
์๋ฃ์ง์ด ์์ฑํ ๋ฌธ์๋ก ์ฌ์ ํ์ต๋ ํ๊ตญ์ด ์ํ word2vec ๋ชจ๋ธ / Korean word2vec model trained on clinical documents
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.