nayohan

🔥

enthusiasm

Yohan Na nayohan

🔥

enthusiasm

NLP Research Engineer

59 followers · 99 following

LG Uplus CTO
Seoul, SouthKorea
in/nayohan

Achievements

Lists (19)

Sort

Stars

SalesforceAIResearch / PretrainRL-pipeline

An automated data pipeline scaling RL to pretraining levels

Python 65 6 Updated Oct 11, 2025

MME-Benchmarks / MME-RealWorld

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 135 9 Updated Oct 21, 2025

passing2961 / MultiVerse

Official code and dataset for our ICCV 2025 paper: MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models

4 Updated Oct 21, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,622 3,778 Updated Nov 6, 2025

bytedance / Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,728 633 Updated Nov 6, 2025

AnswerDotAI / ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

Python 1,559 129 Updated Jun 30, 2025

rapidsai / crossfit

Metric calculation library

Python 10 9 Updated Jul 3, 2025

tstanislawek / awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

1,472 165 Updated Jun 2, 2023

mismayil / creative-preference-optimization

Creative Preference Optimization

Python 2 Updated Sep 23, 2025

huggingface / trl-jobs

Train LLM on Hugging Face infra

Python 65 9 Updated Sep 10, 2025

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 18,012 1,721 Updated Nov 6, 2025

weigao266 / Awesome-Efficient-Arch

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

355 31 Updated Aug 29, 2025

OPPO-PersonalAI / Agent_Foundation_Models

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 485 45 Updated Sep 8, 2025

punkpeye / awesome-mcp-servers

A collection of MCP servers.

74,374 6,225 Updated Nov 4, 2025

RUCAIBox / Passk_Training

The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''

Python 96 4 Updated Aug 15, 2025

huggingface / aisheets

Build, enrich, and transform datasets using AI models with no code

TypeScript 1,558 131 Updated Oct 23, 2025

NomaDamas / KoDarkBench

Korean version of DarkBench

Python 57 Updated Aug 8, 2025

huggingface / gpt-oss-recipes

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Jupyter Notebook 473 50 Updated Aug 25, 2025

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 15,483 1,160 Updated Nov 6, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 17,139 2,820 Updated Nov 6, 2025

theeluwin / Summarxiv

Daily latest arXiv paper summary digest.

Python 45 3 Updated Jul 30, 2025

going-doer / Paper2Code

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 3,593 543 Updated Jul 18, 2025

rladmstn1714 / BenchHub

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

Python 10 Updated Aug 1, 2025

safety-research / circuit-tracer

Python 2,415 260 Updated Nov 1, 2025

kdhRick2222 / Exposure-slot

Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Vision and Pattern Recognition (CVPR), 2025.

Python 20 2 Updated Sep 2, 2025