guijinSON

GUIJIN SON guijinSON

Yonsei University UIC Economics Major

89 followers · 19 following

Seoul, Korea
https://scholar.google.com/citations?user=Zf_eLDsAAAAJ&hl=en&oi=ao

Achievements

x2 x2 x2

Achievements

x2 x2 x2

Stars

boradorish / text-to-json

Python 1 Updated May 25, 2026

frenzymath / FATE

The FATE (Formal Algebra Theorem Evaluation) benchmarks.

54 3 Updated Feb 23, 2026

autolabhq / autolab

A benchmark for evaluating AI agents on frontier ultra long-horizon auto research tasks.

Python 142 15 Updated Jun 17, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,210 905 Updated Jun 18, 2026

pmxt-dev / pmxt

CCXT for prediction markets. PMXT is a unified API for trading on Polymarket, Kalshi, and more.

TypeScript 1,899 225 Updated Jun 18, 2026

hiyouga / MathRuler

A light-weight tool for evaluating LLMs in rule-based ways.

Python 87 11 Updated Jun 19, 2025

huggingface / yourbench

Forked from sumukshashidhar/yourbench

🤗 Benchmark Large Language Models Reliably On Your Data

HTML 448 41 Updated Apr 2, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,558 1,483 Updated Jun 18, 2026

prometheus-eval / scaling-evaluation-compute

Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"

12 Updated Mar 25, 2025

joey00072 / nanoGRPO

nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)

Python 144 9 Updated May 8, 2025

HAE-RAE / haerae-evaluation-toolkit

The most modern LLM evaluation toolkit

Python 70 11 Updated Apr 30, 2026

goddoe / RLYX

A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.

Python 38 4 Updated Aug 27, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 661 63 Updated Jan 29, 2026

guijinSON / MM-Eval

Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"

Jupyter Notebook 20 4 Updated Oct 26, 2024

blockchain-etl / bitcoin-etl

ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ

Python 458 135 Updated May 2, 2025

daekeun-ml / evaluate-llm-on-korean-dataset

Performs benchmarking on two Korean datasets with minimal time and effort.

Python 45 8 Updated Jan 22, 2026

alvarobartt / investiny

🤏🏻 `investpy` but made tiny

Python 422 43 Updated Feb 28, 2026

arcee-ai / DistillKit

An Open Source Toolkit For LLM Distillation

Python 968 128 Updated May 12, 2026

yash-srivastava19 / arrakis

Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.

Jupyter Notebook 31 4 Updated Apr 14, 2026

ykwon0407 / DataInf

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)

Jupyter Notebook 81 13 Updated Oct 3, 2024

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,092 71 Updated Apr 25, 2025

yule-BUAA / MergeLM

Codebase for Merging Language Models (ICML 2024)

Python 868 52 Updated May 5, 2024

guijinSON / MTI-Bench

Python 8 2 Updated Aug 16, 2024

segyges / pythia-embedding-analysis

Jupyter Notebook 2 2 Updated Mar 25, 2024

chakki-works / CoARiJ

Corpus of Annual Reports in Japan

Python 94 7 Updated Dec 19, 2020

LLaVA-VL / LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 767 58 Updated Feb 1, 2024

uyunho99 / Psatkiller

Jupyter Notebook 3 Updated Jan 31, 2024

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 7,162 743 Updated Jun 17, 2026

GoogleChromeLabs / chrome-for-testing

JavaScript 1,227 158 Updated Jun 18, 2026

guijinSON / KoLLM-LogBook

Forked from teknium1/LLM-Logbook

Korean Port for teknium1/LLM-Logbook

HTML 6 Updated Oct 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GUIJIN SON guijinSON

Achievements

Achievements

Block or report guijinSON

Stars

boradorish / text-to-json

frenzymath / FATE

autolabhq / autolab

THUDM / slime

pmxt-dev / pmxt

hiyouga / MathRuler

huggingface / yourbench

modelscope / ms-swift

prometheus-eval / scaling-evaluation-compute

joey00072 / nanoGRPO

HAE-RAE / haerae-evaluation-toolkit

goddoe / RLYX

sail-sg / oat

guijinSON / MM-Eval

blockchain-etl / bitcoin-etl

daekeun-ml / evaluate-llm-on-korean-dataset

alvarobartt / investiny

arcee-ai / DistillKit

yash-srivastava19 / arrakis

ykwon0407 / DataInf

prometheus-eval / prometheus-eval

yule-BUAA / MergeLM

guijinSON / MTI-Bench

segyges / pythia-embedding-analysis

chakki-works / CoARiJ

LLaVA-VL / LLaVA-Plus-Codebase

uyunho99 / Psatkiller

arcee-ai / mergekit

GoogleChromeLabs / chrome-for-testing

guijinSON / KoLLM-LogBook