nawnoes

😀

Seonghwan Kim nawnoes

😀

72 followers · 19 following

Seoul, Korea

Lists (9)

Sort

Stars

LG-AI-EXAONE / EXAONE-4.0

Official repository for EXAONE 4.0 built by LG AI Research

105 9 Updated Aug 4, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,862 112 Updated Mar 18, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,131 132 Updated May 26, 2026

LG-AI-EXAONE / EXAONE-3.0

Official repository for EXAONE built by LG AI Research

181 14 Updated Aug 8, 2024

LG-AI-EXAONE / EXAONE-3.5

Official repository for EXAONE 3.5 built by LG AI Research

208 23 Updated Dec 16, 2024

WindyLee0822 / Process_Q_Model

official implementation of paper "Process Reward Model with Q-value Rankings"

Python 69 8 Updated Feb 5, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,612 2,303 Updated Apr 15, 2026

kyegomez / swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 6,824 948 Updated Jun 9, 2026

kyegomez / Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 115 12 Updated May 19, 2026

alibaba / ChatLearn

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,896 370 Updated Dec 17, 2025

tianyi-lab / Superfiltering

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 191 17 Updated Jun 25, 2025

deepseek-ai / DeepSeek-Prover-V1.5

Python 577 239 Updated Aug 16, 2024

google-deepmind / nanodo

Python 306 22 Updated Jul 15, 2024

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 706 49 Updated Jan 20, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,533 110 Updated Apr 24, 2025

siyan-zhao / prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]

Jupyter Notebook 60 5 Updated Oct 11, 2024

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,891 71 Updated Jun 22, 2025

openai / simple-evals

Python 4,522 492 Updated Apr 22, 2026

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,300 79 Updated May 18, 2026

instructkr / LogicKor

한국어 언어모델 다분야 사고력 벤치마크

Python 209 43 Updated Oct 17, 2024

alexandres / terashuf

terashuf shuffles multi-terabyte text files using limited memory

C++ 233 15 Updated Feb 5, 2023

haoliuhl / ringattention

Large Context Attention

Python 773 53 Updated Oct 13, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 1,025 98 Updated Sep 10, 2025

InflectionAI / Inflection-Benchmarks

Public Inflection Benchmarks

67 2 Updated Mar 6, 2024

openai / transformer-debugger

Python 4,116 239 Updated Apr 15, 2026

HeegyuKim / ko-rm-judge

Reward Model을 이용하여 언어모델의 답변을 평가하기

Python 30 2 Updated Feb 23, 2024

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,116 59 Updated Feb 2, 2025

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,689 599 Updated May 30, 2025

huggingface / large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training large language models

Python 501 23 Updated Mar 8, 2023

Seonghwan Kim nawnoes

Lists (9)

🏃‍♂️ Crawl

Dataset

📔 Knowledge

📔 NLP

👣 Preprocess

🐎 Reinforcement&Meta Learning

🕵️‍♀️ Retrieval

🛠 Tool

💄 Transformers

Stars