Skip to content
View nawnoes's full-sized avatar
😀
😀

Block or report nawnoes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for EXAONE 4.0 built by LG AI Research

105 9 Updated Aug 4, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,862 112 Updated Mar 18, 2025

Recipes to scale inference-time compute of open models

Python 1,131 132 Updated May 26, 2026

Official repository for EXAONE built by LG AI Research

181 14 Updated Aug 8, 2024

Official repository for EXAONE 3.5 built by LG AI Research

208 23 Updated Dec 16, 2024

official implementation of paper "Process Reward Model with Q-value Rankings"

Python 69 8 Updated Feb 5, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,612 2,303 Updated Apr 15, 2026

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 6,824 948 Updated Jun 9, 2026

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 115 12 Updated May 19, 2026

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,896 370 Updated Dec 17, 2025

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 191 17 Updated Jun 25, 2025
Python 306 22 Updated Jul 15, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 706 49 Updated Jan 20, 2025

Recipes to train reward model for RLHF.

Python 1,533 110 Updated Apr 24, 2025

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]

Jupyter Notebook 60 5 Updated Oct 11, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,891 71 Updated Jun 22, 2025
Python 4,522 492 Updated Apr 22, 2026

Schedule-Free Optimization in PyTorch

Python 2,300 79 Updated May 18, 2026

한국어 언어모델 다분야 사고력 벤치마크

Python 209 43 Updated Oct 17, 2024

terashuf shuffles multi-terabyte text files using limited memory

C++ 233 15 Updated Feb 5, 2023

Large Context Attention

Python 773 53 Updated Oct 13, 2025

Ring attention implementation with flash attention

Python 1,025 98 Updated Sep 10, 2025

Public Inflection Benchmarks

67 2 Updated Mar 6, 2024

Reward Model을 이용하여 언어모델의 답변을 평가하기

Python 30 2 Updated Feb 23, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,116 59 Updated Feb 2, 2025

The official PyTorch implementation of Google's Gemma models

Python 5,689 599 Updated May 30, 2025

An open collection of implementation tips, tricks and resources for training large language models

Python 501 23 Updated Mar 8, 2023
Next