Skip to content
View nawnoes's full-sized avatar
😀
😀

Block or report nawnoes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for EXAONE 4.0 built by LG AI Research

103 8 Updated Aug 4, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,843 110 Updated Mar 18, 2025

Recipes to scale inference-time compute of open models

Python 1,131 130 Updated Apr 2, 2026

Official repository for EXAONE built by LG AI Research

180 14 Updated Aug 8, 2024

Official repository for EXAONE 3.5 built by LG AI Research

205 23 Updated Dec 16, 2024

official implementation of paper "Process Reward Model with Q-value Rankings"

Python 66 7 Updated Feb 5, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,291 2,275 Updated Mar 11, 2025

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 6,214 799 Updated Apr 10, 2026

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 114 11 Updated Feb 3, 2026

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,902 368 Updated Dec 17, 2025

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 187 16 Updated Jun 25, 2025
Python 308 23 Updated Jul 15, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 701 49 Updated Jan 20, 2025

Recipes to train reward model for RLHF.

Python 1,527 108 Updated Apr 24, 2025

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]

Jupyter Notebook 60 5 Updated Oct 11, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,877 70 Updated Jun 22, 2025
Python 4,436 480 Updated Jul 31, 2025

Schedule-Free Optimization in PyTorch

Python 2,272 74 Updated May 21, 2025

한국어 언어모델 다분야 사고력 벤치마크

Python 207 43 Updated Oct 17, 2024

terashuf shuffles multi-terabyte text files using limited memory

C++ 232 15 Updated Feb 5, 2023

Large Context Attention

Python 770 52 Updated Oct 13, 2025

Ring attention implementation with flash attention

Python 1,004 97 Updated Sep 10, 2025

Public Inflection Benchmarks

67 2 Updated Mar 6, 2024

Reward Model을 이용하여 언어모델의 답변을 평가하기

Python 29 2 Updated Feb 23, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,108 60 Updated Feb 2, 2025

The official PyTorch implementation of Google's Gemma models

Python 5,648 592 Updated May 30, 2025

An open collection of implementation tips, tricks and resources for training large language models

Python 496 22 Updated Mar 8, 2023
Next