Skip to content
View SeungyounShin's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report SeungyounShin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 2,918 351 Updated Dec 19, 2025
Python 612 57 Updated Dec 19, 2025

Deepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped …

Python 7,289 1,120 Updated Dec 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,643 2,856 Updated Dec 20, 2025

Large-scale language modeling tutorials with PyTorch

Jupyter Notebook 6 4 Updated Jul 14, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,140 191 Updated Oct 9, 2025

[ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues

Python 25 1 Updated Jul 10, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,662 1,355 Updated Dec 17, 2025
Jupyter Notebook 4 1 Updated Jun 3, 2024

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,049 75 Updated Nov 25, 2025
Python 26 5 Updated Feb 11, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,443 1,991 Updated Nov 1, 2025

Nano vLLM

Python 9,833 1,236 Updated Nov 3, 2025

Demo of a customer service use case implemented with the OpenAI Agents SDK

Python 5,891 912 Updated Dec 18, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,664 272 Updated Nov 26, 2025
Jupyter Notebook 280 36 Updated Sep 28, 2025

Official Repository of Absolute Zero Reasoner

Python 1,779 291 Updated Aug 24, 2025

Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers

Python 57 11 Updated May 17, 2025
Python 321 16 Updated May 24, 2025

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Python 56 3 Updated Apr 14, 2025

A live stream development of RL tunning for LLM agents

Python 3,683 514 Updated Oct 8, 2025
Python 3 Updated May 21, 2025
Python 3 Updated May 21, 2025
Jupyter Notebook 27 6 Updated Sep 12, 2024

Simple RL training for reasoning

Python 3,810 281 Updated Aug 3, 2025

Fully open reproduction of DeepSeek-R1

Python 25,742 2,405 Updated Nov 24, 2025

Recipes to scale inference-time compute of open models

Python 1,121 131 Updated May 22, 2025

Accepted LLM Papers in NeurIPS 2024

37 2 Updated Oct 13, 2024

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 412 29 Updated Sep 15, 2025
Next