SeungyounShin

🎯

Focusing

Seungyoun, Shin SeungyounShin

🎯

Focusing

110 followers · 62 following

Achievements

x2 x3

Achievements

x2 x3

slime Public
Forked from THUDM/slime

slime is an LLM post-training framework for RL Scaling.

Python Apache License 2.0 Updated Dec 10, 2025
yet-another-claude-code Public

A minimal, hackable implementation of Claude’s code

Python 11 Updated Dec 7, 2025
minimal-web-browser Public

Python Updated Dec 4, 2025
deepagents Public
Forked from langchain-ai/deepagents

Deepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped …

Python MIT License Updated Dec 3, 2025
rl-learned-plan-search Public

From Prompted Plan-and-Act to RL-Incentivized, In-Weights Planning for Multi-Turn RAG

Python Apache License 2.0 Updated Nov 28, 2025
tau2-bench Public
Forked from sierra-research/tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python MIT License Updated Nov 28, 2025
NeMo Public
Forked from NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python Apache License 2.0 Updated Nov 26, 2025
Qwen3-Omni Public
Forked from QwenLM/Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook Apache License 2.0 Updated Nov 25, 2025
qwen3_computer_use Public

Python 20 3 Updated Nov 23, 2025
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Nov 23, 2025
tau-retail-rl Public

End-to-end reinforcement learning for retail domain tasks focused on exchange and cancel actions, inspired by the τ-bench framework.

Python 2 Apache License 2.0 Updated Nov 2, 2025
gpt-oss Public
Forked from openai/gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python Apache License 2.0 Updated Nov 1, 2025
SeungyounShin.github.io Public template
Forked from alshedivat/al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML MIT License Updated Oct 27, 2025
ALF-bench Public

Python Updated Oct 22, 2025
FlashCosyVoice Public
Forked from xingchensong/FlashCosyVoice

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python Apache License 2.0 Updated Sep 15, 2025
verl Public
Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python Apache License 2.0 Updated Sep 13, 2025
tau-bench Public
Forked from sierra-research/tau-bench

Code and Data for Tau-Bench

Python MIT License Updated Aug 12, 2025
higgs-audio Public
Forked from boson-ai/higgs-audio

Text-audio foundation model from Boson AI

Python Apache License 2.0 Updated Jul 31, 2025
Kimi-Audio Public
Forked from MoonshotAI/Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python Updated Jul 22, 2025
icefall Public
Forked from k2-fsa/icefall

Python Apache License 2.0 Updated Jun 19, 2025
EasyAgentRL Public

verl based search + code agent like o3

Python 1 Apache License 2.0 Updated Jun 11, 2025
LLaSA_training Public
Forked from zhenye234/LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python Other Updated Jun 9, 2025
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 1 Apache License 2.0 Updated May 4, 2025
moshi-finetune Public
Forked from nu-dialogue/moshi-finetune

Fine-tuning Moshi/J-Moshi on your own spoken dialogue data

Python Apache License 2.0 Updated Apr 11, 2025
Search-R1 Public
Forked from PeterGriffinJin/Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python Apache License 2.0 Updated Mar 27, 2025
qwen-flax Public

Python Updated Feb 27, 2025
minimal-r1 Public

Python 26 5 Updated Feb 11, 2025
open-r1 Public
Forked from huggingface/open-r1

Fully open reproduction of DeepSeek-R1

Python Apache License 2.0 Updated Feb 7, 2025
minimal-mcts-llm Public

minimal implementation of mcts-llm

Python 1 Updated Jan 22, 2025
search-and-learn Public
Forked from huggingface/search-and-learn

Python Apache License 2.0 Updated Dec 18, 2024

Seungyoun, Shin SeungyounShin

Achievements

Achievements

slime Public

Uh oh!

yet-another-claude-code Public

Uh oh!

minimal-web-browser Public

Uh oh!

deepagents Public

Uh oh!

rl-learned-plan-search Public

Uh oh!

tau2-bench Public

Uh oh!

NeMo Public

Uh oh!

Qwen3-Omni Public

Uh oh!

qwen3_computer_use Public

Uh oh!

trl Public

Uh oh!

tau-retail-rl Public

Uh oh!

gpt-oss Public

Uh oh!

SeungyounShin.github.io Public template

Uh oh!

ALF-bench Public

Uh oh!

FlashCosyVoice Public

Uh oh!

verl Public

Uh oh!

tau-bench Public

Uh oh!

higgs-audio Public

Uh oh!

Kimi-Audio Public

Uh oh!

icefall Public

Uh oh!

EasyAgentRL Public

Uh oh!

LLaSA_training Public

Uh oh!

transformers Public

Uh oh!

moshi-finetune Public

Uh oh!

Search-R1 Public

Uh oh!

qwen-flax Public

Uh oh!

minimal-r1 Public

Uh oh!

open-r1 Public

Uh oh!

minimal-mcts-llm Public

Uh oh!

search-and-learn Public

Uh oh!