SeungyounShin

Follow

🎯

Focusing

Seungyoun, Shin SeungyounShin

🎯

Focusing

Follow

Hi

110 followers · 62 following

Achievements

Achievements

Lists (1)

Sort

채널톡

회사레포들

Stars

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,918 351 Updated Dec 19, 2025

radixark / miles

Python 612 57 Updated Dec 19, 2025

langchain-ai / deepagents

Deepagents is an agent harness built on langchain and langgraph. Deep agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped …

Python 7,289 1,120 Updated Dec 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,643 2,856 Updated Dec 20, 2025

hwang2006 / large-scale-lm-tutorials

Forked from tunib-ai/large-scale-lm-tutorials

Large-scale language modeling tutorials with PyTorch

Jupyter Notebook 6 4 Updated Jul 14, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,140 191 Updated Oct 9, 2025

snuhcc / DICE-Bench

[ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues

Python 25 1 Updated Jul 10, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,662 1,355 Updated Dec 17, 2025

elu-lab / matcha_tts_e

Jupyter Notebook 4 1 Updated Jun 3, 2024

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,049 75 Updated Nov 25, 2025

SeungyounShin / minimal-r1

Python 26 5 Updated Feb 11, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,443 1,991 Updated Nov 1, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 9,833 1,236 Updated Nov 3, 2025

openai / openai-cs-agents-demo

Demo of a customer service use case implemented with the OpenAI Agents SDK

Python 5,891 912 Updated Dec 18, 2025

kyutai-labs / delayed-streams-modeling

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,664 272 Updated Nov 26, 2025

jeongeun980906 / lerobot-mujoco-tutorial

Jupyter Notebook 280 36 Updated Sep 28, 2025

channel-io / ch-tts-llasa-rl-grpo

Python 41 4 Updated Aug 28, 2025

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,779 291 Updated Aug 24, 2025

thomasgauthier / csm-hf

Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers

Python 57 11 Updated May 17, 2025

GAIR-NLP / ToRL

Python 321 16 Updated May 24, 2025

VITA-MLLM / LUCY

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Python 56 3 Updated Apr 14, 2025

OpenManus / OpenManus-RL

A live stream development of RL tunning for LLM agents

Python 3,683 514 Updated Oct 8, 2025

Ch-talk-SKM / audio_refiner

Python 3 Updated May 21, 2025

Ch-talk-SKM / MOSHI_forTPU

Python 3 Updated May 21, 2025

ra9hur / Decision-Transformers-For-Trading

Jupyter Notebook 27 6 Updated Sep 12, 2024

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,810 281 Updated Aug 3, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,742 2,405 Updated Nov 24, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,121 131 Updated May 22, 2025

Persdre / NeurIPS-2024-LLM-Papers

Accepted LLM Papers in NeurIPS 2024

37 2 Updated Oct 13, 2024

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 412 29 Updated Sep 15, 2025