devjwsong

Jaewoo (Kyle) Song devjwsong

Applied Scientist @ Amazon

71 followers · 68 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

IuryAlves / system-design-primer

Forked from donnemartin/system-design-primer

Learn how to design large-scale systems. Prep for the system design interview.

Python 31 4 Updated Mar 18, 2018

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,068 4,952 Updated Dec 9, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 6,070 928 Updated Dec 2, 2025

seannyD / VideoGameDialogueCorpusPublic

Python 63 8 Updated Oct 29, 2025

raghavc / LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 181 19 Updated Mar 18, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,320 2,131 Updated Dec 18, 2025

vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Python 197 12 Updated Jan 14, 2024

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,379 174 Updated Jul 25, 2023

ash80 / RLHF_in_notebooks

RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks

Jupyter Notebook 224 20 Updated Jun 20, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 47,988 3,360 Updated Dec 20, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,463 1,913 Updated Jun 3, 2025

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,652 126 Updated Apr 17, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,947 288 Updated May 15, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,058 4,669 Updated Dec 22, 2025

kaist-ina / stellatrain

Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"

C++ 73 18 Updated Jul 13, 2024

Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 18,962 4,402 Updated Dec 19, 2025

undreamai / LLMUnity

Create characters in Unity with LLMs!

C# 1,398 155 Updated Dec 17, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,434 7,024 Updated Dec 23, 2025

adewynter / Doom

Repository for the paper "Will GPT-4 Run DOOM?"

Python 24 4 Updated Nov 27, 2024

pygame / pygame

🐍🎮 pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, Ope…

C 8,533 3,937 Updated Nov 1, 2025