Skip to content
View devjwsong's full-sized avatar

Highlights

  • Pro

Block or report devjwsong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Learn how to design large-scale systems. Prep for the system design interview.

Python 31 4 Updated Mar 18, 2018

The best ChatGPT that $100 can buy.

Python 39,068 4,952 Updated Dec 9, 2025

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 181 19 Updated Mar 18, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,320 2,131 Updated Dec 18, 2025

RLHF implementation details of OAI's 2019 codebase

Python 197 12 Updated Jan 14, 2024

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,379 174 Updated Jul 25, 2023

RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks

Jupyter Notebook 224 20 Updated Jun 20, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 47,988 3,360 Updated Dec 20, 2025

Official inference framework for 1-bit LLMs

Python 24,463 1,913 Updated Jun 3, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,652 126 Updated Apr 17, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,947 288 Updated May 15, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,058 4,669 Updated Dec 22, 2025

Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"

C++ 73 18 Updated Jul 13, 2024

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 18,962 4,402 Updated Dec 19, 2025

Create characters in Unity with LLMs!

C# 1,398 155 Updated Dec 17, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,434 7,024 Updated Dec 23, 2025

Repository for the paper "Will GPT-4 Run DOOM?"

Python 24 4 Updated Nov 27, 2024

🐍🎮 pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, Ope…

C 8,533 3,937 Updated Nov 1, 2025

Godot Engine – Multi-platform 2D and 3D game engine

C++ 104,317 23,874 Updated Dec 22, 2025

A Data Streaming Library for Efficient Neural Network Training

Python 1,433 182 Updated Oct 27, 2025
C 4,490 392 Updated Dec 27, 2023

Decompilation of The Legend of Zelda: Twilight Princess (GCN)

C++ 1,296 137 Updated Dec 22, 2025

A list of open source games.

11,210 875 Updated Nov 28, 2025

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 9,709 1,507 Updated Apr 15, 2023

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 45,226 7,071 Updated Aug 18, 2024

Fully open reproduction of DeepSeek-R1

Python 25,746 2,405 Updated Nov 24, 2025

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,335 2,128 Updated Oct 27, 2025

Train Models Contrastively in Pytorch

Python 765 63 Updated Mar 26, 2025
Next