-
Amazon
- Seattle, WA
- songstudio.info
- @devjwsong
- in/jaewoo-song-13b375196
- devjwsong
Highlights
- Pro
Stars
Learn how to design large-scale systems. Prep for the system design interview.
Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
RLHF implementation details of OAI's 2019 codebase
Code for the paper Fine-Tuning Language Models from Human Preferences
RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Official inference framework for 1-bit LLMs
YaRN: Efficient Context Window Extension of Large Language Models
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🐍🎮 pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, Ope…
Godot Engine – Multi-platform 2D and 3D game engine
A Data Streaming Library for Efficient Neural Network Training
Decompilation of The Legend of Zelda: Twilight Princess (GCN)
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Fully open reproduction of DeepSeek-R1
A very simple framework for state-of-the-art Natural Language Processing (NLP)