Skip to content
View deep-diver's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@deepbaksu @fast-ai-kr @codingpot

Block or report deep-diver

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Lightweight LLM Post-Training Library

Python 3 Updated Jun 14, 2026
Python 3 1 Updated Jun 9, 2026

Agentic AutoResearch scaffold for modular Diffusers experiments on JarvisLabs

Python 3 Updated Jun 3, 2026

https://deep-diver.github.io/tunix-accel/

Python 1 Updated Jun 16, 2026

dLLM training implementation on pure jax/flax (w/o pytorch) for Google TPUs(v4/v5e/v6e). #TPUSprint #TRC

Python 7 1 Updated Apr 25, 2026

Diffusion model research harness built on Keras 3 + JAX — unconditional DDPM, classifier-free guidance, and more

Python 2 1 Updated May 1, 2026

[ACL'26] Official Code for "ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning"

Python 21 1 Updated Apr 24, 2026

[arXiv'26] Official Code for "WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning"

Python 19 2 Updated Jun 22, 2026

Run ML workloads seamlessly on cloud TPUs and GPUs with a single Python decorator. No infrastructure management required.

Python 51 10 Updated Jun 23, 2026

🖥 Neural Computers' Data Engine

Python 202 27 Updated May 19, 2026

A library for Multilingual Unsupervised or Supervised word Embeddings

Python 3,245 558 Updated Aug 31, 2022

A meta-skill that designs domain-specific agent teams, defines specialized agents, and generates the skills they use.

HTML 8,166 1,106 Updated Jun 10, 2026

Algorithm powering the For You feed on X

Rust 26,358 4,517 Updated May 15, 2026
JavaScript 4 1 Updated Jan 12, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,498 719 Updated May 17, 2026
TypeScript 15,579 1,213 Updated Jul 2, 2026

Build realtime AI voice agents using FastRTC for low-latency streaming, Superlinked for vector search, Twilio for live phone calls, and Runpod for scalable GPU deployment.

Python 1,004 229 Updated Jan 10, 2026

Generate unique fashion moodboards to find your next design inspiration.

Python 4 3 Updated Feb 15, 2026
Python 6 3 Updated Dec 30, 2025

Agentic RL Training at Scale

Python 1,579 329 Updated Jul 3, 2026

Dream 7B, a large diffusion language model

Python 1,250 79 Updated Nov 21, 2025

Train transformer language models with reinforcement learning.

Python 18,749 2,817 Updated Jul 3, 2026

Easy and Efficient dLLM Fine-Tuning

Python 261 17 Updated Mar 2, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,265 4,164 Updated Jul 3, 2026

AlexNet model from ILSVRC 2012

Jupyter Notebook 64 28 Updated Oct 29, 2022

The best ChatGPT that $100 can buy.

Python 55,708 7,679 Updated Jul 2, 2026

A Lightweight LLM Post-Training Library

Python 2,359 316 Updated Jul 2, 2026
Next