dtunai

dthinky! dtunai

stealth mode rl @SelfPlayAI

72 followers · 14 following

Self Play Computer
dtunai.blog
@dtthinky

Highlights

Developer Program Member

parameter-golf-board Public

TypeScript Updated Mar 21, 2026
agent-skills-for-compute Public

Agent-optimized skills for the full LLM lifecycle — pre-training, post-training (RL/DPO/RLHF), inference, and autonomous research — plus GPU/TPU/QPU kernel programming, simulation, and scientific c…

2 MIT License Updated Mar 11, 2026
Mem-RLM Public

Memory augmented inference library for Recursive Language Models (RLMs), built on top of rlm.

Python 1 Updated Feb 21, 2026
ContextJira Public

Chrome Extension for extracting AI-ready Markdown from Jira Cloud & Server. Copy issue context — metadata, descriptions, comments, linked issues, attachments. Built for Claude, ChatGPT, Copilot and…

JavaScript 3 1 Updated Feb 19, 2026
continual_learning_via_sparse_memory_finetuning Public

Implementation of Lin et al., 2025.

Python 1 Updated Dec 2, 2025
batch-invariant-ops-jax Public

Python Updated Nov 30, 2025
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Nov 26, 2025
mirage Public
Forked from mirage-project/mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ Apache License 2.0 Updated Nov 8, 2025
Streaming-DeepAgents Public

Streaming and task delegation for Langchain's Deepagents

Python 20 3 Updated Oct 20, 2025
awesome-gemini-cli Public

A curated list of awesome resources, tools, workflows, and guides for Google's > Gemini CLI

gemini gemini-cli

Shell 35 6 Creative Commons Zero v1.0 Universal Updated Jun 26, 2025
SynthToT Public

SynthToT: Generate synthetic dataset for your training dataset through deliberate problem-solving et al S Yao, 2023.

prompt-toolkit ai-agents langchain tree-of-thoughts

Python 9 Apache License 2.0 Updated Jan 3, 2025
cpp-langchain Public

Tool for executing C/C++ code snippets with Langchain Agents.

code-execution ai-agents langchain

Python 6 1 Apache License 2.0 Updated Oct 14, 2024
xLSTM-Jax Public

Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)

lstm neural-networks jax x-lstm

Python 16 Apache License 2.0 Updated Aug 6, 2024
Mixture-of-Depths-Jax Public

Jax module for the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Python 3 Apache License 2.0 Updated Jun 28, 2024
LongConv-Jax Public

Jax/Flax/Linen implementation of "Simple Hardware-Efficient Long Convolutions for Sequence Modeling"

machine-learning artificial-intelligence sequence-models

Python 3 Apache License 2.0 Updated Jun 10, 2024
Tri-RMSNorm Public

Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

machine-learning ai triton rmsnorm

Python 13 2 Apache License 2.0 Updated Jun 5, 2024
GradientAscent-Jax Public

Custom gradient ascent solver (optimizer) for JAX/Flax models

Python 2 Apache License 2.0 Updated Jun 4, 2024
Ring-Attention-Jax Public

Packaged Ring Attention with Blockwise Transformers for Near-Infinite Context implemented in Jax + Flax.

jax ring-attention

Python 1 Apache License 2.0 Updated May 10, 2024
Griffin-Jax Public

Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"

flax deepmind griffin jax

Python 15 Apache License 2.0 Updated May 10, 2024
triton-activations Public

Collection of neural network activation function kernels for Triton Language Compiler by OpenAI

machine-learning neural-network

Python 8 Updated Apr 11, 2024
PaLM-rlhf-pytorch-DS Public
Forked from lucidrains/PaLM-rlhf-pytorch

Modificated DeepSpeed training setup fork of RLHF (Reinforcement Learning with Human Feedback) by lucidrains on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python MIT License Updated Apr 11, 2024
MEGABYTE-pytorch-DS Public
Forked from lucidrains/MEGABYTE-pytorch

Modificated DeepSpeed training setup fork of MEGABYTE - PyTorch by lucidrains, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Python MIT License Updated Apr 11, 2024
kmeansops Public

PyKeops Powered K-Means Clustering Algorithms Module both on CPU & GPU

statistics clustering pytorch

Python 1 Updated Apr 11, 2024
jax-triton Public
Forked from jax-ml/jax-triton

jax-triton contains integrations between JAX and OpenAI Triton

Python Apache License 2.0 Updated Mar 12, 2024
mpi-ds Public

MPI Operator DeepSpeed Base Configuration for CIFAR-10

Dockerfile 4 Updated Feb 22, 2024
miniF2F-code Public

Dataset of formal Olympiad-level mathematics problems solved with Python code instructions.

dataset instruction

Shell 3 Updated Feb 22, 2024
smooth-activations Public

Smooth ReLU activations in CUDA. Shamir, G., I. et al.

machine-learning cuda

Python 1 Updated Feb 22, 2024

dthinky! dtunai

Highlights

parameter-golf-board Public

Uh oh!

agent-skills-for-compute Public

Uh oh!

Mem-RLM Public

Uh oh!

ContextJira Public

Uh oh!

continual_learning_via_sparse_memory_finetuning Public

Uh oh!

batch-invariant-ops-jax Public

Uh oh!

sglang Public

Uh oh!

mirage Public

Uh oh!

Streaming-DeepAgents Public

Uh oh!

awesome-gemini-cli Public

Uh oh!

SynthToT Public

Uh oh!

cpp-langchain Public

Uh oh!

xLSTM-Jax Public

Uh oh!

Mixture-of-Depths-Jax Public

Uh oh!

LongConv-Jax Public

Uh oh!

Tri-RMSNorm Public

Uh oh!

GradientAscent-Jax Public

Uh oh!

Ring-Attention-Jax Public

Uh oh!

Griffin-Jax Public

Uh oh!

triton-activations Public

Uh oh!

PaLM-rlhf-pytorch-DS Public

Uh oh!

MEGABYTE-pytorch-DS Public

Uh oh!

kmeansops Public

Uh oh!

jax-triton Public

Uh oh!

mpi-ds Public

Uh oh!

miniF2F-code Public

Uh oh!

smooth-activations Public

Uh oh!