zxteloiv

Haruki Kirigaya zxteloiv

God is in his heaven, all is right with the world.

138 followers · 77 following

Achievements

Highlights

Stars

chenditc / investment_data

Scripts and doc for https://www.dolthub.com/repositories/chenditc/investment_data

Python 1,198 166 Updated May 17, 2026

allen4747 / ZOPO

An efficient prompt optimization method that uses zeroth-order method to optimize the prompts for black-box LLMs.

Python 9 Updated Oct 21, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,208 1,527 Updated May 9, 2026

treeverse / dvc

🦉 Data Versioning and ML Experiments

Python 15,607 1,296 Updated Apr 28, 2026

dCaples / AutoDidact

Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.

Jupyter Notebook 689 63 Updated Mar 22, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,319 417 Updated Apr 23, 2026

GAIR-NLP / LIMR

Python 219 9 Updated Feb 20, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,650 1,034 Updated Apr 30, 2026

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,673 132 Updated Nov 21, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,259 187 Updated Dec 2, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,521 944 Updated May 15, 2026

facebookresearch / blt

Code for BLT research paper

Python 2,041 192 Updated Nov 3, 2025

schwartz-lab-NLP / TOVA

Token Omission Via Attention

Python 128 6 Updated Oct 13, 2024

virattt / ai-financial-agent

A financial agent for investment research

TypeScript 1,967 396 Updated Aug 19, 2025

24mlight / A_Share_investment_Agent

Python 2,403 612 Updated Jul 25, 2025

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

Python 885 79 Updated Dec 29, 2025

eddycmu / demystify-long-cot

Python 337 18 Updated May 31, 2025

sail-sg / oat-zero

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 250 9 Updated Apr 15, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,653 761 Updated Jun 25, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,017 2,418 Updated Apr 2, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,099 1,587 Updated Feb 27, 2026

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,450 165 Updated Mar 20, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,416 371 Updated May 18, 2026

MiuLab / SynData-Survey

9 Updated Jul 8, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,480 750 Updated Jun 7, 2025

ZitongYang / Synthetic_Continued_Pretraining

Code implementation of synthetic continued pretraining

Jupyter Notebook 159 19 Updated Jan 6, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 27,934 5,958 Updated May 18, 2026

deepseek-ai / DeepSeek-R1

92,017 11,733 Updated Jun 27, 2025

ahmedkhaleel2004 / gitdiagram

Free, simple, fast interactive diagrams for any GitHub repository

TypeScript 15,609 1,197 Updated May 14, 2026

GAIR-NLP / O1-Journey

O1 Replication Journey

2,000 61 Updated Jan 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haruki Kirigaya zxteloiv

Achievements

Achievements

Highlights

Block or report zxteloiv

Stars

chenditc / investment_data

allen4747 / ZOPO

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

treeverse / dvc

dCaples / AutoDidact

zhaochenyang20 / Awesome-ML-SYS-Tutorial

GAIR-NLP / LIMR

deepseek-ai / FlashMLA

lsdefine / simple_GRPO

open-thoughts / open-thoughts

OpenRLHF / OpenRLHF

facebookresearch / blt

schwartz-lab-NLP / TOVA

virattt / ai-financial-agent

24mlight / A_Share_investment_Agent

seal-rg / recurrent-pretraining

eddycmu / demystify-long-cot

sail-sg / oat-zero

simplescaling / s1

huggingface / open-r1

Jiayi-Pan / TinyZero

Unakar / Logic-RL

datajuicer / data-juicer

MiuLab / SynData-Survey

nlpxucan / WizardLM

ZitongYang / Synthetic_Continued_Pretraining

sgl-project / sglang

deepseek-ai / DeepSeek-R1

ahmedkhaleel2004 / gitdiagram

GAIR-NLP / O1-Journey