Skip to content
View DyeKuu's full-sized avatar
😻
😻

Block or report DyeKuu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**

Python 407 42 Updated Apr 11, 2026

[ICLR 2026] Tina: Tiny Reasoning Models via LoRA

Python 335 43 Updated Sep 23, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,548 359 Updated Jan 5, 2026

A Gym for Agentic LLMs

Python 494 33 Updated Jan 21, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,172 2,096 Updated Jun 9, 2026

A JAX-based Differentiable Density Functional Theory Framework for Materials

Python 50 1 Updated Apr 20, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,032 4,095 Updated Jun 18, 2026

BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated code.

Python 41 5 Updated Apr 15, 2025

The Ultimate program analysis framework.

Java 244 50 Updated Jun 18, 2026

Everything about the SmolLM and SmolVLM family of models

Python 3,819 298 Updated May 26, 2026

LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management

Python 77 5 Updated Jan 15, 2025

Stackfish is an open-source LLM-powered pipeline designed to automatically solve competitive programming problems.

C++ 53 5 Updated Dec 14, 2024

Mutation testing for Python

Python 636 71 Updated Apr 2, 2026

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,578 1,067 Updated Jul 1, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,212 288 Updated May 23, 2026

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

73 3 Updated Aug 31, 2024

Automatic Functional Differentiation in JAX

Python 87 2 Updated Sep 18, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,768 200 Updated Oct 2, 2025

What would you do with 1000 H100s...

Jupyter Notebook 1,178 73 Updated Jan 10, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,541 4,860 Updated Jun 16, 2026

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 1,001 58 Updated Jan 30, 2024

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Python 819 69 Updated Jun 8, 2025

how to optimize some algorithm in cuda.

Cuda 3,090 279 Updated Jun 9, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,704 33,540 Updated Jun 18, 2026

PyTorch extensions for high performance and large scale training.

Python 3,409 298 Updated Apr 26, 2025

Inference code for Llama models

Python 59,465 9,790 Updated Jan 26, 2025

Ongoing research training transformer models at scale

Python 16,749 4,101 Updated Jun 18, 2026
Python 22 Updated Mar 30, 2023

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,830 90 Updated Jun 13, 2026

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 756 75 Updated Oct 26, 2022
Next