Skip to content
View cauyxy's full-sized avatar
🐵
Unacquainted with machine learning
🐵
Unacquainted with machine learning

Block or report cauyxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Video generation via code

Python 1,376 185 Updated Nov 25, 2025

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 20,991 926 Updated Dec 15, 2025

A stable & generalizable GRPO method for AR image generation

Python 26 1 Updated Oct 1, 2025

A python module to repair invalid JSON from LLMs

Python 4,194 161 Updated Dec 17, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,516 58 Updated Jun 14, 2025
Python 328 17 Updated May 31, 2025
Lean 66 25 Updated Nov 7, 2025
2 Updated Oct 9, 2023

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 412 6 Updated Aug 8, 2025

Block Puzzle is a classic, puzzle game, made in Unity, where you have to put a randomly spawned blocks in suitable places.

C# 56 24 Updated May 27, 2021

Efficient Triton Kernels for LLM Training

Python 5,962 452 Updated Dec 20, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,654 2,859 Updated Dec 21, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,054 151 Updated Jan 11, 2025

[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning

Python 41 Updated Apr 14, 2025

A flexible and efficient training framework for large-scale alignment tasks

Python 444 39 Updated Oct 23, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,436 328 Updated Dec 19, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,408 635 Updated Dec 20, 2025

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 3,093 558 Updated Apr 15, 2024

KenLM: Faster and Smaller Language Model Queries

C++ 2,706 531 Updated Mar 30, 2025

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,367 162 Updated Nov 5, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,241 690 Updated Nov 24, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,421 7,020 Updated Dec 20, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,630 838 Updated Dec 18, 2025

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

284 8 Updated Aug 20, 2023

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

Python 39 4 Updated Jan 12, 2024

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,600 3,625 Updated Dec 20, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,837 583 Updated May 3, 2024
Next