Skip to content
View aishoot's full-sized avatar
😉
Working
😉
Working

Block or report aishoot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,399 128 Updated Nov 9, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 7,006 752 Updated Feb 4, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,426 246 Updated Mar 6, 2026

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 43,566 5,231 Updated Mar 24, 2026

人人都能用英语

TypeScript 33,778 4,741 Updated Feb 3, 2026

一人企业方法论

HTML 4 Updated Aug 17, 2025

《一人企业方法论》第二版,也适合做其他副业(比如自媒体、电商、数字商品)的非技术人群。

PHP 14,136 1,582 Updated Oct 10, 2025

Create Epic Math and Physics Animations & Study Notes From Text and Images.

Python 1,741 199 Updated Mar 13, 2026

A community-maintained Python framework for creating mathematical animations.

Python 37,379 2,745 Updated Mar 18, 2026
Python 1,117 53 Updated Jan 10, 2026

a-m-team's exploration in large language modeling

195 3 Updated May 29, 2025

adds Sequence Parallelism into LLaMA-Factory

Python 605 42 Updated Feb 5, 2026

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 289 15 Updated Sep 25, 2025

Fully open data curation for reasoning models

Python 2,233 186 Updated Dec 2, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,346 1,296 Updated Mar 25, 2026
Python 762 47 Updated Dec 23, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,903 367 Updated Dec 17, 2025
Python 969 110 Updated Jan 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,026 8,415 Updated Mar 25, 2026

A series of math-specific large language models of our Qwen2 series.

Python 1,074 156 Updated Jan 11, 2025

Train transformer language models with reinforcement learning.

Python 17,782 2,586 Updated Mar 25, 2026

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

152 8 Updated Jul 12, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 392 16 Updated Jan 19, 2025

本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。

Python 102 9 Updated Sep 14, 2024

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 121 7 Updated Dec 10, 2024

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 187 11 Updated Jun 8, 2025
Jupyter Notebook 489 36 Updated Jul 22, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 591 35 Updated Dec 9, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 4,314 652 Updated Jul 29, 2025
Next