Skip to content
View aishoot's full-sized avatar
😉
Working
😉
Working

Block or report aishoot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,320 129 Updated Nov 9, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,314 687 Updated Feb 4, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,313 229 Updated Jan 29, 2026

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 38,897 4,683 Updated Feb 6, 2026

人人都能用英语

TypeScript 33,479 4,711 Updated Feb 3, 2026

一人企业方法论

HTML 4 Updated Aug 17, 2025

《一人企业方法论》第二版,也适合做其他副业(比如自媒体、电商、数字商品)的非技术人群。

PHP 12,950 1,462 Updated Oct 10, 2025

Create Epic Math and Physics Animations & Study Notes From Text and Images.

Python 1,654 188 Updated Feb 1, 2026

A community-maintained Python framework for creating mathematical animations.

Python 36,681 2,647 Updated Feb 6, 2026
Python 1,087 51 Updated Jan 10, 2026

a-m-team's exploration in large language modeling

195 3 Updated May 29, 2025

adds Sequence Parallelism into LLaMA-Factory

Python 604 43 Updated Feb 5, 2026

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 283 15 Updated Sep 25, 2025

Fully open data curation for reasoning models

Python 2,206 185 Updated Dec 2, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,576 1,195 Updated Feb 7, 2026
Python 762 49 Updated Dec 23, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,893 371 Updated Dec 17, 2025
Python 970 111 Updated Jan 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,009 8,142 Updated Feb 4, 2026

A series of math-specific large language models of our Qwen2 series.

Python 1,065 152 Updated Jan 11, 2025

Train transformer language models with reinforcement learning.

Python 17,304 2,476 Updated Feb 7, 2026

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

150 8 Updated Jul 12, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 391 16 Updated Jan 19, 2025

本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。

Python 100 9 Updated Sep 14, 2024

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 120 7 Updated Dec 10, 2024

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 180 11 Updated Jun 8, 2025
Jupyter Notebook 481 36 Updated Jul 22, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 587 35 Updated Dec 9, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 4,262 647 Updated Jul 29, 2025
Next