Skip to content
View aishoot's full-sized avatar
😉
Working
😉
Working

Block or report aishoot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,842 4,232 Updated Dec 14, 2025

人人都能用英语

TypeScript 32,991 4,657 Updated Nov 25, 2025

一人企业方法论

HTML 4 Updated Aug 17, 2025

《一人企业方法论》第二版,也适合做其他副业(比如自媒体、电商、数字商品)的非技术人群。

PHP 12,301 1,379 Updated Oct 10, 2025

Create Epic Math and Physics Animations & Study Notes From Text and Images.

Python 1,452 167 Updated Dec 19, 2025

A community-maintained Python framework for creating mathematical animations.

Python 36,046 2,569 Updated Dec 16, 2025

a-m-team's exploration in large language modeling

195 3 Updated May 29, 2025

adds Sequence Parallelism into LLaMA-Factory

Python 599 41 Updated Oct 14, 2025

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Python 280 13 Updated Sep 25, 2025

Fully open data curation for reasoning models

Python 2,171 182 Updated Dec 2, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,750 1,070 Updated Dec 19, 2025
Python 754 49 Updated Sep 3, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,869 371 Updated Dec 17, 2025
Python 970 111 Updated Jan 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,236 7,787 Updated Dec 19, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,054 151 Updated Jan 11, 2025

Train transformer language models with reinforcement learning.

Python 16,706 2,370 Updated Dec 19, 2025

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

147 8 Updated Jul 12, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 388 16 Updated Jan 19, 2025

本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。

Python 98 9 Updated Sep 14, 2024

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 119 7 Updated Dec 10, 2024

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.

Python 175 9 Updated Jun 8, 2025
Jupyter Notebook 477 34 Updated Jul 22, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 579 33 Updated Dec 9, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 4,192 639 Updated Jul 29, 2025

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

561 34 Updated Oct 28, 2024

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 443 24 Updated Oct 11, 2023

⚡ OVM for Planning in Mathematical Reasoning

Python 10 Updated Feb 20, 2024
Next