Skip to content
View chibohe's full-sized avatar

Block or report chibohe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 705 46 Updated Nov 4, 2025

The best ChatGPT that $100 can buy.

Python 36,040 4,181 Updated Nov 5, 2025

轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程

376 38 Updated Jun 16, 2025

Use interactive notebook to break down MiniMind code and learn from scratch.

Jupyter Notebook 102 17 Updated Mar 28, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,698 3,795 Updated Nov 7, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

1,999 111 Updated Nov 5, 2025

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 358 15 Updated Sep 15, 2025
Python 72 4 Updated May 22, 2025

Implement a reasoning LLM in PyTorch from scratch, step by step

Python 1,914 238 Updated Nov 7, 2025

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 139 18 Updated Sep 23, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,726 111 Updated Sep 16, 2025

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 7,601 851 Updated Sep 30, 2025

The best workflows and configurations I've developed, having heavily used Claude Code since the day of it's release. Workflows are based off applied learnings from our AI-native startup.

3,056 461 Updated Sep 14, 2025

《解构大语言模型:从线性回归到通用人工智能》配套代码

Jupyter Notebook 255 50 Updated Oct 21, 2025

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 302 19 Updated Nov 2, 2025

minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.

Jupyter Notebook 482 28 Updated Jun 21, 2023

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

712 28 Updated Nov 7, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 127 6 Updated Jun 30, 2025

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 206 8 Updated Oct 15, 2025

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,883 179 Updated Jun 26, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,252 58 Updated Oct 18, 2025

Code for Retrieval-Augmented Perception (ICML 2025)

Python 60 4 Updated Aug 10, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,093 37 Updated Oct 4, 2025

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 323 5 Updated Jun 1, 2025

Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grained visual understanding".

97 2 Updated Aug 21, 2025
Python 94 3 Updated Aug 14, 2025
Python 923 55 Updated Oct 20, 2025

Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…

Python 11,294 2,386 Updated Sep 24, 2025

Advanced Python Mastery (course by @dabeaz)

Python 12,230 2,064 Updated Oct 22, 2025

All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.

Python 1,067 42 Updated Nov 7, 2025
Next