Skip to content
View chibohe's full-sized avatar

Block or report chibohe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
752 results for source starred repositories
Clear filter

The repository containing tools and information about the WoodScape dataset.

Python 669 131 Updated Aug 26, 2023

💫 Toolkit to help you get started with Spec-Driven Development

Shell 47,310 4,018 Updated Nov 7, 2025

A MCP server that supports mainstream eBook formats including EPUB, PDF and more. Simplify your eBook user experience with LLM.

Python 131 23 Updated Sep 7, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 730 49 Updated Nov 10, 2025

The best ChatGPT that $100 can buy.

Python 36,398 4,349 Updated Nov 5, 2025

轻量级大语言模型MiniMind的源码解读,包含tokenizer、RoPE、MoE、KV Cache、pretraining、SFT、LoRA、DPO等完整流程

394 40 Updated Jun 16, 2025

Use interactive notebook to break down MiniMind code and learn from scratch.

Jupyter Notebook 105 16 Updated Mar 28, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 33,278 3,875 Updated Nov 10, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,021 112 Updated Nov 9, 2025

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 359 15 Updated Sep 15, 2025
Python 72 4 Updated May 22, 2025

Implement a reasoning LLM in PyTorch from scratch, step by step

Python 1,935 241 Updated Nov 11, 2025

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 141 18 Updated Sep 23, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,731 112 Updated Sep 16, 2025

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 7,639 854 Updated Sep 30, 2025

The best workflows and configurations I've developed, having heavily used Claude Code since the day of it's release. Workflows are based off applied learnings from our AI-native startup.

3,090 465 Updated Sep 14, 2025

《解构大语言模型:从线性回归到通用人工智能》配套代码

Jupyter Notebook 256 50 Updated Oct 21, 2025

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 302 19 Updated Nov 2, 2025

minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.

Jupyter Notebook 482 28 Updated Jun 21, 2023

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

723 30 Updated Nov 7, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 127 6 Updated Jun 30, 2025

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 206 8 Updated Oct 15, 2025

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 1,893 180 Updated Jun 26, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,258 58 Updated Oct 18, 2025

Code for Retrieval-Augmented Perception (ICML 2025)

Python 60 4 Updated Aug 10, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,108 37 Updated Oct 4, 2025

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 323 6 Updated Jun 1, 2025

Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grained visual understanding".

97 2 Updated Aug 21, 2025
Python 95 3 Updated Aug 14, 2025
Python 959 58 Updated Oct 20, 2025
Next