Skip to content
View th-z's full-sized avatar

Block or report th-z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 179,666 106,531 Updated Apr 9, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,806 1,414 Updated Mar 3, 2026

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 52,459 5,558 Updated Feb 19, 2026
Python 481 24 Updated Mar 26, 2026

🔥 🔥 🔥 Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding 📹

65 1 Updated Sep 1, 2025

A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.

587 53 Updated Apr 1, 2026

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,583 202 Updated May 7, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,420 129 Updated Nov 9, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 353,264 71,292 Updated Apr 9, 2026

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Python 398 Updated Dec 16, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,829 367 Updated Apr 6, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,060 808 Updated Mar 30, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,899 2,421 Updated Apr 7, 2026

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,877 807 Updated Mar 24, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,597 5,261 Updated Apr 9, 2026

翻墙-科学上网

Kotlin 43,100 7,699 Updated Feb 7, 2026

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,541 892 Updated May 12, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,472 112 Updated Jan 19, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,917 15,382 Updated Apr 9, 2026

量化代码

Python 317 94 Updated Aug 10, 2025

[CVPR 2023 Highlight] Perspective Fields for Single Image Camera Calibration

Jupyter Notebook 307 23 Updated Nov 2, 2024

结巴中文分词

Python 34,843 6,705 Updated Aug 21, 2024

🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!

Python 7,362 807 Updated Apr 4, 2026

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 46,274 5,695 Updated Apr 9, 2026

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,304 549 Updated May 5, 2025

A Framework of Small-scale Large Multimodal Models

Python 975 99 Updated Mar 29, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,078 1,971 Updated Jan 9, 2026

Everything about the SmolLM and SmolVLM family of models

Python 3,703 285 Updated Apr 2, 2026

The framework to prune LLMs to any size and any config.

Python 95 3 Updated Mar 1, 2024
Next