Skip to content
View th-z's full-sized avatar

Block or report th-z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Notice] The repo temporarily locked while ownership transfer. in the meantime we maintain on here: https://github.com/ultraworkers/claw-code-parity. The fastest repo in history to surpass 100K sta…

Rust 139,752 101,584 Updated Apr 2, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,764 1,406 Updated Mar 3, 2026

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 50,452 5,264 Updated Feb 19, 2026
Python 459 23 Updated Mar 26, 2026

🔥 🔥 🔥 Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding 📹

64 1 Updated Sep 1, 2025

A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.

584 52 Updated Apr 1, 2026

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,577 203 Updated May 7, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,411 128 Updated Nov 9, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 345,484 68,659 Updated Apr 2, 2026

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Python 429 Updated Dec 16, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,796 365 Updated Mar 26, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,999 799 Updated Mar 30, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,838 2,409 Updated Mar 20, 2026

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,836 798 Updated Mar 24, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,347 5,127 Updated Apr 2, 2026

翻墙-科学上网

Kotlin 42,997 7,687 Updated Feb 7, 2026

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,542 894 Updated May 12, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,455 111 Updated Jan 19, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,989 15,081 Updated Apr 2, 2026

量化代码

Python 317 94 Updated Aug 10, 2025

[CVPR 2023 Highlight] Perspective Fields for Single Image Camera Calibration

Jupyter Notebook 307 23 Updated Nov 2, 2024

结巴中文分词

Python 34,835 6,709 Updated Aug 21, 2024

🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!

Python 7,237 786 Updated Apr 2, 2026

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 45,390 5,513 Updated Apr 2, 2026

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,288 549 Updated May 5, 2025

A Framework of Small-scale Large Multimodal Models

Python 973 99 Updated Mar 29, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,028 1,957 Updated Jan 9, 2026

Everything about the SmolLM and SmolVLM family of models

Python 3,688 284 Updated Apr 2, 2026

The framework to prune LLMs to any size and any config.

Python 95 3 Updated Mar 1, 2024
Next