Skip to content
View zw-ruan's full-sized avatar

Block or report zw-ruan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
51 stars written in Jupyter Notebook
Clear filter

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 24,298 5,620 Updated Dec 17, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,735 1,683 Updated Jan 30, 2026

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸

Jupyter Notebook 16,188 1,385 Updated Sep 7, 2023

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,576 1,199 Updated Mar 12, 2026

PRML algorithms implemented in Python

Jupyter Notebook 11,720 3,228 Updated Apr 5, 2025

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 7,408 1,100 Updated Aug 6, 2024

A course on aligning smol models.

Jupyter Notebook 6,614 2,300 Updated Feb 6, 2026

COCO API - Dataset @ http://cocodataset.org/

Jupyter Notebook 6,363 3,758 Updated Apr 17, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,837 549 Updated Aug 29, 2025

Efficient Image Captioning code in Torch, runs on GPU

Jupyter Notebook 5,578 1,266 Updated Nov 7, 2017

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,877 356 Updated Mar 3, 2026

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,348 390 Updated Nov 11, 2025

GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…

Jupyter Notebook 3,346 917 Updated Mar 22, 2026

Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022

Jupyter Notebook 2,843 406 Updated May 31, 2024

Metric depth estimation from a single image

Jupyter Notebook 2,807 270 Updated May 5, 2025

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 2,460 179 Updated Mar 10, 2026

Efficient neural feature detector and descriptor

Jupyter Notebook 2,400 468 Updated May 5, 2025

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 2,122 352 Updated Feb 8, 2026

Tracking Any Point (TAP)

Jupyter Notebook 1,820 176 Updated Jan 22, 2026

A Modular Framework for 3D Gaussian Splatting and Beyond

Jupyter Notebook 1,733 98 Updated Nov 5, 2025

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Jupyter Notebook 1,570 204 Updated Jan 15, 2025

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Jupyter Notebook 1,467 377 Updated Feb 3, 2023

assistant tools for attention visualization in deep learning

Jupyter Notebook 1,265 93 Updated Jun 9, 2022
Jupyter Notebook 1,218 548 Updated May 13, 2024

[ECCV`24&ICLR`25] CityGaussian Series for High-quality Large-Scale Scene Reconstruction with Gaussians

Jupyter Notebook 1,126 94 Updated Feb 7, 2026

Code for "Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed", CVPR 2024

Jupyter Notebook 968 97 Updated Jul 30, 2025

Superpoint Implemented in PyTorch: https://arxiv.org/abs/1712.07629

Jupyter Notebook 925 197 Updated Aug 11, 2023

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Jupyter Notebook 897 43 Updated Jul 10, 2024

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral

Jupyter Notebook 882 149 Updated Jul 15, 2024

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

Jupyter Notebook 846 170 Updated Apr 8, 2024
Next