Skip to content
View woqk's full-sized avatar

Block or report woqk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
32 stars written in Jupyter Notebook
Clear filter

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 54,069 6,332 Updated Sep 18, 2024

Google Research

Jupyter Notebook 37,830 8,397 Updated Apr 30, 2026

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,707 2,163 Updated Apr 13, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 19,048 2,430 Updated Apr 7, 2026

StableLM: Stability AI Language Models

Jupyter Notebook 15,717 1,015 Updated Apr 8, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 14,013 1,730 Updated Feb 29, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,782 1,220 Updated Apr 8, 2026

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,214 1,103 Updated Nov 18, 2024

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 10,138 1,325 Updated Nov 9, 2023

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 9,919 1,049 Updated Feb 5, 2025

Best Practices, code samples, and documentation for Computer Vision.

Jupyter Notebook 9,847 1,205 Updated Feb 16, 2024

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Jupyter Notebook 7,425 652 Updated Apr 28, 2026

A unified framework for 3D content generation.

Jupyter Notebook 7,017 550 Updated Dec 16, 2024

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,905 363 Updated Apr 16, 2026

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Jupyter Notebook 5,127 1,356 Updated Mar 21, 2020

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 5,027 589 Updated Feb 24, 2026

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,597 272 Updated Dec 14, 2025

[ICCV 2019] Monocular depth estimation from a single image

Jupyter Notebook 4,479 988 Updated Aug 10, 2024

[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

Jupyter Notebook 3,598 448 Updated Oct 25, 2023

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,993 197 Updated Apr 23, 2026

Incredibly fast Whisper-large-v3

Jupyter Notebook 1,877 110 Updated Feb 16, 2024

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Jupyter Notebook 1,623 99 Updated May 29, 2025

A high-fidelity 3D face reconstruction library from monocular RGB image(s)

Jupyter Notebook 817 103 Updated Oct 18, 2023

Self-Supervised Learning of 3D Human Pose using Multi-view Geometry (CVPR2019)

Jupyter Notebook 610 94 Updated Jun 26, 2019

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 602 27 Updated Dec 11, 2024

Joint deep network for feature line detection and description

Jupyter Notebook 586 77 Updated Dec 26, 2023

Large dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.

Jupyter Notebook 410 36 Updated May 7, 2025

Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)

Jupyter Notebook 401 27 Updated Dec 2, 2023

Low latency JSON generation using LLMs ⚡️

Jupyter Notebook 397 14 Updated Mar 10, 2024

[ICCV 2019] Depth Hints are complementary depth suggestions which improve monocular depth estimation algorithms trained from stereo pairs

Jupyter Notebook 188 20 Updated May 17, 2021
Next