Skip to content
View yuiseki's full-sized avatar
🍻
I want to drink
🍻
I want to drink

Organizations

@nota @gyazo @sinsai @maltine-records @UNopenGIS @arakawatomonori @unvt

Block or report yuiseki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Computer Vision

41 repositories

Open Source Computer Vision Library

C++ 85,373 56,418 Updated Dec 20, 2025

Datasets, Transforms and Models specific to Computer Vision

Python 17,387 7,185 Updated Dec 20, 2025

We write your reusable computer vision tools. 💜

Python 36,180 3,054 Updated Dec 15, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,073 1,088 Updated Nov 18, 2024

C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library

C++ 7,180 1,707 Updated Dec 19, 2025

An open source library and framework for deep learning on satellite and aerial imagery.

Python 2,186 394 Updated Sep 29, 2025

A full-body keyboard using gestures to type through computer vision

Python 1,945 27 Updated Jun 29, 2023

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,974 121 Updated Nov 30, 2023

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 863 53 Updated May 8, 2025

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Python 897 67 Updated Jul 22, 2025

GIT: A Generative Image-to-text Transformer for Vision and Language

Python 578 71 Updated Dec 2, 2023

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

Python 338 54 Updated May 16, 2023

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral

Python 239 18 Updated Mar 4, 2023

Inference Vision Transformer (ViT) in plain C/C++ with ggml

C++ 31 2 Updated Nov 23, 2023

This is the official repository for M2UGen

Jupyter Notebook 507 38 Updated Jan 2, 2025

CLIP inference in plain C/C++ with no extra dependencies

C++ 543 49 Updated Jun 19, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 71,497 10,431 Updated Dec 15, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,963 6,181 Updated Sep 18, 2024

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Swift 2,906 444 Updated Jan 4, 2025

This repository aims to implement an Image Search engine powered by the CLIP model.

Python 46 4 Updated Jul 15, 2022

LLaVA-JP is a Japanese VLM trained by LLaVA method

Python 64 13 Updated Jul 3, 2024

Fast Segment Anything

Python 8,200 746 Updated Jul 30, 2024

A Python package for segmenting geospatial data with the Segment Anything Model (SAM)

Python 3,802 405 Updated Dec 18, 2025

SAM with text prompt

Python 2,510 291 Updated Aug 28, 2025

Fine-tune Segment-Anything Model with Lightning Fabric.

Python 568 54 Updated Mar 25, 2024

A SAM-based model for instance segmentation of images of grains

Jupyter Notebook 516 73 Updated Nov 19, 2025

A QGIS plugin tool using Segment Anything Model (SAM) to accelerate segmenting or delineating landforms in geospatial raster images.

Python 380 47 Updated Dec 3, 2025

Webassembly compilation of https://github.com/ImageMagick/ImageMagick & samples

TypeScript 913 91 Updated Oct 23, 2023