-
HUST | Research intern at ByteDance
- Wuhan, China
- https://wjf5203.github.io/
Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
High-Resolution Image Synthesis with Latent Diffusion Models
LAVIS - A One-stop Library for Language-Vision Intelligence
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Taming Transformers for High-Resolution Image Synthesis
CoTracker is a model for tracking any point (pixel) on a video.
Open-source and strong foundation image recognition models.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
pytorch1.0 updated. Support cpu test and demo. (Use detectron2, it's a masterpiece)
Official PyTorch repo for GAN's N' Roses. Diverse im2im and vid2vid selfie to anime translation.
This repo contains the code for 1D tokenizer and generator
This repository is intended to host tools and demos for ActivityNet
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
A Simple pytorch implementation of GradCAM and GradCAM++
Evaluating text-to-image/video/3D models with VQAScore
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
Official PyTorch implementation of FlowMo.
qjy981010 / cocoapi
Forked from youtubevos/cocoapiCOCO API Customized for OVIS evaluation