Starred repositories
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
1st Place solution to the Cornell Birdcall Identification competition.
Code for BHI 2023 Paper on EEG Seizure Detection
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
PyTorch implementations of several SOTA backbone deep neural networks (such as ResNet, ResNeXt, RegNet) on one-dimensional (1D) signal/time-series data.
A PyTorch implementation of EfficientNet
C++/CUDA/Python multimedia utilities for NVIDIA Jetson
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
A simple face detect and alignment method, which is easy and stable.
Official repo for consistency models.
An open source implementation of CLIP.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
An simple and effective segmentation framework
Python (Pytorch) and Matlab (MatConvNet) implementations of CVPR 2021 Image Matching Workshop paper DFM: A Performance Baseline for Deep Feature Matching
sketch + style = paints 🎨 (TOG2018/SIGGRAPH2018ASIA)
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to trans…
Converts a 2D Color Image 🖼 into a Hand drawn Sketch ✏ Using Novel Technique.
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
implementation of "Combining Sketch and Tone for Pencil Drawing Production"
Fine-tuning StyleGAN2 for Cartoon Face Generation
Torch implementation of neural style algorithm
Code and data for paper "Deep Photo Style Transfer": https://arxiv.org/abs/1703.07511
Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)