-
The Pennsylvania State University
-
17:59
(UTC -05:00) - www.linkedin.com/in/anh-phan-2705
- @anhphan2705
- anhphan2705
Highlights
Stars
Using YOLO object detection, this program will detect if a person is drowning. This project is still a work in progress, so it can only be implemented with a computer's webcam, and doesn't work com…
Drowning Detector - A computer vision project using OpenCV and deep learning to detect drowning incidents in videos.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Torchreid: Deep learning person re-identification in PyTorch.
Scripts to stabilize video stream from wifibroadcast for FPV
Startup is free Next.js template for SaaS startups comes with all the essential pages, components, and sections you need to launch a complete business website.
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.
Learn Low Level Design (LLD) and prepare for interviews using free resources.
An autonomous drone to follow a person, using the OAK-D Lite and MAVLink
We write your reusable computer vision tools. 💜
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Lightweight stereo matching network based on MobileNet blocks
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Using Temporal Fusion Transformer for Book sales forecasting use case. We use the model implementation available in Pytorch Forecasting library.
Productive, portable, and performant GPU programming in Python.
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
This is the official repository for our recent work: PIDNet
AsymFormer: Asymmetrical Cross-Modal Representation Learning for Mobile Platform Real-Time RGB-D Semantic Segmentation
Add bisenetv2. My implementation of BiSeNet