A Simple Three Dimensional Convolutional Neural Networks approach
-
Updated
Mar 6, 2020 - Python
A Simple Three Dimensional Convolutional Neural Networks approach
A python class compatible with TensorFlow to perform data augmentation on 3D objects during CNN training.
An experimental project for autonomous vehicle driving perception with steering angle prediction and semantic segmentation using a combination of UNet, attention and transformers.
Fight Detection From Surveillance Cameras by fine-tuning a PyTorch Pretrained Model
어린이집 CCTV로 학대상황 감지
lock mechanism with face recognition and liveness detection
This repository contains my personal code for the paper Learning Spatiotemporal Features with 3D Convolutional Networks by Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri.
My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
Deep learning lip-reading model using Conv3D + BiLSTM + CTC architecture. Transcribes speech from mouth region video clips for accessibility applications.
🔍 Read lips in videos with an end-to-end deep learning model, enhancing accessibility and transcribing speech from mouth movements effectively.
Add a description, image, and links to the conv3d topic page so that developers can more easily learn about it.
To associate your repository with the conv3d topic, visit your repo's landing page and select "manage topics."