hudaAlamri

Huda hudaAlamri

CS PhD School of Interactive Computing Georgia Tech

16 followers · 1 following

Achievements

Stars

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,472 249 Updated Dec 3, 2024

hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge

Python 54 11 Updated Nov 18, 2019

wendykan / cnn-for-visual-recognition

Jupyter Notebook 21 6 Updated Feb 27, 2017

roudimit / AVLnet

Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.

Python 54 6 Updated Mar 30, 2022

uzaymacar / comparatively-finetuning-bert

Comparatively fine-tuning pretrained BERT models on downstream, text classification tasks with different architectural configurations in PyTorch.

Python 126 28 Updated Jul 2, 2020

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,343 1,294 Updated Mar 16, 2026

piergiaj / super-events-cvpr18

Code for our CVPR 2018 paper "Learning Latent Super-Events to Detect Multiple Activities in Videos"

Python 123 34 Updated Oct 4, 2018

ictnlp / DSTC8-AVSD

We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Tra…

Python 56 14 Updated Jun 12, 2023

gsig / PyVideoResearch

A repository of common methods, datasets, and tasks for video research

Python 539 89 Updated Jun 17, 2019

subho406 / OmniNet

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Python 514 59 Updated Oct 31, 2020

uclanlp / visualbert

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Python 542 101 Updated May 1, 2023

vmurahari3 / visdial-bert

Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379

Python 97 19 Updated Mar 31, 2020

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,588 32,918 Updated Apr 18, 2026

batra-mlp-lab / visdial-challenge-starter-pytorch

Starter code in PyTorch for the Visual Dialog challenge

Python 189 38 Updated Mar 24, 2023

gsig / charades-algorithms

Activity Recognition Algorithms for the Charades Dataset

Lua 207 59 Updated Dec 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huda hudaAlamri

Achievements

Achievements

Block or report hudaAlamri

Stars

PKU-YuanGroup / Video-LLaVA

hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge

wendykan / cnn-for-visual-recognition

roudimit / AVLnet

uzaymacar / comparatively-finetuning-bert

facebookresearch / SlowFast

piergiaj / super-events-cvpr18

ictnlp / DSTC8-AVSD

gsig / PyVideoResearch

subho406 / OmniNet

uclanlp / visualbert

vmurahari3 / visdial-bert

huggingface / transformers

batra-mlp-lab / visdial-challenge-starter-pytorch

gsig / charades-algorithms