-
TFE-DCN Public
Forked from jianxiong-zhou/TFE-DCN[WACV 2023] Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization
Python UpdatedSep 9, 2024 -
PHATE Public
Forked from KrishnaswamyLab/PHATEPHATE (Potential of Heat-diffusion for Affinity-based Transition Embedding) is a tool for visualizing high dimensional data.
Python GNU General Public License v2.0 UpdatedAug 21, 2024 -
Awesome-Multimodal-Large-Language-Models Public
Forked from BradyFU/Awesome-Multimodal-Large-Language-Models✨✨Latest Advances on Multimodal Large Language Models
UpdatedAug 7, 2024 -
Awesome-LLMs-for-Video-Understanding Public
Forked from yunlong10/Awesome-LLMs-for-Video-Understanding🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
UpdatedAug 1, 2024 -
textgrad Public
Forked from zou-group/textgradTextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Python MIT License UpdatedJul 23, 2024 -
LLMs-from-scratch Public
Forked from rasbt/LLMs-from-scratchImplementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Jupyter Notebook Other UpdatedJul 14, 2024 -
-
facenet-pytorch Public
Forked from timesler/facenet-pytorchPretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Python MIT License UpdatedJun 26, 2024 -
awesome-multimodal-ml Public
Forked from pliang279/awesome-multimodal-mlReading list for research topics in multimodal machine learning
MIT License UpdatedJun 19, 2024 -
SER-on-WER-and-Fusion Public
Forked from yc-li20/SLT2024-SER-on-WER-and-FusionCode for paper "Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques"
Python UpdatedJun 17, 2024 -
ECCV2022-DELU Public
Forked from MengyuanChen21/ECCV2022-DELU[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
Python MIT License UpdatedJun 14, 2024 -
conclugen Public
Forked from tub-cv-group/conclugenOfficial repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".
Python UpdatedJun 3, 2024 -
RJCMA Public
Forked from praveena2j/RJCMAABAW6 (CVPR-W) We achieved second place in the valence arousal challenge of ABAW6
Python UpdatedMay 21, 2024 -
EACL Public
Forked from Yu-Fangxu/EACLOfficial code of "Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation" (Findings of NAACL 2024)
Python UpdatedMay 10, 2024 -
maskedmultiqueryslot Public
Forked from rishavpramanik/maskedmultiqueryslotPython MIT License UpdatedMay 1, 2024 -
Awesome-Open-Vocabulary-Semantic-Segmentation Public
Forked from Qinying-Liu/Awesome-Open-Vocabulary-Semantic-SegmentationA curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
UpdatedApr 27, 2024 -
LibreFace Public
Forked from ihp-lab/LibreFace[WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
Python Other UpdatedApr 26, 2024 -
CASE Public
Forked from Qinying-Liu/CASEAccepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
Python UpdatedApr 24, 2024 -
flowsam Public
Forked from Jyxarthur/flowsamOfficial Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman
Python Apache License 2.0 UpdatedApr 19, 2024 -
MedCLIP Public
Forked from RyanWangZf/MedCLIPEMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts
Python UpdatedApr 12, 2024 -
TOP Public
Forked from miccaiif/TOP[NeurIPS 2023] The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification
Python UpdatedApr 10, 2024 -
CorrMatch Public
Forked from BBBBchan/CorrMatchOfficial code for "CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation"
Python UpdatedApr 7, 2024 -
Macaw-LLM Public
Forked from lyuchenyang/Macaw-LLMMacaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
Python Apache License 2.0 UpdatedApr 3, 2024 -
dinov2 Public
Forked from facebookresearch/dinov2PyTorch code and models for the DINOv2 self-supervised learning method.
Jupyter Notebook Apache License 2.0 UpdatedMar 29, 2024 -
-
LoRA-ViT Public
Forked from JamesQFreeman/LoRA-ViTLow rank adaptation for Vision Transformer
Python GNU General Public License v3.0 UpdatedMar 18, 2024 -
MM-Align Public
Forked from declare-lab/MM-Align[EMNLP 2022] This repository contains the official implementation of the paper "MM-Align: Learning Optimal Transport-based Alignment Dynamics for Fast and Accurate Inference on Missing Modality Seq…
Python MIT License UpdatedMar 10, 2024 -
MELD Public
Forked from declare-lab/MELDMELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
Python GNU General Public License v3.0 UpdatedMar 10, 2024 -
papers_for_protein_design_using_DL Public
Forked from Peldom/papers_for_protein_design_using_DLList of papers about Proteins Design using Deep Learning
GNU General Public License v3.0 UpdatedMar 8, 2024 -
Awesome-Segment-Anything Public
Forked from liliu-avril/Awesome-Segment-AnythingThis repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
MIT License UpdatedMar 8, 2024