Stars
[AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)
A collection of AWESOME things about domain adaptation
🏃♀️ A curated list about human motion capture, analysis and synthesis.
State-of-the-Art Text Embeddings
All about FineGym (CVPR 2020 Oral): models, features, data, and more... keep starring and stay tuned!
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.
Skeleton-based Action Recognition
An open-source toolbox for action understanding based on PyTorch
OpenMMLab Detection Toolbox and Benchmark
Some basic examples of playing with RL
Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models"
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
Code & Models for Temporal Segment Networks (TSN) in ECCV 2016