-
FAIR @ Meta
- Seattle
- https://jayleicn.github.io/
- @jayleicn
Stars
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
QLoRA: Efficient Finetuning of Quantized LLMs
Toolkit for Elevater Benchmark
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
程序员延寿指南 | A programmer's guide to live longer
Rich is a Python library for rich text and beautiful formatting in the terminal.
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
My tools for the Slurm HPC workload manager
It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…
Examples of how to create colorful, annotated equations in Latex using Tikz.
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Comparing speed of different implementations of reading video into numpy arrays
[ACL 2021] mTVR: Multilingual Video Moment Retrieval