-
Singapore University of Technology And Design
- Singapore
Lists (1)
Sort Name ascending (A-Z)
Stars
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
GRID: Generative Recommendation with Semantic IDs
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
Another API-less Instagram pictures and videos downloader. (defunct)
😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)
Code for the ALiBi method for transformer language models (ICLR 2022)
Code associated with the Don't Stop Pretraining ACL 2020 paper
novel deep learning research works with PaddlePaddle
Official implementation of "Can Language Understand Depth?"
MixGen: A New Multi-Modal Data Augmentation
Official implementation for "Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition" (ICML'22 Long Presentation)
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
MGeo: Multi-Modal Geographic Language Model Pre-Training
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
LAVIS - A One-stop Library for Language-Vision Intelligence
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
TensorFlow code and pre-trained models for BERT
Code for paper "Identifying places using multimodal social network data"
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Scrape Facebook public pages without an API key
SimVLM ---SIMPLE VISUAL LANGUAGE MODEL PRETRAINING WITH WEAK SUPERVISION