Starred repositories
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs
Fast and memory-efficient exact attention
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
The official code for the paper 'Structured Knowledge Distillation for Semantic Segmentation'. (CVPR 2019 ORAL) and extension to other tasks.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
A curated list of foundation models for vision and language tasks
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
(CVPR 2023) The official project of "3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds"
✨✨Latest Advances on Multimodal Large Language Models
Awesome papers for markerless animal motion capture and 3D reconstruction.
awesome grounding: A curated list of research papers in visual grounding
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
QLoRA: Efficient Finetuning of Quantized LLMs
CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarchy.
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
Official code for VisProg (CVPR 2023 Best Paper!)
Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples" https://arxiv.org/abs/…
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX