Stars
Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.
Collection of AWESOME vision-language models for vision tasks
Fully open reproduction of DeepSeek-R1
Solve Visual Understanding with Reinforced VLMs
This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Code for the paper: Prompts have evil twins (EMNLP 2024)
Resources for cultural NLP research
Famous Vision Language Models and Their Architectures
Reading list for research topics in multimodal machine learning
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
A collection of AWESOME things about mixture-of-experts
Cross-modal few-shot adaptation with CLIP
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
The simplest, fastest repository for training/finetuning medium-sized GPTs.
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Optimized Stable Diffusion modified to run on lower GPU VRAM
Google Research
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
This is the official implementation of NeurIPS 2021 "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).