Skip to content
View VegB's full-sized avatar
🈚
🈚
  • UC Santa Barbara
  • Santa Barbara, CA

Organizations

@asyml

Block or report VegB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 146 4 Updated Aug 23, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,673 598 Updated May 30, 2025
Python 46 7 Updated Dec 8, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,815 395 Updated Mar 27, 2026

Project webpage of LayoutGPT

JavaScript 2 Updated Jun 9, 2023

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,928 382 Updated Mar 14, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,724 1,124 Updated Apr 24, 2026

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,541 190 Updated Apr 2, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,729 2,905 Updated Sep 2, 2024

Official repo for LayoutGPT

Python 402 30 Updated Apr 10, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,373 212 Updated Mar 5, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 954 38 Updated Mar 19, 2025

An open-source framework for training large multimodal models.

Python 4,087 318 Updated Aug 31, 2024

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Jupyter Notebook 143 6 Updated Jun 10, 2025

Reverse engineered ChatGPT API

Python 27,934 4,395 Updated Aug 2, 2023

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Python 1,348 125 Updated Dec 1, 2023

Intuitive Annotation Tool for Information Extraction / Named Entity Recognition using localturk / Amazon Mechanical Turk

JavaScript 264 26 Updated Aug 25, 2019

Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training

Python 170 18 Updated Apr 27, 2023

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,267 66 Updated Oct 18, 2022

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,376 78 Updated Jul 11, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2,203 235 Updated May 20, 2024
Jupyter Notebook 231 30 Updated Dec 18, 2023

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,412 375 Updated Oct 19, 2025

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Python 309 28 Updated Jul 12, 2024

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,234 618 Updated Jul 19, 2024

Simple image captioning model

Jupyter Notebook 1,417 224 Updated Jun 9, 2024

LaTeX template for dissertations in Peking University

TeX 606 200 Updated Apr 25, 2024

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 186,584 26,363 Updated Apr 28, 2026