Skip to content
View ChunyuanLI's full-sized avatar

Block or report ChunyuanLI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The open source coding agent.

TypeScript 144,661 16,363 Updated Apr 17, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,282 314 Updated Jan 14, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 359,093 73,054 Updated Apr 17, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,611 2,367 Updated Mar 16, 2026

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 209 24 Updated Nov 13, 2023

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 21,345 2,227 Updated Apr 4, 2026

The official Python library for the OpenAI API

Python 30,525 4,723 Updated Apr 16, 2026

LLM101n: Let's build a Storyteller

36,791 2,010 Updated Aug 1, 2024
Python 4,637 457 Updated Apr 15, 2026

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

78,353 9,108 Updated Feb 5, 2026

[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,525 111 Updated Aug 19, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,288 2,723 Updated Apr 1, 2026

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,365 211 Updated Mar 5, 2024

Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Jupyter Notebook 468 29 Updated Dec 29, 2023

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,691 2,760 Updated Aug 12, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,774 456 Updated Aug 19, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,525 1,590 Updated Sep 5, 2024

Instruction Tuning with GPT-4

HTML 4,335 309 Updated Jun 11, 2023

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,204 3,624 Updated Jul 4, 2024

Open-Set Grounded Text-to-Image Generation

Python 2,220 166 Updated Mar 6, 2024

A playbook for systematically maximizing the performance of deep learning models.

30,031 2,418 Updated Jun 18, 2024

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,343 161 Updated Oct 5, 2023
Jupyter Notebook 3,052 287 Updated Feb 27, 2023

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,738 801 Updated Dec 8, 2022

A compilation of network architectures for vision and others without usage of self-attention mechanism

81 7 Updated Jan 18, 2023

Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"

Python 106 8 Updated Aug 7, 2023

[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222

Python 53 2 Updated Jun 12, 2023

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,873 552 Updated Mar 31, 2026

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,367 57 Updated Mar 14, 2024

Toolkit for Elevater Benchmark

Python 77 19 Updated Oct 17, 2023
Next