Skip to content
View jiaangli's full-sized avatar

Highlights

  • Pro

Block or report jiaangli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

implementing minimal versions of joint-embedding predictive architecture (JEPA)

Python 202 19 Updated May 19, 2026

Official implementation of "StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues", CVPR 2026.

Python 36 1 Updated Mar 9, 2026
Python 1,238 108 Updated Jan 25, 2026

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,919 218 Updated May 26, 2026

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights" (ICML 2026 Spotlight)

Python 615 66 Updated May 20, 2026

AI agents running research on single-GPU nanochat training automatically

Python 88,050 12,749 Updated Mar 26, 2026

[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"

Python 151 5 Updated Jan 26, 2026

A course on aligning smol models.

Jupyter Notebook 6,661 2,280 Updated May 26, 2026

Pytorch Lightning Implement of Generative Recommenders

Python 113 27 Updated Sep 17, 2024

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 3 1 Updated Dec 11, 2023

(ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"

Jupyter Notebook 116 9 Updated Mar 13, 2024

The best ChatGPT that $100 can buy.

Python 55,315 7,591 Updated May 5, 2026

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 995 1,128 Updated Jul 4, 2024

simple pytorch pipeline for pretraining/finetuning vision models on imagenet-1k

Python 5 Updated Nov 17, 2024

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 4,243 603 Updated Jun 22, 2026

DeepConf: Deep Think with Confidence

Python 403 59 Updated Jun 20, 2026

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

452 24 Updated May 8, 2026

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 323 14 Updated Oct 12, 2025

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 351 19 Updated Nov 2, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,529 14,941 Updated Jun 2, 2026

[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study

Python 16 Updated Nov 22, 2024

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 204 13 Updated Jun 10, 2025

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

527 26 Updated Jun 3, 2025

👋 Overcomplete is a Vision-based SAE Toolbox

Python 145 8 Updated Dec 4, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,233 722 Updated Jun 17, 2026

Open-source implementation of AlphaEvolve

Python 6,583 1,055 Updated Mar 18, 2026

A final sanity checklist to help your CS paper get accepted, not desk rejected.

1,586 146 Updated May 25, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,914 496 Updated Oct 27, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,866 94 Updated Apr 18, 2025
Next