Skip to content
View fujingling's full-sized avatar

Block or report fujingling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).

Python 51 2 Updated Feb 25, 2026

ClawPhD is an agent for research that can turn academic papers into publication-ready diagrams, posters, videos, and more.

Python 149 10 Updated Mar 25, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 340,029 67,020 Updated Mar 29, 2026

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 398 13 Updated Mar 26, 2025

A curated list of research based on CLIP.

295 20 Updated Nov 17, 2024

[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding

Python 132 7 Updated Aug 27, 2025

Visual Spatial Tuning

Jupyter Notebook 189 8 Updated Mar 25, 2026

Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)

Python 277 11 Updated Mar 3, 2026

Public repository for Agent Skills

Python 105,414 11,650 Updated Mar 25, 2026

SigLIP-based Aesthetic Score Predictor

Python 391 9 Updated Dec 18, 2024

[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Python 229 4 Updated Dec 16, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,833 73 Updated Feb 25, 2026

Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)

Python 32 Updated Mar 10, 2025

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 553 23 Updated Jan 4, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 99,408 12,712 Updated Mar 29, 2026

The Universe of Evaluation. All about the evaluation for LLMs.

Python 233 25 Updated Jul 9, 2024

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,241 159 Updated Mar 29, 2026

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 519 11 Updated Nov 14, 2025

Open-source unified multimodal model

Python 5,780 511 Updated Oct 27, 2025

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 252 12 Updated Apr 3, 2024

Official inference repo for FLUX.1 models

Python 25,363 1,872 Updated Jul 31, 2025

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,539 189 Updated Apr 2, 2025

[ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"

Python 190 36 Updated Sep 23, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,569 1,266 Updated Nov 4, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,793 1,701 Updated Jan 30, 2026

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,903 89 Updated Jan 8, 2026

Index of URLs to pdf files all over the internet and scripts

Shell 25 3 Updated May 2, 2023

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,806 751 Updated Mar 27, 2026

A high-performance inference system for large language models, designed for production environments.

C++ 495 40 Updated Dec 19, 2025
Python 4,620 453 Updated Sep 14, 2025
Next