Skip to content
View fujingling's full-sized avatar

Block or report fujingling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Notice] The repo temporarily locked while ownership transfer. in the meantime we maintain on here: https://github.com/ultraworkers/claw-code-parity. The fastest repo in history to surpass 100K sta…

Rust 140,786 101,558 Updated Apr 2, 2026

The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).

Python 51 2 Updated Feb 25, 2026

ClawPhD is an agent for research that can turn academic papers into publication-ready diagrams, posters, videos, and more.

Python 149 10 Updated Mar 25, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 345,545 68,679 Updated Apr 2, 2026

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 398 13 Updated Mar 26, 2025

A curated list of research based on CLIP.

296 20 Updated Nov 17, 2024

[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding

Python 132 7 Updated Aug 27, 2025

Visual Spatial Tuning

Jupyter Notebook 191 8 Updated Mar 25, 2026

Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)

Python 280 12 Updated Mar 3, 2026

Public repository for Agent Skills

Python 108,990 12,188 Updated Mar 25, 2026

SigLIP-based Aesthetic Score Predictor

Python 393 9 Updated Dec 18, 2024

[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Python 229 4 Updated Dec 16, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,840 75 Updated Feb 25, 2026

Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)

Python 32 Updated Mar 10, 2025

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 554 22 Updated Jan 4, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 99,984 12,826 Updated Apr 2, 2026

The Universe of Evaluation. All about the evaluation for LLMs.

Python 235 25 Updated Jul 9, 2024

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,250 160 Updated Apr 1, 2026

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 520 11 Updated Nov 14, 2025

Open-source unified multimodal model

Python 5,782 512 Updated Oct 27, 2025

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 252 12 Updated Apr 3, 2024

Official inference repo for FLUX.1 models

Python 25,373 1,874 Updated Jul 31, 2025

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,539 190 Updated Apr 2, 2025

[ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"

Python 190 36 Updated Sep 23, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,583 1,268 Updated Nov 4, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,854 1,713 Updated Jan 30, 2026

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,906 89 Updated Jan 8, 2026

Index of URLs to pdf files all over the internet and scripts

Shell 25 3 Updated May 2, 2023

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,828 756 Updated Mar 30, 2026

A high-performance inference system for large language models, designed for production environments.

C++ 495 40 Updated Dec 19, 2025
Next