fujingling

fujingling fujingling

Stars

ultraworkers / claw-code

[Notice] The repo temporarily locked while ownership transfer. in the meantime we maintain on here: https://github.com/ultraworkers/claw-code-parity. The fastest repo in history to surpass 100K sta…

Rust 140,786 101,558 Updated Apr 2, 2026

XMUDeepLIT / UME-R1

The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).

Python 51 2 Updated Feb 25, 2026

ZhihaoAIRobotic / ClawPhD

ClawPhD is an agent for research that can turn academic papers into publication-ready diagrams, posters, videos, and more.

Python 149 10 Updated Mar 25, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 345,545 68,679 Updated Apr 2, 2026

zai-org / VisionReward

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 398 13 Updated Mar 26, 2025

TalkUHulk / Awesome-CLIP

A curated list of research based on CLIP.

296 20 Updated Nov 17, 2024

Token-family / TokenFD

[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding

Python 132 7 Updated Aug 27, 2025

Yangr116 / VST

Visual Spatial Tuning

Jupyter Notebook 191 8 Updated Mar 25, 2026

bytedance / Q-Insight

Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)

Python 280 12 Updated Mar 3, 2026

anthropics / skills

Public repository for Agent Skills

Python 108,990 12,188 Updated Mar 25, 2026

discus0434 / aesthetic-predictor-v2-5

SigLIP-based Aesthetic Score Predictor

Python 393 9 Updated Dec 18, 2024

zhiyuanyou / DeQA-Score

[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Python 229 4 Updated Dec 16, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,840 75 Updated Feb 25, 2026

YeolJ00 / Personalized-Aesthetics

Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)

Python 32 Updated Mar 10, 2025

yongliang-wu / DFT

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 554 22 Updated Jan 4, 2026

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 99,984 12,826 Updated Apr 2, 2026

UpstageAI / evalverse

The Universe of Evaluation. All about the evaluation for LLMs.

Python 235 25 Updated Jul 9, 2024

zai-org / GLM-V

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,250 160 Updated Apr 1, 2026

FoundationVision / UniTok

[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding

Python 520 11 Updated Nov 14, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,782 512 Updated Oct 27, 2025

OpenGVLab / MM-Interleaved

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 252 12 Updated Apr 3, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 25,373 1,874 Updated Jul 31, 2025

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,539 190 Updated Apr 2, 2025

steven-ccq / ViLAMP

[ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"

Python 190 36 Updated Sep 23, 2025

zai-org / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,583 1,268 Updated Nov 4, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,854 1,713 Updated Jan 30, 2026

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,906 89 Updated Jan 8, 2026

applicaai / CCpdf

Index of URLs to pdf files all over the internet and scripts

Shell 25 3 Updated May 2, 2023

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,828 756 Updated Mar 30, 2026

vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.

C++ 495 40 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly