-
PicsArt & University of Bonn
- Berlin
- dsx0511.github.io
- https://orcid.org/0000-0002-4040-5585
- in/shuxiao-ding-790197142
- https://scholar.google.com/citations?user=QPLytlUAAAAJ&hl
Lists (2)
Sort Name ascending (A-Z)
Stars
A Claude Code research loop that keeps reading, practice, ideation, experiments, and writing connected through shared plain-text state.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Official Repo For Pixel-LLM Codebase: Sa2VA (Arxiv-25), SAMTok (CVPR-26), VRT, SaSaSa2VA (1-st solution for LSVOS)
A Claude Code custom skill that serves as an ADHD-friendly companion assistant.
Strategic research thinking agents for Claude Code — idea evaluation, project triage, and structured brainstorming. Helps you decide which papers to write, not just how to write them.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
[CVPR2026] DriveLaW: Unifying Planning and Video Generation in a Latent Driving World
[EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
A general purpose scientific writer
Tracking the systems that automate scientific research — from literature scrapers to full paper-writing pipelines
FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…
Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows — particularly Claude Code
Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.
An agentic skills framework & software development methodology that works.
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
Segment Anything in High Quality [NeurIPS 2023]
Multimodal Referring Segmentation
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]