Skip to content
View zx-pan's full-sized avatar
  • USA

Highlights

  • Pro

Block or report zx-pan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 9,962 743 Updated Jun 16, 2026

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 33,657 2,769 Updated Jun 21, 2026

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

Python 107 Updated Apr 30, 2026
Python 585 28 Updated Jun 8, 2026

[ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Python 197 7 Updated Jan 26, 2026

[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 143 8 Updated May 1, 2026

CVPR and NeurIPS poster examples and templates

2,001 172 Updated May 9, 2023

[CVPR2026] LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

Python 53 1 Updated Jun 13, 2026

AI agent for Microscopy Image Analysis

Python 13 3 Updated May 13, 2026

A lightweight napari plugin that exposes the viewer over MCP (Message-Control Protocol) via a Python socket server. Built on top of FastMCP, it lets external MCP-speaking clients—such as autonomous…

Python 3 1 Updated Feb 9, 2026

Experiments for bioagent benchmark

Python 3 1 Updated May 26, 2026

Benchmark for evaluating LLM agents in bioinformatics

Python 30 Updated May 2, 2026

[ECCV 2026] MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Python 1,145 49 Updated Feb 26, 2026

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex

Python 790 41 Updated Jun 18, 2026

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 272 14 Updated Feb 10, 2026

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 8,446 1,485 Updated Jun 20, 2026

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,731 4,631 Updated Jan 9, 2026

[ICLR'26] Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Python 541 34 Updated Jan 27, 2026

[CVPR2026] PosterOmni: One model for poster creation—unifying local edits and global design for generalized multi-task image/poster-to-poster generation.

Python 201 12 Updated Feb 22, 2026

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 412 14 Updated Mar 26, 2025

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,683 92 Updated Oct 29, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,615 83 Updated Oct 16, 2025

[CVPR 2026 Highlight] SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation

Python 7 Updated Jun 12, 2026

Machine Learning and Computer Vision Engineer - Technical Interview Questions

4,710 763 Updated Jan 24, 2026

[AAAI 2026] SlideTailor: Personalized Presentation Slide Generation for Scientific Papers

Python 56 3 Updated Apr 18, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 87,984 16,982 Updated Jun 22, 2026

"Paper2Slides: From Paper to Presentation in One Click"

Python 3,728 470 Updated May 20, 2026
Python 11,596 790 Updated Feb 9, 2026

Automatic Video Generation from Scientific Papers

Python 2,319 328 Updated Mar 5, 2026

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,484 47 Updated Mar 9, 2026
Next