-
Fudan University
- Shanghai, China
- https://zane-zyqiu.github.io/
- https://orcid.org/0009-0001-5159-2128
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Ultra-light Harness scaffolding for AI agents, a mini version of claude code
[Notice] The repo temporarily locked while ownership transfer. in the meantime we maintain on here: https://github.com/ultraworkers/claw-code-parity. The fastest repo in history to surpass 100K sta…
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
AI-Powered Goal Planner — Break down any goal into daily actionable tasks / AI 驱动的目标规划工具
A curated list of papers on reasoning by video generation, organized by reasoning capability.
A curated list of papers on reinforcement learning for video generation
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform stron…
Enjoy the magic of Diffusion models!
The official GitHub page for ''Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning''
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.
The open-source CapCut alternative
[CVPR'25] Official Implementations for Paper - AniDoc: Animation Creation Made Easier
[ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
Wan: Open and Advanced Large-Scale Video Generative Models
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
A Framework of Small-scale Large Multimodal Models
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A unified framework for 3D content generation.
Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
A work list of recent human video generation method. This repository focus on half/full body human video generation method, The Nerf, Gaussian splashing, Motion Pose, and talking head/Portrait is n…