-
Pixocial
- Singapore
- https://bxz9200.github.io/
- https://orcid.org/0000-0003-0372-8198
Lists (16)
Sort Name ascending (A-Z)
Stars
Real-time global intelligence dashboard — AI-powered news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
Light Image Video Generation Inference Framework
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
A curated list of recent diffusion models for video generation, editing, and various other applications.
cjeen / LoRAEdit
Forked from tdrussell/diffusion-pipeWe achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additional reference conditions.
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…
Awesome work on hand pose estimation/tracking
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
Pytorch implementation of Unimotion: Unifying 3D Human Motion Synthesis and Understanding.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"
TradingAgents: Multi-Agents LLM Financial Trading Framework
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
Enjoy the magic of Diffusion models!