👋 Hi there, I'm Jinheng Xie, final-year PhD student at National University of Singapore, working with Professor Mike Shou. I’ve had the privilege of interning at Google Research, Google DeepMind, ByteDance, and Tencent.
My research focuses on unifying multimodal understanding and generation within a unified multimodal architecture. I have trained two unified multimodal models—Show-o and Show-o2—with up to 7B parameters on billion-scale datasets.