Skip to content
View alexzms's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report alexzms

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alexzms/README.md

Hi there 👋 This is Minshen Zhang

Github Linkedin Personal Website Gmail

I am a first-year M.S. student in Computer Science at UC San Diego, advised by Prof. Hao Zhang, and I hold a B.S. from ShanghaiTech University advised by Prof. Kewei Tu. My research lies at the intersection of Natural Language Processing and Machine Learning Systems. I am particularly passionate about designing efficient architectures for Long-Context Modeling and exploring the frontiers of World Models to bridge system efficiency with model capability.

Currently, I focus on scalable training and inference for generative models. I am the lead author of FlashMHF (under review), where I proposed a novel Multi-Head FFN architecture backed by IO-aware Triton/CUDA kernels. Additionally, as a core contributor to FastVideo in Hao AI Lab, I am working on new model aggregation and optimized kernel implementations to accelerate video generation systems.

Looking ahead, I aim to extend my work on FlashMHF to broader LLM backbones and delve deeper into World Models within the FastVideo framework. I am also actively exploring retrieval-based methods and Continual Learning to solve the challenges of long-context understanding in foundation models.


Technical Focus: NLP Triton/CUDA LLM Architecture Video Generation

Pinned Loading

  1. FoundationResearch/FlashMHF FoundationResearch/FlashMHF Public

    Flash Multi-Head Feed-Forward Networks

    Python 1

  2. hao-ai-lab/FastVideo hao-ai-lab/FastVideo Public

    A unified inference and post-training framework for accelerated video generation.

    Python 2.9k 228

  3. learn_cuda learn_cuda Public

    On the way learning CUDA...

    C 3

  4. ray_tracing_cpp ray_tracing_cpp Public

    RayTracing renderer written in C++

    C++ 2