Skip to content
View andrew-miao's full-sized avatar
🤗
Focusing
🤗
Focusing
  • University of Waterloo
  • Waterloo, ON, CA

Highlights

  • Pro

Block or report andrew-miao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,669 109,959 Updated Jun 8, 2026

A framework for few-shot evaluation of language models.

Python 12,930 3,335 Updated Jun 2, 2026

A reproduction of the Deepseek-OCR model including training

Python 209 21 Updated Nov 21, 2025
Python 11,527 784 Updated Feb 9, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,297 1,992 Updated Jan 9, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,363 1,787 Updated Jan 30, 2026

[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"

Python 429 28 Updated Feb 8, 2026

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 195 11 Updated May 31, 2024

Fast and memory-efficient exact attention

Python 24,123 2,826 Updated Jun 10, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,671 17,979 Updated Jun 12, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,994 502 Updated Feb 10, 2026

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 105,186 14,044 Updated Jun 11, 2026

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 1,151 65 Updated Dec 17, 2025

A curated list for awesome discrete diffusion models resources.

562 23 Updated Sep 9, 2025

[NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers

Python 3,777 278 Updated Jun 8, 2026

✨✨Latest Advances on Multimodal Large Language Models

17,874 1,127 Updated May 1, 2026

Enable Comprehensive LLM Evaluation on Graph Reasoning

Python 79 2 Updated Jun 12, 2025

PyTorch implementation for RPO https://arxiv.org/abs/2407.12164

Python 5 Updated Nov 10, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 13,156 1,584 Updated Feb 27, 2026

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,748 2,232 Updated Feb 1, 2025

EDM2 and Autoguidance -- Official PyTorch implementation

Python 844 57 Updated Dec 9, 2024

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,159 67 Updated Mar 20, 2025

Implementation of MagViT2 Tokenizer in Pytorch

Python 665 34 Updated Jan 12, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,963 329 Updated Jun 5, 2026

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,617 791 Updated May 31, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 14,063 1,730 Updated Feb 29, 2024

A latent text-to-image diffusion model

Jupyter Notebook 73,097 10,603 Updated Jun 18, 2024

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,324 361 Updated Dec 4, 2025

Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.

Python 538 35 Updated Dec 6, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 937 43 Updated Sep 27, 2024
Next