Skip to content
View ujay-zheng's full-sized avatar
  • USTC
  • HeiFei/JinHua

Block or report ujay-zheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.

48 1 Updated Nov 11, 2025

Contexts Optical Compression

Python 21,510 1,925 Updated Oct 25, 2025
Python 23 3 Updated Oct 11, 2025

Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration

Python 26 2 Updated Dec 2, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 449 34 Updated May 30, 2025

A low-latency & high-throughput serving engine for LLMs

Python 457 58 Updated Oct 16, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,740 2,049 Updated Nov 19, 2024

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

Python 179 20 Updated Apr 6, 2024

Zero-Shot Detection via Vision and Language Knowledge Distillation

Python 8 Updated Mar 11, 2022

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 411 17 Updated Apr 25, 2025

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 94 3 Updated Mar 1, 2025

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 399 10 Updated Aug 26, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,897 2,680 Updated Dec 15, 2025

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Python 315 18 Updated Aug 10, 2023
Python 47 5 Updated Jul 30, 2025

Contrastive Language-Audio Pretraining

Python 1,942 198 Updated May 15, 2025

[WACV 2023] Audio-Visual Efficient Conformer (AVEC) for Robust Speech Recognition

Jupyter Notebook 100 10 Updated Feb 21, 2023

Grounded Language-Image Pre-training

Python 2,559 213 Updated Jan 24, 2024
Python 1,041 137 Updated Oct 3, 2022

An up-to-date list of works on Multi-Task Learning

375 28 Updated Oct 10, 2025

awesome-autonomous-driving

1,060 99 Updated Aug 19, 2024

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

Python 413 53 Updated Jan 12, 2022

Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.

C 47 3 Updated Nov 24, 2025

Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS

C++ 32 Updated Feb 10, 2025

Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)

Python 863 90 Updated Nov 22, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,021 3,864 Updated Jul 23, 2024

[ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks

Python 15 2 Updated May 18, 2022

[ICLR 2020] Lite Transformer with Long-Short Range Attention

Python 610 82 Updated Jul 11, 2024

Code for "Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation"

Python 30 3 Updated Jun 15, 2025
Next