Skip to content
View zyuanbing's full-sized avatar

Highlights

  • Pro

Block or report zyuanbing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

178,016 18,174 Updated Apr 20, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 12,301 1,126 Updated Jun 18, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,312 79,408 Updated Jun 18, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,282 1,160 Updated Jun 18, 2026

An autonomous agent that conducts deep research on any data using any LLM providers

Python 27,771 3,743 Updated May 28, 2026

The best ChatGPT that $100 can buy.

Python 55,182 7,586 Updated May 5, 2026

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,316 246 Updated Jun 8, 2026

Trainable Compression For LLMs

8 1 Updated Mar 3, 2025

Paper list for Efficient Reasoning.

889 45 Updated May 29, 2026

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,221 78 Updated Apr 8, 2026

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

448 29 Updated Jun 17, 2026

[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Python 118 5 Updated Mar 26, 2025

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

880 39 Updated May 20, 2026

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 19,378 2,477 Updated May 30, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,239 18,185 Updated Jun 18, 2026

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Python 616 28 Updated Sep 27, 2024

[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Python 49 5 Updated May 12, 2024

FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)

Python 51 4 Updated Aug 28, 2024

Official repository of paper "Subobject-level Image Tokenization" (ICML-25)

Python 93 9 Updated Jul 4, 2025

Recent LLM-based CV and related works. Welcome to comment/contribute!

869 39 Updated Mar 8, 2025

Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024

Python 71 7 Updated Jun 14, 2024

A curated list of Large Language Model (LLM) Interpretability resources.

1,539 114 Updated Feb 24, 2026

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Python 350 19 Updated Dec 1, 2025

The official Meta Llama 3 GitHub site

Python 29,287 3,528 Updated Jan 26, 2025

PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"

Python 78 7 Updated Sep 23, 2024

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Jupyter Notebook 234 28 Updated Jun 1, 2025

DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements the main DiffSeg algorithm and additionally includes an expe…

Jupyter Notebook 330 28 Updated Jul 9, 2024

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…

Python 495 23 Updated Jun 8, 2026
Next