LuletterSoul

🎯

Focusing

Sheldon Lau LuletterSoul

🎯

Focusing

AI Deployment Engineer.

55 followers · 12 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Starred repositories

NVlabs / Fast-dLLM

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 971 119 Updated May 6, 2026

0xSero / turboquant

TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration

Python 1,395 172 Updated Mar 27, 2026

bytedance / deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 68,174 9,080 Updated May 17, 2026

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 982 1,078 Updated Jul 4, 2024

2U1 / Qwen-VL-Series-Finetune

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,865 214 Updated Apr 10, 2026

antgroup / SPEED-Q

Python 26 1 Updated Jan 20, 2026

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,246 515 Updated May 16, 2026

alibaba / EfficientAI

Python 46 3 Updated May 9, 2026

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,352 88 Updated Apr 15, 2024

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,318 417 Updated Apr 23, 2026

CMLab-Korea / Awesome-Video-Frame-Interpolation

[IEEE TCSVT'26] 🂡 AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation

147 5 Updated Apr 6, 2026

Kai-Liu001 / Awesome-Model-Quantization

This repository contains low-bit quantization papers from 2020 to 2025 on top conference.

161 5 Updated Apr 29, 2026

cvlab-yonsei / TRS

An official implementation of "Scheduling Weight Transitions for Quantization-Aware Training" (ICCV 2025) in PyTorch.

Python 59 13 Updated Nov 17, 2025

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 123,428 13,678 Updated May 14, 2026

HuaiyuanXu / 3D-Occupancy-Perception

[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

603 37 Updated Apr 27, 2026

microsoft / nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,355 1,861 Updated Jul 3, 2024

dc-ai-projects / DC-Gen

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 377 12 Updated Oct 5, 2025

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 3,308 245 Updated Sep 5, 2025

SimonZeng7108 / efficientsam3

EfficientSAM3 compresses SAM3 into lightweight, edge-friendly models via progressive knowledge distillation for fast promptable concept segmentation and tracking.

Jupyter Notebook 577 43 Updated Apr 22, 2026

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 9,563 1,459 Updated May 16, 2026