Skip to content
View LMMMEng's full-sized avatar
🎯
Focusing
🎯
Focusing
  • The University of Hong Kong
  • Hong Kong
  • 18:44 (UTC +08:00)

Highlights

  • Pro

Block or report LMMMEng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 181,673 46,259 Updated Feb 6, 2026

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 34,048 7,895 Updated Nov 17, 2025

CVPR 2025 论文和开源项目合集

21,848 2,779 Updated Jul 2, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,248 1,949 Updated Nov 19, 2025

A markdown version emoji cheat sheet

TypeScript 13,596 4,601 Updated Feb 6, 2026

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Python 9,118 1,825 Updated Apr 22, 2022

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,219 1,344 Updated Jul 23, 2024

PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.

Python 4,712 1,002 Updated Apr 24, 2024

Awesome Incremental Learning

4,391 625 Updated Jan 29, 2026

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,353 377 Updated Feb 3, 2026

A collection of loss functions for medical image segmentation

Python 3,998 613 Updated Nov 1, 2023

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,793 273 Updated Feb 13, 2025

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

3,567 400 Updated Jan 7, 2025

医学影像数据集列表 『An Index for Medical Imaging Datasets』

3,465 426 Updated Aug 15, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 3,039 219 Updated Mar 7, 2025

DeepLab v3+ model in PyTorch. Support different backbones.

Python 3,004 778 Updated Aug 4, 2024

Pointcept: Perceive the world with sparse points, a codebase for point cloud perception research. Latest works: Concerto (NeurIPS'25), Sonata (CVPR'25 Highlight), PTv3 (CVPR'24 Oral)

Python 2,814 341 Updated Feb 4, 2026

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,788 259 Updated Mar 25, 2025

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 2,582 5,268 Updated Feb 6, 2026

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Python 2,260 373 Updated Oct 17, 2024

PaperBanana: Automating Academic Illustration For AI Scientists

JavaScript 2,068 92 Updated Feb 2, 2026

SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.

Python 1,898 295 Updated Feb 4, 2026

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,674 159 Updated Dec 8, 2023

Awesome Papers related to Mamba.

1,388 74 Updated Oct 17, 2024

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,214 103 Updated Sep 2, 2023

[CVPR 2024 & TPAMI 2025] UniRepLKNet

Python 1,065 60 Updated Aug 10, 2025

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)

Python 938 92 Updated Apr 24, 2024
Python 821 56 Updated Oct 19, 2023

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

749 38 Updated Dec 1, 2025
Next