Skip to content
View Hello123hello123's full-sized avatar
  • 06:44 (UTC +08:00)

Highlights

  • Pro

Block or report Hello123hello123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
114 results for source starred repositories
Clear filter

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 19,547 2,026 Updated Sep 11, 2025

Fully open reproduction of DeepSeek-R1

Python 25,412 2,372 Updated Sep 8, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 57,740 10,056 Updated Sep 11, 2025

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Python 446 68 Updated Mar 18, 2022

[NeurIPS 2020] Released code for Interventional Few-Shot Learning

Python 169 23 Updated Jul 28, 2021

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

7,531 498 Updated Sep 9, 2025

deep learning for image processing including classification and object-detection etc.

Python 25,390 8,223 Updated Jan 12, 2025

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,154 1,657 Updated Apr 7, 2025

Code for "Leveraging Bilateral Correlations for Multi-Label Few-Shot Learning" in TNNLS 2024.

Python 4 Updated Oct 23, 2024

Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.

Python 1,245 169 Updated Nov 13, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,269 208 Updated Mar 5, 2024

The new spin-off of Visual Language Navigation.

25 Updated Jul 7, 2025

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

539 24 Updated May 2, 2024

Code for the Molmo Vision-Language Model

Python 744 71 Updated Dec 12, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,866 908 Updated Aug 12, 2024

✨✨Latest Advances on Multimodal Large Language Models

16,224 1,055 Updated Sep 4, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,949 129 Updated Oct 30, 2024

Diffusion model papers, survey, and taxonomy

3,245 269 Updated Jun 13, 2025

本人的科研经验

7,414 427 Updated Aug 12, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,216 461 Updated Aug 7, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 12,445 962 Updated May 15, 2025
Jupyter Notebook 6 Updated Sep 22, 2023

A summary of recent semi-supervised semantic segmentation methods

246 26 Updated Aug 7, 2025

OpenMMLab Foundational Library for Training Deep Learning Models

Python 1,370 414 Updated Aug 17, 2025

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

710 31 Updated Sep 9, 2025

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 593 24 Updated Dec 11, 2024

Fast and memory-efficient exact attention

Python 19,431 1,982 Updated Sep 6, 2025

[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Python 103 5 Updated Mar 26, 2025

Single-Stage Semantic Segmentation from Image Labels (CVPR 2020)

Python 383 42 Updated Nov 10, 2021

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python 629 55 Updated Nov 28, 2024
Next