Highlights
- Pro
Stars
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, and Online KMS activation methods, along with advanced troubleshooting.
The Open-Source Data Annotation Platform
A Comprehensive Toolkit for High-Quality PDF Content Extraction
[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
[CVPR'24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…
[ICML 2025] Official PyTorch implementation of LongVU
Densely Captioned Images (DCI) dataset repository.
🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.
📚 A collection of papers about Referring Image Segmentation.
[ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
[CVPR 2024] PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding.
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
[NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrument, verb, target> labels for every surgical fine-grained act…
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Machine Learning Engineering Open Book
A curated list of recent diffusion models for video generation, editing, and various other applications.
The open source codebase powering HuggingChat
QA Bot for Hugging Face documentation to accelerate development within the ecosystem.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.