Cosmos-Reason2 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 422 88 Updated Jun 7, 2026

Geekgineer / YOLOs-CPP-TensorRT

YOLOs-TRT is a header-only C++ library for running all YOLO models with all tasks with NVIDIA TensorRT on CUDA GPUs and Jetson. It features GPU preprocessing (letterbox/normalize/HWC→NCHW), CUDA Gr…

C++ 69 3 Updated Apr 28, 2026

Intellindust-AI-Lab / DEIMv2

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,870 201 Updated Mar 24, 2026

ahmetkumass / yolo-gen

Train YOLO + VLM with one command. Auto-generate vision-language training data from YOLO labels - no extra labeling needed.

Python 33 7 Updated Apr 21, 2026

cheng-haha / ScConv

🕹️SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy

Python 335 18 Updated Aug 14, 2024

LeapLabTHU / CODA

CODA: Repurposing Continuous VAEs for Discrete Tokenization

Python 36 3 Updated Jul 4, 2025

tiiuae / Falcon-Perception

Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.

Python 720 68 Updated Apr 27, 2026

open-edge-platform / anomalib

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 5,856 944 Updated Jun 18, 2026

mlzxy / devit

CoRL 2024

Python 481 60 Updated Oct 29, 2024

microsoft / RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 816 58 Updated Mar 20, 2024

xiaobiaodu / Mobile-GS

[ICLR 2026] Mobile-GS: Real-time Gaussian Splatting for Mobile Devices

Python 290 37 Updated Mar 30, 2026

Robbyant / lingbot-map

A feed-forward 3D foundation model for reconstructing scenes from streaming data

Python 7,257 712 Updated Jun 17, 2026

Topdu / OpenOCR

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…

Python 1,369 129 Updated May 20, 2026

visinf / INSID3

[CVPR 2026 Oral] "INSID3: Training-Free In-Context Segmentation with DINOv3"

Python 586 51 Updated May 29, 2026

Genesis-Embodied-AI / genesis-world

Simulation platform for general-purpose robotics & embodied AI learning.

Python 29,372 2,786 Updated Jun 17, 2026

facebookresearch / EUPE

Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.

Python 666 38 Updated Apr 14, 2026

cvg / resplat

ReSplat: Learning Recurrent Gaussian Splatting

Python 276 22 Updated Mar 24, 2026

HenryNdubuaku / maths-cs-ai-compendium

Become a cracked AI/ML Research Engineer

TypeScript 4,535 629 Updated Jun 14, 2026

Tencent / YOLO-Master

[CVPR2026]🚀🚀🚀Official code for the paper "YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection." *(YOLO = You Only Look Once)* 🔥🔥🔥

Python 540 69 Updated Jun 17, 2026

zai-org / GLM-OCR

GLM-OCR: Accurate × Fast × Comprehensive

Python 6,989 643 Updated Apr 21, 2026

FriedFeid / OnlineVideoDepthAnything

Python 10 Updated Mar 9, 2026

MiracleDance / CAR

CAR: Controllable AutoRegressive Modeling for Visual Generation

Python 129 3 Updated Nov 29, 2024

Eyeline-Labs / FlashDepth

The official implementation of ICCV'25 paper "FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution"

Python 393 28 Updated Sep 7, 2025

jerpelhan / GECO2

Official implementation of GeCo2 (AAAI 2026) -- Generalized-Scale Object Counting with Gradual Query Aggregation

Jupyter Notebook 150 22 Updated Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

japb11

Block or report japb11

Lists (1)

🚀 My stack

Stars

mcarfagno / mpc_python

NVlabs / Eagle

leochlon / ntkmirror

bytedance / SIFThinker

JIA-Lab-research / VisionReasoner

rohitg00 / ai-engineering-from-scratch

nvidia-cosmos / cosmos-reason2