Skip to content
View japb11's full-sized avatar

Block or report japb11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple MPC controller for path tracking implemented in python

Jupyter Notebook 437 72 Updated Jun 17, 2026

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 2,622 239 Updated Jun 15, 2026
Python 341 48 Updated Jun 9, 2026

SIFThinker: Spatially-Aware Image Focus for Visual Reasoning

Python 13 1 Updated Dec 2, 2025

[ICLR 2026] VisionReasoner: Unified Reasoning-Integrated Visual Perception via Reinforcement Learning

Python 348 15 Updated Feb 9, 2026

Learn it. Build it. Ship it for others.

Python 34,289 5,582 Updated Jun 14, 2026

Cosmos-Reason2 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 422 88 Updated Jun 7, 2026

YOLOs-TRT is a header-only C++ library for running all YOLO models with all tasks with NVIDIA TensorRT on CUDA GPUs and Jetson. It features GPU preprocessing (letterbox/normalize/HWC→NCHW), CUDA Gr…

C++ 69 3 Updated Apr 28, 2026

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,870 201 Updated Mar 24, 2026

Train YOLO + VLM with one command. Auto-generate vision-language training data from YOLO labels - no extra labeling needed.

Python 33 7 Updated Apr 21, 2026

🕹️SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy

Python 335 18 Updated Aug 14, 2024

CODA: Repurposing Continuous VAEs for Discrete Tokenization

Python 36 3 Updated Jul 4, 2025

Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.

Python 720 68 Updated Apr 27, 2026

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 5,856 944 Updated Jun 18, 2026

CoRL 2024

Python 481 60 Updated Oct 29, 2024

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 816 58 Updated Mar 20, 2024

[ICLR 2026] Mobile-GS: Real-time Gaussian Splatting for Mobile Devices

Python 290 37 Updated Mar 30, 2026

A feed-forward 3D foundation model for reconstructing scenes from streaming data

Python 7,257 712 Updated Jun 17, 2026

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…

Python 1,369 129 Updated May 20, 2026

[CVPR 2026 Oral] "INSID3: Training-Free In-Context Segmentation with DINOv3"

Python 586 51 Updated May 29, 2026

Simulation platform for general-purpose robotics & embodied AI learning.

Python 29,372 2,786 Updated Jun 17, 2026

Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.

Python 666 38 Updated Apr 14, 2026

ReSplat: Learning Recurrent Gaussian Splatting

Python 276 22 Updated Mar 24, 2026

Become a cracked AI/ML Research Engineer

TypeScript 4,535 629 Updated Jun 14, 2026

[CVPR2026]🚀🚀🚀Official code for the paper "YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection." *(YOLO = You Only Look Once)* 🔥🔥🔥

Python 540 69 Updated Jun 17, 2026

GLM-OCR: Accurate × Fast × Comprehensive

Python 6,989 643 Updated Apr 21, 2026

CAR: Controllable AutoRegressive Modeling for Visual Generation

Python 129 3 Updated Nov 29, 2024

The official implementation of ICCV'25 paper "FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution"

Python 393 28 Updated Sep 7, 2025

Official implementation of GeCo2 (AAAI 2026) -- Generalized-Scale Object Counting with Gradual Query Aggregation

Jupyter Notebook 150 22 Updated Apr 13, 2026
Next