-
University of Alberta
- Edmonton
- https://cjiang2.github.io/
Highlights
- Pro
Stars
Python bindings for real-time control of Franka Emika robots.
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
Run Segment Anything Model 2 on a live video stream
zdata-inc / sam2_realtime
Forked from facebookresearch/sam2The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2) in real-time.
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).
Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability Perspective (AAAI 2026)
Learning to use this device. You'll find edge AI applications and useful commands to optimize your Jetson
Repository for "PointSt3R: Point Tracking through 3D Grounded Correspondence"
[RSS 2025] CLIP-RT : Learning Language-Conditioned Robotic Policies from Natural Language Supervision
Benchmarking Knowledge Transfer in Lifelong Robot Learning
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
Web-based 3D visualization + Python
SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3
Completed research on semantic retrieval augmented generation through novel semantic similarity graph traversal algorithms.
EntityNet: Using Knowledge Graphs to harvest datasets for efficient CLIP model training
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[DEIMv2] Real Time Object Detection Meets DINOv3
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)
Codebase for Kinova Gen3 training in Isaac Lab, sim2real transfer and ROS deployment.
VLA-0: Building State-of-the-Art VLAs with Zero Modification
Official PyTorch Implementation of "Latent Diffusion Model Without Variational Autoencoder".
The simplest, fastest repository for training/finetuning medium-sized GPTs.
(ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation