-
University of Chinese Academy of Sciences
- Beijing
-
08:54
(UTC +08:00) - https://luoxubo.github.io/
- @xubo_luo
- in/xubo-luo-1124a31b3
Highlights
- Pro
Lists (25)
Sort Name ascending (A-Z)
Attention mechanism
Autonomous driving
clip
Efficiency
ekf
Event Camera
Facial expression recognition
flow matching
Homography Estimation
IELTS
Image fusion
Image matching
Some nice image matching related worksImage retrieval
Lab homepage
Some nice templates of homepage of labsLearning
Mulit sensor localization
NeRF
Paper codes
Pose estimation
Segmentation
SLAM with deep learning
Tracking
Object tracking repos.TTT
Test Time TrainingVisual localization
world model
Starred repositories
😎 Awesome lists about all kinds of interesting topics
All Algorithms implemented in Python
List of Computer Science courses with video lectures.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
Collection of Summer 2026 tech internships!
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official Code for DragGAN (SIGGRAPH 2023)
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Python sample codes and textbook for robotics algorithms.
A generative world for general-purpose robotics & embodied AI learning.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
A beautiful, simple, clean, and responsive Jekyll theme for academics
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
PyTorch code and models for the DINOv2 self-supervised learning method.
[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide