-
Institute of Automation Chinese Academy of Sciences
- BEIJING, CHINA
- https://bitcats.github.io/
Lists (6)
Sort Name ascending (A-Z)
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Code release for NeRF (Neural Radiance Fields)
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Pytorch🍊🍉 is delicious, just eat it! 😋😋
CoTracker is a model for tracking any point (pixel) on a video.
Segment Anything in High Quality [NeurIPS 2023]
Open-source and strong foundation image recognition models.
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
[ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.
Code for "Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed", CVPR 2024
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learna…
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation
Joint Deep Matcher for Points and Lines 🖼️💥🖼️ (ICCV 2023)
Implementation of the paper "DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients"
Joint deep network for feature line detection and description
MatterSim: A deep learning atomistic model across elements, temperatures and pressures.
OpenEQA Embodied Question Answering in the Era of Foundation Models
[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
ELSED: Enhanced Line SEgment Drawing
Dataset for MetaSLAM Challenge
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)
Code for paper Unsupervised Moving Object Segmentation with Atmospheric Turbulence