Inès Hyeonsu Kim1* · Seokju Cho1* · Jiahui Huang2 · Jung Yi1 · Joon-Young Lee2 · Seungryong Kim1
1KAIST AI 2Adobe Research
CVPR 2025
Point tracking models often rely on feature backbones that lack temporal awareness, requiring computationally expensive refiners to correct errors and ensure coherence across frames. What if your backbone itself could model long-term temporal dynamics?
✨ Introducing Chrono – a novel feature backbone designed for point tracking, integrating a long-range temporal adapter for enhanced temporal consistency and efficiency.
🔨 Filling the Gap: Chrono addresses the lack of temporally-aware feature backbones and reduces reliance on expensive refinement processes.
⏳ Long-Range Temporal Awareness: Our temporal adapter enables feature extraction with extended temporal context, improving tracking quality.
⚡ Smooth & Efficient Tracking: Chrono produces smoother initial tracks in a simple and effective manner, reducing the need for refiners.
📈 Refiner-Free Performance: Chrono achieves accuracy comparable to refiner-based pipelines, proving that temporally-aware features can be just as effective.
Prepare the environment by cloning the repository and installing the required dependencies:
git clone https://github.com/google-research/kubric.git
conda create -y -n chrono python=3.11
conda activate chrono
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
pip3 install -U xformers --index-url https://download.pytorch.org/whl/cu118
pip3 install -U lightning tensorflow_datasets tensorflow matplotlib mediapy tensorflow_graphics einops wandbFirst, download the evaluation datasets:
# TAP-Vid-DAVIS dataset
wget https://storage.googleapis.com/dm-tapnet/tapvid_davis.zip
unzip tapvid_davis.zip
# TAP-Vid-RGB-Stacking dataset
wget https://storage.googleapis.com/dm-tapnet/tapvid_rgb_stacking.zip
unzip tapvid_rgb_stacking.zipFor downloading TAP-Vid-Kinetics, please refer to official TAP-Vid repository.
To evaluate Chrono on the benchmarks, first download the pre-trained weights.
| Model | Pre-trained Weights |
|---|---|
| Chrono (ViT-S) | Link |
| Chrono (ViT-B) | Link |
You can download the weights using the following commands:
pip install gdown
gdown 1Q-rqNl1ZkYhH4UtOjwcMH0oKkcCxMi7K
gdown 1XYOr5pVncEAgyWcQZ_TjgvqLTcexdUQrIn config/dino.ini (or any other config file), add the path to the evaluation datasets to [TRAINING]-val_dataset_path. Additionally, adjust the model size for evaluation in [MODEL]-model_kwargs-model_size.
To evaluate the Chrono, use the experiment.py script with the following command-line arguments:
python experiment.py --config config/dino.ini --mode eval_{dataset_to_eval_1}_..._{dataset_to_eval_N}[_q_first] --ckpt_path /path/to/checkpoint --save_path ./path_to_save_checkpoints/--config: Specifies the path to the configuration file. Default isconfig/dino.ini.--mode: Specifies the mode to run the script. Useevalto perform evaluation. You can also include additional options for query first mode (q_first), and the name of the evaluation datasets. For example:- Evaluation of the DAVIS dataset:
eval_davis - Evaluation of DAVIS and Kinetics in query first mode:
eval_davis_kinetics_q_first
- Evaluation of the DAVIS dataset:
--ckpt_path: Specifies the path to the checkpoint file. If not provided, the script will use the default checkpoint.--save_path: Specifies the path to save logs.
Replace /path/to/checkpoint with the actual path to your checkpoint file. This command will run the evaluation process and save the results in the specified save_path.
Download the panning-MOVi-E dataset used for training (approximately 273GB) from Huggingface using the following script. Git LFS should be installed to download the dataset. To install Git LFS, please refer to this link. Additionally, downloading instructions for the Huggingface dataset are available at this link.
git clone git@hf.co:datasets/hamacojr/LocoTrack-panning-MOVi-EAdd the path to the downloaded panning-MOVi-E to the [TRAINING]-kubric_dir entry in config/dino.ini (or any other config file). Then, run the training with the following script:
python experiment.py --config config/dino.ini --mode train_davis --save_path ./path_to_save_checkpoints/Please use the following bibtex to cite our work:
@article{kim2025exploring,
title={Exploring Temporally-Aware Features for Point Tracking},
author={Kim, In{\`e}s Hyeonsu and Cho, Seokju and Huang, Jiahui and Yi, Jung and Lee, Joon-Young and Kim, Seungryong},
journal={arXiv preprint arXiv:2501.12218},
year={2025}
}
This project is largely based on the TAP repository and LocoTrack repository. Thanks to the authors for their invaluable work and contributions.