Exploring Temporally-Aware Features for Point Tracking

Inès Hyeonsu Kim^1* · Seokju Cho^1* · Jiahui Huang² · Jung Yi¹ · Joon-Young Lee² · Seungryong Kim¹

¹KAIST AI ²Adobe Research

CVPR 2025

Point tracking models often rely on feature backbones that lack temporal awareness, requiring computationally expensive refiners to correct errors and ensure coherence across frames. What if your backbone itself could model long-term temporal dynamics?

✨ Introducing Chrono – a novel feature backbone designed for point tracking, integrating a long-range temporal adapter for enhanced temporal consistency and efficiency.

🔍 Why Chrono?

🔨 Filling the Gap: Chrono addresses the lack of temporally-aware feature backbones and reduces reliance on expensive refinement processes.

⏳ Long-Range Temporal Awareness: Our temporal adapter enables feature extraction with extended temporal context, improving tracking quality.

⚡ Smooth & Efficient Tracking: Chrono produces smoother initial tracks in a simple and effective manner, reducing the need for refiners.

📈 Refiner-Free Performance: Chrono achieves accuracy comparable to refiner-based pipelines, proving that temporally-aware features can be just as effective.

Environment

Prepare the environment by cloning the repository and installing the required dependencies:

git clone https://github.com/google-research/kubric.git

conda create -y -n chrono python=3.11
conda activate chrono

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
pip3 install -U xformers --index-url https://download.pytorch.org/whl/cu118
pip3 install -U lightning tensorflow_datasets tensorflow matplotlib mediapy tensorflow_graphics einops wandb

Evaluation

0. Evaluation Dataset Preparation

First, download the evaluation datasets:

# TAP-Vid-DAVIS dataset
wget https://storage.googleapis.com/dm-tapnet/tapvid_davis.zip
unzip tapvid_davis.zip

# TAP-Vid-RGB-Stacking dataset
wget https://storage.googleapis.com/dm-tapnet/tapvid_rgb_stacking.zip
unzip tapvid_rgb_stacking.zip

For downloading TAP-Vid-Kinetics, please refer to official TAP-Vid repository.

1. Download Pre-trained Weights

To evaluate Chrono on the benchmarks, first download the pre-trained weights.

Model	Pre-trained Weights
Chrono (ViT-S)	Link
Chrono (ViT-B)	Link

You can download the weights using the following commands:

pip install gdown

gdown 1Q-rqNl1ZkYhH4UtOjwcMH0oKkcCxMi7K
gdown 1XYOr5pVncEAgyWcQZ_TjgvqLTcexdUQr

2. Adjust the Config File

In config/dino.ini (or any other config file), add the path to the evaluation datasets to [TRAINING]-val_dataset_path. Additionally, adjust the model size for evaluation in [MODEL]-model_kwargs-model_size.

3. Run Evaluation

To evaluate the Chrono, use the experiment.py script with the following command-line arguments:

python experiment.py --config config/dino.ini --mode eval_{dataset_to_eval_1}_..._{dataset_to_eval_N}[_q_first] --ckpt_path /path/to/checkpoint --save_path ./path_to_save_checkpoints/

--config: Specifies the path to the configuration file. Default is config/dino.ini.
--mode: Specifies the mode to run the script. Use eval to perform evaluation. You can also include additional options for query first mode (q_first), and the name of the evaluation datasets. For example:
- Evaluation of the DAVIS dataset: eval_davis
- Evaluation of DAVIS and Kinetics in query first mode: eval_davis_kinetics_q_first
--ckpt_path: Specifies the path to the checkpoint file. If not provided, the script will use the default checkpoint.
--save_path: Specifies the path to save logs.

Replace /path/to/checkpoint with the actual path to your checkpoint file. This command will run the evaluation process and save the results in the specified save_path.

Training

Training Dataset Preparation

Download the panning-MOVi-E dataset used for training (approximately 273GB) from Huggingface using the following script. Git LFS should be installed to download the dataset. To install Git LFS, please refer to this link. Additionally, downloading instructions for the Huggingface dataset are available at this link.

git clone git@hf.co:datasets/hamacojr/LocoTrack-panning-MOVi-E

Training Script

Add the path to the downloaded panning-MOVi-E to the [TRAINING]-kubric_dir entry in config/dino.ini (or any other config file). Then, run the training with the following script:

python experiment.py --config config/dino.ini --mode train_davis --save_path ./path_to_save_checkpoints/

📚 Citing this Work

Please use the following bibtex to cite our work:

@article{kim2025exploring,
  title={Exploring Temporally-Aware Features for Point Tracking},
  author={Kim, In{\`e}s Hyeonsu and Cho, Seokju and Huang, Jiahui and Yi, Jung and Lee, Joon-Young and Kim, Seungryong},
  journal={arXiv preprint arXiv:2501.12218},
  year={2025}
}

🙏 Acknowledgement

This project is largely based on the TAP repository and LocoTrack repository. Thanks to the authors for their invaluable work and contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
data		data
models		models
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
experiment.py		experiment.py
model_utils.py		model_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exploring Temporally-Aware Features for Point Tracking

🔍 Why Chrono?

Environment

Evaluation

0. Evaluation Dataset Preparation

1. Download Pre-trained Weights

2. Adjust the Config File

3. Run Evaluation

Training

Training Dataset Preparation

Training Script

📚 Citing this Work

🙏 Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

cvlab-kaist/Chrono

Folders and files

Latest commit

History

Repository files navigation

Exploring Temporally-Aware Features for Point Tracking

🔍 Why Chrono?

Environment

Evaluation

0. Evaluation Dataset Preparation

1. Download Pre-trained Weights

2. Adjust the Config File

3. Run Evaluation

Training

Training Dataset Preparation

Training Script

📚 Citing this Work

🙏 Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages