Data Version Control Blog

Insights and updates from the DVC team. Explore best practices in data versioning, machine learning workflows, and model management. Stay informed with our latest news, tutorials, and community highlights.

A Shared Vision for the Future of DVC

Dmitry Petrov
Dmitry Petrov
November 18, 2025
4 minutes read
DVC Joins lakeFS: Your Questions Answered!
Jeny De Figueiredo
Jeny De Figueiredo
November 18, 2025
5 minutes read
Transforming a Jupyter Notebook into a Reproducible Pipeline for Experiments with DVC
Rob De Wit shares his Pokémon Generator project to demonstrate how you can move from a Jupyter Notebook prototype to a production-ready pipeline with DVC.
Rob de Wit
Rob de Wit
October 30, 2025
8 minutes read
Community Spotlight: Akash Mane – How to Label and Rotate Training and Feedback Datasets Cleanly
Raise your labeling standards with this great guide from Community member, Akash Mane!
Akash Mane
Akash Mane
October 7, 2025
28 minutes read
Tutorial: Scalable and Distributed ML Workflows with DVC and Ray on AWS (Part 2)
Need to setup DVC to work with Ray Cluster on AWS? This tutorial has you covered!
Mikhail Rozhkov
Mikhail Rozhkov
March 13, 2024
19 minutes read
Tutorial: Scalable and Distributed ML Workflows with DVC and Ray (Part 1)
This tutorial introduces you to integrating DVC (Data Version Control) with Ray, turning them into your go-to toolkit for creating automated, scalable, and distributed ML pipelines.
Mikhail Rozhkov
Mikhail Rozhkov
March 12, 2024
20 minutes read
Running DVC on a SLURM cluster
Learn how Exscientia uses DVC experiments on a cloud-deployed SLURM cluster to scale their ML experimentation.
Dom Miketa
Dom Miketa
March 11, 2024
11 minutes read