How to Do Data Labeling, Versioning, and Management for ML

A case study of enriching food dataset

Magdalena Konkiewicz
Towards Data Science
8 min readSep 23, 2022

--

Introduction

It has been months ago when Toloka and ClearML met together to create this joint project. Our goal was to showcase to other ML practitioners how to first gather data and then version and manage data before it is fed to an ML model.

We believe that following those best practices will help others build better and more robust AI solutions. If you are curious, have a look at the project we have created…

--

--