The Big Book of Data Engineering

A collection of technical blogs, including code samples and notebooks

Data labeling for ML

Ebooks

This is a must-have e-book for anyone doing data labeling for Machine Learning/AI. It is about the pros, cons, and use cases of two popular data labeling approaches from our partner Toloka.


It covers the following topics:

-How data labeling is carried out today

-The pros and cons of in-house vs. crowdsourced data labeling:

  • Real-life use cases: crowdsourced data labeling

  • Identity verification: ID R&D 8

  • Geo-analysis and hot spot prediction: BestPlace

  • E-commerce catalogue localization: AliExpress 10

-Questions to consider when choosing a data-labeling method