2.5M Data Samples in OmniArt

- 1 min read

Series: Early Phd Blog

Today OmniArt hit the 2.5M mark in the number of data samples it contains. Our system is listening for changes as you are reading this post and expanding the dataset even further. In the current form, OmniArt features more than 2 million different faces in paintings, sketches and drawing. Our model’s gender estimation is that 70% of the faces are male and 30% are female. Female portraits mostly contain more than one person in their content while male subjects are mostly alone.

We are working hard on improving the quality of the dataset and providing extensive information on the entities contained within.

If you use this dataset in your research, make sure to cite this paper:

and we are interested in your feedback.