Back to Engineering

OmniArt

2018

Researcher • University of Amsterdam

DatasetComputer VisionArtMulti-Task LearningACM TOMM

Overview

Baselines are the starting point of any quantitative multimedia research, and benchmarks are essential for pushing those baselines further. OmniArt is a large-scale artistic benchmark dataset featuring over 2 million images with rich structured metadata, aggregated from multiple museum collections around the world.

OmniArt contains annotations for dozens of attribute types and features semantic context information through concepts, IconClass labels, color information, and object-level bounding boxes. The dataset establishes baseline scores on multiple tasks such as artist attribution, creation period estimation, type, style, and school prediction.

In addition to metadata-related experiments, we explore the color spaces of art through different types and evaluate a transfer learning object recognition pipeline. The dataset was designed for seamless integration with popular deep learning frameworks including PyTorch, Keras, and TensorFlow.

Dataset Statistics

2M+
Images
10+
Museums
7
Benchmark Tasks
256px
Pre-resized

Metadata Types

ArtistTitleDateMediumDimensionsStyleGenreSchoolIconClass LabelsColor HistogramsObject Bounding BoxesConcepts

Featured Collections

OmniArt aggregates artwork from some of the world's most prestigious institutions:

Rijksmuseum, Amsterdam
Metropolitan Museum of Art, NYC
Museum of Modern Art, NYC
Van Gogh Museum
British National Gallery
Carnegie Museum of Art
WikiArt
Web Gallery of Art
Google Art Project

Benchmark Tasks

The dataset includes seven specialized subsets for evaluating different aspects of artistic understanding:

Artist Attribution: Identify the creator of an artwork

Artwork Type: Classify the medium (painting, sculpture, print, etc.)

Genre Classification: Categorize by subject matter (portrait, landscape, still life)

Creation Period: Estimate the historical period of creation

Geographical Origin: Determine the region of origin

Style Classification: Identify artistic movement (Impressionism, Baroque, etc.)

Creation Year Regression: Predict the exact year of creation

ArtSight Exploration Tool

The integrated ArtSight visualization tool allows interactive navigation of the entire collection. Explore artworks by:

HueSaturationLuminescenceCreation PeriodGenreArtwork TypeDepicted Objects

Framework Support

OmniArt provides custom data loaders for seamless integration with popular deep learning frameworks:

PyTorch
Keras
TensorFlow

Publications

OmniArt: A Large-scale Artistic Benchmark
G. Strezoski, M. Worring
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)
Volume 14, Issue 4, Article 88, November 2018, Pages 1-21
ACM DLDOI: 10.1145/3273022
OmniArt: Multi-task Deep Learning for Artistic Data Analysis
G. Strezoski, M. Worring
arXiv preprint, August 2017
arXiv

Citation

@article{strezoski2018omniart,
  title     = {OmniArt: A Large-scale Artistic Benchmark},
  author    = {Strezoski, Gjorgji and Worring, Marcel},
  journal   = {ACM Trans. Multimedia Comput. Commun. Appl.},
  volume    = {14},
  number    = {4},
  articleno = {88},
  pages     = {1--21},
  year      = {2018},
  publisher = {ACM},
  doi       = {10.1145/3273022}
}