Generalized overlap measures for evaluation and validation in medical image analysis

Research output: Contribution to journalArticlepeer-review

590 Citations (Scopus)


Measures of overlap of labelled regions of images, such as the Dice and Tanimoto coefficients, have been extensively used to evaluate image registration and segmentation algorithms. Modern studies can include multiple labels defined on multiple images yet most evaluation schemes report one overlap per labelled region, simply averaged over multiple images. In this paper, common overlap measures are generalized to measure the total overlap of ensembles of labels defined on multiple test images and account for fractional labels using fuzzy set theory. This framework allows a single "figure-of-merit" to be reported which summarises the results of a complex experiment by image pair, by label or overall. A complementary measure of error, the overlap distance, is defined which captures the spatial extent of the nonoverlapping part and is related to the Hausdorff distance computed on grey level images. The generalized overlap measures are validated on synthetic images for which the overlap can be computed analytically and used as similarity measures in nonrigid registration of three-dimensional magnetic resonance imaging (MRI) brain images. Finally, a pragmatic segmentation ground truth is constructed by registering a magnetic resonance atlas brain to 20 individual scans, and used with the overlap measures to evaluate publicly available brain segmentation algorithms.

Original languageEnglish
Article number1717643
Pages (from-to)1451-1461
Number of pages11
JournalIeee Transactions on Medical Imaging
Issue number11
Publication statusPublished - Nov 2006


Dive into the research topics of 'Generalized overlap measures for evaluation and validation in medical image analysis'. Together they form a unique fingerprint.

Cite this