Why Is Greenwich so Common? Quantifying the Uniqueness of Multivariate Observations

Andrea Ballatore*, Stefano Cavazzi*

*Corresponding author for this work

Research output: Contribution to conference typesPaperpeer-review

Abstract

The concept of uniqueness can play an important role when the assessment of an observation’s distinctiveness is essential. This article introduces a distance-based uniqueness measure that quantifies the relative rarity or commonness of a multi-variate observation within a dataset. Unique observations exhibit rare combinations of values, and not necessarily extreme values. Taking a cognitive psychological perspective, our measure defines uniqueness as the sum of distances between a target observation and all other observations. After presenting the measure u and its corresponding standardised version uz, we propose a method to calculate a p value through a probability density function. We then demonstrate the measure’s behaviour in a case study on the uniqueness of Greater London boroughs, based on real-world socioeconomic variables. This initial investigation indicates that u can support exploratory data analysis.

Original languageEnglish
DOIs
Publication statusPublished - Sept 2023
Event12th International Conference on Geographic Information Science, GIScience 2023 - Leeds, United Kingdom
Duration: 12 Sept 202315 Sept 2023

Conference

Conference12th International Conference on Geographic Information Science, GIScience 2023
Country/TerritoryUnited Kingdom
CityLeeds
Period12/09/202315/09/2023

Keywords

  • distinctiveness
  • multivariate data
  • outlier detection
  • similarity
  • uniqueness

Fingerprint

Dive into the research topics of 'Why Is Greenwich so Common? Quantifying the Uniqueness of Multivariate Observations'. Together they form a unique fingerprint.

Cite this