Abstract
The concept of uniqueness can play an important role when the assessment of an observation’s distinctiveness is essential. This article introduces a distance-based uniqueness measure that quantifies the relative rarity or commonness of a multi-variate observation within a dataset. Unique observations exhibit rare combinations of values, and not necessarily extreme values. Taking a cognitive psychological perspective, our measure defines uniqueness as the sum of distances between a target observation and all other observations. After presenting the measure u and its corresponding standardised version uz, we propose a method to calculate a p value through a probability density function. We then demonstrate the measure’s behaviour in a case study on the uniqueness of Greater London boroughs, based on real-world socioeconomic variables. This initial investigation indicates that u can support exploratory data analysis.
Original language | English |
---|---|
DOIs | |
Publication status | Published - Sept 2023 |
Event | 12th International Conference on Geographic Information Science, GIScience 2023 - Leeds, United Kingdom Duration: 12 Sept 2023 → 15 Sept 2023 |
Conference
Conference | 12th International Conference on Geographic Information Science, GIScience 2023 |
---|---|
Country/Territory | United Kingdom |
City | Leeds |
Period | 12/09/2023 → 15/09/2023 |
Keywords
- distinctiveness
- multivariate data
- outlier detection
- similarity
- uniqueness