King's College London

Research portal

Characterising dataset search ? an analysis of search logs and data requests

Research output: Contribution to journalArticlepeer-review

Magdalena Kacprzak Emilia, Mylena Koesten Laura, Luis Ibanez Gonzalez, Tom Blount, Jeni Tennison, Elena Simperl

Original languageEnglish
Pages (from-to)37-55
Number of pages19
JournalJournal of Web Semantics
Volume55
Early online date19 Nov 2018
DOIs
Accepted/In press14 Nov 2018
E-pub ahead of print19 Nov 2018
PublishedMar 2019

Documents

Links

King's Authors

Abstract

Large amounts of data are becoming increasingly available online. In order to benefit from it we need tools to retrieve the most relevant datasets that match ones data needs. Several vocabularies have been developed to describe datasets in order to increase their discoverability, but for data publishers is costly to cumbersome to annotate them using all, leading to the question of what properties are more important. In this work we contribute with a systematic study of the patterns and specific attributes that data consumers use to search for data and how it compares with general web search. We performed a query log analysis based on logs from four national open data portals and conducted a qualitative analysis of user data requests for requests issued to one of them. Search queries issued on data portals differ from those issued to web search engines in their length, topic, and structure. Based on our findings we hypothesise that portals search functionalities are currently used in an exploratory manner, rather than to retrieve a specific resource. In our study of data requests we found that geospatial and temporal attributes, as well as information on the required granularity of the data are the most common features. The findings of both analyses suggest that these features are of higher importance in dataset retrieval in contrast to general web search, suggesting that efforts of dataset publishers should focus on generating dataset descriptions including them.

Download statistics

No data available

View graph of relations

© 2020 King's College London | Strand | London WC2R 2LS | England | United Kingdom | Tel +44 (0)20 7836 5454