An Analysis of Concept Lattice Usability for Open Data Categorization within Evolving Metadata Environment

XVII International Conference on Systems, Automatic Control and Measurements, SAUM 2024 (pp. 43-46)

АУТОР(И) / AUTHOR(S): Milena Frtunić Gligorijević , Miloš Bogdanović , Leonid Stoimenov 

Download Full Pdf  

DOI:  10.46793/SAUM24.043FG

САЖЕТАК / ABSTRACT:

Due to the open data initiatives a substantial number of datasets has been published on open data portals. In order to support data findability and usability these portals provide various search options based on datasets metadata. However, often metadata incompleteness directly impacts the quality of search results and consequently data discoverability. Therefore, a need has arisen for approaches for filling in the missing metadata information, like missing dataset category values. One of these approaches is focused on categorizing datasets based on the tags used to describe them. This approach relies on a knowledge base consisting of concept lattices created for each of the categories created using the Formal Concept Analysis method. However, with the increase in the number of datasets on the portal and consequently number of tags and combinations of tags used to describe them, the knowledge base has to be recreated. Therefore, within this research, we present the analysis of concept lattice usability for the categorization of datasets within evolving metadata environment. We analyze two sets of concept lattices created for Ireland’s open data portal datasets in 2020 and 2021 and their usability for categorizing new datasets that were available on the portal in 2021 and 2023.

КЉУЧНЕ РЕЧИ / KEYWORDS:

open data portal, categorization, Formal Concept Analysis, concept lattice

ПРОЈЕКАТ/ ACKNOWLEDGEMENT:

The research presented in this paper was funded by the Ministry of Science, Technological Development and Innovation of the Republic of Serbia [grant number 451-03-66/2024-03/200102].

ЛИТЕРАТУРА / REFERENCES

  1. Braunschweig K, Eberius J, Thiele M et al. The State of Open Data Limits of Current Open Data Platforms, 2012.
  2. Reiche KJ and Höfig E. Implementation of metadata quality metrics and application on public government data. In 2013 IEEE 37th Annual Computer Software and Applications Conference Workshops. pp. 236–241. DOI: 10.1109/COMPSACW.2013. 32.
  3. Zuiderwijk A, Volten C, Kroesen M et al. Motivation perspectives on opening up municipality data: Does municipality size matter? Information 2018; 9: 267. DOI: https://doi.org/10.3390/info9110267.
  4. Milena Frtunić Gligorijević, Miloš Bogdanović, and Leonid Stoimenov. Tracking metadata changes in the government open data portals. In Zdravković, M., Trajanović, M., Konjović, Z. (Eds.) ICIST 2022 Proceedings, pp.180-184, 2022. ISBN 978-86-85525-24-7
  5. Wille, “Restructuring lattice theory: An approach based on hierar[1]chies of concepts”, Ordered Sets, Springer, Dordrecht, pp. 445–470, 1982.
  6. Birkhoff, Lattice theory, American Mathematical Society Coll. Publ. 25, Providence, RI, 1973
  7. M. Barbut, B. Monjardet, Ordre et classification, algèbre et combinatoire, Paris, Hachette, 1970.