3rd International Conference on Chemo and BioInformatics, Kragujevac, September 25-26, 2025. (pp. 71-74)
AUTOR(I) / AUTHOR(S): Miloš Jovičić, Dragutin Ostojić, Nikola Prodanović, Nenad Djordjević, Nenad Janković
Download Full Pdf 
DOI: 10.46793/ICCBIKG25.071J
SAŽETAK / ABSTRACT:
Electronic health records (EHRs) contain rich relational data such as individual patient data, encounters, diagnoses, medications, etc. Healthcare systems often store EHR data in tabular form. However, traditional flat representations (“bag of features”) can lose critical context. For example, treating a patient encounter as an unordered set of codes obscures the fact that a specific combination of drugs might have caused an adverse outcome. Knowledge graphs offer a robust alternative by organizing medical data into interconnected entities and relationships, capturing complex associations (e.g. between symptoms, treatments, diagnoses) for a more holistic understanding of patient history. In this work, we transform the Diabetes 130-US Hospitals dataset (a collection of ~100,000 inpatient encounters from 130 hospitals over 10 years) into a labeled property graph (LPG), and demonstrate the advantages both conceptual and quantitative of graph-based analysis, in a medical informatics context. Each encounter in this dataset includes patient demographics, diagnoses (ICD-9 codes), lab results (e.g. HbA1c), and 24 diabetes-related medications with change indicators (“up”, “down”, “steady” or “no change”) among other features. Notably, the original study focused on 30-day readmissions, highlighting that poor glycemic control and suboptimal inpatient diabetes management lead to higher readmission rates and complications. Our graph model makes these clinical relationships explicit, enabling multi- hop reasoning (e.g. linking a patient’s lab result to medication changes and subsequent readmission outcome) that is cumbersome with relational tables. We show that converting such EHR data into a graph can improve predictive modeling of readmissions and uncover insightful patterns of comorbidities and care processes that would be difficult to extract using SQL alone, aligning with recent trends in biomedical informatics to leverage networks for clinical data analysis.
KLJUČNE REČI / KEYWORDS:
Knowledge graph, Electronic health records, Labeled property graph, Neo4j, Graph algorithms
PROJEKAT / ACKNOWLEDGEMENT:
This work has been supported by the Serbian Ministry of Science, Technological Development, and Innovations, Agreement No. 451-03-137/2025-03/ 200122. We gratefully acknowledge use of the “Diabetes 130-US Hospitals for Years 1999–2008” dataset, provided by the UCI Machine Learning Repository (DOI: 10.24432/C5230J). UCI Machine Learning Repository When citing this dataset, please also reference the introductory study describing its construction and variables: Strack B., DeShazo J.P., Gennings C., Olmo J.L., Ventura S., Cios K.J., Clore J.N. Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records, BioMed Research International, 2014, Article ID 781670. pmc.ncbi.nlm.nih.gov
LITERATURA / REFERENCES:
- J. Schaff, C.C. Fink, B. Slepohenko, J.H. Carson, L.M. Loew., A General Computational Framework for Modeling Cellular Structure and Function, Biophysical Journal, 73 (1997) 1135- 1146.
- B. Strack, J.P. DeShazo, C. Gennings, J.L. Olmo, S. Ventura, K.J. Cios, J.N. Clore., Impact of HbA1c measurement on hospital readmission rates: analysis of 70,000 clinical database patient records, BioMed Research International, 2014 (2014) 781670.
- Y. Li, H. Qiu., Disease network analysis to reveal comorbidity patterns in hospitalized patients with COPD using large-scale administrative health data, Scientific Reports, 15 (2025) 27082.
- R. Alotaibi, C. Lei, A. Quamar, V. Efthymiou, F. Özcan., Property Graph Schema Optimization for Domain-Specific Knowledge Graphs, 2021 IEEE 37th International Conference on Data Engineering (ICDE), (2021) 924–935.
- I. Robinson, J. Webber, E. Eifrem., Graph Databases: New Opportunities for Connected Data, 2nd ed., O’Reilly Media, (2015) 23–24.
- p. Marjai, B. Szabari, A. Kiss., An Experimental Study on Centrality Measures Using Clustering, Computers, 10 (2021) 115.
- t. Bratanič., Graph Algorithms for Data Science: With examples in Neo4j, Manning Publications, (2024).
- M. E.J. Newman, M. Girvan., Finding and evaluating community structure in networks, Physical Review E, 69 (2004) 026113.