IRTUM – Institutional Repository of the Technical University of Moldova

Comparative evaluation of k-anonymity, differential privacy, and pseudonymization for data protection in rare disease registries

Show simple item record

dc.contributor.author SIMINIUC, Sergiu
dc.contributor.author ȚURCANU, Dinu
dc.contributor.author SIMINIUC, Rodica
dc.date.accessioned 2026-02-15T14:28:31Z
dc.date.available 2026-02-15T14:28:31Z
dc.date.issued 2025
dc.identifier.citation SIMINIUC, Sergiu; Dinu ȚURCANU and Rodica SIMINIUC. Comparative evaluation of k-anonymity, differential privacy, and pseudonymization for data protection in rare disease registries. In: 7th International Conference on Nanotechnologies and Biomedical Engineering, ICNBME 2025, Biomedical Engineering and New Technologies for Diagnosis, Treatment, and Rehabilitation, Chisinau, Republic of Moldova, 7-10 October, 2025. Technical University of Moldova. Springer Nature, 2025, vol. 2, pp. 566-580. ISBN 978-3-032-06496-7, eISBN 978-3-032-06497-4, ISSN 1680-0737, eISSN 1433-9277. en_US
dc.identifier.isbn 978-3-032-06496-7
dc.identifier.isbn 978-3-032-06497-4
dc.identifier.issn 1680-0737
dc.identifier.issn 1433-9277
dc.identifier.uri https://doi.org/10.1007/978-3-032-06497-4_56
dc.identifier.uri https://repository.utm.md/handle/5014/35212
dc.description Acces full text: https://doi.org/10.1007/978-3-032-06497-4_56 en_US
dc.description.abstract Protecting patient confidentiality in rare disease research presents unique challenges due to small population sizes and the increased risk of re-identification through quasi-identifiers. This study presents a comparative evaluation of three anonymization techniques—k-anonymity, differential privacy, and pseudonymization—applied to a fully synthetic dataset of 10,000 rare disease patients, calibrated using real epidemiological distributions. Each method was assessed across three dimensions: confidentiality protection (residual risk, NCP, AECS, ε–δ guarantee), analytical utility (impact on descriptive statistics and logistic regression AUC), and computational efficiency (execution time, RAM usage). The results show that differential privacy (ε = 1.0) achieved the lowest re-identification risk (< 0.1%) with a negligible loss of utility (ΔAUC ≈ 0), making it suitable for open data dissemination. K-anonymity (k = 5) reduced risk to 2% while introducing moderate information loss (NCP ≈ 0.12), offering a compromise where interpretability is prioritized. Pseudonymization preserved full analytical utility and minimal processing cost, but remained insufficient under GDPR due to the potential for re-linkage. A hybrid anonymization framework is proposed: pseudonymization for internal operations and longitudinal tracking, k-anonymity for interpretable analysis, and differential privacy for public dissemination. This integrated approach ensures compliance with GDPR while preserving analytical usability in rare disease research. en_US
dc.language.iso en en_US
dc.publisher Springer Nature en_US
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject data anonymizatio en_US
dc.subject data confidentiality en_US
dc.subject differential privacy en_US
dc.subject k-anonymity en_US
dc.subject pseudonymization en_US
dc.subject rare diseases en_US
dc.title Comparative evaluation of k-anonymity, differential privacy, and pseudonymization for data protection in rare disease registries en_US
dc.type Article en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

Search DSpace


Browse

My Account