Locational privacy-preserving distance computations with intersecting sets of randomly labeled grid points

GND
133653633
ORCID
0000-0001-7843-4974
LSF
49684
Zugehörige Organisation
Research Methodology Group, University of Duisburg-Essen, Duisburg, Germany
Schnell, Rainer;
GND
1143269616
ORCID
0000-0002-4545-9136
LSF
58467
Zugehörige Organisation
Methodology R&D, Statistics Netherlands (CBS), Heerlen, The Netherlands
Klingwort, Jonas;
ORCID
0000-0002-4916-6111
Zugehörige Organisation
Farrow Norris, Sydney, Australia
Farrow, James M.

Background: We introduce and study a recently proposed method for privacy-preserving distance computa- tions which has received little attention in the scientific literature so far. The method, which is based on intersecting sets of randomly labeled grid points, is henceforth denoted as ISGP allows calculating the approximate distances between masked spatial data. Coordinates are replaced by sets of hash values. The method allows the computation of distances between locations L when the locations at different points in time t are not known simultaneously. The distance between L1 and L2 could be computed even when L2 does not exist at t1 and L1has been deleted at t2. An example would be patients from a medical data set and locations of later hospitalizations. ISGP is a new tool for privacy-preserving data handling of geo-referenced data sets in general. Furthermore, this technique can be used to include geographical identifiers as additional information for privacy-preserving record-linkage. To show that the technique can be implemented in most high-level programming languages with a few lines of code, a complete implementation within the statistical programming language R is given. The properties of the method are explored using simulations based on large-scale real-world data of hospitals (n = 850) and residential locations (n = 13, 000). The method has already been used in a real-world application.

Results: ISGP yields very accurate results. Our simulation study showed that—with appropriately chosen parameters – 99 % accuracy in the approximated distances is achieved.

Conclusion: We discussed a new method for privacy-preserving distance computations in microdata. The method is highly accurate, fast, has low computational burden, and does not require excessive storage. Keywords: Geographical data, Geo-referenced data, Geo-masking, Record-linkage, ISGP

Zitieren

Zitierform:
Zitierform konnte nicht geladen werden.

Rechte

Rechteinhaber:

© The Author(s) 2021

Nutzung und Vervielfältigung:
Dieses Werk kann unter einer
CC BY 4.0 LogoCreative Commons Namensnennung 4.0 Lizenz (CC BY 4.0)
genutzt werden.