Preserving temporal relations in clinical data while maintaining privacy

George Hripcsak, Parsa Mirhaji, Alexander F H Low, Bradley A. Malin

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Objective Maintaining patient privacy is a challenge in large-scale observational research. To assist in reducing the risk of identifying study subjects through publicly available data, we introduce a method for obscuring date information for clinical events and patient characteristics.Methods The method, which we call Shift and Truncate (SANT), obscures date information to any desired granularity. Shift and Truncate first assigns each patient a random shift value, such that all dates in that patient's record are shifted by that amount. Data are then truncated from the beginning and end of the data set.Results The data set can be proven to not disclose temporal information finer than the chosen granularity. Unlike previous strategies such as a simple shift, it remains robust to frequent - even daily - updates and robust to inferring dates at the beginning and end of date-shifted data sets. Time-of-day may be retained or obscured, depending on the goal and anticipated knowledge of the data recipient.Conclusions The method can be useful as a scientific approach for reducing re-identification risk under the Privacy Rule of the Health Insurance Portability and Accountability Act and may contribute to qualification for the Safe Harbor implementation.

Original languageEnglish (US)
Article numberocw001
Pages (from-to)1040-1045
Number of pages6
JournalJournal of the American Medical Informatics Association
Volume23
Issue number6
DOIs
StatePublished - Nov 1 2016

Fingerprint

Privacy
Health Insurance Portability and Accountability Act
Research
Datasets

ASJC Scopus subject areas

  • Health Informatics

Cite this

Preserving temporal relations in clinical data while maintaining privacy. / Hripcsak, George; Mirhaji, Parsa; Low, Alexander F H; Malin, Bradley A.

In: Journal of the American Medical Informatics Association, Vol. 23, No. 6, ocw001, 01.11.2016, p. 1040-1045.

Research output: Contribution to journalArticle

Hripcsak, George ; Mirhaji, Parsa ; Low, Alexander F H ; Malin, Bradley A. / Preserving temporal relations in clinical data while maintaining privacy. In: Journal of the American Medical Informatics Association. 2016 ; Vol. 23, No. 6. pp. 1040-1045.
@article{2500a4875a3d4b0eb8e600bec3d4b213,
title = "Preserving temporal relations in clinical data while maintaining privacy",
abstract = "Objective Maintaining patient privacy is a challenge in large-scale observational research. To assist in reducing the risk of identifying study subjects through publicly available data, we introduce a method for obscuring date information for clinical events and patient characteristics.Methods The method, which we call Shift and Truncate (SANT), obscures date information to any desired granularity. Shift and Truncate first assigns each patient a random shift value, such that all dates in that patient's record are shifted by that amount. Data are then truncated from the beginning and end of the data set.Results The data set can be proven to not disclose temporal information finer than the chosen granularity. Unlike previous strategies such as a simple shift, it remains robust to frequent - even daily - updates and robust to inferring dates at the beginning and end of date-shifted data sets. Time-of-day may be retained or obscured, depending on the goal and anticipated knowledge of the data recipient.Conclusions The method can be useful as a scientific approach for reducing re-identification risk under the Privacy Rule of the Health Insurance Portability and Accountability Act and may contribute to qualification for the Safe Harbor implementation.",
author = "George Hripcsak and Parsa Mirhaji and Low, {Alexander F H} and Malin, {Bradley A.}",
year = "2016",
month = "11",
day = "1",
doi = "10.1093/jamia/ocw001",
language = "English (US)",
volume = "23",
pages = "1040--1045",
journal = "Journal of the American Medical Informatics Association : JAMIA",
issn = "1067-5027",
publisher = "Oxford University Press",
number = "6",

}

TY - JOUR

T1 - Preserving temporal relations in clinical data while maintaining privacy

AU - Hripcsak, George

AU - Mirhaji, Parsa

AU - Low, Alexander F H

AU - Malin, Bradley A.

PY - 2016/11/1

Y1 - 2016/11/1

N2 - Objective Maintaining patient privacy is a challenge in large-scale observational research. To assist in reducing the risk of identifying study subjects through publicly available data, we introduce a method for obscuring date information for clinical events and patient characteristics.Methods The method, which we call Shift and Truncate (SANT), obscures date information to any desired granularity. Shift and Truncate first assigns each patient a random shift value, such that all dates in that patient's record are shifted by that amount. Data are then truncated from the beginning and end of the data set.Results The data set can be proven to not disclose temporal information finer than the chosen granularity. Unlike previous strategies such as a simple shift, it remains robust to frequent - even daily - updates and robust to inferring dates at the beginning and end of date-shifted data sets. Time-of-day may be retained or obscured, depending on the goal and anticipated knowledge of the data recipient.Conclusions The method can be useful as a scientific approach for reducing re-identification risk under the Privacy Rule of the Health Insurance Portability and Accountability Act and may contribute to qualification for the Safe Harbor implementation.

AB - Objective Maintaining patient privacy is a challenge in large-scale observational research. To assist in reducing the risk of identifying study subjects through publicly available data, we introduce a method for obscuring date information for clinical events and patient characteristics.Methods The method, which we call Shift and Truncate (SANT), obscures date information to any desired granularity. Shift and Truncate first assigns each patient a random shift value, such that all dates in that patient's record are shifted by that amount. Data are then truncated from the beginning and end of the data set.Results The data set can be proven to not disclose temporal information finer than the chosen granularity. Unlike previous strategies such as a simple shift, it remains robust to frequent - even daily - updates and robust to inferring dates at the beginning and end of date-shifted data sets. Time-of-day may be retained or obscured, depending on the goal and anticipated knowledge of the data recipient.Conclusions The method can be useful as a scientific approach for reducing re-identification risk under the Privacy Rule of the Health Insurance Portability and Accountability Act and may contribute to qualification for the Safe Harbor implementation.

UR - http://www.scopus.com/inward/record.url?scp=84994700819&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84994700819&partnerID=8YFLogxK

U2 - 10.1093/jamia/ocw001

DO - 10.1093/jamia/ocw001

M3 - Article

C2 - 27013522

AN - SCOPUS:84994700819

VL - 23

SP - 1040

EP - 1045

JO - Journal of the American Medical Informatics Association : JAMIA

JF - Journal of the American Medical Informatics Association : JAMIA

SN - 1067-5027

IS - 6

M1 - ocw001

ER -