TY - JOUR
T1 - Privacy-preserving GWAS analysis on federated genomic datasets
AU - Constable, Scott D.
AU - Tang, Yuzhe
AU - Wang, Shuang
AU - Jiang, Xiaoqian
AU - Chapin, Steve
N1 - Publisher Copyright:
© 2015 Constable et al.
PY - 2015/12/21
Y1 - 2015/12/21
N2 - Background: The biomedical community benefits from the increasing availability of genomic data to support meaningful scientific research, e.g., Genome-Wide Association Studies (GWAS). However, high quality GWAS usually requires a large amount of samples, which can grow beyond the capability of a single institution. Federated genomic data analysis holds the promise of enabling cross-institution collaboration for effective GWAS, but it raises concerns about patient privacy and medical information confidentiality (as data are being exchanged across institutional boundaries), which becomes an inhibiting factor for the practical use. Methods: We present a privacy-preserving GWAS framework on federated genomic datasets. Our method is to layer the GWAS computations on top of secure multi-party computation (MPC) systems. This approach allows two parties in a distributed system to mutually perform secure GWAS computations, but without exposing their private data outside. Results: We demonstrate our technique by implementing a framework for minor allele frequency counting and χ 2 statistics calculation, one of typical computations used in GWAS. For efficient prototyping, we use a state-of-the-art MPC framework, i.e., Portable Circuit Format (PCF) [1]. Our experimental results show promise in realizing both efficient and secure cross-institution GWAS computations.
AB - Background: The biomedical community benefits from the increasing availability of genomic data to support meaningful scientific research, e.g., Genome-Wide Association Studies (GWAS). However, high quality GWAS usually requires a large amount of samples, which can grow beyond the capability of a single institution. Federated genomic data analysis holds the promise of enabling cross-institution collaboration for effective GWAS, but it raises concerns about patient privacy and medical information confidentiality (as data are being exchanged across institutional boundaries), which becomes an inhibiting factor for the practical use. Methods: We present a privacy-preserving GWAS framework on federated genomic datasets. Our method is to layer the GWAS computations on top of secure multi-party computation (MPC) systems. This approach allows two parties in a distributed system to mutually perform secure GWAS computations, but without exposing their private data outside. Results: We demonstrate our technique by implementing a framework for minor allele frequency counting and χ 2 statistics calculation, one of typical computations used in GWAS. For efficient prototyping, we use a state-of-the-art MPC framework, i.e., Portable Circuit Format (PCF) [1]. Our experimental results show promise in realizing both efficient and secure cross-institution GWAS computations.
KW - GWAS
KW - Genomic data privacy protection
KW - Secure multi-party computation
KW - Statistical analysis
UR - http://www.scopus.com/inward/record.url?scp=84977270885&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84977270885&partnerID=8YFLogxK
U2 - 10.1186/1472-6947-15-S5-S2
DO - 10.1186/1472-6947-15-S5-S2
M3 - Article
C2 - 26733045
AN - SCOPUS:84977270885
SN - 1472-6947
VL - 15
JO - BMC Medical Informatics and Decision Making
JF - BMC Medical Informatics and Decision Making
IS - 5
M1 - S2
ER -