TY - GEN
T1 - Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data
AU - Liu, Ying
AU - Logan, Brent
AU - Liu, Ning
AU - Xu, Zhiyuan
AU - Tang, Jian
AU - Wang, Yangzhi
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/9/8
Y1 - 2017/9/8
N2 - In this paper, we propose the first deep reinforce-ment learning framework to estimate the optimal Dynamic Treat-ment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real life complexity in heterogeneous disease progression and treatment choices, with the goal to provide doctor and patients the data-driven personalized decision recommendations. The proposed deep reinforcement learning framework contains a supervised learning step to predict the most possible expert actions; and a deep reinforcement learning step to estimate the long term value function of Dynamic Treatment Regimes. We motivated and implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease. We showed results of the initial implementation that demonstrates promising accuracy in predicting human expert decisions and initial implementation for the reinforcement learning step.
AB - In this paper, we propose the first deep reinforce-ment learning framework to estimate the optimal Dynamic Treat-ment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real life complexity in heterogeneous disease progression and treatment choices, with the goal to provide doctor and patients the data-driven personalized decision recommendations. The proposed deep reinforcement learning framework contains a supervised learning step to predict the most possible expert actions; and a deep reinforcement learning step to estimate the long term value function of Dynamic Treatment Regimes. We motivated and implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease. We showed results of the initial implementation that demonstrates promising accuracy in predicting human expert decisions and initial implementation for the reinforcement learning step.
UR - http://www.scopus.com/inward/record.url?scp=85032350439&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85032350439&partnerID=8YFLogxK
U2 - 10.1109/ICHI.2017.45
DO - 10.1109/ICHI.2017.45
M3 - Conference contribution
AN - SCOPUS:85032350439
T3 - Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017
SP - 380
EP - 385
BT - Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017
A2 - Cummins, Mollie
A2 - Facelli, Julio
A2 - Meixner, Gerrit
A2 - Giraud-Carrier, Christophe
A2 - Nakajima, Hiroshi
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th IEEE International Conference on Healthcare Informatics, ICHI 2017
Y2 - 23 August 2017 through 26 August 2017
ER -