Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network

Ning Liu, Ying Liu, Brent Logan, Zhiyuan Xu, Jian Tang, Yanzhi Wang

Research output: Contribution to journalArticle

1 Scopus citations

Abstract

This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-life complexity in heterogeneous disease progression and treatment choices, with the goal of providing doctors and patients the data-driven personalized decision recommendations. The proposed DRL framework comprises (i) a supervised learning step to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term value function of Dynamic Treatment Regimes. Both steps depend on deep neural networks. As a key motivational example, we have implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease after transplantation. In the experimental results, we have demonstrated promising accuracy in predicting human experts’ decisions, as well as the high expected reward function in the DRL-based dynamic treatment regimes.

Original languageEnglish (US)
Article number1495
JournalScientific reports
Volume9
Issue number1
DOIs
StatePublished - Dec 1 2019

ASJC Scopus subject areas

  • General

Fingerprint Dive into the research topics of 'Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network'. Together they form a unique fingerprint.

  • Cite this