Neurorobotics Lab - University of Freiburg

Workshop Paper at AAAI 2020: Maria Hügle, Gabriel Kalweit, Thomas Hügle and Joschka Boedecker

AdaptiveNet: A Dynamic Deep Neural Network For Multimodal Clinical Data Analysis

Workshop Paper Proceedings (Springer) Cite

Introduction

Fig. 1: Scheme of AdaptiveNet, which projects visits and medication adjustments to the same latent space using encoder networks $\phi^{\text{visit}}$ and $\phi^{\text{med}}$, where the output vectors $\phi^{(\cdot)}{(\cdot)}$ have the same length. The sorted list of encoded events are pooled by an LSTM to compute a fixed-length encoded patient history. The final output $\hat y$ is computed by the network module $\rho$.}

Clinical data from electronic medical records, registries or trials provide a large source of information to apply machine learning methods in order to foster precision medicine, e.g. by finding new disease phenotypes or performing individual disease prediction. However, to take full advantage of deep learning methods on clinical data, architectures are necessary that 1) are robust with respect to missing and wrong values, and 2) can deal with highly variable-sized lists and long-term dependencies of individual diagnosis, procedures, measurements and medication prescriptions. In this work, we elaborate limitations of fully-connected neural networks and classical machine learning methods in this context and propose AdaptiveNet, a novel recurrent neural network architecture, which can deal with multiple lists of different events, alleviating the aforementioned limitations. We employ the architecture to the problem of disease progression prediction in rheumatoid arthritis using the Swiss Clinical Quality Management registry, which contains over 10.000 patients and more than 65.000 patient visits. Our proposed approach leads to more compact representations and outperforms the classical baselines.

Technical Approach

Disease Progression Prediction

Disease progression prediction based on a clinical dataset can be modeled as time-series prediction, where for a patient at time point $t$ the future disease activity at time point $ t + \Delta t$ is predicted. The dataset consists of records for a set of patients $\mathcal{R} = \bigcup_j \mathcal{R}_j$. Records can contain general patient information (e.g. age, gender, antibody-status) and multiple list of events. The subset $R_j(t) \subseteq \mathcal{R}_j$, denotes all records of a patient $j$ collected until time point $t$ with $$R_j(t) = \{e^\text{patient}(t)\} \cup E^\text{visit}(t) \cup E^{\text{med}}(t),$$ where $e^\text{patient}(t)$ contains general patient information collected until time point $t$. $E^{k}(t)$ is a list of events collected until time point $t$, where $$E^{k}(t) = \{e^{k}(t_e)\ | \ e^{k}(t_e) \in \mathcal{R}_j \text{ and } t_e \le t \} ,$$ for all event types $k \in \{\text{visit}, \text{med}\}$. Visit events contain information like joint swelling or patient reported outcomes (e.g. joint pain, morning stiffness and HAQ) and lab values (e.g. CRP, BSR). Medication events contain adjustment information, such as drug type and dose. In the same manner, further lists of other event types could be added, such as imaging data (e.g. MRI, x-ray).
Considering records $R_j(t)$ as input, we aim to learn a function $ f: (R_j(t), \Delta t) \rightarrow \mathbb{R}$ that maps the records at time point $t$ to the expected change of the disease level $\text{score}_j$ until time $\Delta t$ with $$f( R_j(t), \Delta t) = \text{score}_j(t + \Delta t) - \text{score}_j(t).$$ To account for variable time spans between entries, for all events, the time distance $\Delta t$ to the prediction time point is added as additional input feature. Records with included time feature are denoted as $R_j(t, \Delta t)$ and event lists with $E^{(\cdot)}(t, \Delta t)$. If not denoted explicitly in the following, we assume that $\Delta t$ is included in all records and lists.

AdaptiveNet

In order to deal with patient records $R_j(t)$ and the corresponding variable-sized event lists $E^{(\cdot)}(t)$ for a time point $t$, we propose the neural network architecture AdaptiveNet, which is able to deal with $K$ input sets $E^{1}, ..., E^{K}$, where every set can have variable length and different feature representations. With neural network modules $\phi^1, ..., \phi^K$, every element of the $K$ lists can be projected to a latent space and a sorted list of latent events can be computed as $$ \Psi(R_j(t)) = \Psi(E^{1}, ..., E^{K}) = sort \left( \bigcup_{k} \bigcup_{e \in E^{ k}(t)}\phi^k(e) \right),$$ with $1 \le k \le K$, sorted according to the time points of the events. The output vectors $\phi^k(\cdot) \in \mathbb{R}^F $ of the encoder networks $\phi^k$ have the same length $F$. These network modules can have an arbitrary architecture. In this work, we use fully-connected network modules to deal with numerical and categorical input features. We additionally propose to share the parameters of the last layer over all encoder networks. Then, $\phi^k(\cdot)$ can be seen as a projection of all input objects to the same encoded \textit{event space} (effects of which we investigate further in the results). The prediction $\hat y$ of the network is then computed as $$\hat y(R_j(t)) = \rho\bigg( \text{LSTM} \bigg[ \Psi(R_j(t)) \bigg] \ || \ e^\text{patient} \bigg),$$ where $||$ denotes concatenation of the vectors and $\rho$ is a fully-connected network module. In this work, we use an LSTM to pool the events by a recurrent unit. To tackle the disease progression prediction problem, in this work, we use two encoder modules $\phi^1, \phi^2$ for the set of event types $\{\text{visit}, \text{med}\}$. It is straightforward to add other event types, for example imaging data. In this case, the encoder module could consist of a convolutional neural network, which can optionally be pre-trained and have fixed weights.

Dataset

In this work, we use the Swiss Clinical Quality Management (SCQM) database for rheumatic diseases, which includes data of over 9500 patients with RA, assessed during consultations and via the mySCQM online application. The database consists of general patient information, clinical data, disease characteristics, ultrasound, radiographs, lab values, medication treatments and patient reported outcome (HAQ, RADAI-5, SF12, EuroQol). Patients were followed-up with one to four visits yearly and clinical information was updated every time. The data collection was approved by a national review board and all individuals willing to participate, signing an informed consent form before enrolment in accordance with the Declaration of Helsinki.

Disease Progression Prediction in Rheumatoid Arthritis

To represent the disease level, we use the hybrid score DAS28-BSR as prediction target, which contains DAS28 (Disease Activity Score 28) and the inflammation bloodmarker BSR (blood sedimentation rate). DAS28 defines the disease activity based on 28 joints (number of swollen joints, number of painfull joints and questionnaires (HAQ). For both training and evaluation, we consider only visits with available DAS28-BSR score. We focus on 13 visit features, selected by a medical expert. Additionally, we consider eight medications. As general patient information, we include gender, rheumatoid factor, anti-CCP, time since first symptoms and age.

Results

To evaluate the performance of all models, we use 5-fold cross validation. The results are shown in Fig. 2 for a prediction horizon of one year for all methods and different maximum history lengths of 6 months to 5 years.

Fig. 2: Mean squared error of the disease progression prediction for different maximum history lengths in a range from 5 years
to 6 months. The prediction horizon is 1 year (left). T-SNE visualization of the latent representations $\phi^\text{visit}(\cdot)$ and $\phi^\text{med}(\cdot)$
with shared parameters in the last layer (right).

BibTeX

			
@misc{hgle2020dynamic,
    title={A Dynamic Deep Neural Network For Multimodal Clinical Data Analysis},
    author={Maria Hügle and Gabriel Kalweit and Thomas Huegle and Joschka Boedecker},
    year={2020},
    eprint={2008.06294},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

Health

SeizureNet: Early Seizure Detection with an Energy-Efficient Convolutional Neural Network on an Implantable Microcontroller

Introduction

Technical Approach

Hardware

Seizure Detection Performance Evaluation

Results

Demonstration

BibTeX

AdaptiveNet: A Dynamic Deep Neural Network For Multimodal Clinical Data Analysis

Introduction

Technical Approach

Disease Progression Prediction

AdaptiveNet

Dataset

Disease Progression Prediction in Rheumatoid Arthritis

Results

BibTeX

People

Maria Hügle

Gabriel Kalweit

Joschka Bödecker