Skip to main content

Detection of mild cognitive impairment in Parkinson’s disease using gradient boosting decision tree models based on multilevel DTI indices



Cognitive dysfunction is the most common non-motor symptom in Parkinson’s disease (PD), and timely detection of a slight cognitive decline is crucial for early treatment and prevention of dementia. This study aimed to build a machine learning model based on intra- and/or intervoxel metrics extracted from diffusion tensor imaging (DTI) to automatically classify PD patients without dementia into mild cognitive impairment (PD-MCI) and normal cognition (PD-NC) groups.


We enrolled PD patients without dementia (52 PD-NC and 68 PD-MCI subtypes) who were assigned to the training and test datasets in an 8:2 ratio. Four intravoxel metrics, including fractional anisotropy (FA), mean diffusivity (MD), axial diffusivity (AD), and radial diffusivity (RD), and two novel intervoxel metrics, local diffusion homogeneity (LDH) using Spearman’s rank correlation coefficient (LDHs) and Kendall’s coefficient concordance (LDHk), were extracted from the DTI data. Decision tree, random forest, and eXtreme gradient boosting (XGBoost) models based on individual and combined indices were built for classification, and model performance was assessed and compared via the area under the receiver operating characteristic curve (AUC). Finally, feature importance was evaluated using SHapley Additive exPlanation (SHAP) values.


The XGBoost model based on a combination of the intra- and intervoxel indices achieved the best classification performance, with an accuracy of 91.67%, sensitivity of 92.86%, and AUC of 0.94 in the test dataset. SHAP analysis showed that the LDH of the brainstem and MD of the right cingulum (hippocampus) were important features.


More comprehensive information on white matter changes can be obtained by combining intra- and intervoxel DTI indices, improving classification accuracy. Furthermore, machine learning methods based on DTI indices can be used as alternatives for the automatic identification of PD-MCI at the individual level.


Parkinson’s disease (PD) is the second most common neurodegenerative disease. Cognitive dysfunction is one of the most common nonmotor symptoms of PD [1], including PD with mild cognitive impairment (PD-MCI) and PD with dementia (PDD). PD-MCI is an intermediate state between PD with normal cognition (PD-NC) and PDD, with a prevalence of approximately 30% [2], that can progress over time to either PD-NC or PDD [1, 3]. However, cognitive decline tends to be slow and insidious, and PD-MCI is often overlooked by patients and clinicians. Once the disease progresses to PDD, it seriously affects the quality of life of the patient. Accurate diagnosis of PD-MCI is essential for effective intervention and prevention of PDD.

The current diagnosis of PD-MCI mainly depends on clinical symptoms and neuropsychological tests, but some challenges remain in terms of the homogeneity of the neuropsychological test results, and the testing process is time consuming and labour intensive. Therefore, there is a need for an easier and more accurate method to establish the diagnosis of PD-MCI.

The classical mechanism underlying PD-MCI is the abnormal accumulation of Lewy bodies [4] and β-amyloid (Aβ) [5] in neuronal cell bodies and axons accompanied by damage to glial cells, demyelination of axons, and increased microglial concentrations in the extracellular space. Structural MR studies [6,7,8] have confirmed that the progression of cognitive impairment in PD is closely related to white matter (WM) damage and that the range of WM hyperintensity is a moderate risk factor for cognitive impairment [7]. Moreover, WM microstructural changes occur prior to grey matter volume atrophy [9].

Diffusion tensor imaging (DTI) techniques are currently recognized as the most reliable noninvasive methods for quantifying WM fibre integrity, demonstrating greater sensitivity than conventional MRI in revealing WM microstructural damage [10]. Previous DTI studies calculated a series of intravoxel DTI indices, including fractional anisotropy (FA), mean diffusivity (MD), axial diffusivity (AD), and radial diffusivity (RD), based on a diffusion tensor model and confirmed that the WM microstructure deteriorates across stages of cognitive decline in PD patients [8]. Specifically, decreased FA and increased MD, mainly in the bilateral frontal white matter [9, 11], corpus callosum [12], and temporal regions [13], are related to cognitive dysfunction. Recently, Gong et al. proposed a novel intervoxel metric named local diffusion homogeneity (LDH) [14]. LDH is independent of the diffusion model and can reveal intervoxel diffusion properties by capturing the overall coherence of water molecule diffusion within a neighbourhood; it reflects the microstructural coherence of the underlying WM fibres and provides additional insights beyond traditional intravoxel metrics. Some recent studies have applied LDH parameters to the detection of WM microstructure abnormalities in vascular cognitive impairment [15], epilepsy [16], type 2 diabetes [17], and blepharospasm [18], demonstrating regions of variation that differed from those of intravoxel diffusion metrics. Moreover, LDH can help predict the prognosis of stroke patients [19]. However, LDH alterations in PD or PD-MCI patients have not been fully explored.

In summary, traditional statistical methods for comparing groups have demonstrated significant differences in WM microstructure between PD-MCI patients and PD-NC patients, providing new evidence for understanding the pathophysiological mechanisms underlying cognitive dysfunction in PD. However, these studies have not been translated into suitable biomarkers for identifying PD-MCI at the individual level. Additionally, it is unknown which metric is the most accurate and useful for predicting PD-MCI. In particular, the role of LDH is unclear. Machine learning classification provides a powerful method for predicting an individual’s disease status based on MRI data and has been applied to generate imaging biomarkers for various neurodegenerative diseases, such as Alzheimer’s disease [20] and Parkinson’s disease [21]. Tree model algorithms, such as decision tree (DT), random forest (RF), and eXtreme gradient boosting (XGBoost), are relatively basic and widely used classes of models in machine learning. These tree models are built with a small amount of data, have a moderately complex algorithm time, and are more interpretable than neural network algorithms. Studies have confirmed the potential of tree model algorithms in studies on automatic PD identification [21].

This study aimed to develop a machine learning model based on DTI data to automatically classify PD patients without dementia as PD-MCI and PD-NC, thus providing a more convenient method for the early clinical detection of MCI. We hypothesized that tree models employing the means of DTI indices extracted from atlas-based WM segmentation as input features would be helpful for PD-MCI diagnosis, and combining intra- and intervoxel DTI metrics could improve prediction precision. Finally, we assessed the correlations between the regional DTI parameter values of selected features and neuropsychological scores and calculated the importance of the features of the best model using the SHapley Additive exPlanation (SHAP) method to validate and explain the model.

Materials and methods

Participants and ethics

A total of 133 PD patients were recruited from the Department of Neurology of the First Hospital of China Medical University from June 2013 to June 2019. All subjects were right-handed and had no contraindications for MR. The inclusion criteria were as follows: (1) the PD clinical diagnostic criteria of the Movement Disorder Society (MDS) were met; (2) age older than 45 years; and (3) Hoehn and Yahr stage < 5. The exclusion criteria were as follows: (1) Parkinson’s dementia [22]; (2) severe heart, liver, kidney, or endocrine system diseases; (3) severe mental illness; (4) inability to cooperate with the MRI examination and clinical assessment; and (5) unusual structural MR findings. MR scans and clinical symptom assessments were conducted on patients in the “off” state (i.e., discontinued antiparkinsonian medications for at least 12 h). Additionally, 100 sex-, age-, and education year-matched healthy people without neurological or mental diseases were included as the healthy control group.

This study was approved by the Ethics Committee of the First Hospital of China Medical University, and all subjects gave informed consent prior to participation.

Clinical evaluation

Each subject underwent a battery of neuropsychological tests. Motor symptom severity was measured by the MDS revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS) [23] Part III. Disease staging was performed using Hoehn and Yahr (H&Y) staging. The Mini-Mental State Examination (MMSE) and Montreal Cognitive Assessment (MoCA) were used to assess the patients’ global cognitive status, and the Hamilton depression scale (HAMD) was used to assess patients’ level of depression. The levodopa equivalent daily dose (LEDD) was used to summarize the patients’ medication received. In addition, the Auditory Verbal Learning Test (AVLT), Clock Drawing Test (CDT), and Trail Making Test A and B (TMT-A, TMT-B) were used to evaluate patients’ verbal memory function, visuospatial function, and executive function, respectively.

Diagnosis of PD-MCI and PD-NC

PD-MCI was diagnosed according to the MDS Task Force level 1 criteria [2, 24], which entailed MoCA scores < 26 [25] or at least two neuropsychological test scores 2 standard deviations (SD) below the healthy control group mean and reports from the patient or family members of subjective cognitive decline, defined by a score of ≥ 1 on item 1 (cognitive impairment) of the MDS-UPDRS Part I. Participants who did not qualify for the above criteria were defined as PD patients with normal cognition (PD-NC).

DTI data acquisition and preprocessing

A Magnetom Verio 3.0 T MRI scanner (Siemens Medical Solutions, Erlangen, Germany) equipped with a 32-channel head coil was used to obtain MRI scans of all subjects. The scanning parameters were as follows: repetition time (TR)/echo time (TE) = 10,300/95 ms, field of view (FOV) = 256 × 256 mm2, matrix = 128 × 128, voxel size = 2.0 × 2.0 × 2.0 mm3, slice thickness = 2 mm, number of directions = 64, b = 1000 s/mm2. DTI data preprocessing and atlas-based analysis (ABA) were carried out with FSL 5.0.9 ( and PANDA (Pipeline for Analysing braiN Diffusion imAges, The preprocessing steps included format conversion, mask generation and cropping, head motion and eddy correction, and spatial registration. Details for data acquisition and preprocessing are presented in Additional file 1.

Feature extraction

In this study, we calculated six different DTI indicators. Four commonly evaluated intravoxel diffusivity metrics, FA (a normalized SD of the eigenvalues), MD (a direction-averaged measure), AD (apparent diffusivity parallel to the underlying tissue tract), and RD (apparent diffusivity perpendicular to the underlying tissue tract), were obtained from the tensor matrix. In addition, we calculated an intervoxel diffusivity metric called local diffusion homogeneity (LDH) using Spearman’s rank correlation coefficient (LDH) and Kendall’s coefficient concordance (LDHk); the specific calculations were performed according to a previous study [14]. The atlas-based analysis (ABA) method in the PANDA software package was selected for feature extraction. According to the John Hopkins University ICBM-DTI-81 White Matter Labels and John Hopkins University White Matter Tractography ( atlases [26], the whole-brain WM was divided into 70 regions of interest, and the mean DTI parameters were extracted for each region. Ultimately, 280 intravoxel [(FA, MD, AD, RD) *0 regions] and 140 intervoxel [(LDHs, LDHk) *70 regions] indices were extracted for each subject.

Feature selection

First, we randomly divided the data into training and test datasets (80%:20%); the ratio of PD-MCI to PD-NC remained unchanged in this division. The training dataset was used for feature selection and model construction, and the test dataset was used to evaluate the performance of the model. A feature selection procedure was performed to remove redundant features to prevent model overfitting. First, all features were normalized by L2 normalization. Next, the random forest (RF) feature selection algorithm was applied to rank the importance of each feature, 10-fold cross-validation was performed, and the top 3% most important features were retained. Finally, Pearson correlation coefficient was used to analyse the correlations among the remaining connectome features. When the absolute value of the correlation coefficient was ≥ 0.7 and the p value was < 0.05, the feature with the lower importance was excluded. For separate intravoxel/intervoxel metrics and combined metrics, feature selection was performed as described above to construct the optimal subset of features.

Model construction, evaluation and interpretation

We selected decision tree (DT), random forest (RF), and extreme gradient boosting (XGBoost) as the machine learning algorithms to build classifiers to distinguish PD-MCI from PD-NC. The hyperparameters were tuned with the gradient descent method and are shown in Additional file 4: Table S1.

The predictive performance of each model and the receiver operating characteristic (ROC) curve were plotted, and the area under the curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated. To compare the performance among different models, the DeLong test was used to compare the differences among different AUCs, and p < 0.05 (two-tailed) was considered statistically significant. Afterwards, the values of each selected feature between the two groups were also calculated and compared.

Finally, an additional feature attribution method, SHAP, was used to characterize the optimal model and identify the top contributing DTI index for classification. SHAP analysis, a model-independent method, provides insights into the model by calculating the global influence (positive or negative, feature importance ranking) of each feature on the model prediction. The workflow of this study is presented in Fig. 1.

Fig. 1
figure 1

Flowchart of the study. First, a total of 420 features were extracted for each subject, and an intravoxel feature group (280 features), an intervoxel feature group (140 features) and their combination, an intra- and intervoxel feature group, were generated. After standardizing the features, the random forest algorithm and Spearman’s correlation were carried out to reduce the dimensionality of the dataset. Finally, decision tree, random forest, and extreme gradient boosting (XGBoost) were used to discriminate between PD-MCI and PD-NC subjects. SHapley Additive exPlanation (SHAP) analysis was performed to interpret the predictive model

Statistical analysis

All statistical analyses were performed using SPSS 22.0 software, and a two-tailed p < 0.05 was considered significant. The Shapiro‒Wilk test (S‒W test) was conducted to assess the normality of the distributions of continuous variables. Based on the normality of the data, the t test or the Mann‒Whitney U test was conducted to confirm the differences between groups. Measurement data with or without a normal distribution are expressed as the mean ± standard deviation (χ ± s) or median and interquartile M (P25-P75), respectively, while enumeration data are expressed as (n). Finally, partial correlation was adopted to evaluate the relationship among eigenvalues after selection and MoCA scores, MDS-UPDRS-III scores, H&Y stage, and disease duration.


Demographic characteristics

Five patients were excluded due to a diagnosis of PDD, three patients were excluded due to structural MR abnormalities, and five patients were excluded due to the inability to cooperate with MR or clinical assessments. Finally, 120 PD patients without dementia (including 52 PD-NC patients and 68 PD-MCI patients) were included in this study. There were no significant differences in sex, age, education level, disease duration, H&Y stage, MDS-UPDRS-III, LEDD, or HAMD between the two groups. The MMSE, MoCA, CDT, AVLT, TMT-A and TMT-B scores of patients in the PD-MCI group were lower than those of patients in the PD-NC group. The demographic characteristics of all participants are detailed in Table 1.

Table 1 Participant demographics and clinical information

Feature selection

For the intravoxel metrics model, 8 features were retained after RF feature selection, and 2 features were excluded after Spearman’s rank correlation analysis. For the intervoxel metrics model, 5 features were retained after RF feature selection, and no features were excluded after Spearman’s rank correlation analysis. For the combined metrics model, the RF feature selection retained the top 12 features in terms of feature importance. After Spearman’s rank correlation analysis, 5 features were excluded. Finally, the 7 most discriminative DTI features were retained (including three LDHs, two LDHk and two MD values). The WM structural connectivity areas with classification significance were mainly located in the brainstem—pontine crossing tract (PCT), medial lemniscus (ML), right cingulum (hippocampus) and left fornix (cres)/stria terminalis. Between-group comparisons showed that patients with PD-MCI exhibited greater MD in the PCT than patients in the PD-NC group. Table 2 lists the details of the feature group for the combined metrics model, and the corresponding brain region locations are shown in Fig. 2.

Table 2 Statistical descriptions and p values for all 7 selected features
Fig. 2
figure 2

Seven features were selected for discriminating patients with PD-MCI and PD-NC. In this case, the results are displayed on a canonical FMRIB58_FA template. “.R and .L” in the text indicate the right and left sides, respectively. Neurologic conventions and MNI coordinates are used. MD mean diffusivity, LDHs local diffusion homogeneity (LDH) using Spearman’s rank correlation coefficient, LDHk LDH using Kendall’s coefficient concordance, RIC retrolenticular part of the internal capsule, PCT pontine crossing tract, ML medial lemniscus, ST fornix (cres)/stria terminalis, SLF superior longitudinal fasciculus, CH = cingulum (hippocampus)

Comparison between the models

The evaluation scores for accuracy, sensitivity, specificity, and area under the curve (AUC) for each model are shown in Table 3; Fig. 3. The XGBoost model based on the combined intra- and intervoxel DTI indices had the highest classification performance; the test set AUC was 0.94, the accuracy was 91.67%, the sensitivity was 92.86% and the specificity was 90.00%. The AUC difference between combined_XGBoost and intravoxel_XGBoost did not reach statistical significance (P = 0.07, Delong test), and the AUC of combined_XGBoost was significantly higher than that of all other models except intravoxel_XGBoost (P < 0.01, Delong test). In addition, the intravoxel_RF model had the highest specificity (90.00%) and moderate sensitivity (64.29), and the intervoxel_XGBoost model had the highest sensitivity (100.00%) and very low specificity (10.00%).

Table 3 Classification performance of the different models
Fig. 3
figure 3

ROC curves of each model index in the test datasets. The area under the ROC curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated and are shown in Table 3

Feature importance

The SHAP summary plot of each prediction in the combined XGBoost model is presented in Additional file 2:  Fig. S1. According to the SHAP value, the LDHk of PCT served as the most important feature. The MD of the right cingulum (hippocampus) and the LDHs of the right ML were also important for model prediction (Additional file 3: Fig. S2).

Correlation analysis

The correlations of each DTI metric value with MoCA score, MDS-UPDRS-III score, H & Y stage, and disease duration are summarized in Additional file 5:  Table S2. After controlling for disease duration and MDS-UPDRS-III score, the LDHk values in the PCT were positively correlated with the MoCA scores (r = 0.246, P = 0.007), the MD values in the right cingulum (hippocampus) were negatively correlated with the MoCA scores (r = −0.206, P < 0.001), and the LDHk values in the left fornix (cres)/stria terminalis were positively correlated with the MoCA scores (r = 0.223, P = 0.015).


This study succeeded in developing a machine learning model based on DTI metrics to accurately discriminate PD-MCI patients from PD patients without dementia. The main contributions were as follows: First, to our knowledge, this was the first time that LDHs and LDHk (novel intervoxel DTI indices) were used as classification metrics. We confirmed that these metrics can be used as a complement to intravoxel metrics to improve the classification accuracy for PD-NC and PD-MCI patients. Second, the XGBoost model based on the combined intra- and intervoxel metrics achieved a classification accuracy of 91.67% and an AUC of 0.94 in the test dataset and was the best performing model. Finally, by applying the SHAP method to interpret the best model, the LDHk values of the pontine crossing tract (PCT) were found to be important features, as were the MD values of the right cingulum (hippocampus).

Although neuropsychological testing remains the primary method for assessing the presence or absence of cognitive decline in PD patients, neuroimaging studies have observed brain structural and functional changes in patients with PD-MCI by measuring grey matter volume, white matter damage, and resting-state functional activity [27]. For years, machine learning studies used neuroimaging or electrophysiology data to build classifiers for PD-MCI. One study found that electroencephalogram (EEG) signals achieved 84% classification accuracy in identifying PD-MCI patients [28]. Zhang et al.‘s study [29] combined EEG and grey matter structural MRI as input features to identify patients with PD-MCI, achieving the highest accuracy of 77%. Moreover, Lenfeldt et al. found that white matter DTI values are more sensitive neuroimaging indicators than grey matter atrophy [11]. To the best of our knowledge, previous DTI studies have mainly focused on the differences in intravoxel diffusion parameters, finding that cognitive impairment is associated with decreased FA and increased MD, AD, and RD values in multiple WM regions, particularly in the predominantly anterior WM tracts [12, 30]. Haller et al. attempted to use machine learning algorithms based on intravoxel DTI indicators to diagnose Parkinson’s disease [31], with accuracies of up to 97%. However, we have not found any research utilizing machine learning models with DTI features to automatically identify PD-MCI patients. Our study confirms that intervoxel metrics that reflect the microstructural consistency of white matter fibre tracts can complement traditional intravoxel metrics to reveal a comprehensive picture of WM alterations. The reason this phenomenon has not been described appears to be that previous works did not consider intervoxel DTI metrics. To our knowledge, this is the first study to automatically identify PD-MCI using machine learning methods based on both intra- and intervoxel DTI features. Regarding the machine learning-based algorithms, we found that XGBoost-based algorithms achieved better performance, which is consistent with previous research. Lee et al. confirmed that an XGBoost model based on electroencephalography signals had a good effect in the diagnosis of PD [21], with the highest accuracy rate of 71.4%. Shibata et al. applied the XGBoost model based on quantitative susceptibility mapping (a type of MR method reflecting iron deposition) features to classify PD-MCI and PD-NC patients, achieving an accuracy of 79.1% [32].

This study used SHAP analysis to interpret the best model, which revealed that the MD values of the right cingulum and the LDHk values of the PCT were the most important features, as well as the LDH values of the ML, another region of the brainstem. Statistical analysis revealed that the LDHk value of the PCT in the PD-MCI group was lower than that in the PD-NC group and that in general, the LDHk value of the PCT was positively correlated with the MoCA score. One of the pathological hallmarks of PD is dopaminergic neuron loss, and postmortem studies have confirmed that human brainstem regions, such as the substantia nigra, red nucleus, medial lemniscus, and pontine nucleus, highly express D2 dopamine receptor mRNAs [33]. Furthermore, the PCT and ML are the main structural connections of the cerebello-thalamo-cortical (CTC) circuits. Various lines of evidence suggest that the CTC circuits play a critical role in the cognitive symptoms of PD. Pathological studies have confirmed the presence of landmark Lewy body pathology aggregates in the cerebellar nuclei and adjacent white matter displayed in PD patients [34, 35]. Neuroimaging studies have confirmed that the CTC loop mediates the involvement of the cerebellum in higher-order cognitive processes, such as planning, verbal fluency, mental flexibility, abstract reasoning, and working memory; its dysfunction contributes to cognitive dysfunction in PD [36, 37]. Therefore, our results further support the notion that the CTC circuitry is affected by disease-specific impairments in PD and contributes to cognitive dysfunction in PD. Moreover, the PCT and ML contain topologically arranged projection fibres, and adjacent voxels may project to very different neocortices. Thus, it is possible that when one of the voxels suggests damage, its neighbours remain normal. LDH estimates the overall consistency of diffusion of water molecules between a voxel and its neighbours, and so abnormalities in the PCT and ML may be more sensitive to LDH.

In addition, the MD value of the right cingulum (hippocampus) was the second most important feature and was significantly negatively correlated with MoCA scores. Several studies have shown susceptibility alterations in the hippocampus in patients with PD-MCI. The hippocampus plays an important role in the interaction between dopamine transmission and hippocampal synaptic remodelling, and an imbalance in this interaction leads to dementia [38]. Neuropathological studies have observed Lewy body pathology (accumulations of the protein alpha-synuclein) in the hippocampus of PD patients, and the degree of cognitive impairment is correlated with the degree of Lewy body deposition in the hippocampus [39]. Increased MD indicates extensive cellular damage, including oedema and necrosis [10]. Multimodal MRI studies have confirmed that injury to the structural integrity and connectivity of the fornix-hippocampal projections is associated with decreased memory test scores in PD patients [40]. DTI studies have confirmed that PD-NC patients showed increased fornix MD values compared with those of HCs [40], and PDD patients showed lower hippocampal FA values than PD patients without dementia [13]. In this study, the LDHk value of the left fornix (cres)/stria terminalis was also decreased in the PD-MCI group. This finding indicates that microstructural damage to the fornix-hippocampal projection plays an important role in the cognitive impairment in PD. The pathological changes in the hippocampus may be relatively dispersed, and the transition from the normal area to the abnormal area is not as sudden as that in the PCT and ML. Therefore, the MD index of the hippocampus has classification importance.

Several limitations of our study should be noted. First, this work is a retrospective study, and prospective studies are needed in the future to validate whether the proposed method can predict the conversion of PD-NC to PD-MCI. Additionally, although this study adopted the simpler level I diagnostic criteria for PD-MCI, one study confirmed that level II criteria did not add value to the level I criteria [24]. Second, this study did not further subdivide patients according to cognitive impairment, specifically including (1) frontal-dominant impairment and (2) posterior-cortical-dominant impairment, which is a future research direction we plan to pursue. Finally, the present study only explored the predictive ability of DTI parameters in white matter brain regions for PD-MCI patients, which may be a rather one-sided analysis, and multimodal data (including spatial and temporal features) are needed in the future to fully explore the mechanisms of PD-MCI and increase accuracy of machine learning models, as studied by Bianchetti et al [41].


In conclusion, a machine learning model trained with DTI metrics extracted from atlas-based WM segmentation shows potential in differentiating individuals with PD-MCI from PD patients without dementia. Specifically, the combined application of intra- and intervoxel diffusion measures can provide more comprehensive information about white matter alterations and improve classification accuracy. XGBoost models based on combined DTI indices are particularly promising classifiers with high classification accuracy. After further validation, the model may become a valuable tool in supporting PD-MCI clinical diagnostic systems.

Availability of data and materials

The datasets from the current study are available from the corresponding author upon reasonable request.



Area under the curve


Diffusion tensor imaging


Fractional anisotropy


Random forest


Local diffusion homogeneity using Kendall’s coefficient concordance


Local diffusion homogeneity using Spearman’s rank correlation coefficient


Mean diffusivity


Movement Disorder Society Unified Parkinson’s Disease Rating Scale Part III


Montreal Cognitive Assessment


pontine crossing tract


Parkinson’s disease


Parkinson’s disease with mild cognitive impairment


Parkinson’s disease with normal cognition


Receiver operating characteristic


White matter


Extreme gradient boosting


  1. Aarsland D, Batzu L, Halliday GM, Geurtsen GJ, Ballard C, Ray Chaudhuri K, Weintraub D: Parkinson disease-associated cognitive impairment. Nat Rev Dis Primers 2021, 7:47.

    Article  PubMed  Google Scholar 

  2. Litvan I, Goldman JG, Troster AI, Schmand BA, Weintraub D, Petersen RC, Mollenhauer B, Adler CH, Marder K, Williams-Gray CH, et al: Diagnostic criteria for mild cognitive impairment in Parkinson’s disease: Movement Disorder Society Task Force guidelines. Mov Disord 2012, 27:349–356.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Pedersen KF, Larsen JP, Tysnes OBR, Alves G: Natural course of mild cognitive impairment in Parkinson disease: a 5-year population-based study. Neurology 2017, 88:767–774.

    Article  PubMed  Google Scholar 

  4. Braak H, Rüb U, Tredici KD: Cognitive decline correlates with neuropathological stage in Parkinson’s disease. Journal of the Neurological Sciences 2006, 248:255–258.

    Article  PubMed  Google Scholar 

  5. Winer JR, Maass A, Pressman P, Stiver J, Schonhaut DR, Baker SL, Kramer J, Rabinovici GD, Jagust WJ: Associations between tau, beta-amyloid, and Cognition in Parkinson Disease. JAMA Neurol 2018, 75:227–235.

    Article  PubMed  Google Scholar 

  6. Taylor KI, Sambataro F, Boess F, Bertolino A, Dukart J: Progressive decline in Gray and White Matter Integrity in de novo Parkinson’s Disease: an analysis of longitudinal Parkinson progression markers Initiative Diffusion Tensor Imaging Data. Front Aging Neurosci 2018, 10:318.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Scamarcia PG, Agosta F, Spinelli EG, Basaia S, Stojkovic T, Stankovic I, Sarasso E, Canu E, Markovic V, Petrovic I, et al: Longitudinal white matter damage evolution in Parkinson’s Disease. Mov Disord 2022, 37:315–324.

    Article  PubMed  Google Scholar 

  8. Melzer TR, Watts R, Macaskill MR, Pitcher TL, Livingston L, Keenan RJ, Dalrymple-Alford JC, Anderson TJ: White matter microstructure deteriorates across cognitive stages in Parkinson disease. Neurology 2013, 80:1841–1849.

    Article  CAS  PubMed  Google Scholar 

  9. Duncan GW, Firbank MJ, Yarnall AJ, Khoo TK, Brooks DJ, Barker RA, Burn DJ, O’Brien JT: Gray and white matter imaging: a biomarker for cognitive impairment in early Parkinson’s disease? Mov Disord 2016, 31:103–110.

    Article  PubMed  Google Scholar 

  10. Pierpaoli C, Jezzard P, Basser PJ, Barnett A, Chiro GD: Diffusion tensor MR imaging of the human brain. Radiology 1996, 201:637–648.

    Article  CAS  PubMed  Google Scholar 

  11. Agosta F, Canu E, Stefanova E, Sarro L, Tomic A, Spica V, Comi G, Kostic VS, Filippi M: Mild cognitive impairment in Parkinson’s disease is associated with a distributed pattern of brain white matter damage. Hum Brain Mapp 2014, 35:1921–1929.

    Article  PubMed  Google Scholar 

  12. Bledsoe IO, Stebbins GT, Merkitch D, Goldman JG: White matter abnormalities in the corpus callosum with cognitive impairment in Parkinson disease. Neurology 2018, 91:e2244-e2255.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Chen B, Guo GF, Hu L, Wang S: Changes in anatomical and functional connectivity of Parkinson’s disease patients according to cognitive status. European Journal of Radiology 2015, 84:1318–1324.

    Article  PubMed  Google Scholar 

  14. Gong G: Local diffusion homogeneity (LDH): an inter-voxel diffusion MRI metric for assessing inter-subject white matter variability. PLoS One 2013, 8:e66366.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Chen HJ, Gao YQ, Che CH, Lin H, Ruan XL: Diffusion Tensor Imaging with Tract-Based spatial Statistics reveals White Matter Abnormalities in patients with vascular cognitive impairment. Front Neuroanat 2018, 12:53.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Liu HH, Wang J, Chen XM, Li JP, Ye W, Zheng J: Reduced local diffusion homogeneity as a biomarker for temporal lobe epilepsy. Medicine 2016, 95:e4032.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Liang Y, Zhang H, Tan X, Liu J, Qin C, Zeng H, Zheng Y, Liu Y, Chen J, Leng X, et al: Local Diffusion Homogeneity provides supplementary information in T2DM-Related WM Microstructural Abnormality Detection. Front Neurosci 2019, 13:63.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Liu G, Gao Y, Liu Y, Guo Y, Yan Z, Ou Z, Zhong L, Xie C, Zeng J, Zhang W, et al: Machine Learning for Predicting Individual Severity of Blepharospasm using Diffusion Tensor Imaging. Front Neurosci 2021, 15:670475.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Liu G, Tan S, Dang C, Peng K, Xie C, Xing S, Zeng J: Motor Recovery Prediction with Clinical Assessment and local Diffusion Homogeneity after Acute Subcortical Infarction. Stroke 2017, 48:2121–2128.

    Article  PubMed  Google Scholar 

  20. Ruiz-Gómez S, Gómez C, Poza J, Gutiérrez-Tobal G. 2018. Automated Multiclass Classification of Spontaneous  Activity in Alzheimer’s Disease and Mild Cognitive Impairment. Entropy20: 35.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Lee SB, Kim YJ, Hwang S, Son H, Sang KL, Park KI, Kim YG: Predicting Parkinson’s disease using gradient boosting decision tree models with electroencephalography signals. Parkinsonism & Related Disorders 2022, 95:77–85.

    Article  PubMed  Google Scholar 

  22. Dubois B, Burn D, Goetz C, Aarsland D, Brown RG, Broe GA, Dickson D, Duyckaerts C, Cummings J, Gauthier S, et al: Diagnostic procedures for Parkinson’s disease dementia: recommendations from the movement disorder society task force. Mov Disord 2007, 22:2314–2324.

    Article  PubMed  Google Scholar 

  23. Goetz CG, Tilley BC, Shaftman SR, Stebbins GT, Zweig RM: Movement Disorder Society-Sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): Scale Presentation and Clinimetric Testing results. Movement Disorders 2008, 23:2129–2170.

    Article  PubMed  Google Scholar 

  24. Hoogland J, Boel JA, de Bie RMA, Schmand BA, Geskus RB, Dalrymple-Alford JC, Marras C, Adler CH, Weintraub D, Junque C, et al: Risk of Parkinson’s disease dementia related to level I MDS PD-MCI. Mov Disord 2019, 34:430–435.

    Article  PubMed  Google Scholar 

  25. Dalrymple-Alford JC, Macaskill MR, Nakas CT, Livingston L, Graham C, Crucian GP, Melzer TR, Kirwan J, Keenan R, Wells S. 2010 The MoCA: well-suited screen for cognitive impairment in Parkinson disease. Neurology. doi: 10.1212/WNL.0b013e3181fc29c9

    Article  PubMed  Google Scholar 

  26. Hua K, Zhang J, Wakana S, Jiang H, Li X, Reich DS, Calabresi PA, Pekar JJ, Zijl P, Mori S: Tract probability maps in stereotaxic spaces: analyses of white matter anatomy and tract-specific quantification. Neuroimage 2008, 39:336–347.

    Article  PubMed  Google Scholar 

  27. Chen B, Wang S, Sun W, Shang X, Liu H, Liu G, Gao J, Fan G: Functional and structural changes in gray matter of parkinson’s disease patients with mild cognitive impairment. Eur J Radiol 2017, 93:16–23.

    Article  PubMed  Google Scholar 

  28. Betrouni N, Delval A, Chaton L, Defebvre L, Duits A, Moonen A, Leentjens AFG, Dujardin K: Electroencephalography-based machine learning for cognitive profiling in Parkinson’s disease: preliminary results. Mov Disord 2019, 34:210–217.

    Article  PubMed  Google Scholar 

  29. Zhang J, Gao Y, He X, Feng S, Hu J, Zhang Q, Zhao J, Huang Z, Wang L, Ma G, et al: Identifying Parkinson’s disease with mild cognitive impairment by using combined MR imaging and electroencephalogram. Eur Radiol 2021, 31:7386–7394.

    Article  PubMed  Google Scholar 

  30. Chougar L, Faouzi J, Pyatigorskaya N, Yahia-Cherif L, Gaurav R, Biondetti E, Villotte M, Valabregue R, Corvol JC, Brice A, et al: Automated categorization of parkinsonian Syndromes using magnetic resonance imaging in a clinical setting. Mov Disord 2021, 36:460–470.

    Article  CAS  PubMed  Google Scholar 

  31. Haller S, Badoud S, Nguyen D, Garibotto V, Lovblad KO, Burkhard PR: Individual detection of patients with Parkinson disease using support vector machine analysis of diffusion tensor imaging data: initial results. AJNR Am J Neuroradiol 2012, 33:2123–2128.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Shibata H, Uchida Y, Inui S, Kan H, Sakurai K, Oishi N, Ueki Y, Oishi K, Matsukawa N: Machine learning trained with quantitative susceptibility mapping to detect mild cognitive impairment in Parkinson’s disease. Parkinsonism Relat Disord 2022, 94:104–110.

    Article  CAS  PubMed  Google Scholar 

  33. Hurd YL, Suzuki M, Sedvall GC: D1 and D2 dopamine receptor mRNA expression in whole hemisphere sections of the human brain. Journal of Chemical Neuroanatomy 2001, 22:127–137.

    Article  CAS  PubMed  Google Scholar 

  34. Seidel K, Bouzrou M, Heidemann N, Krüger R, Schols L, Dunnen WD, Korf HW, Rüb U. 2017 Involvement of the cerebellum in Parkinson disease and dementia with Lewy bodies. Ann Neurol. 81: 898.

    Article  CAS  PubMed  Google Scholar 

  35. Zhong Y, Liu H, Liu G, Zhao L, Dai C, Liang Y, Du J, Zhou X, Mo L, Tan C, et al: A review on pathology, mechanism, and therapy for cerebellum and tremor in Parkinson’s disease. NPJ Parkinsons Dis 2022, 8:82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Maiti B, Koller JM, Snyder AZ, Tanenbaum AB, Perlmutter JS. 2019 Cognitive correlates of cerebellar resting-state functional connectivity in Parkinson disease. Neurology.

    Article  PubMed  Google Scholar 

  37. Riou A, Houvenaghel JF, Dondaine T, Drapier S, Sauleau P, Drapier D, Duprez J, Guillery M, Jeune FL, Verin M. 2021 Functional role of the Cerebellum in Parkinson Disease: a PET study. Neurology.

    Article  PubMed  Google Scholar 

  38. Valero J, Bernardino L, Cardoso F, Silva AP, Fontesribeiro C, Ambrósio AF, Malva JO: Impact of neuroinflammation on hippocampal neurogenesis: relevance toAging and Alzheimer’s Disease. Journal of Alzheimers Disease Jad 2017, 60:1–8.

    Article  Google Scholar 

  39. Hall H, Reyes S, Landeck N, Bye C, Kirik D: Hippocampal Lewy pathology and cholinergic dysfunction are associated with dementia in Parkinson’s disease. Brain 2014, 137:2493–2508.

    Article  PubMed  Google Scholar 

  40. Gargouri F, Gallea C, Mongin M, Pyatigorskaya N, Valabregue R, Ewenczyk C, Sarazin M, Yahia-Cherif L, Vidailhet M, Lehericy S: Multimodal magnetic resonance imaging investigation of basal forebrain damage and cognitive deficits in Parkinson’s disease. Mov Disord 2019, 34:516–525.

    Article  PubMed  Google Scholar 

  41. Bianchetti G, Taralli S, Vaccaro M, Indovina L, Mattoli MV, Capotosti A, Scolozzi V, Calcagni ML, Giordano A, De Spirito M, Maulucci G: Automated detection and classification of tumor histotypes on dynamic PET imaging data through machine-learning driven voxel classification. Comput Biol Med 2022, 145:105423.

    Article  CAS  PubMed  Google Scholar 

Download references


We thank all patients of the study for their patience and cooperation. This manuscript was edited for proper English language, grammar, punctuation, spelling, and overall style by one or more of the highly qualified native English-speaking editors at AJE (


This study has received funding by National Natural Science Foundation of China (No.82071909, GuoGuang Fan).

Author information

Authors and Affiliations



Dr. GF: conceptualization and supervision of the study. BC: validation, investigation and writing. MX, JH: methodology and visualization. HMY, YL: neuropsychological assessment. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Guo Guang Fan.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the First Hospital of China Medical University (No:AF-SOP-07-1.1-01).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

MRI data acquisition and preprocessing.

Additional file 2:

 Figure S1. SHapley Additive exPlanation. SHAP summary plot showing the values of features in every sample. Each line represents a feature, and the abscissa represents the SHAP value. Each dot represents a sample. Feature Importance: The mean absolute SHAP value of each feature.

Additional file 3:

Figure S2. Overview of correlations of the regional mean DTI values with MoCA scores. “.R” and “-.L” indicate the right and left sides, respectively. Abbreviations: PD-MCI= Parkinson's disease with mild cognitive impairment; PD-CN = Parkinson's disease with normal cognition; MoCA=Montreal CognitiveAssessment; CH=cingulum; ST=fornix/stria terminalis; MD=mean diffusivity; LDHk=local diffusion homogeneity using Kendall's coefficient concordance.

Additional file 4:

 Table S1. Hyperparameters of different models for classifying PD-MCI vs. PD-NC.

Additional file 5:

 Table S2. All correlations between the DTI values and clinical scores among all participants.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, B., Xu, M., Yu, H. et al. Detection of mild cognitive impairment in Parkinson’s disease using gradient boosting decision tree models based on multilevel DTI indices. J Transl Med 21, 310 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: