Hostname: page-component-669899f699-8p65j Total loading time: 0 Render date: 2025-04-26T03:28:08.345Z Has data issue: false hasContentIssue false

Spatiotemporal filtering modeling of hand, foot, and mouth disease: a case study from East China, 2009–2015

Published online by Cambridge University Press:  16 April 2025

Xi Chen
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Jianbo Ba
Affiliation:
Naval Medical Center, Naval Medical University, No.880 Xiangyin Road, Yangpu District, Shanghai, China
Yuanhua Liu
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Jiaqi Huang
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Ke Li
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Yun Yin
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Jin Shi
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Jiayao Xu
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Rui Yuan
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
Michael P. Ward
Affiliation:
Sydney School of Veterinary Science, The University of Sydney, Camden, NSW, Australia
Wei Tu
Affiliation:
Department of Geology and Geography, Georgia Southern University, Statesboro, GA 30460, USA
Lili Yu
Affiliation:
Peace Center for Biostatistics, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, GA 30460, USA
Quanyi Wang
Affiliation:
Beijing Center for Disease Prevention and Control
Xiaoli Wang
Affiliation:
Beijing Center for Disease Prevention and Control
Zhaorui Chang*
Affiliation:
Division of Infectious Disease, Key Laboratory of Surveillance and Early-Warning on Infectious Disease, Chinese Center for Disease Control and Prevention, 155 Changbai Rd, Changping District, Beijing 102206, China
Zhijie Zhang*
Affiliation:
Department of Epidemiology and Health Statistics, School of Public Health, Fudan University, Shanghai, China; Key Laboratory of Public Health Safety, Ministry of Education, Shanghai, China
*
Corresponding authors: Zhijie Zhang and Zhaorui Chang; Emails: [email protected]; [email protected]
Corresponding authors: Zhijie Zhang and Zhaorui Chang; Emails: [email protected]; [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Hand, foot, and mouth disease (HFMD) shows spatiotemporal heterogeneity in China. A spatiotemporal filtering model was constructed and applied to HFMD data to explore the underlying spatiotemporal structure of the disease and determine the impact of different spatiotemporal weight matrices on the results. HFMD cases and covariate data in East China were collected between 2009 and 2015. The different spatiotemporal weight matrices formed by Rook, K-nearest neighbour (KNN; K = 1), distance, and second-order spatial weight matrices (SO-SWM) with first-order temporal weight matrices in contemporaneous and lagged forms were decomposed, and spatiotemporal filtering model was constructed by selecting eigenvectors according to MC and the AIC. We used MI, standard deviation of the regression coefficients, and five indices (AIC, BIC, DIC, R2, and MSE) to compare the spatiotemporal filtering model with a Bayesian spatiotemporal model. The eigenvectors effectively removed spatial correlation in the model residuals (Moran’s I < 0.2, p > 0.05). The Bayesian spatiotemporal model’s Rook weight matrix outperformed others. The spatiotemporal filtering model with SO-SWM was superior, as shown by lower AIC (92,029.60), BIC (92,681.20), and MSE (418,022.7) values, and higher R2 (0.56) value. All spatiotemporal contemporaneous structures outperformed the lagged structures. Additionally, eigenvector maps from the Rook and SO-SWM closely resembled incidence patterns of HFMD.

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Introduction

Hand, foot, and mouth disease (HFMD) is an infectious disease caused by more than 20 types of enteroviruses [Reference Liu1, Reference Zhang, Xu and Xiao2], among which coxsackievirus A16 (CV-A16) and enterovirus 71 (EV-71) are the most common. This disease primarily occurs in children under 5 years of age [Reference Liu1, Reference Xu, Zhang and Xiao3]. A total of 11,814,830 cases of HFMD were reported in China between 2008 and 2014, with the average growth rate was 125.55/100,000 [Reference Li4]. Both non-pharmaceutical interventions (such as regular disinfection) and vaccination strategies are used to control the occurrence and spread of HFMD; however, the disease remains a major public health burden in China [Reference Liu5].

Previous studies have indicated spatiotemporal aggregation and heterogeneity (spatially local heterogeneity and spatially stratified heterogeneity) of HFMD [Reference Li4, Reference Zhang6Reference Wang8], highlighting the importance of understanding its distribution and identifying high-risk areas and risk factors, which are prerequisites for undertaking targeted prevention and control measures to reduce the impact of HFMD [Reference Li4].

Various spatiotemporal models, including the spatiotemporal autoregressive model [Reference Truong9, Reference Phung10], spatiotemporal model with variable coefficients [Reference Hong11, Reference Li12], and Bayesian spatiotemporal model [Reference Zhang, Xu and Xiao2, Reference Hong13, Reference Xu14], have been utilized to evaluate the spatiotemporal distribution of HFMD and its associations with factors like meteorological patterns, socioeconomic status, and health conditions [Reference Wang15Reference Tian17]. For instances, Ding et al. used an econometric panel model to explore the relationship between HFMD and meteorological factors [Reference Ding18]. Hong et al. constructed a monthly time-lag geographically weighted regression model to investigate spatiotemporal variations due to the effect of climatic factors on HFMD occurrence in the Inner Mongolia Autonomous Region, China [Reference Hong11]. Zhang et al. used a Bayesian spatiotemporal hierarchy model to identify the spatiotemporal heterogeneity (Supplementary Information S1) of HFMD [Reference Zhang, Xu and Xiao2]. However, these models suffer from mathematical complexities, intensive computation requirements, and limitations in dealing with spatial autocorrelation (e.g. the choice of spatial lag form or spatial error form in traditional autoregressive models) [Reference Griffith, Chun, Li, Griffith, Chun and Li19]. Additionally, they may fail to reveal the underlying spatiotemporal structure (Supplementary Information S1) and require the specification of the model’s parameter distribution [Reference Griffith20]. Moreover, these models used only a Rook [Reference Wang21] or Queen [Reference Liao22] weight matrix in spatial autocorrelation analysis [Reference Chen23Reference Zhu25], without considering other possible configurations of weight matrix [Reference li and Gui26].

Griffith [Reference Griffith27] proposed an eigenvector spatiotemporal filtering method to overcome the flaws in the existing models. This method first breaks down geographical and temporal relationships into a set of eigenvectors [Reference Griffith and Li28, Reference Park29], a subset of the eigenvectors was then incorporated into a conventional nonspatial model as covariates. The biggest advantage of this new method is that the linear combination of the selected eigenvectors can filter the spatial autocorrelation out of the observations so that the observations in the model can be treated as if they are independent [Reference Griffith20]. The spatiotemporal filtering method can be applied to standard statistical models to construct spatiotemporal filtering models, making the modelling process simpler than that of other spatiotemporal models. In addition, the spatiotemporal filtering model allows visualization of the spatial autocorrelation of geo-referenced datasets which describes how map patterns evolve in space and time. Most importantly, compared to other models, the spatial and temporal weights can be flexibly set in a spatiotemporal filtering model to determine the optimal weight matrix better [Reference Tan30, Reference Lin and Ma31].

In this study, we developed spatiotemporal filtering models based on HFMD case data and compared their performance against Bayesian spatiotemporal models. We explored the influence of different weight matrices on model results and visualized the eigenvector’s maps to identify potential high-risk areas for HFMD.

Methods

Data source

Data collection was conducted in eastern China between 2009 and 2015, focusing on HFMD cases and relevant covariates at 78 prefectural levels. HFMD case data were obtained from the national infectious disease reporting system (NIDRS) of the China CDC. To assess potential risk factors, meteorological data were retrieved from China’s National Meteorological Information Center (https://data.cma.cn). The meteorological variables included daily average temperature (°C), average humidity (%), sunlight duration (h), and average wind speed (m/s) for prefecture-level cities. Additionally, socioeconomic variables, such as GDP per capita and the number of hospitals, were collected from local official demographic yearbooks. For a comprehensive understanding of the data collection process, a more detailed description can be found in our previous study [Reference Hong13].

To ensure data quality and avoid multicollinearity issues, all data were summarized on a monthly basis. Covariates that displayed a correlation greater than 0.6 were scrutinized, and the one with the weaker association with HFMD case occurrence was removed from the dataset [Reference Stone32]. Detailed data processing is provided in the Appendix (Supplementary Information S2).

Statistical analysis

We compared three spatiotemporal models of HFMD: the generalized linear mixed-effects model (GLMM), the spatiotemporal filtering model, and the Bayesian spatiotemporal model. We first examined the spatial correlation in the model residuals using Moran’s I (MI) to determine whether the eigenvectors could effectively remove the spatial correlation from the model residuals and then compared the model fit and accuracy between the spatiotemporal filtering model and the Bayesian spatiotemporal model.

Generalized linear mixed-effects model (GLMM)

The number of HFMD cases at time point t in each prefecture-level city i is denoted as $ {\gamma}_{it} $ (i = 1, 2, …, 78). The GLMM is formulated with a logarithmic link function, and the expression is as follows:

(1) $$ \mathit{\log}\left({\gamma}_{it}\right)=\mathit{\log}\left({n}_{it}\right)+{\beta}_0+{\beta}_t{X}_t+\sum_k^m{\beta}_k{X}_k+{RE}_{it} $$

where $ {n}_{it} $ represents the population of the i-th region; $ {\beta}_0 $ denotes the intercept term; $ {\beta}_{\mathrm{t}} $ denotes the regression coefficient of the time trend; $ {X}_t $ denotes the time trend (e.g. $ {X}_t $ = 1, 2, …, 84); $ {\beta}_k $ is the regression coefficient of the covariate; $ {X}_k $ is the k-th covariate; and $ {RE}_{it} $ is random effect term. The variance and mean of the HFMD incidence data were unequal, which indicates that the incidence data were excessively dispersed. Therefore, a negative binomial distribution was chosen as the final link function for our GLMM based on the suggestion of the previous studies [Reference Sun33, Reference Chen and Huang34].

Spatiotemporal filtering model

The spatiotemporal filtering model is based on a GLMM. The $ {RE}_{it} $ terms are divided into two components: spatial structural effect (SSRE) and spatial unstructured effect (SURE). The linear combinations of filtered eigenvectors form the SSRE terms (the screening process can be found below), while the SURE represents the model residuals.

(2) $$ \mathit{\log}\left({\gamma}_{it}\right)=\mathit{\log}\left({n}_{it}\right)+{\beta}_0+{\beta}_t{X}_t+\sum_{k=1}^K{\beta}_k{X}_k+\sum_{m=1}^M{\gamma}_{it m}{E}_{it m}+ SURE $$

where $ {E}_{itm} $ is the m-th eigenvector and $ {\gamma}_{itm} $ is the regression coefficient of the eigenvector.

The decomposition of a transformed connectivity matrix constitutes the core element of the spatiotemporal filtering method. More formally, decomposition yields

(3) $$ \boldsymbol{MVM}=\boldsymbol{E}\boldsymbol{\varLambda } \boldsymbol{E}', $$

where $ \boldsymbol{M} $ is a symmetric and idempotent projection matrix. If only the intercept is included in the construction of the projection matrix, the equation simplifies to $ \boldsymbol{M}=\frac{\boldsymbol{I}-\mathbf{11}'}{\boldsymbol{nT}} $ , where $ \boldsymbol{I} $ is the identity matrix and $ \mathbf{1} $ is an n × 1 one-dimensional vector. $ \boldsymbol{V} $ is the exogenously specified connectivity matrix (Supplementary Information S3) represented by the spatiotemporal weight matrix $ W $ , which will be introduced below. The columns of matrix $ \boldsymbol{E} $ are n mutually independent eigenvectors obtained from the $ \boldsymbol{MVM} $ , and each eigenvector in $ \boldsymbol{E} $ represents a distinct map pattern permitted by the units’ spatial arrangement and is associated with a certain level of spatial autocorrelation. The eigenvectors in $ \boldsymbol{E} $ are independent and orthogonal (the advantage of orthogonality is the absence of multicollinearity) [Reference Griffith35], which is screened first for significance before it is included in the model. $ \boldsymbol{\Lambda} $ is the corresponding eigenvalue on the main diagonal of the matrix, and $ \boldsymbol{E}' $ is the transposition matrix of $ \boldsymbol{E} $ .

Construction of the spatiotemporal weight matrix W

Spatial weight matrices, the temporal weight matrix, and the spatiotemporal weight matrix for the spatiotemporal filtering model were constructed as follows.

First, we built four types of spatial weight matrices [Reference Chen and Huang34]: the Rook weight matrix, KNN (K = 1) weight matrix, the distance weight matrix (the list of distance vectors corresponding to the neighbouring objects was first calculated, and 1.5 times the maximum distance of neighbouring objects was specified as the distance threshold), and the second-order weight matrix. All the matrices were row normalized before modelling.

Next, we constructed a first-order temporal weight matrix to quantify the relationship between disease incidences over time. The weight between the two adjacent time periods was set to be 1 if there was a temporal dependence between the values of the two adjacent periods or if the value of a period depended exclusively on the value of the preceding period; otherwise, the weight was set to be 0. Finally, a spatiotemporal weight matrix was constructed and two sets of temporal dependence structures were specified [Reference Chen36].

  1. 1) Spatiotemporal lag structure:

The value of a position at a point in time is determined by its value at the previous point in time and the value of adjacent positions at the previous point in time (Figure 1a). The spatiotemporal time-lag dependence structure is expressed as a matrix by

(4) $$ {W}_1={I}_T\hskip0.5em \otimes \hskip0.5em {W}_N+{W}_T\hskip0.5em \otimes \hskip0.5em {I}_N $$

Figure 1. Spatiotemporal lag structure (a) and spatiotemporal contemporaneous structure (b).

where $ {I}_T $ is the T*T (84*84) order unit matrix; $ \otimes $ is the Kronecker; $ {W}_N $ is the N*N (78*78) order spatial weight matrix; $ {W}_T $ is the T*T (84*84) order temporal weight matrix; and $ {I}_N $ is the N*N (78*78) order unit matrix.

  1. 2) Spatiotemporal contemporaneous structure:

The value of a position at a given time point is determined by the value of all its adjacent positions and the value of that position at the previous time point (Figure 1b). In this instance, the contemporaneous spatiotemporal dependence structure is expressed as a matrix by

(5) $$ {W}_2={W}_T\otimes {W}_N+{W}_T\otimes {I}_N $$

Thus, our formula (3) $ \boldsymbol{MVM} $ becomes either.

Lag:

(6) $$ \left[\frac{\boldsymbol{I}-\mathbf{11}'}{\boldsymbol{nT}}\right]\left[{W}_1\right]\left[\frac{\boldsymbol{I}-\mathbf{11}'}{\boldsymbol{nT}}\right] $$

or

Contemporaneous:

(7) $$ \left[\frac{\boldsymbol{I}-\mathbf{11}'}{\boldsymbol{nT}}\right]\left[{W}_2\right]\left[\frac{\boldsymbol{I}-\mathbf{11}'}{\boldsymbol{nT}}\right] $$
Screening of the eigenvectors

For the spatiotemporal filtering method, the identification and selection of the relevant eigenvectors ( $ \boldsymbol{E} $ ) are decisive step before building the model [Reference Griffith37]. The candidate set i∈{1,2,…,n} is determined by calculating MCi/MCmax for all eigenvectors [Reference Tiefelsdorf and Griffith38], where MC is Moran coefficient ( $ {MC}_i=\frac{n}{1V1'}{\lambda}_{\mathrm{n}} $ ).

$ \boldsymbol{E} $ with eigenvector, MCi/MC max ≥ 0.25, enters the candidate set preliminarily. After that, a useful criterion for selecting eigenvectors from the candidate set is to combine stepwise regression and use a significance level criterion of 0.10 (the Akaike’s information criterion [AIC] = 0.1) [Reference Griffith39, Reference Griffith, Chun, Li, Griffith, Chun and Li40].

Bayesian spatiotemporal model

(8) $$ {\gamma}_{it}\sim \mathrm{n} Binomial\left({\mathrm{E}}_{it}{\lambda}_{it}\right) $$

where $ {E}_{it} $ is the expected value for area i and time t. $ {\lambda}_{it} $ is the target estimated variable and is explained as the standard morbidity ratio (SMR) [Reference Griffith39].

(9) $$ {\eta}_{it}=\mathit{\log}\left({\lambda}_{it}\right)={\beta}_0+\sum_k^m{\beta}_k{\chi}_k+{\delta}_i+{\upsilon}_i+{\varphi}_t+{\tau}_t+{\varPsi}_t $$
(10) $$ {\delta}_i\mid {\delta}_{-i}\sim Normal\left(\frac{1}{N_i}\sum_{j=1}^n{\alpha}_{ij}{\delta}_j,{s}_i^2\right) $$

where $ {\eta}_{it} $ is the structured additive linear predictor; $ {\lambda}_{it} $ is estimated SMR of HFMD in space i and time t; $ {\beta}_0 $ is the intercept term of the model; $ {\beta}_k $ is the regression coefficient of the covariates; $ {\delta}_i $ is the structured spatial influence term; $ {\upsilon}_i $ is the unstructured spatial influence term (i.e. the $ {\mathrm{RE}}_{\mathrm{it}} $ ); $ {\varphi}_t $ is the temporally structured effect; $ {\tau}_t $ is a temporally non-structured effect; $ {\varPsi}_t $ is the spatiotemporal interaction structure; and $ {s}_i^2={\sigma}_{\delta}^2/{N}_i $ is the variance of region i influenced by the spatial effects of the $ N $ neighbouring regions, which depends on the number of its neighbouring regions $ {N}_i $ . $ {\alpha}_{ij} $ is 1 if region i and region j are adjacent, else it is 0. Here, we set four different weight matrices (Rook, KNN, distance, and second-order spatial weight matrices) to explore their effects on the model results.

The uninformative prior distribution N (0, 106) are used for $ {\beta}_0 $ , $ {\beta}_k $ , $ {\tau}_t $ , and $ {\upsilon}_i $ .The prior distribution of the spatial structural effect $ {\delta}_i $ is modelled by conditional autoregression. Prior information for time effects ( $ {\varphi}_t $ ) was modelled through a first-order autoregressive process.

Model evaluation

First, the spatial correlation in the residuals of the three models was examined using MI, where MI is used to determine whether spatial entities within a certain range are correlated with each other, with values distributed in [−1, 1]. Here, [0, 1] indicates a positive correlation between geographic entities, [−1,0] indicates a negative correlation, and 0 indicates no correlation [Reference Su41]. Subsequently, AIC, Bayesian information criterion (BIC), deviation information criterion (DIC), mean square error (MSE), and goodness of fit (R2) were used to compare the differences between the models with different weight settings. Larger R2 values and smaller AIC, BIC, DIC, and MSE values indicate better model performanceReference Musio, Sauleau and Buemi[42].

R software (version 4.1.3; R Foundation for Statistical Computing, Vienna, Austria) was used for the analysis, and the INLA, dplyr, spdep, and lme4 packages were utilized for data processing, matrix creation, and model development. QGIS (version 3.1.4; QGIS Development Team) was used to create maps.

Results

Spatial and temporal characteristics of HFMD in eastern China

Between 2009 and 2015, a total of 4,058,702 HFMD cases were reported. The incidence rates varied across space, but the disease was concentrated mainly in the south-eastern region of the study area. The incidence rate was also increasing over time but the rate was lower in 2015 than 2014 (Figure 2). The time-series plot of the HFMD incidence rate further revealed distinct seasonal patterns, indicating major incidence peaks occurring from May to July and minor peaks from September to November each year (Supplementary Information S4). Meanwhile, the incidence of HFMD in the East China showed spatially local heterogeneity (Supplementary Information S5). These findings suggest that HFMD is both temporally and spatially correlated and heterogeneous, necessitating further spatiotemporal analysis.

Figure 2. Incidence maps of HFMD in east Mainland China, 2009–2015.

Model specification, assessment, and comparison

The RE term of the GLMM was decomposed into SSRE and SURE, as illustrated in Figure 3. The SSRE showed spatial aggregation, whereas the SURE had no obvious spatial distribution pattern.

Figure 3. SSRE and SURE terms after decomposition of random effect terms.

*The values of the colour in the graph are relative sizes.

The MI of the original GLMM’s residuals demonstrated spatial correlation and was statistically significant (p < 0.05). The residuals of the spatiotemporal filtering model were effectively free of spatial correlation after adding the eigenvectors (p > 0.05) (Table 1).

Table 1. Comparison of the results from the spatiotemporal filtering models and Bayesian spatiotemporal models

a MI-NON: MI value of generalized linear mixed-effects model (GLMM) residuals.

b P-NON: P value of MI value in GLMM residuals.

c N (initial): the number of eigenvectors with MIi/MImax ≥ 0.25.

d N (added): the number of eigenvectors after stepwise regression screening.

e MI: MI value of spatiotemporal filtering model residuals.

f P: P value of MI value in spatiotemporal filtering model residuals.

g R: Rook.

h K: K-nearest neighbour (KNN; K = 1).

i D: Distance.

j S: Second-order.

k C: Contemporaneous.

l L: Lagging.

The Bayesian spatiotemporal model with Rook weight matrix performed better based on the DIC and MSE values. The spatiotemporal filtering model with second-order contemporaneous weight matrix outperformed other models, as evidenced by lower AIC, BIC, and MSE values, and higher R 2 value. Furthermore, all spatiotemporal contemporaneous structures were superior to the models with lagged structures (Table 1).

Comparing the results of the optimal spatiotemporal filtering model with those of the Bayesian spatiotemporal model, the AIC and MSE values of the former model were 92,029.6 and 418,022.7, respectively, which were lower than those of the latter model. In addition, the R 2 value of the former model was about 30% higher than that of the latter model. These results suggest that the spatiotemporal filtering model performed better than the Bayesian spatiotemporal model (Table 1).

Model coefficients

Table 2 shows that the model coefficients from the spatiotemporal filtering models and Bayesian spatiotemporal models were overall consistent, sunlight and average humidity were negatively related to HFMD, whereas average wind speed and average temperature were positively associated with HFMD. However, an important observation is that the standard deviations of all the coefficients in the spatiotemporal filtering models were smaller compared to those in the Bayesian spatiotemporal models. This indicates a higher precision of estimation in the spatiotemporal filtering models. The smaller standard deviations suggest that the coefficients estimated by the spatiotemporal filtering models are more reliable and less affected by variability in the data.

Table 2. Comparison of coefficients of the model

a indicates statistical significance (p < 0.05).

Exploration of map patterns based on eigenvectors

Figure 4 shows three statistically significant eigenvectors generated by models using different spatiotemporal matrices. These eigenvectors capture varying levels of spatiotemporal information. The first two eigenvectors contain the most spatiotemporal information, while the last eigenvector represents the least spatiotemporal information. The mapped geographical patterns are related to the eigenvectors at different geographical scales. While the first two eigenvector maps show a regional (larger) clustering pattern with a north–south orientation, the last eigenvector map shows a local (smaller) clustering pattern with much less visible orientation.

Figure 4. Eigenvector decomposition of the spatiotemporal simultaneous structure matrix. (a), (b), (c), and (d) represent the Rook, KNN (K = 1), distance, and second-order matrices, respectively, where (a1), (b1), (c1), and (d1) represent the first eigenvectors; (a2), (b2), (c2), and (d2) are the second eigenvectors; and (a3), (b3), (c3), and (d3) are the last eigenvectors.

* The colours in the figure represent map modes with different MI values. The darker the colour, the more concentrated the place is. All values are greater than 0.

Moreover, on the first eigenvector maps with the Rook and the second-order matrices (Figure 4: a1, d1) and on the HFMD incidence map (Figure 2), HFMD appeared to be concentrated in the south-eastern region of the study area.

Discussion

In this study, spatiotemporal filtering models were constructed to analyse the HFMD incidence in East China. The eigenvectors in these models effectively removed the spatial correlation from the residuals to better reveal the underlying spatiotemporal structure of the HFMD data. This information can help identify the risk factors and design effective interventions and control programs. We also compared the spatiotemporal filtering models and GLMM and the Bayesian spatiotemporal models. Based on the evaluation index, model coefficients, and the standard deviations of the model coefficients, goodness of fit, and precision of estimation, the spatiotemporal filtering models overall overperformed the Bayesian spatiotemporal models and therefore can be considered a more optimal approach to conduct the spatiotemporal analysis of HFMD in East China.

Identifying appropriate spatial weight matrices that can best represent the spatial relationship between the outcomes in the neighbouring spatial units is a critical step to investigate the spatial autocorrelation of HFMD [Reference Wang43]. Different from most of the previous studies that used only one spatial weight matrix, we applied four different spatial weight matrices to examine how and to what extent the spatial weight matrices could influence the model results. We also extended the investigation to include spatiotemporal weight matrices in the models. Our findings are as follows. First, the Rook spatial weight matrix was the best choice for the Bayesian spatiotemporal models, and this conclusion is consistent with some earlier findings [Reference Chen and Huang34, Reference Wang43]. HFMD spreads mainly through close contact, so people nearby are more susceptible to the disease. Second, expanding to the spatiotemporal matrix, the second-order weight matrix performed the best in the spatiotemporal filtering models. This might be because the spatiotemporal weight matrix captures the spatiotemporal interactions, whereas in the Bayesian spatiotemporal model, only the spatial weight matrix can be used. Third, the spatiotemporal contemporaneous structure was better than the lagging structure; this might be explained by the relatively short incubation period of HFMD [Reference Anon44] (approximately 5–7 days). Therefore, the monthly HFMD incidence was more significantly influenced by contemporaneous structure than by lag structure. Our eigenvector plots also showed that the Rook and second-order weight matrices were more consistent with the observed incidence of HFMD, suggesting that in our study area, there may be slight differences between the Rook and the second-order weight matrices. The first two eigenvectors of each matrix in Figure 4 contain relatively more spatiotemporal information and show clustering patterns for areas in the north–south direction (larger); whereas the last eigenvector represents the least spatiotemporal information and shows clustering patterns for places where the direction is not obvious (smaller). The map pattern that contains the most spatiotemporal information is most likely the major spatial distribution (e.g. east–west or north–south structure). This also verifies our conjecture and suggests that our eigenvector visualization can reveal the potential important areas that are the priorities for disease prevention and control.

Compared to GLMM and the Bayesian spatiotemporal model, the spatiotemporal filtering model could reveal underlying spatial patterns by extracting and visualizing eigenvectors. It detected spatial structures and patterns at finer spatial scales. Such information could potentially be used to reveal directions of disease spread and inform disease control priority, although the underlying mechanism of the discerned spatiotemporal structure and pattern will need to be investigated in separate studies.

The eigenvectors decomposed from the spatiotemporal filtering models are independent of each other; they can easily be applied to traditional statistical models to avoid the mathematical intractability issue that many traditional statistical models such as the spatial autoregression model suffer from. In addition, the spatiotemporal filtering models do not rely on the Markov chain Monte Carlo simulation as do Bayesian models [Reference Griffith, Chun, Li, Griffith, Chun and Li19], which make them require much less computational resources. In addition, several studies have begun to explore the spatiotemporal filtering method in practical applications. For example, using the eigenvector spatial filtering method, Zhang et al. attempted to estimate ground PM2.5 concentrations [Reference Zhang45], and Chen et al. built a spatiotemporal eigenvector filtering model based on population flow to examine the spatiotemporal pattern of health risk factors associated with COVID-19 [Reference Chen36] and effectively improved the model prediction accuracy. Results of the current study indicate that the spatiotemporal filtering model offers a promising alternative to traditional complex spatiotemporal models in disease-related applications.

In this study, we found that there is global spatial correlation and spatial heterogeneity in the incidence of HFMD in East China. The risk of HFMD in East China is influenced by meteorological factors (average temperature, average humidity, sunshine hours, and average wind speed) and economic factors (GDP), as indicated by our modelling results. So, we measured spatially stratified heterogeneity of the model’s residuals; we found that the heterogeneity became rather weak, indicating that the spatially stratified heterogeneity of HFMD may be attributable to the included factors in our model. Hence, the models used in this study are reasonable.

However, this study had several limitations. First, we compared model results of a few configurations of spatial weight matrices that are based on geometric adjacency; however, adjacency can also be based on the various geographic characteristics (demographic, socioeconomic, etc.), which should be explored in future studies. Also, in subsequent studies, it is necessary to investigate more innovative computational methods to further improve the efficiency of calculation (e.g. by using parallel computing). Second, how and to what extent the number of eigenvectors impact model results also require detailed studies.

Conclusions

The spatiotemporal filtering model is an effective spatiotemporal analysis method that can be used as a suitable alternative to traditional complex spatiotemporal models for practical application in the control of HFMD. More importantly, the generated eigenvectors can be used to reveal the potential direction of disease spread, which has not been possible with previous spatiotemporal models. However, the appropriate spatial and spatiotemporal weight matrices must be carefully considered.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/S0950268824001080.

Data availability statement

The data that support the findings of this study are available on request from the corresponding author (Zhijie Zhang). The data are not publicly available due to restrictions (e.g. their containing information that could compromise the privacy of research participants).

Acknowledgements

We would like to thank the CCDC for assistance in research design and data collection; the views expressed are those of the authors alone.

Author contribution

L.Y.H., C.X., L.K., Y.Y., X.J.Y., Y.R., S.J., H.J.Q., and Z.Z.J. were involved in conceptualization; C.Z.R. and Z.Z.J. were involved in data curation; C.X., B.J.B., M.P.W., T.W., Y.L.L., W.Q.Y., and W.X.L. were involved in formal analysis; C.X., B.J.B., C.Z.R., and Z.Z.J. were involved in methodology; and C.X., B.J.B., C.Z.R., and Z.Z.J. involved in writing the review and editing the article.

Funding statement

This research was supported by the National Key Research and Development Program of China (grant number 2022YFC2602900); the Natural Science Foundation of Shanghai (147); the National Natural Science Foundation of China (grant number 82473736); Key research projects of Beijing Natural Science Foundation-Haidian District Joint Fund (grant number L192012), and Beijing Natural Science Foundation (grant number 7202073).

Competing interest

All authors have no competing interests to declare.

Footnotes

Xi Chen, Jianbo Ba and Yuanhua Liu contributed equally to this work.

References

Liu, YX et al. (2015) Spatio-temporal analysis of the relationship between climate and hand, foot, and mouth disease in Shandong province, China, 2008-2012. BMC Infectious Diseases 15, 146. https://doi.org/10.1186/s12879-015-0901-4.CrossRefGoogle ScholarPubMed
Zhang, XX, Xu, CD and Xiao, GX (2020) Spatial heterogeneity of the association between temperature and hand, foot, and mouth disease risk in metropolitan and other areas. Science of the Total Environment 713, 136623. https://doi.org/10.1016/j.scitotenv.2020.136623.CrossRefGoogle ScholarPubMed
Xu, C, Zhang, X and Xiao, G. (2019) Spatiotemporal decomposition and risk determinants of hand, foot and mouth disease in Henan, China. Science of the Total Environment 657, 509516. https://doi.org/10.1016/j.scitotenv.2018.12.039.CrossRefGoogle ScholarPubMed
Li, P et al. (2018) Temporal-spatial variation of hand-foot-mouth disease in 2008 to 2014, China. Journal of Nanjing Medical University(Natural Science) 38, 380385. http://doi.org/10.7655/NYDXBNS20180321.Google Scholar
Liu, H et al. (2019) Spatial-temporal variation and risk factor analysis of hand, foot, and mouth disease in children under 5 years old in Guangxi, China. BMC Public Health 19, 1491. https://doi.org/10.1186/s12889-019-7619-y.CrossRefGoogle ScholarPubMed
Zhang, XX et al. (2019) Spatiotemporal variation analysis and risk determinants of hand, foot and mouth disease in Beijing-Tianjin-Tangshan, China. Journal of Geo-information Science 21, 398406. https://doi.org/10.12082/dqxxkx.2019.180517.Google Scholar
Wang, L et al. (2023) Spatiotemporal associations between hand, foot and mouth disease and meteorological factors over multiple climate zones. International Journal of Biometeorology 67, 14931504. https://doi.org/10.1007/s00484-023-02519-y.CrossRefGoogle ScholarPubMed
Wang, JF et al. (2016) A measure of spatial stratified heterogeneity. Ecological Indicators 67, 250256. https://doi.org/10.1016/j.ecolind.2016.02.052.CrossRefGoogle Scholar
Truong, PN et al. (2019) A spatial-temporal statistical analysis of health seasonality: Explaining HFMD infections within a children population along the Vietnamese south central coast. BMC Public Health 19, 937. https://doi.org/10.1186/s12889-019-7281-4.CrossRefGoogle Scholar
Phung, D et al. (2018) Spatiotemporal variation of hand-foot-mouth disease in relation to socioecological factors: A multiple-province analysis in Vietnam. Science of the Total Environment, 610, 983991. https://doi.org/10.1016/j.scitotenv.2017.08.158.Google ScholarPubMed
Hong, Z et al. (2018) Exploration of potential risks of hand, foot, and mouth disease in Inner Mongolia Autonomous Region, China using geographically weighted regression model. Scientific Reports 8, 17707. https://doi.org/10.1038/s41598-018-35721-9.CrossRefGoogle ScholarPubMed
Li, LF et al. (2018) A spatiotemporal mixed model to assess the influence of environmental and socioeconomic factors on the incidence of hand, foot and mouth disease. BMC Public Health 18, 274. https://doi.org/10.1186/s12889-018-5169-3.Google ScholarPubMed
Hong, S et al. (2021) Intra-area factors dominate the spatio-temporal transmission heterogeneity of hand, foot, and mouth disease in China: A modelling study. Science of the Total Environment 775, 145859. https://doi.org/10.1016/j.scitotenv.2021.145859.Google Scholar
Xu, CD et al. (2020) Effects of temperature fluctuations on spatial-temporal transmission of hand, foot, and mouth disease. Scientific Reports 10, 2541. https://doi.org/10.1038/s41598-020-59265-z.CrossRefGoogle ScholarPubMed
Wang, JF et al. (2011) Hand, foot and mouth disease: Spatiotemporal transmission and climate. International Journal of Health Geographics 10, 25. https://doi.org/10.1186/1476-072x-10-25.CrossRefGoogle ScholarPubMed
Zhang, X, Xu, C and Xiao, G. (2018) Space-time heterogeneity of hand, foot and mouth disease in children and its potential driving factors in Henan, China. BMC Infectious Diseases 18, 638. https://doi.org/10.1186/s12879-018-3546-2.CrossRefGoogle ScholarPubMed
Tian, L et al. (2018) Spatio-temporal analysis of the relationship between meteorological factors and hand-foot-mouth disease in Beijing, China. BMC Infectious Diseases 18, 158. https://doi.org/10.1186/s12879-018-3071-3.CrossRefGoogle ScholarPubMed
Ding, L et al. (2021) Spatiotemporal characteristics and meteorological determinants of hand, foot and mouth disease in Shaanxi Province, China: A county-level analysis. BMC Public Health 21, 374. https://doi.org/10.1186/s12889-021-10385-9.CrossRefGoogle Scholar
Griffith, DA, Chun, Y and Li, B. (2019) Spatial autocorrelation. In Griffith, DA, Chun, Y and Li, B (eds.), Spatial Regression Analysis Using Eigenvector Spatial Filtering. Academic Press, pp. 127.Google Scholar
Griffith, DA. (1996) Spatial autocorrelation and eigenfunctions of the geographic weights matrix accompanying geo-referenced data. Canadian Geographer-Geographe Canadien 40, 351367. https://doi.org/10.1111/j.1541-0064.1996.tb00462.x.CrossRefGoogle Scholar
Wang, Y et al. (2019) Spatiotemporal distribution of hand, foot, and mouth disease in Guangdong Province, China and potential predictors, 2009-2012. International Journal of Environmental Research and Public Health 16, 1191. https://doi.org/10.3390/ijerph16071191.CrossRefGoogle ScholarPubMed
Liao, J et al. (2016) Spatial-temporal mapping of hand foot and mouth disease and the long-term effects associated with climate and socio-economic variables in Sichuan Province, China from 2009 to 2013. Science of the Total Environment 563–564, 152159. https://doi.org/10.1016/j.scitotenv.2016.03.159.CrossRefGoogle ScholarPubMed
Chen, YS (2009) Inversitigation of Local Indexes of Spatial Autocorrelation Application-Take Sex Ratio of Jiangsu for Example(Dissertation). Shanghai, CHN: East China Normal University, pp. 1352.Google Scholar
Lin, JJ, Zhang, TW and Li, XY (2020) Research progress on spatiotemporal clustering of disease. Chinese Journal of Epidemiology 41, 11651170.10.3760/cma.j.cn112338-20190806-00582.Google ScholarPubMed
Zhu, WK (2014) Construction and Analyzing of Spatial Weight Matrix of Spatial Econometric Model (Dissertation). Shanghai, CHN: Shanghai Jiao Tong University, pp. 1032.Google Scholar
li, Y and Gui, YF (2021) A new spatial weight matrix and its application to provincial carbon emission. Journal of Luoyang Normal University 40, 59.https://doi.org/10.16594/j.cnki.41-1302/g4.2021.02.002.Google Scholar
Griffith, DA (2008) A comparison of four model specifications for describing small heterogeneous space-time datasets: Sugar cane production in Puerto Rico, 1958/59-1973/74. Papers in Regional Science 87, 341356. https://doi.org/10.1111/j.1435-5957.2008.00188.x.CrossRefGoogle Scholar
Griffith, D and Li, B (2021) Spatial-temporal modeling of initial COVID-19 diffusion: The cases of the Chinese mainland and conterminous United States. Geo-Spatial Information Science 24, 340362. https://doi.org/10.1080/10095020.2021.1937338.CrossRefGoogle Scholar
Park, Y et al. (2022) Spatial autocorrelation may bias the risk estimation: An application of eigenvector spatial filtering on the risk of air pollutant on asthma. Science of the Total Environment 843, 157053. https://doi.org/10.1016/j.scitotenv.2022.157053.CrossRefGoogle ScholarPubMed
Tan, H et al. (2021) Self-adaptive bandwidth eigenvector spatial filtering model for estimating. Environmental Science and Pollution Research 28, 6780067813. https://doi.org/10.1007/s11356-021-15196-4.Google ScholarPubMed
Lin, GW and Ma, JX (2013) Study on the spatial statistical analysis of leukemia using different spatial weight matrixes. Chinese Journal Disease Control Prevention 17, 617620. https://doi.org/10.5732/cjc.011.10122.Google Scholar
Stone, GW (2011) A prospective natural-history study of coronary atherosclerosis. New England Journal of Medicine 364, 226235. https://doi.org/10.1056/nejmoa1002358.CrossRefGoogle ScholarPubMed
Sun, JW et al. (2011) Analysis on the spatial distribution of severe hand foot-mouth disease cases in Lushan county Henan province in 2010. Chinese Journal of Epidemiology 32, 10531054. https://doi.org/10.3760/cma.j.issn.0254-6450.2011.%2010.024.Google Scholar
Chen, TT and Huang, YY (2016) Analyzing the impact of different spatial weight matrix for the western coast of the Taiwan straits. Journal of Shangqiu Normal University 32, 6771.Google Scholar
Griffith, DA (2000) Eigenfunction properties and approximations of selected incidence matrices employed in spatial analyses. Linear Algebra and Its Applications 321, 95112. https://doi.org/10.1016/s0024-3795(00)00031-8.CrossRefGoogle Scholar
Chen, M et al. (2022) Population flow based spatial-temporal eigenvector filtering modeling for exploring effects of health risk factors on COVID-19. Sustainable Cities and Society 87, 104256. https://doi.org/10.1016/j.scs.2022.104256.CrossRefGoogle ScholarPubMed
Griffith, DA. (2012) Spatial statistics: A quantitative geographer’s perspective. Spatial Statistics 1, 315. https://doi.org/10.1016/j.spasta.2012.03.005.Google Scholar
Tiefelsdorf, M and Griffith, DA (2007) Semiparametric filtering of spatial auto correlation: The eigenvector approach. Environment and Planning A: Economy and Space 39, 11931221. https://doi.org/10.1068/a37378.CrossRefGoogle Scholar
Griffith, DA (2004) Distributional properties of georeferenced random variables based on the eigenfunction spatial filter. Journal of Geographical Systems 6, 263288. https://doi.org/10.1007/s10109-004-0134-3.CrossRefGoogle Scholar
Griffith, DA, Chun, Y, Li, B. (2019) MESF and linear regression. In Griffith, DA, Chun, Y and Li, B (eds), Spatial Regression Analysis Using Eigenvector Spatial Filtering. Academic Press, pp. 5979.CrossRefGoogle Scholar
Su, Q et al. (2010) An analysis of spatial distribution pattern of malaria using different spatial weight matrixes. Chinese Journal Disease Control Prevention 14, 419422.Google Scholar
Musio, M, Sauleau, EA and Buemi, A (2010) Bayesian semi-parametric ZIP models with space-time interactions: An application to cancer registry data. Mathematical Medicine and Biolog 27, 181194. https://doi.org/10.1093/imammb/dqp025.y.CrossRefGoogle ScholarPubMed
Wang, W et al. (2021) Influences of using different spatial weight matrices in analyzing spatial autocorrelation of cardiovascular diseases mortality in China. Chinese Journal of Epidemiology 42, 14371444. https://doi.org/10.3760/cma.j.cn112338-20201102-01293.Google ScholarPubMed
Anon, (2018) Guidelines for diagnosis and treatment of hand, foot and mouth disease (2018 version). Chinese Journal Viral Disease 8, 347352. https://doi.org/10.16505/j.2095-0136.2018.0063.Google Scholar
Zhang, J et al. (2018) Eigenvector spatial filtering regression modeling of ground PM(2.5) concentrations using remotely sensed data. International Journal of Environmental Research and Public Health 15, 1228. https://doi.org/10.3390/ijerph15061228.CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. Spatiotemporal lag structure (a) and spatiotemporal contemporaneous structure (b).

Figure 1

Figure 2. Incidence maps of HFMD in east Mainland China, 2009–2015.

Figure 2

Figure 3. SSRE and SURE terms after decomposition of random effect terms.*The values of the colour in the graph are relative sizes.

Figure 3

Table 1. Comparison of the results from the spatiotemporal filtering models and Bayesian spatiotemporal models

Figure 4

Table 2. Comparison of coefficients of the model

Figure 5

Figure 4. Eigenvector decomposition of the spatiotemporal simultaneous structure matrix. (a), (b), (c), and (d) represent the Rook, KNN (K = 1), distance, and second-order matrices, respectively, where (a1), (b1), (c1), and (d1) represent the first eigenvectors; (a2), (b2), (c2), and (d2) are the second eigenvectors; and (a3), (b3), (c3), and (d3) are the last eigenvectors.* The colours in the figure represent map modes with different MI values. The darker the colour, the more concentrated the place is. All values are greater than 0.

Supplementary material: File

Chen et al. supplementary material

Chen et al. supplementary material
Download Chen et al. supplementary material(File)
File 461.1 KB