Spatio-temporal prediction of regional land subsidence via ConvLSTM

LENG Jing; GAO Mingliang; GONG Huili; CHEN Beibei; ZHOU Chaofan; SHI Min; CHEN Zheng; LI Xiang

doi:10.1007/s11442-023-2169-8

Journal of Geographical Sciences >

2023 , Vol. 33 >Issue 10: 2131 - 2156

DOI: https://doi.org/10.1007/s11442-023-2169-8

Research Articles

Spatio-temporal prediction of regional land subsidence via ConvLSTM

LENG Jing ^,¹^,²^,³ ,
GAO Mingliang ^,¹^,²^,³^,⁴^,^* ,
GONG Huili ¹^,²^,³^,⁴ ,
CHEN Beibei ¹^,²^,³^,⁴ ,
ZHOU Chaofan ¹^,²^,³^,⁴ ,
SHI Min ⁵ ,
CHEN Zheng ⁶ ,
LI Xiang ¹^,²^,³

Expand

1. Beijing Laboratory of Water Resources Security, Capital Normal University, Beijing 100048, China
2. Key Laboratory of Mechanism, Prevention and Mitigation of Land Subsidence, MOE, Capital Normal University, Beijing 100048, China
3. College of Resources Environment and Tourism, Capital Normal University, Beijing 100048, China
4. Hebei Cangzhou Groundwater and Land Subsidence National Observation and Research Station, Cangzhou 061000, Hebei, China
5. School of Electrical Engineering, Nantong University, Nantong 226019, Jiangsu, China
6. Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China

*Gao Mingliang (1989-), Lecturer, specialized in evolution of regional land subsidence. E-mail: b312@cnu.edu.cn

Leng Jing (1997-), Master Candidate, specialized in prediction of regional land subsidence. E-mail: lj906425232@163.com

Received date: 2022-10-22

Accepted date: 2023-05-16

Online published: 2023-10-08

Supported by

National Natural Science Foundation of China(41930109/D010702)

Beijing Outstanding Young Scientist Program(BJJWZYJH01201910028032)

R&D Program of Beijing Municipal Education Commission(KM202210028009)

Fold

Abstract

Land subsidence is a geohazard phenomenon caused by the lowering of land elevation due to the compression of the sinking land soil body, thus creating an excessive constraint on the safe construction and sustainable development of cities. The use of accurate and efficient means for land subsidence prediction is of remarkable importance for preventing land subsidence and ensuring urban safety. Although the current time-series prediction method can accomplish relatively high accuracy, the predicted settlement points are independent of each other, and the existence of spatial dependence in the data itself is lost. In order to unlock this problem, a spatial convolutional long short-term memory neural network (ConvLSTM) based on the spatio-temporal prediction method for land subsidence is constructed. To this end, a cloud platform is employed to obtain a long time series deformation dataset from May 2017 to November 2021 in the understudied area. A convolutional structure to extract spatial features is utilized in the proposed model, and an LSTM structure is linked to the model for time-series prediction to achieve unified modeling of temporal and spatial correlation, thereby rationally predicting the land subsidence progress trend and distribution. The experimental results reveal that the prediction results of the ConvLSTM model are more accurate than those of the LSTM in about 62% of the understudied area, and the overall mean absolute error (MAE) is reduced by about 7%. The achieved results exhibit better prediction in the subsidence center region, and the spatial distribution characteristics of the subsidence data are effectively captured. The present prediction results are more consistent with the distribution of real subsidence and could provide more accurate and reasonable scientific references for subsidence prevention and control in the Beijing-Tianjin-Hebei region.

Key words： land subsidence; deep learning; ConvLSTM; spatio-temporal prediction; cloud platform

Cite this article

LENG Jing , GAO Mingliang , GONG Huili , CHEN Beibei , ZHOU Chaofan , SHI Min , CHEN Zheng , LI Xiang . Spatio-temporal prediction of regional land subsidence via ConvLSTM[J]. Journal of Geographical Sciences, 2023 , 33(10) : 2131 -2156 . DOI: 10.1007/s11442-023-2169-8

1 Introduction

Land subsidence is a regional decrease in land elevation caused by natural or man-made factors, and is a slow-changing geological hazard (Chen et al., 2020). Land subsidence is characterized by slow recovery or difficult recovery, the development process is basically irreversible, and its impact is long-term, representing the leading geological hazard in the China plain areas. Due to its widespread, intricate identification, and most occurrence in large and medium-sized cities with an active economy, it has become a crucial safety hazard in modern cities (Shi et al., 2021). How to capture the dynamic and complex spatio-temporal relationships based on the historical state of subsidence spatial information is of great practical importance for predicting the future development process and its possible direction, for the prevention and control of land subsidence (Zhai et al., 2012; Guo et al., 2021).

Currently, the existing methods for predicting land subsidence can be divided into three main categories: methods based on physical mechanisms (deterministic models), methods based on mathematical-statistics (mathematical-statistical models), and methods based on machine learning (machine learning models) (Liu et al., 2021b). The methods based on physical mechanisms often require obtaining a series of complex physical parameters, such as hydrological characteristics and lithological characteristics, which are problematic and laborious to obtain and have limitations in use, causing it hard to make timely and quick predictions (Zhang and Zhang, 2013; Chen et al., 2016; Ren et al., 2018). The mathematical statistics-based approaches include regression models, gray models, biological models, and other mathematical models, but they commonly depend more on the accuracy of the data, and the conditions are more limited and difficult to develop (Shearer, 1998; Wang and Yang, 2014). In contrast, machine learning-based methodologies rely on data-driven, enabling training, and learning functions based on historical data without the need for multiple experimental parameters. In cases where only observed subsidence values or synthetic aperture radar (InSAR) deformation value data are obtained, prediction by machine learning approaches is often the best option. At the same time, the neural network also has excellent mapping ability and can complete more accurate mapping between input and output in the absence of quantitative influence relationships among the factors (Yi and Gao, 2021).

With the rapid development of land observation, the volume of remote sensing image data has grown exponentially. The speed of data acquisition has increased, the update cycle has shortened, and the data timeliness has become stronger. This provides strong support for machine learning-based land subsidence prediction (Guo et al., 2014; Li et al., 2014). In recent years, machine learning methods have performed very well in predicting land subsidence. Yue et al. (2020) employed the recurrent neural network (RNN) to estimate the subsidence of radar monitoring data after data clustering, which confirmed the advantages of RNN in subsidence prediction of large samples and also revealed the existence of trend correlation between subsidence points. Liu et al. (2021b) developed the long short-term memory (LSTM) artificial neural network to capture surface subsidence in the Cangzhou region with a single element, demonstrating that deep learning leads to more accurate prediction results under data scarcity conditions on subsidence drivers. A wavelet transform-random forest (WT-RF) prediction model was developed by Zhou et al. (2021), which decomposes the subsidence trend component and the random component by wavelet transform to achieve an accurate prediction of land subsidence along the Jinbao high-speed rail line. Li et al. (2021) developed the geographically weighted long short-term memory (GW-LSTM) model to estimate the subsidence of the Chaobai River alluvial fan in northeastern Beijing Plain, China, and confirmed that incorporating spatial correlation can effectively enhance the accuracy of the subsidence prediction.

From the above analysis, it is understandable that the existing machine learning subsidence prediction models are mostly time-series prediction models or combined models, which can achieve more accurate results in subsidence point prediction, but there are still lacking in considering the spatial correlation of subsidence points. The processing of subsidence data by the time-series model is only limited to the temporal dimension of data points, ignores the learning of spatial features, and does not have the limitations of spatial autocorrelation constraints in training. The hybrid model develops various structures to extract spatio-temporal features separately, often employing the results of the model extracting spatial features as input to the temporal model, integrating to obtain spatio-temporal data features that capture both spatial structure and temporal information. However, because the two structures are almost independent during processing, it is difficult to fully outline the spatial correlation in the extracted feature vectors, and the model is still trained at points that cannot appropriately capture the interaction between spatial features and temporal dynamics.

According to Tobler's first law (all attribute values on a geographic surface are correlated, but closer values are more strongly correlated than more distant values), changes in features are a combination of large-scale spatial trends and small-scale spatial correlations (Wang et al., 2000). The development of land subsidence is influenced by various factors such as groundwater extraction, surface loading, and geological structure. This fact demonstrates strong-regional characteristics (Pan et al., 2004; Gong et al., 2017, 2009), and the dependence of the data on the spatial and temporal dimensions should be simultaneously considered while making predictions. With the change of time, the correlation of spatial dimensions also changes somewhat dynamically (Nanni et al., 2008). For subsidence images, the data of each pixel at any moment is influenced not only by its own data at the historical moment, but also by the neighboring elements at the current moment. If the vectors are constructed in terms of data points, even though some degree of prediction can be made by utilizing a time-series model; as a result, the spatio-temporal structure present in the data is apparently ignored, so integrated modeling of the features of temporal and spatial features is essential.

Spatio-temporal forecasting (STF) extends the traditional time series forecasting or spatial interpolation problem to spatial and temporal dimensions and models through considering the spatio-temporal dependence of the forecast target and predictor variables. It implies that the linear and non-linear features in spatio-temporal data can be effectively captured to enhance the accuracy of simulation and forecasting (Liu et al., 2021; Xu et al., 2021). Accurate STF can efficiently process large-scale spatial and temporal data, provides a scientific reference for decision-making in various divisions, and reduce or avoid socio-economic damages caused by geological disasters. Deep learning models in STF are capable of modeling the spatio-temporal dependencies from a data-driven perspective and remarkably outperform traditional prediction approaches in dealing with long-term forecasting problems and dynamic change scenarios (Pan and Li, 2021). Spatio-temporal deep learning models can effectually capture both local and global spatial dependencies while dealing with long- and short-term temporal dependencies (Zhang et al., 2016; Zhang et al., 2020). Such predictive models exhibit good adaptability to various complex STF tasks and have demonstrated excellent performance in traffic prediction (Lv et al., 2015; Polson and Sokolov, 2017), weather forecasting (Akbari Asanjan et al., 2018; Huang and Kuo, 2018), and prediction of disasters (DeVries et al., 2018; Ham et al., 2019). We have not yet fully realized the complex internal mechanism of the spatial and temporal distribution of land subsidence; furthermore, it is difficult and complicated to achieve and match the data of groundwater, artificial engineering, and other influencing elements with complex factors. Based on this situation, direct prediction of their future spatial and temporal variations via deep learning approaches provides relatively accurate results.

With these views, incorporating the extraction of spatial correlations into regression prediction for fitting complex nonlinear runaway relationships can provide a more rational spatial and temporal prediction of the development and distribution of land subsidence. In the present work, we use a land subsidence prediction model based on a convolutional long short-term memory network (ConvLSTM), which is trained and predicted in the form of an image tensor by integrating convolutional and LSTM algorithms for spatio-temporal relationships simulations. In this way, spatial features could be effectively extracted on the basis of the time-series learning to capture the temporal and spatial dependence of regional land subsidence, provide a scientific reference for spatial and temporal predictions of large-scale land subsidence, and give effective support for the prevention and control of land subsidence.

2 Understudied area and data

2.1 Understudied area

The understudied area of this paper is located in Hebei (37°36′-39°03′N, 114°30′-117°42′E), mainly including Cangzhou, Langfang, Baoding, Shijiazhuang, and Hengshui (Figure 1). The Hebei Plain (location of the understudied area) is one of the largest and most complex areas of land subsidence in the North China Plain and even in China (Zhang et al., 2014). Due to the severe shortage of surface water, groundwater is the chief source of water supply for production and living in the area (Wang et al., 2009; Wang and Guo, 2015), and long-term groundwater over-extraction has triggered serious land subsidence. Due to the overall low and flat terrain, land subsidence has a substantial impact on the economic development and production life of the area. This issue has been becoming one of the crucial factors limiting the sustainable development of the local economy and causing extensive and long-term damage to urban construction and all aspects.

Radar image	Sentinel-1A (S1A)
Flight direction	Ascending
Polarization	VV+VH
Band	C-Band
Beam mode	Interferometric wide swath (IW)
Wave length (cm)	5.6
Ground resolution (m)	5×20
Revisit cycle (d)	12
Number of images (scene)	132
Time range	2017.05.20-2021.11.19

Model	MAE (mm)	RMSE (mm)	MSE (mm²)	SSIM	MS-SSIM
ARIMA	17.96	24.06	597.62	—	—
SVR	16.02	21.31	470.85	—	—
RNN	14.61	19.46	393.06	—	—
LSTM	11.62	15.61	243.59	0.9518	0.9795
ConvLSTM	10.73	14.31	204.96	0.9654	0.9822

模态框（Modal）标题

Abstract

Cite this article

1 Introduction

2 Understudied area and data

2.1 Understudied area

Figure 1 Geographic location of the understudied area in the Hebei Plain

2.2 Data preparation

Table 1 Radar image information

Figure 2 Spatio-temporal subsidence dataset production process

Figure 3 Spatial distribution of land subsidence in the understudied area from 2017 to 2021 (Note: the presented box indicates the typical deformation area.)

3 Methodology

3.1 ConvLSTM

Figure 4 Cells structure of the models: (a) LSTM cells, (b) ConvLSTM cells

Figure 5 Schematic representation of the convolution calculation of the ConvLSTM

3.2 Evaluation index

4 Experiment

4.1 Model construction and output results

Figure 6 The stacked ConvLSTM prediction model structure

Figure 7 The flowchart of the ConvLSTM network prediction

Figure 8 Parameter tuning process curve graph: (a) Loss due to various loss functions, (b) Loss due to various learning rates, (c) Loss due to various optimizers, (d) Loss due to various kernels

Figure 9 Cumulative shape variable forecast results for the understudied area from January 11, 2021 to November 19, 2021

4.2 Results

4.2.1 Prediction accuracy comparison

Table 2 Evaluation of the accuracy of the prediction results

Figure 10 Comparison of MAE distributions between ConvLSTM and LSTM predictions: (a) ConvLSTM, (b) LSTM

Figure 11 Comparison of MAE proportion differences between ConvLSTM and LSTM predictions: (a) ConvLSTM, (b) LSTM

Figure 12 Distribution of MAE differences between ConvLSTM and LSTM prediction results

4.2.2 Spatial distribution comparison

Figure 13 Spatial distribution of cumulative deformation (CD) in the subsidence center: (a) original data, (b) LSTM prediction results, (c) ConvLSTM prediction results

Figure 14 Spatial distribution of cumulative deformation (CD) in the subsidence edge: (a) original data, (b) LSTM prediction results, (c) ConvLSTM prediction results

4.3 Discussion

4.3.1 Analysis of the spatial features

Figure 15 Clustering result of time-series subsidence data features with ConvLSTM prediction error overlay (note: the black areas indicate regions of relatively low prediction error)

Figure 16 Autocorrelation of adjacent data points at the center of subsidence: (a) original data, (b) LSTM prediction results, (c) ConvLSTM prediction results (note: different colors represent different points of data.)

4.3.2 Deformation mechanisms and impact on prediction results

Figure 17 Distribution of groundwater decline funnel and surface settlement in the understudied area (2016): (a) elevation of the shallow groundwater table, (b) elevation of the deep groundwater table

5 Conclusions

References