 Original Article
 Open Access
 Published:
Novel Hybrid PhysicsInformed Deep Neural Network for Dynamic Load Prediction of Electric Cable Shovel
Chinese Journal of Mechanical Engineering volume 35, Article number: 147 (2022)
Abstract
Electric cable shovel (ECS) is a complex production equipment, which is widely utilized in openpit mines. Rational valuations of load is the foundation for the development of intelligent or unmanned ECS, since it directly influences the planning of digging trajectories and energy consumption. Load prediction of ECS mainly consists of two types of methods: physicsbased modeling and datadriven methods. The former approach is based on known physical laws, usually, it is necessarily approximations of reality due to incomplete knowledge of certain processes, which introduces bias. The latter captures features/patterns from data in an endtoend manner without dwelling on domain expertise but requires a large amount of accurately labeled data to achieve generalization, which introduces variance. In addition, some parts of load are nonobservable and latent, which cannot be measured from actual system sensing, so they can’t be predicted by datadriven methods. Herein, an innovative hybrid physicsinformed deep neural network (HPINN) architecture, which combines physicsbased models and datadriven methods to predict dynamic load of ECS, is presented. In the proposed framework, some parts of the theoretical model are incorporated, while capturing the difficulttomodel part by training a highly expressive approximator with data. Prior physics knowledge, such as Lagrangian mechanics and the conservation of energy, is considered extra constraints, and embedded in the overall loss function to enforce model training in a feasible solution space. The satisfactory performance of the proposed framework is verified through both synthetic and actual measurement dataset.
1 Introduction
Electric cable shovel (ECS) is one of the most essential production equipment, which is widely applied in surfacemining operations to peel off the surface cover and load ore materials [1]. Currently, the increasing requirements of high production efficiency and low operating costs have boosted the intelligent or unmanned operation of ECS [2, 3]. Complex construction machinery automation involves a major challenge, that is, the accurate measurement and estimation of the load, which is a crucial precondition for the development of intelligent or unmanned ECS, since it directly influences the planning of digging trajectories and energy consumption.
The load acting on the ECS generally refers to the resistive force between the dipper and surrounding media and digging forces, which includes the crowd force and hoist force provided by the crowd and hoist motor, respectively. For modeling of the resistive force due to mediadipper interactions, there is a considerable number of methods have been developed [4]. Reece [5] proposed a universal earthmoving equation for the cutting model. Based on Reece’s model, McKyes et al. [6] added the inertia. The aforementioned methods are mainly based on the analysis of mining principles and theoretically deduced, have the advantage of universality in calculation, and provide insights for resistive force prediction. However, the resistive force due to mediadipper interactions not only depends on the physical mechanical properties of the excavated medium and the geometrical characteristics of the dipper but also is related to other effects, such as digging strategies. Therefore, some parts of the abovementioned resistive force model are difficult to model with a constant analytical expression. For example, the normal cutting force, which is perpendicular to the speed of dipper teeth, is difficult to express analytically due to compression because it depends on both the digging operation and the medium’s hardness [7].
Digging forces as driving forces are mainly employed to overcome two components of external effects, namely, the dynamic part of ECS and resistive force between the dipper and surrounding media. In recent decades, researchers have focused on using the estimated resistive force to predict digging forces by establishing kinetics or dynamic models of the digging process [8, 9]. For example, Stavropoulou et al. [10] calculated digging forces in excavation based on mechanical equilibrium. Wang et al. [11] adopted an analytic model to calculate digging forces in the mining process based on an empirical resistance equation and the Lagrange method. To realize the accurate prediction of digging forces, it is necessary to accurately establish the kinetics or dynamic models of the digging process under the premise of accurately simulating the resistive force due to mediadipper interaction. In terms of establishing the kinetics and dynamic models of the digging process, Li et al. [12], Rasuli et al. [13], and Frimpong et al. [14] developed dynamic models of the ECS to describe the evolution of the excavator motion with time using the NewtonEuler formulation or iterative NewtonEuler method. The formulation of the dynamic models presented above depends on precise knowledge of the various kinematic and dynamic parameters. When modeling the ECS system’s dynamic parts, such methods make many idealized assumptions about the equipment, such as its kinematic structure, inertia properties, and assumptions regarding the forces acting on the system, which may not capture hardtomodel effects, leading to inaccuracies via model bias.
Based on this analysis, the classical mechanistic method of predicting load of ECS can be used to determine only certain parts and aspects, such as the average value and approximate range, which is insufficiently detailed for the optimal digging trajectory planning of an unmanned ECS. Recently, with the development of sensing and measurement technology, operation data generated in practical digging processes can be effectively recorded. Furthermore, artificial neural networks (ANNs) have been proven to be a powerful tool and approximator to capture potential nonlinear coupling between input and states of complex systems [15]. These results provide many opportunities for the practical application of datadriven methods to improve the accuracy of dynamic load prediction and identification [16]. Datadriven methods typically reduce the problem of predicting load to that of optimizing the parameters of an expressive function class by minimize some form of a prediction loss in an endtoend way [17]. However, there are many circumstances where datadriven approaches can reach their limits or lead to unsatisfactory results, due to model variance [18]. Additionally, some parts of load, such as the resistive force due to mediadipper interactions, are nonobservable and latent which cannot be measured from actual system sensing, so they can’t be directly predicted by datadriven methods. A large number of studies have shown that neural networks that are endowed with good physical priors, namely, physicsinformed neural networks (PINNs), can constrain and boost learning within a feasible solution space [19, 20]. PINNs require substantially less training data than other models and can produce simpler neural network structures while achieving high accuracy. Lutter et al. [21]incorporated the physical prior originating from Lagrange mechanics into the model architecture of simulated and real robot systems. Jia et al. [22] and Muralidhar et al. [23] incorporated monotonic physical relationships as additional constraints in loss functions for a neural network that models lake temperature. Zhang et al. [24] embedded the physics constraints in the loss function to capture latent nonlinear state variables, where measurement is unavailable.
Herein, we follow this line of research and develop a hybrid physicsinformed deep neural network (HPINN) framework that combines firstprinciples models and datadriven methods, to improve the performance of load predictor. More concretely, for modeling of the resistive force due to mediadipper interactions, it delicately incorporates some parts of the theoretical model of the resistive force, while capturing the difficulttomodel parts by training a highly expressive approximator, such as an LSTM neural network from data. In terms of modeling the digging forces, it establishes the kinetics and dynamic models of the digging process, and a novel extended deep Lagrangian networks (DeLaN) is applied to model the system’s dynamic part, including the inertial effect, quality matrix, and gravity. The proposed framework presents salient features that include ① the combination of the flexibility of deep learning with theoretical insights, ② distinct interpretability with physical meaning, and ③ some variables with specific physical meaning that cannot be directly measured can be accurately modeled.
The remainder of this paper is organized as follows: Section 2 describes the mathematical definition of load modeling, theoretical background for the resistant force model, foundations of Lagrangian dynamics, and some preliminaries on LSTM. Section 3 introduces the proposed HPINN architectures for dynamic load prediction. In Section 4, a synthetic dataset is used to test the performance of the proposed framework. In Section 5, the effectiveness of the proposed framework is validated with an actual measurement dataset from an ECS prototype and compared with the classical mechanistic method and pure datadriven method. Section 6 summarizes the conclusions.
2 Background
The framework for dynamic load prediction proposed in this paper draws inspiration from various fields, including the resistive force model, classical mechanics, and advanced machine learning methods, such as PINNs.
2.1 Problem Definition
As a complex multidisciplinary system, an ECS consists of three major assemblies, namely, the upper assembly, lower assembly, and frontend attachments, as illustrated in Figure 1. Among them, the attachments, consisting of the boom, hoist ropes, crowd machinery, dipper handle, and dipper, etc., are the main operating mechanism that directly contacts the media to complete the digging task.
While digging, two types of motions are simultaneously performed by the dipper handle: extension/retraction motion in the direction parallel to the major axes of dipper handle and circular motion around the axis of crowd gear. Therefore, the digging mechanism of ECS can be considered as a two degree of freedom (DOF) mechanism. Based on the motion characteristics, a polar coordinate system is established to describe the dynamic system, as illustrated in Figure 2, where the axis of the crowd gear, O, is set to be the origin; the stretching length of dipper handle is set to be the polar diameter (r), and the angle between vertical direction and axes of dipper handle is set to be the independent variable (ψ).
Generally, three main factors affect the dynamic load of ECS in practice: trajectory parameters (e.g., velocity and acceleration), ore pile parameters (e.g., material mechanical properties and terrain of the ore pile), and structural parameters (e.g., dipper width and boom length). Since the structural parameters are static and remain relatively unchanged during the digging process, the mapping relationship among the digging trajectory, ore pile parameters, and corresponding dynamic load is considered in this paper.
Based on sensor measurements, the dipper motion information (consisting of the angle between the vertical direction and the axis of dipper handle \(\psi\), stretching length of dipper handle \(r\), the angular velocity of dipper handle \(\dot{\psi }\), stretching velocity of dipper handle \(\dot{r}\), the angular acceleration of dipper handle \(\ddot{\psi }\), the stretching acceleration of dipper handle \(\ddot{r}\)), and the corresponding digging forces (consisting of the hoist force \(F_{r}\) and crowd force \(F_{h}\)) can be synchronously acquired at every moment. A 3D scanner is used to measure the profile of the ore pile being excavated, and the obtained laser scanning data can be applied to establish an accurate geometric model, which is capable of describing the dynamic shape of the ore pile, as shown in Figure 3. The digging is a dynamic process, and based on the obtained 3D model, the digging depth d corresponding to the digging trajectory can be obtained. In the digging process, the medium loaded into the dipper continues to accumulate. Therefore, the mass, moment of inertia, and centroid of the dippermaterial system are constantly changing, and the required digging force depends not only on the current working conditions but also on the historical digging trajectory. Essentially, dynamic load prediction can be summarized as a time series modeling task. For simplicity, let \({\varvec{x}} = [\psi ,r,\dot{\psi },\dot{r},\ddot{\psi },\ddot{r},d]\) and \({\varvec{y}}{\mathbf{ = }}[F_{r} ,F_{h} ,F_{t} ,F_{n} ]\). The goal of load prediction is to find a suitable function \(f\) that can efficiently and accurately map from input \({\varvec{x}}\) to digging forces \({\varvec{y}}\), e.g.,
In addition, the nonobservable variable (e.g., the resistive force due to mediadipper interactions) which is not measured in data, and cannot be directly modeled and solved. We can introduce available prior physics knowledge (e.g., Lagrangian mechanics, the conservation of energy) as extra constraints into the function \(f\) for indirectly solving the latent variable.
2.2 Digging Resistance Model
During the digging process, the dipper interacts with the excavated material, and the excavated material produces a great resistance to the dipper originating from the presence of surcharge, cohesion, etc. As mentioned, empirical equations of modeling the resistance of complex digging process may not be reliable, but give insights on digging resistant force prediction. Figure 4 shows the various forces components due to the interaction of media tools that the dipper needs to overcome during the digging process. \(F_{t}\) is the tangential resistance parallel to the direction of tip motion; \(F_{n}\) is the normal resistance normal to the direction of tip motion, and \(G\) is the gravity of the medium loaded in the dipper.
Herein, the dynamic prediction model of the digging resistant force based on the method of trial wedges proposed by McKyes et al. [6] is selected to predict the tangential resistance. With this method, the tangential resistance can be divided into three parts, including the cutting resistance tangential components \(F_{c}\), the velocity effect resistance \(F_{v}\), and the resistance caused by the extrusion from the two sides of the dipper \(F_{s}\).
where \(F_{c}\) can be obtained as Eq. [6].
where ω is the dipper width, γ is medium specific mass, d is the digging depth, c is the medium cohesion, v is the speed of dipper teeth, β is the digging angle, ρ is the failure plane angle, ψ denotes the internal friction angle of the medium, and δ denotes the external friction angle.\(F_{v}\) and \(F_{s}\) can be obtained through Eqs. (4) and (5).
When the dipper cuts through the medium, the bottom of the tip compresses the medium, thus, the normal resistance due to the extrusion reaction arises whose orientation is perpendicular to the speed of dipper teeth. And it’s difficult to obtain an analytical expression of the normal resistance \(F_{n}\) because it depends on both digging operations and the medium’s hardness. Usually, the value is obtained by multiplying the tangential cutting resistance by a factor obtained from experience [7]. However, in this article, we use a neural network to represent the normal resistance:
2.3 Deep Lagrangian Networks (DeLaN)
The purpose of establishing dynamic models of the digging process is to predict how the state of the system evolves over time by a vector of generalized variables \({\varvec{q}} \in {\mathbb{R}}^{N}\) and velocities \(\dot{\user2{q}} \in {\mathbb{R}}^{N}\), where N is the number of coordinates. DeLaN uses knowledge originating from the EulerLagrange equation and encodes this prior within a flexible deep learning architecture [21]. Based on this architecture, all learned models adhere to Lagrangian mechanics. Specifically, the Lagrangian of a rigid body is generally defined as
where \(T = {1 \mathord{\left/ {\vphantom {1 2}} \right. \kern\nulldelimiterspace} 2}\dot{\user2{q}}\user2{H}({\varvec{q}})\dot{\user2{q}}\) is the kinetic energy, V is the potential energy, which can be defined as a scalar function V(q), and H is the positive definite mass matrix. Substituting L into the EulerLagrange differential equation yields the second order ordinary differential equation (ODE) described by:
where τ represents the nonconservative generalized forces, for ECS system, mainly refers to motors force effects and the resistive force due to mediadipper interaction, c describes Coriolis and centripetal effects, and \({{  \partial V} \mathord{\left/ {\vphantom {{  \partial V} {\partial {\varvec{q}}}}} \right. \kern\nulldelimiterspace} {\partial {\varvec{q}}}}\) is gravity [25]. In DeLaN, the unknown functions \({\varvec{H}}({\varvec{q}})\) and \(V({\varvec{q}})\) are represented as a feedforward network, i.e.,
where \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{.}\) refers to an approximation, \(\user2{\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{L} }\) is a lower triangular matrix with a nonnegative diagonal, and θ_{1} and θ_{2} are the network parameters, and one can encode the ODE by exploiting the full differentiability of the neural networks. The parameters θ_{1} and θ_{2} can be learned online and endtoend by minimizing the violation of the physical law described by the ODE. The basic architecture of the DeLaN [26] can be found in Figure 5.
2.4 LongShortTerm Memory Networks (LSTM)
LSTMs have achieved stateoftheart performance in a range of different domains comprising sequential data, such as natural language processing (NLP) [27], load prediction [28], and remaining useful life (RUL) estimation [29]. In Figure 6, we show a typical structure of an LSTM's hidden nodes incorporating four interacting units, including an internal cell, an input gate, a forget gate, and an output gate [30]. The internal cell memorizes the cell state at the previous time step via a selfrecurrent connection. The input gate controls the flow of input activation into the internal cell state. The output gate regulates the flow of output activation into the LSTM cell output. The forget gate scales the internal cell state, enabling the LSTM cell to adaptively forget or reset the cell’s memory [24]. Specifically, given the previous hidden output \(h_{{t{  1}}}\), cell state memory \(C_{t  1}\), and current input \(x_{t}\), the current hidden output \(h_{t}\) can be computed in the following way:
where \(\sigma\) is the logistic sigmoid function, \(\odot\) represents for the Hadamard product, and \({\varvec{U}}^{[\zeta ]}\) (\({\varvec{W}}^{[\zeta ]}\)) denotes the weight matrix between the current input \({\varvec{x}}_{t}\)(\({\varvec{h}}_{t  1}\)) and the operations \(\zeta (\zeta \in i,f,o,g)\).
3 Methodology
In the proposed framework, we incorporate available physics information (e.g., theoretical model of digging resistant force, Lagrangian mechanics, and energy conservation) into the PINN architecture to predict the dynamic load under different positions and postures of the ECS during digging process.
3.1 Overall Framework
In its most general form, we model the continuoustime dynamic digging process of an ECS by modeling ① the digging resistant force between the dipper and the surrounding medium, and ② the dynamic part, including the inertial effect, Coriolis and centripetal effects, and gravity. The overall framework of the HPINN mainly consists of four components, namely, an empirical resistance equation for modeling the tangential digging resistance, a deep LSTM network for modeling the normal digging resistance, and a central finite difference filterbased numerical differentiator for calculating the derivatives of the kinetic energy with respect to time, as described in Figure 7.
For modelling of the digging resistant force, we use an empirical resistance equation proposed by McKyes et al. [6] to model the tangential resistance, which is mainly related to the physical mechanical properties of the excavated medium, and has an analytical expression. Different from the classical mechanistic model, which calculates the normal digging resistance based on the tangential digging resistance multiplied by a proportionality coefficient, in this paper, we present a methodology for modeling the difficulttomodel normal digging resistance \(F_{n}\) using an LSTM network with the trainable weights and biases contained in \(\theta\). The normal digging resistance is a function of the digging trajectory, digging speed and digging depth. Therefore, the input of the LSTM network includes digging trajectory, digging speed and the digging depth corresponding to the digging profile. The digging trajectory and digging speed can be obtained directly through sensor acquisition, and the digging depth can be calculated according to the digging trajectory and the shape of ore pile. Since the digging resistance is generated by the contact between the dipper and the surrounding medium, which cannot be directly measured, in order to establish endtoend supervised training, the output of the LSTM network needs to be projected into the generalized coordinates to participate in the calculation as part of the mechanical equation.
Akin to Lutter et al. [21], we incorporate the physical prior of Lagrangian mechanics into the framework to calculate the dynamic part of the ECS. Meanwhile, there are key distinctions between the approaches. As previously mentioned, during the digging process, the mass, rotational inertia, and centroid of the dippermedium system are related not only to the current position but also to the digging profile that the dipper has traveled, which can be summarized as a time series modeling task. It is known that LSTM is uniquely capable of modeling longterm dependence in time series data. In order to capture the sequencetosequence inputoutput relationship, we extended the DeLaN using an LSTM neural network instead of a feedforward network as the baseline neural network model. All parameters of the extended DeLaN are contained in \(\vartheta\). The input of DeLaN is the sensor data sequence collected during the digging process, including digging trajectory, digging speed, digging acceleration and digging depth information calculated according to the digging profile and the shape of ore pile, and the output is the dynamic part of the system.
3.2 Loss Function
Having established the parameterization for the model of the complex digging process, we now discuss how to optimize these parameters from data using standard endtoend optimization techniques. With the available training data, the “Lagrangian mechanics loss function” of the HPINN can be formulated as Eq. (11).
where N represents the number of measurement (data) samples, T is the number of sampling steps in the time series, \(f^{  1} (\vartheta )\) denotes the inverse model of Eq. (9), which is the dynamic part of ECS, and \({\varvec{\tau}}_{{\varvec{M}}}\) denotes the external forces, including motor forces \(\{ F_{r} ,F_{h} \}\) projected to the generalized coordinate and the interaction force between the dipper and the surrounding medium \(\{ F_{t} ,F_{n} \}\). \(\theta\) and \(\vartheta\) are the trainable weights and biases of the extended deep Lagrangian network and LSTM, respectively. \(\lambda\) is a suitable hyperparameter that controls the regularization strength. \({\varvec{\varOmega}}\) denotes the regularization term, and \({\varvec{\varOmega}}= \sum\nolimits_{{\omega \in \{ \theta ,\vartheta \} }} {{\varvec{\omega}}^{2} }\) , where ω is the parameter to be trained.
In addition to the Lagrangian mechanics objective, ECS needs to observe the conservation of energy that is the total energy of the system must be equal at every moment during its operation. This objective can also be expressed using the change in energy, i.e.,
In particular, the derivatives of kinetic energy with respect to time, namely, \({{{\text{d}}T} \mathord{\left/ {\vphantom {{{\text{d}}T} {{\text{d}}t}}} \right. \kern\nulldelimiterspace} {{\text{d}}t}}\), respectively, cannot be computed using automatic differentiation, as t is not an input of the network. Thus, we developed a central finite difference filterbased numerical differentiator for the calculation. Adding energy conservation to the optimization problem yields “energy loss” for the HPINN.
As a result, the proposed HPINN architecture can be trained by solving the following optimization problem through a standard training algorithm (e.g., gradient descent):
where \({\varvec{J}}\left( {\theta ,\vartheta } \right)\) is the generic total hybrid loss function composed of both data loss and domain loss, as follows:
where, α and η are userdefined hyperparameter determining the weight of each term in the objective function for convergence control (e.g., inversely proportional to the magnitude of each term, or for simplicity α = η= 1). The purpose here is to optimize the network parameters \(\left\{ {\theta ,\vartheta } \right\}\) for both the deep LSTM networks and extended DeLaN, such that the HPINN is able to accurately fit the measurement samples while meeting the domain constraints. This process improves the capabilities of the HPINN to model potential nonlinear, sequencetosequence, inputoutput relationships within a physically feasible solution space.
4 Experimental Investigation on Synthetic Dataset
To comprehensively demonstrate the applicability and superiority of proposed framework, as well as the ability of accurately model latent variables that cannot be directly measured but have specific physical meaning, a synthetic dataset, which from the ideal dynamic equation of WK55type ECS, is used to test. Latent variable values cannot be obtained from field measurements, but they have explicit values when synthesizing a dataset.
4.1 Dataset Preparation
To generate a synthetic dataset for model training, we first need to describe the digging trajectory of the ECS. As described in Section 2, the hoist motor and crowd motor cooperate to drive the dipper teeth along a predetermined trajectory and complete a digging task. Here, two sixthorder polynomials are applied to describe the digging trajectory, as given in Ref. [11]. The digging trajectory is formulated as
where \(a_{0\sim 6}\) and \(b_{0\sim 6}\) denote polynomial coefficients, \(s_{x}\) and \(s_{y}\) are the positions of the dipper tip in the horizontal and vertical directions based on the reference Cartesian coordinates.
Figure 8 shows the trajectory of a dipper tip of the excavator during the digging stage using two sixthorder polynomials. The origin of the digging cycle is at the bottom of media point B, and the end of the cycle is at point C. The ECS is located on the AB plane.
By solving inverse kinematics of the shovel, the motion information of the joint space in the digging process can be obtained, including the angle between the vertical direction and the axis of the dipper handle \(\psi\), the stretching length of the dipper handle r, the angular velocity of the dipper handle \(\dot{\psi }\), the stretching velocity of the dipper handle \(\dot{r}\), the angular acceleration of the dipper handle \(\ddot{\psi }\), and the stretching acceleration of the dipper handle \(\ddot{r}\). The digging resistive forces were calculated based on the method of trial wedges proposed by McKyes et al. [6]. Then based on the dynamic Eq. (17), the corresponding digging forces can be calculated.
where \(L_{b}\) is the length of the dipper handle, \(L_{d}\) is the length of the dipper, \(F_{r}\) is the hoist force, \(F_{h}\) is the crowd force, \(F_{n}\) is the normal excavating resistance, \(F_{t}\) is the tangential excavating resistance, ϑ is the angle between the hoist rope and dipper handle. \(m_{b}\) is the mass of dipper handle, \(m_{d}\) is the total mass of the dipper including the mass of the dipper itself m_{0} and the mass of the loaded material \(m_{t}\) (\(m_{d} = m_{0} + m_{t}\)). And the related parameter values of the shovel in simulation can be found in Ref. [11].
To generate the inputoutput pairs provided for model training and testing, the different coefficients of the polynomial in Eq. (16) are chosen to obtain different digging trajectories. The total time in digging is set to 12 s and sampling time interval is set to 0.1 s. Based on the inverse kinematics and dynamics of the shovel, 1000 time series samples can be obtained. The entire dataset is randomly divided into two parts, including the training dataset with 800 samples, and test dataset with 200 samples. In practical, due to the unevenness of trajectories in real digging, vibration and shock occur during digging, and noise interference inevitably exists in measurement data, which may have a substantial impact on performance. To further examine the noise immunity of the proposed framework, we also added random errors of ±5%, ±10%, ±15%, ±20% and ±25% in positions (ψ, r), velocity (\(\dot{\psi }\), \(\dot{r}\)), acceleration (\(\ddot{\psi }\), \(\ddot{r}\)) and forces (\(F_{r}\), \(F_{h}\)) components, respectively.
4.2 Measurement Indexes
The prediction results were evaluated based on three performance metrics, i.e., Rsquared (R^{2}), mean absolute error (MAE) and root mean square error (RMSE).
where N represents the number of samples, T represents the number of sampling steps in the time series, and \(\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{y}_{t}^{(i)}\), \(y_{t}^{(i)}\) represent the predicted value and actual value, respectively, at time step t for the nth sample, respectively. \(\overline{y}^{(n)}\) is the average value of the nth sample. The R^{2} is closer to 1, the better the performance is. MAE is used to estimate the difference between the groundtruth value and predicted value. This function has strong robustness to large errors, and RMSE also reflects the discreteness of the model. However, compared with the MAE, RMSE is more sensitive to large errors because the error is squared, and large errors are further amplified. These three indices can be utilized to evaluate the performance of this framework from different perspectives.
4.3 Experimental Results and Discussion
The experimental data modeling was performed on a computer with an Intel Xeon Silver 4114 CPU at 2.2 GHz, 64 GB RAM and an NVIDIA Tesla P100 graphics card. The framework is coded by the authors in PyTorch and set as follows: the LSTM network in the proposed HPINN architecture has two LSTM layers and one fully connected layer. The hyperbolic tangent function (tanh) is employed as the nonlinearity, and the standard automatic differentiation toolkit is used to compute the gradients of \({\varvec{H}}({\varvec{q}},d;\theta )\) with respect to \({\varvec{q}}\). The learning rate for the proposed hybrid deep neural network is set to 0.001, and the Adam optimizer is selected as the optimizer.
The prediction results of digging forces under different noise levels are shown in Tables 1 and 2. It can be seen that the performance of HPINN decreases as the noise level increases, but still has high accuracy. For intuitive comparisons, the hoist force and crowd force predicted by the HPINN and real values are also provided in Figures 9 and 10. It can be seen from Figures 9 and 10 that digging forces can be accurately predicted, which indicates that the proposed framework is still able to provide competitive prediction results even if the level of noise is relatively high at 25%.
The HPINN embedded available physics information in model, which has the advantage of clear interpretability with physics meaning. Therefore, it can not only accurately fit the visible variables, but also accurately predict hidden variables. The prediction results of the normal digging resistance under different noise levels are shown in Table 3 and Figure 11. It can be observed that the HPINN is able to provide reasonable predictions under different noise levels, which implies that the proposed framework can successful capture the latent nonlinear expressions of normal digging resistance.
5 Experimental Investigation on Realworld Dataset
For the aim of testing and evaluation, we conducted digging experiments and used sensors to record operating data. The performance of the HPINN is compared to that of the classical mechanistic model and a purely datadriven method, that are most common dynamic load prediction methods, to verify its effectiveness and superiority.
5.1 Experimental Setup
As shown in Figure 12, the experimental equipment used here is an intelligent ECS prototype, that is a 1:7 scale model of the WK55type, and the capacity of the dipper is about 0.16 m^{3}. The main geometry and physical data of the digging device are listed in Table 4, and the corresponding explanatory diagram is illustrated in Figure 13. The hardware for synchronous data acquisitions mainly consists of an industrial computer, a LiDAR for obtaining information about the material to be excavated, two crowd/hoist motor encoders for obtaining position information. And digging forces is uploaded through the drive inverter. Experiments were carried out on a test site constructed from real mine materials, and the whole digging operations are shown in Figure 14.
To obtain sufficient training data, 500 complex pile digging experiments are carried out, and the number of training samples and test samples are set to 400 and 100, respectively. To verify the effective of the HPINN, a pure datadriven method without including physics is constructed, which have two LSTM layers and one fully connected layer, and the Adam optimizer and meansquare error (MSE) are selected as the optimizer and data loss function, respectively.
5.2 Experimental Results and Discussion
This section presents experimental results of the proposed HPINN for dynamic load prediction on a realworld dataset. The prediction results were evaluated with performance metrics used in Section 3, and the digging forces prediction results are shown in Table 5. The hoist and crowd forces predicted by the HPINN, classical analytical method and pure data driven method are also provided in Figure 15 for intuitive comparisons. According to Table 5, the HPINN outperforms the classical analytical method and pure datadriven method on the realworld datasets, i.e., the RMSE and MAE are smaller and the coefficient of determination, R^{2}, is higher. More concretely, the RMSE of the proposed framework is 31.8 %, 21.7% less than that of the classical analytical method and pure datadriven method, respectively. The MAE of the proposed framework is 27.9%, 13.2% less than that of the classical analytical method and pure datadriven method, respectively. Moreover, the R^{2} of the proposed method is 8.6%, 2.3% higher than that of the classical analytical method and pure datadriven method, respectively. These results verify the applicability and superiority of the proposed framework in dynamic digging force prediction.
As shown in Figure 15, compared with the classical analytical method, the predicted curves of HPINN and pure data driven method are more consistent with the groundtruth curve. And the classical analytical method can predict the changing trend of crowd force and hoist force to some extent but cannot make accurate predictions. The main reason lies in that classical analytical approaches are mainly derived from physics and make many idealized assumptions about the system and excavated material, such as its kinematic structure, inertia properties, assumptions regarding the forces acting on the physical system, and the assumption that the excavated material is uniform, continuous and isotropic, which may fail to capture hardtomodel effects, causing inaccuracies via model bias. Moreover, the classical method assumes or simplifies the dynamic digging process as a steady process and does not consider the influence of timevarying dipper penetration and digging speed. Therefore, this approach can predict only a range of crowd force and hoist force and cannot sensitively capture small changes.
Both pure datadriven method and HPINN are able to produce reasonable predictions of digging forces, and the HPINN slightly yields better prediction on whole tendency, which implies that the proposed framework provides reasonable nonlinear expressions and can capture latent correlations for the complicated timedependent data. In many cases, purely databased methods, in particular deep learning models, which ground in the nature of databased and captures rules present in the training data, may reach their limits and produce inaccurate prediction results, due to model variance. Additional, pure datadriven method works in an endtoend manner and cannot model nonobservable variables. The HPINN combines the flexibility of deep learning with the theoretical insights of physics and retains certain physical interpretability, which explores a deeper coupling of ML methods with scientific knowledge. HPINN only restricts the system to comply with general laws of physics, such as energy conservation and force balance, and does not impose too many constraints on the system, so that the model maintains greater flexibility. Moreover, the change of some underlying variables with specific physical meanings can be obtained by using the framework proposed in this paper, such as resistive force, potential, and kinetic energies, which are not observable and hence, cannot be learned supervised. In this paper, the resistive force between the dipper and surrounding media is analyzed, and compared with traditional classical analytical method. The tangential component and normal component of the resistance are illustrated in Figure 16.
During real digging, it is difficult for the bucket to enter the material at the ideal cutting angle, and the material will inevitably be squeezed, so the normal resistance is related to both the digging operation and the medium’s hardness, and has a relatively complex expression. Therefore, the normal resistance cannot be obtained simply by multiplying the tangential resistance by the coefficient.
6 Conclusions

(1)
This paper presents a hybrid physicsinformed deep neural networks framework, named the HPINN, which combines firstprinciples method and datadriven modeling, to predict dynamic load of the ECS.

(2)
In the proposed framework, some parts of the theoretical model of the resistive force are incorporated, while capturing the difficulttomodel part by training a highly expressive LSTM neural network with data.

(3)
The HPINN can not only accurately fit the visible variables, but also accurately predict hidden variables.

(4)
The HPINN combines the flexibility of deep learning with the theoretical insights of physics and retains certain physical interpretability, achieving a deep coupling of ML methods with scientific knowledge.

(5)
Both synthetic data and actual measurement dataset are used to validate the proposed framework, which can also provide a competitive prediction performance with different noise levels.
References
Guzman M Valenzuela, M A Valenzuela. Integrated mechanicalelectrical modeling of an AC electric mining shovel and evaluation of power requirements during a truck loading cycle. IEEE Transactions on Industry Applications, 2015, 51(3): 25902599.
M Dunbabin, P Corke. Autonomous excavation using a rope shovel. Journal of Field Robotics, 2006, 23(6/7): 379394.
S Frimpong, Y Hu. Intelligent cable shovel excavation modeling and simulation. International Journal of Geomechanics, 2008, 8(1): 210.
S Blouin, A Hemami, M Lipsett. Review of resistive force models for earthmoving processes. Journal of Aerospace Engineering, 2001, 14(3): 102111.
A R Reece. Paper 2: The fundamental equation of earthmoving mechanics. Proceedings of the Institution of Mechanical Engineers, Conference Proceedings, 1964, 179(6): 1622.
E McKyes, O S Ali. The cutting of soil by narrow blades. Journal of Terramechanics, 1977, 14(2): 4358.
B Wei, F Gao, J Chen, et al. Mechanics performance of threedegreeoffreedom excavating mechanism for an electric shovel. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, 2011, 225(6): 14431457.
K AwuahOffei, S Frimpong. Cable shovel digging optimization for energy efficiency. Mechanism and Machine Theory, 2007, 42(8): 9951006.
K AwuahOffei, S Frimpong. Efficient cable shovel excavation in surface mines. Geotechnical and Geological Engineering, 2011, 29(1): 1926.
M Stavropoulou, G Xiroudakis, G Exadaktylos. Analytical model for estimation of digging forces and specific energy of cable shovel. Coupled Systems Mechanics, 2013, 2(1): 2351.
X Wang, W Sun, E Li, et al. Energyminimum optimization of the intelligent excavating process for large cable shovel through trajectory planning. Structural and Multidisciplinary Optimization, 2018, 58(5): 22192237.
Y Li, S Frimpong. Hybrid virtual prototype for analyzing cable shovel component stress. International Journal of Advanced Manufacturing Technology, 2008, 37(56): 423430.
A Rasuli, S Tafazoli, W G Dunford. Dynamic modeling, parameter identification, and payload estimation of mining cable shovels. 2014 IEEE Industry Application Society Annual Meeting, 2014: 14795223.
S Frimpong, Y Hu, K AwuahOffei. Mechanics of cable shovelformation interactions in surface mining excavations. Journal of Terramechanics, 2005, 42(1): 1533.
S Chen, S A Billings. Neural networks for nonlinear dynamic system modelling and identification. International Journal of Control, 1992, 56(2): 319346.
C Wang, D Chen, J Chen, et al. Deep regression adaptation networks with modelbased transfer learning for dynamic load identification in the frequency domain. Engineering Applications of Artificial Intelligence, 2021, 102: 104244.
X Gao, M Shi, X Song, et al. Recurrent neural networks for realtime prediction of TBM operating parameters. Automation in Construction, 2019, 98: 225235.
K M Rashid, J Louis. Timesseries data augmentation and deep learning for construction equipment activity recognition. Advanced Engineering Informatics, 2019, 42: 100944.
M Raissi, P Perdikaris, G E Karniadakis. Physicsinformed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics, 2019, 378: 686707.
B Zhao, C Cheng, G Tu, et al. An interpretable denoising layer for neural networks based on reproducing kernel hilbert space and its application in machine fault diagnosis. Chinese Journal of Mechanical Engineering, 2021, 34: 44.
M Lutter, K Listmann, J Peters. Deep Lagrangian networks for endtoend learning of energybased control for underactuated systems. IEEE International Conference on Intelligent Robots and Systems, 2019: 77187725.
X Jia, J Willard, A Karpatne, et al. Physicsguided machine learning for scientific discovery: an application in simulating lake temperature profiles. ACM/IMS Transactions on Data Science, 2021, 2(3): 126.
N Muralidhar, M R Islam, M Marwah, et al. Incorporating prior domain knowledge into deep neural networks. Proceedings  2018 IEEE International Conference on Big Data, Big Data, IEEE, 2019: 3645.
R Zhang, Y Liu, H Sun. Physicsinformed multiLSTM networks for metamodeling of nonlinear structures. Computer Methods in Applied Mechanics and Engineering, 2020, 369: 113226.
R M Murray, Z Li, S Shankar Sastry. A mathematical introduction to robotic manipulation. Boca Raton: CRC Press, 2017.
M Cranmer, S Greydanus, S Hoyer, et al. Lagrangian neural networks. arXiv:2003.04630.
R Collobert, J Weston. A unified architecture for natural language processing: Deep neural networks with multitask learning. Proceedings of the 25th International Conference on Machine Learning, 2008: 160167.
C Qin, G Shi, J Tao, et al. Precise cutterhead torque prediction for shield tunneling machines using a novel hybrid deep neural network. Mechanical Systems and Signal Processing, 2021, 151: 107386.
Y Wang, Y Zhao, S Addepalli. Practical options for adopting recurrent neural network and its variants on remaining useful life prediction. Chinese Journal of Mechanical Engineering, 2021, 34: 69.
S Hochreiter, J Schmidhuber. Long Shortterm memory. Neural Computation, 1997, 9: 17351780.
Acknowledgements
Not applicable.
Funding
Supported by National Natural Science Foundation of China (Grant No. 52075068) and Shanxi Provincial Science and Technology Major Project (Grant No. 20191101014).
Author information
Authors and Affiliations
Contributions
TF conducted most of the research work, including the literature research, coding, results analysis, and writing. TZ and YC assisted with the results analysis and manuscript revision. XS is supervisors who offered the original idea and coordinated the whole trial. All authors read and approved the final manuscript.
Authors’ Information
Tao Fu, born in 1993, is currently a PhD candidate at School of Mechanical Engineering, Dalian University of Technology, China. His research interests include datadriven modeling and intelligent mechanical equipment.
Tianci Zhang, born in 1993, is currently a PhD candidate at School of Mechanical Engineering, Dalian University of Technology, China. His research interests include autonomous decision and control for robotics, intelligent mechanical equipment, and deep learning for industrial data mining.
Yunhao Cui, born in 1986, is currently a PhD candidate at School of Mechanical Engineering, Dalian University of Technology, China. His research interests include point cloud data processing, intelligent mechanical equipment, and 3D environment perception.
Xueguan Song, born in 1982, is currently a professor at School of Mechanical Engineering, Dalian University of Technology, China. His research interests include multidisplinary design optimization, surrogate modeling, intelligent mechanical equipment, computational fluid dynamics, and digital twin.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing financial interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fu, T., Zhang, T., Cui, Y. et al. Novel Hybrid PhysicsInformed Deep Neural Network for Dynamic Load Prediction of Electric Cable Shovel. Chin. J. Mech. Eng. 35, 147 (2022). https://doi.org/10.1186/s1003302200817x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1003302200817x
Keywords
 Hybrid physicsinformed deep learning
 Dynamic load prediction
 Electric cable shovel (ECS)
 Long shortterm memory (LSTM)