 Original Article
 Open access
 Published:
Optimization on the Crosswind Stability of Trains Using Neural Network Surrogate Model
Chinese Journal of Mechanical Engineering volume 34, Article number: 86 (2021)
Abstract
Under the influence of crosswinds, the running safety of trains will decrease sharply, so it is necessary to optimize the suspension parameters of trains. This paper studies the dynamic performance of highspeed trains under crosswind conditions, and optimizes the running safety of train. A computational fluid dynamics simulation was used to determine the aerodynamic loads and moments experienced by a train. A series of dynamic models of a train, with different dynamic parameters were constructed, and analyzed, with safety metrics for these being determined. Finally, a surrogate model was built and an optimization algorithm was used upon this surrogate model, to find the minimum possible values for: derailment coefficient, vertical wheelrail contact force, wheel load reduction ratio, wheel lateral force and overturning coefficient. There were 9 design variables, all associated with the dynamic parameters of the bogie. When the train was running with the speed of 350 km/h, under a crosswind speed of 15 m/s, the benchmark dynamic model performed poorly. The derailment coefficient was 1.31. The vertical wheelrail contact force was 133.30 kN. The wheel load reduction rate was 0.643. The wheel lateral force was 85.67 kN, and the overturning coefficient was 0.425. After optimization, under the same running conditions, the metrics of the train were 0.268, 100.44 kN, 0.474, 34.36 kN, and 0.421, respectively. This paper show that by combining train aerodynamics, vehicle system dynamics and manyobjective optimization theory, a train’s stability can be more comprehensively analyzed, with more safety metrics being considered.
1 Introduction
The advantage of highspeed trains over conventional trains is their high speed [1]. Therefore, the existence and potential problem of crosswinds questions the advantage of highspeed trains [2]. It is necessary to study the crosswind stability of highspeed trains. In fact, vehicle dynamics and optimization have always been the subject of scholars’ research. In terms of dynamics, Baker et al. [3] added wind load to the train dynamic model, and studied the relationship between a train’s derailment coefficient, wheel load reduction coefficient and wind speed. Deng et al. [4] studied the dynamic performance of highspeed trains when passing through wind barriers, and focused on the effects of crosswind based on the two metrics of wheel load reduction and derailment coefficient. Yan et al. [5] used a spectral analysis method to study the overturning risk of highspeed trains subject to strong winds. Olmos et al. [6] used the derailment coefficient and wheel load reduction ratio as metrics to study the safety of trains under the vehiclebridgewind coupling model. Wei et al. [7] used an indirect test method to obtain the wheelrail contact force of a highspeed train in a crosswind environment from an experimental point of view, and then analyzed the safety. Niu et al. [8] studied the influence of coupling regions on the aerodynamic performance of trains under different operating conditions. Yu et al. [9] studied the operational stability of highspeed trains under different wind models. They paid close attention to the differences in wheel load reduction of highspeed trains. In terms of optimization, Li et al. [10] optimized the active suspension system of electric vehicles with the three objectives of reducing the vertical component of the unbalanced electrode, eliminating the negative impact of electromechanical coupling on the vehicle system, and retaining the dynamic advantages of the vehicle’s active suspension. Fossati et al. [11] used driver’s seat vertical acceleration, root mean square of the wheels’ dynamic amplification factor, and maximum relative displacement between each wheel and car body as metrics to optimize the passive suspension vehicle’s parameters and obtained good results. By optimizing the shape of aircraft or trains to reduce the negative impact of aerodynamic loads on them, great progress has been made [12–17]. One of the most important methods is the surrogate model method.
Considering the content introduced above, we have an initial impression of research progress, as shown in Figure 1. In the past, scholars’ research work about train stability can be divided into two categories. One is to combine train aerodynamics and vehicle system dynamics to study the impact of wind on vehicle safety. Another is the combination of vehicle system dynamics and optimization theory to optimize vehicle’s safety by adjusting suspension parameters. Both types of research have made good progress and guided later scholars’ work, but some areas are still worth studying. Firstly, the deterioration of train’s safety mainly occurs in some extreme situations, such as in a crosswind environment. The primary purpose of our study is to improve the safety of trains running in these situations. Therefore, it is necessary to combine train aerodynamics, vehicle system dynamics and optimization theory. Secondly, when previous scholars researched the dynamic performance of trains, especially with regards to safety, two or three metrics were usually selected, such as derailment coefficient and wheel load reduction ratio, or derailment coefficient and overturning coefficient. But in the industry standard, the number of safety metrics is more than two. Code GB559985 [18] is one of the national standards of the People's Republic of China, which specifies the dynamic performance evaluation metric of railway vehicles. There are at least five safety metrics specified in code GB 559985, which are derailment coefficient, vertical wheelrail contact force, wheel lateral force, wheel load reduction rate, and overturning coefficient. If suspension parameters are optimized for only some of these metrics, the final optimization results may not be satisfactory. When the train is running, some metrics may be improved due to optimization, but other metrics may deteriorate due to the changes in suspension parameters. Therefore, the work of this paper is to combine the three areas mentioned above.
This paper is divided into five parts. The first part is an introduction, which mainly introduces the research progress of vehicle dynamics and optimization. The second part is the methodology, which mainly introduces the theories employed in this research. The third part is the model introduction and related verification. The fourth part is the research process and result analysis. The fifth part is the conclusions.
2 Methodology
2.1 Surrogate Model Method
The main idea of the surrogate model method is to fit a series of isolated sample points into a function, based on mapping relationships, as shown in Figure 2.
Consider a function with only a 1dimensional design variable and a 1dimensional objective output. The points in Figure 2 represent sample points, and we know the input and output values of the sample points. If there are only three sampling points, we can also fit a curve with a functional relationship. This is our surrogate model. However, the surrogate model currently is very rough. Its mapping accuracy is very poor, because there are too few sample points. With the number of sample points increasing, our surrogate model will become more accurate until it can accurately express the real characteristics of input and output. It is obvious from Figure 2 that the minimum value of the function is in the red circle. Sometimes we don’t need the surrogate model to be sufficiently accurate globally, because meeting global accuracy often requires a large number of sample points. We can build a rough surrogate model with widely distributed sample points and find the interval where the minimum point may exist, as shown by the red circle. Then continuously increasing the number of sample points for this interval, we construct the surrogate model, until the surrogate model is sufficiently accurate for this small interval. What has been described above is a case with onedimensional input and onedimensional output. The dimensions of the design variables and output can be generalized to \(m\) and \(n\), and the principle remains the same.
It has been shown that neural networks have good datafitting ability [19]. This means that even a twolayer neural network can fit almost any function as long as the parameters are appropriate. Therefore, this paper uses a BP neural network to build the surrogate model. A BP neural network uses the error back propagation method. It approximates real functions by continuously adjusting weights and offset values [20].
Suppose the input is \({\varvec{X}} = \left[ {x_{1} ,x_{2} , \cdots ,x_{n} } \right]\). Then, the output of a neuron can be expressed as y:
where w_{i} is the weight vector, σ is activation function and b is offset.
Suppose the final output of the entire neural network is o^{l}, and the real output of the system is y^{l}. Assume that the cost function is the mean square error function:
Finally, the weights and offsets are updated until the gradient is small enough and the value of the cost function no longer decreases:
where W^{k} and b^{k} are the weights and offsets of the kth layer neural network, respectively.
Under normal circumstances, the structure of a neural network is as shown in Figure 3.
Then the output of the neurons in each layer of the neural network can be expressed as:
where \(\sigma_{{h_{1} }}\), \(\sigma_{{h_{k} }}\) and \(\sigma_{o}\) are the activation functions of each layer. o is the output of the neural network. h_{k} is a hidden layer vector. \({\varvec{h}}_{{\varvec{k}}} = \left( {{\varvec{h}}_{{{\varvec{k}},{\mathbf{1}}}} ,{\varvec{h}}_{{{\varvec{k}},{\mathbf{2}}}} , \cdots ,{\varvec{h}}_{{{\varvec{k}},{\varvec{n}}}} } \right)^{{\text{T}}}\). n is the number of neurons in the hidden layer.
2.2 ManyObjective Optimization
Consider a maximization problem with two decision vectors \({\varvec{a}},{\varvec{b}} \in {\varvec{X}}\). Then, a is said to dominate b (also written as a≻b) if: \(\forall i \in \left\{ {1,2, \cdots ,n} \right\}:f_{i} \left( {\varvec{a}} \right) \ge f_{i} \left( {\varvec{b}} \right) \wedge \exists i\left\{ {1,2, \cdots ,n} \right\}:\)
Additionally, in this study a is said to cover b if a≻b or f(a)=f(b). The decision vectors that are nondominated within the entire search space are denoted as paretooptimal and are called the paretooptimal front [21].
When the number of objectives is 2 ≤ n ≤ 3, the optimization is called multiobjective optimization, and when n > 3, it is called manyobjective optimization. Deb et al. [22, 23] proposed the NSGAIII algorithm based on NSGAII, which can be used to solve manyobjective problems (n ≥ 3). Traditional algorithms, such as particle swarm multiobjective algorithms and NSGAII algorithms often fail, when the number of optimization objectives exceeds 3. This problem forces us to use a better performing algorithm. Most of the operations of the NSGAIII algorithm are the same as NSGAII, except that the necessary improvements are made to retained childhood. Assuming the number of the objectives is n and the population to be processed is S_{t}. The highest rank of elite level in S_{t} is F_{l} (the last level). Childhood is s. The objectives value of s is f_{i} (s), 1 ≤ i ≤ n.
First, we need to compute the ideal point.
Second, transform objectives.
Third, calculate the extreme values of S_{t}.
Fourth, normalize objectives (f^{n}) by
Fifth, specify or evenly arrange reference points Z^{r}.
Sixth, for each reference point \({\varvec{z}} \in Z^{r}\), we compute reference line \({\varvec{w}} = {\varvec{z}}\). Also, we calculate the distance between s and w by
Seventh, assign \(\uppi ({\varvec{s}}) = {\varvec{w}}:\arg \min_{{w \in Z^{r} }} d\) and \(d({\varvec{s}}) = d({\varvec{s}},\uppi ({\varvec{s}}))\) respectively.
Finally, select the remaining new individuals from F_{l} according to the distance and distribution.
3 Model Introduction and Related Verification
3.1 CFD Model
The 1:8 scale train model is used as the research object, as is common in the field of train aerodynamics research. The model is divided into a head car, a middle car and a tail car, as shown in Figure 4.
The characteristic height of the train h = 0.52 m, the length of the head and tail cars l_{1} = 3.45 m, and the length of the middle car l_{2} = 3.125 m. The crosssection area of train is 0.1687 m^{2}.
Li et al. [24] used this model as to study the influence of intercar gap length on aerodynamics, based on experiments in a windless environment. If you want to simulate a crosswind environment, you must make corresponding improvements to the calculation domain. Code TB/T 3503.4 [25] is one of the railway industry standards of the People’s Republic of China. It is a standard for numerical simulation of train aerodynamics. When a train is running in windless environments, it stipulates that the ratio of the crosssectional area of the train to the crosssectional area of the calculated domain should be less than 0.01. The calculation domain height L should not be less than 8 times the characteristic height h, the length of upstream L_{F} should be not less than 8 times the characteristic height h, and the length of the downstream L_{B} should be not less than 16 times the characteristic height h. When the train is running in windy conditions, the requirements for the size of the domain are similar to those in windless environments, but two additional conditions are added. The length of the upstream W_{U} should not be less than 8 times the characteristic height h, and the length of the downstream W_{D} should not be less than 16 times the characteristic height h. Therefore, to obtain the accurate aerodynamic load of the model under crosswind conditions, the calculation domain must meet both the windless requirements and the crosswind requirements. If the simulation results are not significantly different from the experimental results under windless conditions, we must continue to study the aerodynamic performance of the train under crosswind condition. In this paper the relative position of the train in the calculation domain is shown in the Figure 4(c).
The L_{F}, L_{B}, W_{U}, W_{D} and L meet the requirements of windless and crosswind conditions. In addition, in order to ensure the accuracy of aerodynamic calculations, three refine zones are set around the train, as shown by the dotted lines.
3.2 Related Settings of Aerodynamic Simulation
Aerodynamic simulations, presented here are based on the commercial software Fluent. The simulation mode is steadystate. Li et al. [26] studied the aerodynamic behavior of a train under crosswinds based on the SST kω turbulence model. Comparing 6 different turbulence models, the SST kω model is most suitable for the numerical simulation, and so, is used here. The solver is pressurebased. When the operating condition is windless, the speed of the velocity inlet v (x, y, z) = (60, 0, 0) m/s, because the wind speed in the wind tunnel test was 60 m/s. In this case, DAEH is the velocity inlet and BCGF is the pressure outlet. EABF, ABCD, HDCG and HEFG are the wall. HDCG is a sliding wall surface with a sliding speed of 60 m/s. When the train is running under the crosswind, the speed of the velocity inlet v(x, y, z) = (97.2, 15, 0) m/s, so that it can simulate the train running at 350 km/h while encountering a crosswind with a speed of 15 m/s. The boundary conditions are shown in Figure 4(c). HDCG is sliding wall surface with a sliding speed of 97.2 m/s. The number of boundary layers is 12 layers, y+ = 1. The height of the first boundary layer is 1 × 10^{−5} m. ρ = 1.225 kg/m^{3}. The train grid and the boundary layer grid are shown in Figure 5.
3.3 Aerodynamic Model Verification
This paper not only has experimental data verification, but also does grid independence testing to eliminate the difference in results caused by the number of grids. The study by Li et al. [24] provided experimental data of the model's aerodynamic load when the train is running under windless conditions. Aerodynamic coefficients are solved using definition 11 and definition 12.
where F_{d} is drag force, and F_{l} is lift force of the train. v=60 m/s, A =0.1687 m^{2} and ρ=1.225 kg/m^{3}.
A total of four sets of grids were used. As shown in the Table 1, EXP represents experimental values. Among them, the mesh amount of the mesh2 exceeds 30 million, and the simulation results are not significantly different from the mesh3, mesh4, and experimental results. The larger the number of grids, the greater computational cost. Based on the above considerations, mesh2 is selected for further research.
3.4 Vehicle Dynamic Model
The dynamic model is a single vehicle model, and both bogies are trailer bogies, as shown in Figure 6. The Dynamic model has a primarysuspension system and a secondarysuspension system. The mechanical parameters of some components have nonlinear characteristics. The letters in the Figure 6 represent the main components of train, as shown in Table 2.
The nonlinear characteristics of some components are shown in Figure 7. This paper selects the stiffness of steel spring of the primarysuspension in x, y, and z directions, the stiffness of the axle box arm positioning node in the x, y, z directions, the stiffness of the antirolling torsion bar, the damping of the shock absorber of the primarysuspension, and the damping of the yaw damper as design variables. There are nine variables. The damping force of yaw damper is velocitydependent, as shown in the Figure 7(c). It has different damping values at different speed grades. It is inappropriate to artificially choose fixed damping values for the yaw damper. But we can specify the coefficient α to change the slope of the damping force function. The new damping force function can be set as αc_{3}. Parameters of the benchmark model and the range of each design variable are shown in Table 3. We use Latin hypercube sampling technology to construct 200 initial samples in the design space. These dynamic model samples have different suspension parameters.
3.5 Related Settings of Vehicle Dynamics Simulation
Vehicle dynamic simulations, presented in this paper are based on the commercial software Simpack. JingJin track spectrum is used as the track spectrum. The wheel tread type is LM tread. The wheelrail contact force can be calculated by the simplified theory of Kalker (FASTSIM) [27]. The vehicle speed is 350 km/h. The train is run for 20 s. The number of sampling points is 5100. The wind load on the train consists of two parts: aerodynamic forces and aerodynamic moments. Lift and side forces are regarded as concentrated forces acting on the center of the car body. The roll moment, pitch moment and yaw moment obtained by CFD simulation are the moments of the aerodynamic force on the center of the car body. Therefore, in the vehicle dynamics model, they also act on the center of the car body. The wind load is applied to the centre of the car body from 0 s, and reaches full load after 2 s after which the wind load remains unchanged. When the simulation is completed, the wheelrail vertical force and wheelrail lateral force of each wheel are extracted.
3.6 Vehicle Dynamic Model Verification
The topological structure of all dynamic model samples is the same as that of the benchmark train. We verify whether this structure can respond correctly when excited. The train is only subjected to two types of loads, namely constant aerodynamic load and fluctuating track excitation. These two loads have been verified separately.
3.6.1 Aerodynamic Response Verification
For a moving train, side force has a very large impact on safety. Therefore, we use side force to verify whether the model behaves normally when encountering a constant load. When the train is running on a straight track (without the excitation effect of the track spectrum), the lateral resultant force of rail acting on the wheelset should be equal to the value of the side force of the train, but in the opposite direction. The time history curve of the side force and the resultant force of rail acting on the wheelset are shown in Figure 8. The results show that when the train runs stably, the resultant force on the wheelsets is 10 kN. This shows that the response of the model is normal.
3.6.2 Rail Excitation Verification
In this section, we only consider the influence of track excitation on the dynamic behavior of the train, that is, with the train running without crosswinds. In the research process, the track spectrum we used is the JingJin spectrum. It is measured on the railway line. In order to check whether the wheelset response is normal, we replaced the JingJin spectrum with a sine function spectrum. The waveform of the track in the z direction is determined by Eq. (13):
where z is the unevenness of the track. A=0.5 mm. x is the longitudinal distance of the track. λ is the wavelength, λ=97.2 m.
When the train is excited by the track, the wheelsets will be forced to vibrate. If the model is accurate, the vibration frequency of the wheelset should be equal to the excitation frequency, when the state of motion is stable. The excitation frequency is as follows:
where v is the train speed, v=97.2 m/s.
The verification result of the vertical displacement of each wheelset is shown in Figure 9.
Figure 9 shows that in the time interval of 10‒20 s, the vertical displacement curve of each wheelset contains 10 cycles. The vibration frequency is 1 Hz. The response of the wheelset is correct. In summary, the model can respond correctly to both constant aerodynamic loads and fluctuating track excitations.
4 Optimization Process and Results Analysis
4.1 Aerodynamic Performance of HighSpeed Train under Crosswind
The pressure distribution of the train is shown in Figure 10. More detail about the pressure distribution of head car and tail car as shown in Figure 11. The streamline distribution of the crosssection in each car as shown in Figure 12. The crosssections were located in center point of each car. Figure 13 shows the vortex structure around the train in the form of vorticity.
It can be seen from Figure 10 and Figure 11 that there is a large positive pressure zone on the windward side of the train. This is caused by the force of the crosswind. For the head car and the tail car, the streamlined area is most susceptible to crosswinds. On the windward side, there is a large area of positive pressure in the nose tip of the head car. On the leeward side, almost the entire streamlined area of the head car is a negative pressure area. And the value of negative pressure is very large. The huge difference in pressure between the two sides causes the head car to experience huge side forces. In the tail car, the situation was exactly the opposite. The area and value of negative pressure on the windward side of the streamlined area of the tail car are larger than those on the leeward side. This results in the streamlined area of the trailing car being mainly subjected to the pull by the crosswinds rather than a thrust. Figure 12 shows the streamline distribution of each crosssection. On the leeward side of the head car, the airflow has begun to form two vortexes. The cross section of the middle car shows that the vortex generated by the head car has become very large. As the vortex continues to develop, it will gradually move away from the tail car. Figure 13 shows the vortex structure in the form of vorticity. Steadystate simulation can get the main vortex structure around the train. We noticed that the area of the bogie, the leeward side of the head car and the streamline shape area of the tail car are the main areas where the vortex structure is generated. The head car produced two main vortex structures. The vortex structure located below the head car is relatively unstable, while another has been developing steadily along the train, and finally formed into the large vortex structure shown in Figure 12. Vortices may adversely affect the infrastructure around the railway, but these are not within the scope of this paper. This paper pays more attention to the aerodynamic loads on the train. When the train is running with the speed of 350 km/h, under the crosswind speed of 15 m/s, the drag force, side force, lift force and roll moment, pitching moment, and yaw moment of the scaled train are obtained. It is obviously inappropriate to apply the aerodynamic load of the scaled model directly to the fullscale dynamic model. In this paper, the Reynolds number is 3×10^{6}, and yaw angle is 8.7°. Although the Reynolds number of the scale model and the fullsize model are different, the previous work has found that there are few effects of such Reynolds number on the aerodynamic coefficients of trains under different yaw angles between 0° and 30° [28, 29]. Based on the same coefficients of the two models, the aerodynamic load of the scaled model is converted into the aerodynamic load of the fullscale model, as shown in Table 4 and Table 5. However, in the field of train system dynamics, drag force is usually not considered. Among the remaining five types of aerodynamic loads, the values of the side force, roll moment, pitching moment and yaw moment of the head car are the largest. Although the lift force of head car is not the largest in the train, it is not small. Therefore, safety of the head car is probably the worst in the whole train. It is necessary to optimize the dynamics parameters of the head car to ensure that the highspeed train runs safely under crosswind.
4.2 Safety Evaluation Metrics
We have 200 dynamic models with different suspension parameters. The aerodynamic loads shown in the Table 4 are applied to those dynamic models. The wheelrail lateral and vertical forces of the train can be obtained through simulation, and the safety metrics can be calculated. Before research, it is necessary to describe the composition of the safety metrics. The safety metrics used in this study are determined according to GB/T 5599 and EN 140676 [18, 30].

(1)
Derailment coefficient Q/P
where Q and P are the wheelrail lateral and vertical forces, respectively. α is rim angle and μ is wheelrail friction coefficient.

(2)
Wheelrail lateral force L
where P_{0} is the static axle load.

(3)
Vertical wheelrail contact force P_{lim}

(4)
Wheel load reduction rate \(\Delta P/\overline{P}\)
where ∆P is the wheel load reduction; P_{R}, P_{L} are the vertical forces on the left and right wheels, and P_{st} is the vertical static load of the wheelrail.

(5)
overturning coefficient ∆P/P_{st}
where P_{i1} represents the wheelrail vertical force of the front wheel on the load reduction side of the bogie, and P_{j1} represents the wheelrail vertical force of the rear wheel on the load reduction side of the bogie.
All metrics can be calculated from wheelrail forces. Metric values change over time. We find the maximum value of each metrics and record it as the final safety value. Applying wind load to the benchmark model can obtain the safety metrics, as shown in the Table 6. We can see that the derailment coefficient and wheelrail lateral force have exceeded the standard.
4.3 Structure of the Surrogate Model
We take the design variables of each dynamic model as input data, the safety metrics of the corresponding models as output data, and use neural networks to analyze this input and output. The neural network is our surrogate model. Before setting up a surrogate model, we must perform some processing on the data. There are nine design variables, whose range of values vary greatly. If we input them directly into the neural network, the mapping accuracy of the neural network may be poor. Therefore, it is necessary to normalize the input variables so that their values are between 0 and 1.
where x_{new} is the design variable after conversion, x_{i} is the design variable before conversion, x_{imin} is the minimum value of the x_{i}, and x_{imax} is the maximum value of x_{i}.
Similarly, there are five metrics of safety, three of which are coefficients and the other two are forces. There is a huge difference between them. In order to avoid the loss of mapping accuracy of the neural network, we need to deal with it in advance. The difficulty was that, due to the uncertainty, we did not know the precise range of the five metrics before running the simulation. Usually the ranges of overturning coefficient, wheel load reduction ratio and derailment coefficient are between 0‒1. However, the derailment coefficient is special. It is the ratio of lateral force to vertical force. During the simulation, if the value of the lateral force is extremely large and the value of the vertical force is extremely small at the same instant, the maximum value of the derailment coefficient will be very large. In this case, the maximum value of the derailment coefficient of some samples is less than 1, and the maximum value of the derailment coefficient of other samples is much larger than 1. This situation is very unfavorable for the neural network to build a surrogate model. Usually, we don't really care about the specific value of the particularly large derailment coefficient. But, one of the objectives of our optimization is to make the derailment coefficient as small as possible. Therefore, after the dynamic simulation of a certain sample, if the maximum value of the derailment coefficient is greater than 1.5, we artificially set to be equal to 1.5 to avoid excessive differences between samples. The other two metrics that need to be dealt with are the vertical wheelrail contact force and wheelrail lateral force. Their values are much larger than 1. Because of large differences in magnitude between these two metrics and other metrics, it may be difficult for the surrogate model to establish a correct mapping relationship. However, the range of wheelrail vertical force is typically within 0‒200 kN, and the range of wheelrail lateral force is typically within 0‒150 kN. Thus we can use Eq. (20) to crudely normalize these two metrics.
The structure of the surrogate model is shown in Figure 14. It is a neural network with three hidden layers. The number of neurons in each hidden layer is different. Increasing the number of hidden layers and neurons will enhance the ability of the neural network to map nonlinearly, but too many neurons and hidden layers will easily cause the surrogate model to overfit the data, which is not good for optimization. The nonlinear activation functions tanh, softplus and sigmoid are used to enhance the nonlinear fitting ability of the neural network. The optimizer is RMSprop, lr = 0.01, rho = 0.9.
4.4 Process of Optimization
The specific optimization process is shown in Figure 15. Minimizing the values of 5 objectives is the ultimate goal of optimization.
As mentioned above, we already have two hundred sets of data. They comprise the design variables and corresponding safety metrics of each model. They are used as the input and output of the neural network to build a primary surrogate model. Of course, the mapping accuracy of the first attempt at a surrogate model is not sufficient. But NSGAIII algorithms can still find a series of nondominated solutions based on this rough surrogate model. The population number of the NSGAIII algorithm is set to 300, and the evolutionary generation number is set to 300. In each cycle, we always can get a series of nondominated solutions. Partial solutions in this nondominated solution set are randomly selected. Design variables corresponding to these solutions are found and inputted into the dynamic model. Through dynamic simulation, the values of the safety metrics of each model can be obtained and compared with the predicted values obtained by the surrogate model. If the accuracy meets the requirements, it is determined that the optimization has been successful. The nondominated solution set is the paretooptimal front we need. Next, we can find the most suitable solution in the solution set according to preference. If the accuracy does not meet the requirements, the samples that have been simulated will be treated as new sample points and will be used in the expanded sample set together with the original samples in the next stage of neural network training. The accuracy of the surrogate model improves as the number of cycles increases. In terms of model structure, the greater the number of design variables and/or objectives, the more difficult it is to build a surrogate model. The train often experiences random excitations, caused by the effect of the track spectrum, which also makes it very difficult to build a very accurate surrogate model. Therefore, it is necessary to make some tradeoffs between model accuracy and calculation cost. In each cycle, 20 solutions are randomly selected from the optimized pareto solution set. If these solutions satisfy Eq. (21), the accuracy requirements are met:
where o_{i} represents the ith predicted value of the solution by the surrogate model, and y_{i} represents the ith simulation value obtained by the same solution put through the dynamic model.
In summary, \(\sum\nolimits_{i = 1}^{5} {\left {o_{i}  y_{i} } \right} /y_{i}\) represents the sum of the absolute values of the relative errors between the predicted values and the simulation values of each solution. \((1/20)\sum\nolimits_{j = 1}^{20} {\left( \bullet \right)}\) is to average the 20 individuals. Even so, we have paid a very high price. The initial 200 samples were far from enough, and a total of 670 samples were used throughout the study.
4.5 Optimization Result Analysis
A total of 210 pareto solutions were obtained. If the number of objectives is 2 or 3, a 2D plane coordinate system or a 3D space coordinate system could be used to represent the pareto solution. When the number of objectives is greater than 3, it is difficult to visualize the result by the traditional method. Parallel coordinates are introduced here to represent the pareto solution set, as shown in Figure 16. Before the pareto front is presented, it is necessary to perform numerical processing on each metric so that the five metrics having very different values can be represented on one graph, as shown in following equation.
where i=1, 2, ..., 210, i is the serial number of the pareto solution \(\left( \frac{Q}{P} \right)_{inew}\), \(H_{{i{\text{new}}}}\), \(P_{{\lim_{{i{\text{new}}}} }}\), \(\left( {\frac{\Delta P}{{\overline{P}}}} \right)_{{i{\text{new}}}}\),\(\left( {\frac{\Delta P}{{P_{{{\text{st}}}} }}} \right)_{{i{\text{new}}}}\) are the derailment coefficient, wheel lateral force, wheelrail vertical force, and wheel load reduction rate after conversion \(\left( \frac{Q}{P} \right)_{i}\), \(H_{i}\), \(P_{{\lim_{i} }}\), \(\left( {\frac{\Delta P}{{\overline{P}}}} \right)_{i}\),\(\left( {\frac{\Delta P}{{P_{{{\text{st}}}} }}} \right)_{i}\) are the derailment coefficient, wheel lateral force, wheelrail vertical force, and wheel load reduction rate before conversion.
The purple line in Figure 16 represents the paretooptimal front. All solutions are combined into a fivedimensional curved surface. The values of the safety metrics of the benchmark model are also shown on the graph. The pareto front is evenly distributed across the entire curved surface, indicating that pareto solutions are well distributed, proving the reliability of the algorithm. The result shows that four metrics are greatly improved compared to the benchmark model in derailment coefficient, vertical wheelrail contact force, wheelrail lateral force, and wheel load reduction rate. The improvement in overturning coefficient is not obvious. This shows that different objectives have different sensitivities to design variables. By changing the stiffness of the steel spring of the primarysuspension, the damping of the shock absorber of primarysuspension, the damping of the yaw damper, the stiffness of the antirolling torsion bar and the stiffness of the axle box arm positioning node, we can effectively reduce the derailment coefficient, vertical wheelrail contact force, wheelrail lateral force, and wheel load reduction rate of the train. However, it may not be effective to reduce the overturning coefficient only by changing the design variables mentioned above. If it is wished to further reduce the overturning coefficient, other methods may need to be considered. Because the overturning coefficient is quite unusual, it seems to have little correlation with the suspension parameters of the bogie. If we do not want an object to overturn, the most effective measures are to reduce the intensity of the crosswind or increase the mass of the object. By changing the shape of the car body, the effect of crosswind on trains can be reduced, thereby the overturning moment can be greatly reduced, which in turn reduces the overturning coefficient. We select a set of design variables from the pareto solution set, as shown in Table 7. The design variables are used as the input of the dynamic model to obtain the simulation values of the safety metrics. Predicted values by surrogate model and simulation values by dynamic model are shown in Table 8.
In Table 8, the difference between the predicted values and the simulation values is relatively small. Among the simulation values, the five metrics are all better than the benchmark model. The errors’ absolute values of derailment coefficient, vertical wheelrail contact force, wheel load reduction rate, wheelrail lateral force and overturning coefficient are 1.1%, 2.1%, 4.6%, 15.3% and 3.6% respectively. The total error of the 5 objectives is 26.7%. Of course, the improvement of the overturning coefficient is not obvious. The remaining four metrics are greatly improved, compared to the benchmark model. In particular, the derailment coefficient and the wheelrail lateral force are no longer in excess of the safety standards, and they are far from the warning line. Table 8 proves that our idea is feasible. When the train is running in a crosswind environment, safety will be greatly reduced. Optimizing only two of these metrics does not guarantee train safety. Only when all safety metrics are taken into account can an optimization scheme be considered reasonable. And we can do this by combining the surrogate model method and advanced optimization algorithms. None of the five metrics exceeded the safety standards. They were all well below the alert values. Figure 17 shows the wheelrail vertical and lateral forces of wheelset2 of the benchmark model and the optimized model (Wheelset2 being most representative of the differences in the forces experienced by these models). Where A is the lateral wheelrail force of the right wheel, B is the lateral wheelrail force of the left wheel, C is the vertical wheelrail force of the right wheel, and D is the vertical wheelrail force of the left wheel.)
In Figure 17, we can see that both the benchmark model and optimized model's vertical force and lateral force of wheelrail on the windward side are smaller than on the leeward side. Because the wind load is applied to the body center of the car body, the windward side wheel has reduced load and the leeward side wheel has increased load. Wheelrail forces of both models fluctuate over time. This is caused by the synergistic effect of wind loads and track spectrum. However, it is obvious that the wheelrail force fluctuations of the benchmark of the model are more severe than the optimized model. In particular, the benchmark model’s lateral and vertical forces of the wheelrail on the leeward side have many protrusions on the time course curve. This phenomenon did not appear on the optimized model’s time course curve. The peak points of the protrusions are the maximum points of wheelrail forces. In Figure 17(b), the maximum vertical force of the right wheel is 133.30 kN. This value is the P_{lim} of the benchmark model. Other safety metrics are also obtained from wheelrail forces, either directly, or indirectly. The extreme values of forces are the root cause of the train exceeding the safety standards. The optimization process is intended to adjust the dynamic parameters, to alleviate the abnormal fluctuation of wheelrail force and reduce the peak value of the wheelrail force. Reducing peaks is not only helpful for safety. We have reason to believe that this will also improve the smoothness and comfort for passengers when the train is running. But quantitative analysis needs further study.
5 Conclusions

(1)
Combining aerodynamics, the surrogate model method and the manyobjective optimization method, we can optimize train dynamic performance comprehensively, considering more safety metrics. These measures can make trains run more safely.

(2)
By changing the stiffness of the steel spring of the primarysuspension, the damping of the shock absorber of the primarysuspension, the damping of the yaw damper, the stiffness of the antirolling torsion bar and the stiffness of the axle box arm positioning node, we can effectively reduce the derailment coefficient, vertical wheelrail contact force, wheelrail lateral force, and wheel load reduction rate. However, an improvement of the overturning coefficient is not significant, and further research is needed.

(3)
When the train is running, the wheelrail force may fluctuate drastically with time. Many protrusions features will be shown on the wheelrail force timehistory curve. The extreme points are the root cause of exceeding the safety standards. The optimization process is to adjust the parameters of the design variables to suppress the occurrence of fluctuation, thereby improving the safety metrics.
References
H Q Tian. Review of research on highspeed railway aerodynamics in China. Transportation Safety and Environment, 2119, 1(1): 121.
X He, H Li. Review of aerodynamics of highspeed trainbridge system in crosswinds. Journal of Central South University, 2020, 27(4): 10541073.
C Baker, H Hemida, S Iwnick, et al. Integration of crosswind forces into train dynamic modelling. Proceedings of the Institution of Mechanical Engineers, 2011, 225(2): 154164.
E Deng, W C Yang, M F Lei, et al. Aerodynamic loads and traffic safety of highspeed trains when passing through two windproof facilities under crosswind: A comparative study. Engineering Structures, 2019, 188: 320339.
N J Yan, X Z Chen, Y L Li. Assessment of overturning risk of highspeed trains in strong crosswinds using spectral analysis approach. Journal of Wind Engineering and Industrial Aerodynamics, 2018, 174: 103118.
J M Olmos, M A Astiz. Nonlinear vehiclebridgewind interaction model for running safety assessment of highspeed trains over a highpier viaduct. Journal of Sound and Vibration, 2018, 419: 6389.
L Wei, J Zeng, P B Wu, et al. Safety analysis of high speed trains under cross winds using indirect wheelrail force measuring method. Journal of Wind Engineering and Industrial Aerodynamics, 2018, 183: 5567.
J Q Niu, D Zhou, T H Liu, et al. Numerical simulation of aerodynamic performance of a couple multiple units highspeed train. Vehicle System Dynamics, 2017, 55: 681703.
M G Yu, R C Jiang, Q Zhang, et al. Crosswind stability evaluation of highspeed train using different wind models. Chinese Journal of Mechanical Engineering, 2019, 32: 40.
Z Li, L Zheng, Y Ren, et al. Multiobjective optimization of active suspension system in electric vehicle with InWheelMotor against the negative electromechanical coupling effects. Mechanical Systems and Signal Processing, 2019, 116: 545565.
G Fossati, F Miguel, P Casas. Multiobjective optimization of the suspension system parameters of a full vehicle model. Optimization and Engineering, 2019, 20(1): 151177.
L Zhang, J Y Zhang, T Li, et al. A multiobjective aerodynamic optimization design of a highspeed train head under crosswinds. Part F: Journal of Rail and Rapid Transit, 2018, 232(3): 895912.
J Liu, W P Song, Z H Hab, et al. Efficient aerodynamic shape optimization of transonic wings using a parallel infilling strategy and surrogate models. Structural and Multidisciplinary Optimization, 2016, 55(3): 925943.
T Li, Z Y Dai, M G Yue, et al. Numerical investigation on the aerodynamic resistances of doubleunit trains with different gap lengths. Journal of Engineering Applications of Computational Fluid Mechanics, 2021, 15(1): 549–560.
D ViúdezMoreiras. Performance influences on metamodelling for aerodynamic surrogatebased optimization of an aerofoil. Engineering Optimization, 2019, 51(3): 427446.
J MunozPaniagua, J Garcia. Aerodynamic drag optimization of a highspeed train. Journal of Wind Engineering and Industrial Aerodynamics, 2020, 204.
L Zhang, T Li, J Zhang. Research on aerodynamic shape optimization of trains with different dimensional design variables. International Journal of Rail Transportation, 2020, Doi: https://doi.org/10.1080/23248378.2020.1817803.
Standardization Administration. GB55991985. Railway trains – Specification for evaluation the dynamic performance and accreditation test, Beijing: Standardization administration of P.R.C. Standards Press of China, 1985. (in Chinese)
K Funahashi. On the approximate realization of continuous mappings by neural networks. Neural Networks, 1989, 2: 183192.
D E Rumelhart, G E Hinton, R J Williams. Learning representations by backpropagating errors. Nature, 1986, 323: 533536.
M Q Zhang, L Wang, W A Guo, et al. Manyobjective evolutionary algorithm based on relative nondominance matrix. Information Sciences, 2021, 547: 963983.
K Deb, H Jain. An evolutionary manyobjective optimization algorithm using referencepointbased nondominated sorting approach, Part I: Solving problems with box constraints. IEEE Transactions on Evolutionary Computation, 2014, 18(4): 577601.
H Jain, K Deb. An evolutionary manyobjective optimization algorithm using referencepoint based nondominated sorting approach, PartII: Handling constraints and extending to an adaptive approach. IEEE Transactions on Evolutionary Computation, 2014, 18(4): 602622.
T Li, M Li, Z Wang. Effect of the intercar gap length on the aerodynamic characteristics of a highspeed train. Proceedings of the Institution of Mechanical Engineers, Part F: Journal of Rail and Rapid Transit, 2019, 233: 448465.
National Railway Administration of the People’s Republic of China, TB/T 3503.42018 Railway ApplicationsAerodynamicsPart 4: Requirement for train aerodynamic simulation. Beijing: China Railway Culture Media Company Limited, 2018. (in Chinese)
T Li, D Qin, J Y Zhang. Effect of RANS turbulence model on aerodynamic behavior of trains in crosswind. Chinese Journal of Mechanical Engineering, 2019, 32: 85
J J Kalker. A fast algorithm for the simplified theory of rolling contact. Vehicle System Dynamic, 1982, 11: 113
F Dorigatti, M Sterling, C Baker, et al. Crosswind effects on the stability of a model passenger trainA comparison of static and moving experiments. Journal of Wind Engineering and Industrial Aerodynamics, 2015, 138: 3651.
J Du, L Zhang, M Z Yang, et al. Moving model experiments on transient pressure induced by a highspeed train passing through noise barrier. Journal of Wind Engineering and Industrial Aerodynamics, 2020: 204.
European committee for standardization. EN1406762018. Railway Applicationsaerodynamicspart 6: Requirements and Test Procedures for Cross Assessment. Brussels: CENCENELEC Management Centre, 2018.
Acknowledgements
Not applicable.
Funding
Supported by The National Key Research and Development Program of China (Grant No. 2020YFA0710902), The National Natural Science Foundation of China (Grant No. 12172308), Sichuan Provincial Science and Technology Program of China (Grant No. 2019YJ0227), and State Key Laboratory of Traction Power of China (Grant No. 2019TPL_T02).
Author information
Authors and Affiliations
Contributions
LZ, TL and JZ were in charge of the whole trial; LZ wrote the manuscript; RP assisted with sampling and laboratory analyses. All authors read and approved the final manuscript.
Authors’ Information
Le Zhang, born in 1993, is currently a PhD candidate at State Key Laboratory of Traction Power, Southwest Jiaotong University, China. He received his bachelor degree from Southwest Jiaotong University, China, in 2016. His research interests include train aerodynamics and optimization.
Tian Li, born in 1984, is an associate professor at State Key Laboratory of Traction Power, Southwest Jiaotong University, China.
Jiye Zhang, born in 1965, is a professor at State Key Laboratory of Traction Power, Southwest Jiaotong University, China.
Ronghuan Piao, born in 1997, is currently a master candidate at State Key Laboratory of Traction Power, Southwest Jiaotong University, China.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing financial interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhang, L., Li, T., Zhang, J. et al. Optimization on the Crosswind Stability of Trains Using Neural Network Surrogate Model. Chin. J. Mech. Eng. 34, 86 (2021). https://doi.org/10.1186/s10033021006040
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1186/s10033021006040