 Original Article
 Open Access
 Published:
Generation and Selection of DriverBehaviorBased Transferable Motion Primitives
Chinese Journal of Mechanical Engineering volume 35, Article number: 9 (2022)
Abstract
To integrate driver experience and heterogeneous vehicle platform characteristics in a motionplanning algorithm, based on the driverbehaviorbased transferable motion primitives (MPs), a general motionplanning framework for offline generation and online selection of MPs is proposed. Optimal control theory is applied to solve the boundary value problems in the process of generating MPs, where the driver behaviors and the vehicle motion characteristics are integrated into the optimization in the form of constraints. Moreover, a layered, unequalweighted MP selection framework is proposed that utilizes a combination of environmental constraints, nonholonomic vehicle constraints, trajectory smoothness, and collision risk as the singlestep extension evaluation index. The library of MPs generated offline demonstrates that the proposed generation method realizes the effective expansion of MP types and achieves diverse generation of MPs with various velocity attributes and platform types. We also present how the MP selection algorithm utilizes a unique MP library to achieve online extension of MP sequences. The results show that the proposed motionplanning framework can not only improve the efficiency and rationality of the algorithm based on driving experience but can also transfer between heterogeneous vehicle platforms and highlight the unique motion characteristics of the platform.
Introduction
The everincreasing advancement of unmanned vehicle technology, especially multivehicle cooperative technology, will significantly promote the development of future intelligent transportation systems and unmanned combat systems and will affect future travel modes and combat patterns [1,2,3]. Motion planning is a crucial component in the unmanned vehicle system framework. Its main function is to generate a reference trajectory that satisfies the constraints of the environment and the vehicle itself [4,5,6]. Building a unified motionplanning algorithm for heterogeneous vehicle platforms and using the driving experience to guide and accelerate the completion of planning tasks is a fundamental task for the further development of unmanned vehicle motionplanning algorithms. In both the motionplanning algorithm and the driving behavior representation algorithm, the decomposition of complex motion into motion primitives (MPs) can effectively improve the efficiency of the algorithm [7,8,9,10].
The essence of MP generation is to solve a set of boundary value problems, that is, to generate a set of paths connecting different target states [11]. The difficulty in solving the above problems lies in the segmentation strategy of the state space and the form of the curve connecting the start and end states. Graph searchbased motionplanning algorithms, such as the typical A* [12, 13], D* [14, 15], and state lattice [16, 17], divide the environment space into basic grids and then generate MPs based on center points, corner points, or edges. To overcome the disadvantages of a gridbased MP generation method that cannot consider nonholonomic vehicle constraints, the Hybrid A* algorithm completes the generation of MPs based on the vehicle kinematic model [7, 18]. Each primitive used in the Hybrid A* algorithm is an arc generated with a fixed time scale and a fixed front wheel angle. The discrete optimizationbased method uses spline curves or polynomial curves to generate MP candidate sets based on the offset points of the road centerline, which effectively limits the search space and improves the overall efficiency of the algorithm [19,20,21]. In addition, some methods use numerical optimization to realize the generation of MPs while considering vehicle dynamic constraints [22,23,24]. Although the MPs generated by using the above methods can meet the needs of a complex driving environment, they fail to integrate the driver's behavior information into the MP generation algorithm. Moreover, these methods generally lack consideration of the differences and connections of the heterogeneous platform motion characteristics.
The quality of the final path generated by the motionplanning algorithm depends not only on the quality of the MP itself but also on the rationality of the MP selection and connection [25]. The selection of MPs is the process of first considering the cost function represented by different constraints, then combining the cost according to a particular weight factor, and finally selecting the MP with the minimum cost [26]. The Dijkstra algorithm takes the shortest path as the extension cost of the MP [27]. The A* algorithm and its variants add heuristics based on the target state and Voronoi diagram to the original Dijkstra algorithm [28]. The samplingbased RRT path planning algorithm [29], on the one hand, takes the target point as a guide to the selection and extension of MPs; on the other hand, it randomly extends in any direction with a certain probability to achieve a comprehensive search of accessible areas. The discrete optimizationbased method [19] utilizes the linear combination of the three extension cost values by considering static obstacles, dynamic obstacles, and path smoothness as the basis for the selection of MPs. Although the MP selection and extension method mentioned above can generate the desired driving trajectory, no attention is paid to the guiding role of the driver’s experience.
The learning and generalization of the driving experience and characteristics have attracted the attention of many researchers in recent years [30,31,32]. Specific to trajectorylevel driving behavior research, Guo et al. [33] researched the generation of humanlike trajectories with the leader vehicle as an attractive force. Although this method can cope with complex dynamic environments, the learning object of its behavior is the guide vehicle; that is, it is a followup humanlike trajectorygeneration method. Regarding the representation of humanlike trajectories that target the behavior of the ego vehicle, Schnelle et al. [34] proposed a personalized trajectory optimization generation method in two scenarios: one with a single lane change and another with a double lane change. Zhao et al. also conducted a series of studies on lanechange scenes. First, they used the support vector machine algorithm to extract lanechange primitives from the lanechange trajectory data of surrounding vehicles [35, 36]. Subsequently, using the principle of trajectory matching, the generated humanlike lanechange trajectory was integrated into a realtime motionplanning system [37, 38]. However, the aforementioned research has limitations in the application of scenes. It is often aimed at a specific scene, and there is no further study on how to combine multiple driving behaviors with the vehicle motionplanning system in a general driving environment.
In this paper, for the two heterogeneous platforms of wheeled Ackermannsteering vehicles and tracked skidsteering vehicles, a unified vehicle motion differential constraint is proposed. In addition, five types of equality constraints for typical driving behaviors and inequality constraints for vehicle platform characteristics are set, and an offline optimization method for driverbehaviorbased transferable motion primitive (DBTMP) generation is formed. Based on the unique MP library established for different vehicle platforms, research on the selection of MPs was conducted under the Hybrid A* algorithm framework. However, there are two significant differences in the construction of the selection framework owing to the large difference between the offline generated MP library and the online generated MP library used by the Hybrid A* algorithm. First, the MP library generated offline is a collection of multiple MP sets containing the entire speed interval; therefore, a hierarchical structure is applied to restrict the MP candidate set. Second, because each candidate set contains both behavior primitives and general primitives, different weight coefficients are applied to the above two types of primitives when selecting and evaluating a single extended MP. Finally, a motionplanning algorithm based on offline generation and online selection of the MP library is proposed.
The main contributions of this study are the following:

A unified motionplanning algorithm framework is proposed to complete the motionplanning tasks of heterogeneous platforms, achieving a balance between the general solution framework and the characteristics of heterogeneous vehicle platforms.

An offline MP generation method combining discrete driving behaviors and a vehicle kinematics model is proposed, and the guiding role of behavior primitives in the online selection layer of the motionplanning algorithm is highlighted.
The remainder of this paper is organized as follows. Section 1 details the problems to be solved in this study and explains the main parameters. Section 2 introduces the offline generation method for the MP library. Section 3 introduces an online selection framework for MPs. Section 4 demonstrates the unique MP libraries of two heterogeneous vehicle platforms and the results of motion planning in two typical scenarios. Finally, the conclusions and future work are presented in Section 5.
Problem Statement
The offline MP generation solves the optimization problem with driver behavior B and platform motion characteristics as constraints and trajectory smoothness as the objective function g. The selection of MPs is first to select the MP set from the MP library based on the passable area and velocity connection, and then utilize the cost function J as the evaluation index to choose the corresponding single extended MP from the MP set.
The definitions and explanations of the main parameters used in this study are as follows:

\({\varvec{s}}(t)=[x(t),y(t),\theta (t){]}^{\mathrm{T}}\in {R}^{3\times 1}\) is the unified state parameter set of each heterogeneous platform at time t, where x(t) and y(t) are the coordinate positions of the x and y axes, respectively, and \(\theta (t)\) is the course angle. The defined coordinate system is the geodetic coordinate system xoy.

\({\varvec{u}}(t)=[{v}_{x}(t),{v}_{y}(t),{\omega }_{z}(t){]}^{\mathrm{T}}\in {R}^{3\times 1}\) is the unified control variable set of each heterogeneous platform at time t, where v_{x}(t) and v_{y}(t) are the velocities of the platform along the x and y axes, respectively, and \({\omega }_{z}(t)\) is the yaw rate around the z axis. The defined coordinate system is the vehicle body coordinate system x_{a}oy_{a} or x_{t}oy_{t}.

\({{\varvec{u}}}_{\mathrm{a}}(t)=[{v}_{{w}_{x}}(t),\alpha (t){]}^{\mathrm{T}}\in {R}^{2\times 1}\) is the unique control variable set of the wheeled Ackermannsteering vehicle at time t, where \({v}_{{w}_{x}}(t)\) is the velocity of the rear axle along the x_{a} axis and \(\alpha (t)\) is the front wheel angle of the vehicle. The defined coordinate system is the vehicle body coordinate system x_{a}oy_{a}.

\({{\varvec{u}}}_{t}(t)=[{v}_{{l}_{x}}(t),{v}_{{r}_{x}}(t){]}^{\mathrm{T}}\in {R}^{2\times 1}\) is the unique control variable set of the tracked skidsteering vehicle at time t, where \({v}_{{l}_{x}}(t)\) and \({v}_{{r}_{x}}(t)\) are the velocities of the left and right tracks, respectively, along the x axis. The defined coordinate system is the vehicle body coordinate system x_{t}oy_{t}.

\({\varvec{D}}={{\varvec{D}}}_{s}\cdot {{\varvec{D}}}_{\text{a/t}}\) represents the mapping relationship between the platform state parameters and control variables, where \({{\varvec{D}}}_{s}\) is the correlation between the unified state parameter set s and control variable set u. \({{\varvec{D}}}_{\text{a/t}}\) is the correlation between u and the unique control variable set u_{a/t}.

\({\varvec{B}}=\left\{{B}_{\text{SD}},{B}_{\text{LC}},{B}_{\text{UT}},{B}_{\text{RT}},{B}_{\text{TA}}\right\}\) are the five defined types of driving behavior constraints: straight driving behavior \({B}_{\text{SD}}\), lanechanging behavior \({B}_{\text{LC}}\), Ushaped turn behavior \({B}_{\text{UT}}\), rightangle turn behavior \({B}_{\text{RT}}\), and turnaround behavior \({B}_{\text{TA}}\).

U is the inequality constraint condition set according to the motion limit of the platform. U_{a} and U_{t} are for wheeled Ackermannsteering vehicles and tracked skidsteering vehicles, respectively.

\({\varvec{J}}=\left\{{J}_{\mathrm{e}},{J}_{\mathrm{n}},{J}_{\mathrm{s}},{J}_{\mathrm{c}}\right\}\) is the cost function when selecting a single MP, where J_{e} is the environmental constraint heuristic, J_{n} is the nonholonomic vehicle constraint heuristic, J_{s} is the evaluation function considering the smoothness of the curve, and J_{c} is the evaluation function considering the collision risk.
The method proposed in this study is dedicated to solving the following two challenges: (1) determining how to generate an MP library that can be transferred between heterogeneous platforms, considering the overall versatility and highlighting the specificity of the vehicle platform, and (2) integrating the driving behavior factors into the generation and selection algorithm of the MPs and then realizing the guidance of the driver’s experience with the motionplanning algorithm.
Generation of MPs
The generation process of the MP library is shown in Figure 1. The entire processing flow can be decomposed into two main components: (1) the creation and solution of optimal control problems and (2) the rotation transformation and velocity expansion of a single MP cluster. The definition of the optimal control problem for a single MP generation lies at the core of the generation method.
The specific form of the optimal control problem is as follows:
where \(g({\varvec{s}},{\varvec{u}})\) is the objective function of the trajectory smoothness optimization based on the state and control variables; \({\varvec{s}}(t)={\varvec{B}}\) is the constraint condition of the vehicle state based on the driving behavior; \(\dot{{\varvec{s}}}(t)=f({\varvec{s}}(t),{\varvec{u}}(t))\) is the motion differential constraint describing the relationship between the control variables and the state parameters of the vehicle platform; \({\varvec{U}}(t)\in {\varvec{U}}\) is the inequality constraint condition set, and t_{1} and t_{g} are the start and end times of the MP generation, respectively.
Vehicle Motion Differential Constraint
In this study, we ignore the impact of the suspension system on the vehicle motion response and regard the vehicle platform as a rigid body moving in a twodimensional plane. The relationship between the unified state parameter set s and the unified control variable set u in a twodimensional space is
According to the characteristics of heterogeneous platform actuators, the following mapping relationship between the unique control variable set u_{a/t} and the unified control variable set u of different types of platforms can be established:
The relationship between the control variable u_{a} of a typical wheeled Ackermannsteering vehicle and the unified state parameter set s is
where L is the distance between the front and rear axles of the vehicle. The related coordinate system and simplified model of the wheeled vehicle platform are shown in Figure 2.
The relationship between the control variable u_{t} of a typical tracked vehicle and the unified state parameter set s is
where B is the distance between the two tracks. The related coordinate system and simplified model of the tracked vehicle platform are shown in Figure 3.
Equality Constraints of Driving Behavior
This study defines five typical driving behaviors, as shown in Figure 4. These five behaviors can be classified into two main types: behaviors related to typical road structures (B_{SD}, B_{UT}, and B_{RT}) and behaviors related to typical driving behavior patterns (B_{LC} and B_{TA}). The MPs related to the road structure highlight the restraint effect of typical road structure factors on behavior, and the MPs related to the driving pattern highlight the restraint effect of the driver's behavior preferences. The remaining general MPs in the library are used as a supplement to the behavior primitives to improve the planning algorithm's adaptability to different driving environments. By increasing the priority of behavior primitives in the subsequent selection logic, we hope to better adapt to road structures and driving habits.
Among the above behavior primitives, B_{SD} and B_{LC} exist throughout the velocity range, and B_{UT}, B_{RT}, and B_{TA} exist only in the lowvelocity range. Owing to the specificity of the vehicle structure, B_{TA} can be subdivided into \({{B}_{\text{TA}}}_{\mathrm{a}}\) for wheeled vehicles and \({{B}_{\text{TA}}}_{\mathrm{t}}\) for tracked vehicles.
For straight driving behavior B_{SD}, Ushaped turn behavior B_{UT}, and rightangle turn behavior B_{RT}, only the course angle change between the start state t_{1} and the end state t_{g} is constrained:
For lanechanging behavior B_{LC}, in addition to restraining the course angle change deviation, the lateral distance change deviation d should also be constrained:
For the turnaround behavior \({{B}_{\text{TA}}}_{\mathrm{a}}\) of a wheeled vehicle, it is necessary to restrict not only the beginning and end states but also the intermediate states accordingly:
Because the tracked vehicle can complete the task of turning around through the pivot turn movement, its constraint relationship \({{B}_{\text{TA}}}_{\mathrm{t}}\) is as follows:
Inequality Constraints of Vehicle Platform Characteristics
The constraints of vehicle platform characteristics mainly include four aspects (the input range of control variable u, the limitation of yaw rate \({\omega }_{z}\), the limitation of lateral acceleration a_{y}, and the limitation of the current generated MP velocity attribute v):
The inequality constraint relationship U_{a} of the wheeled Ackermannsteering vehicle is
where \({\alpha }_{\mathrm{max}}\) is the maximum front wheel angle of the vehicle platform.
The inequality constraint relationship U_{t} of the tracked skidsteering vehicle is
where \({v}_{{l}_{xmax}}\) and \({v}_{{r}_{xmax}}\) are the maximum speed limits of the left and right driving wheels, respectively.
Objective Function and Optimization Problem Solving
The objective function g mainly reflects the smoothness requirements of MP generation and includes the steering control input c_{s} and yaw rate of the vehicle platform \({\omega }_{z}\):
These two parameters should be as small as possible because they correspond to the curvature of the generated curve and its rate of change, which directly affects the smoothness of the curve.
According to the difference in platform characteristics, the objective function g_{a} for the wheeled Ackermannsteering platform is given by
and the objective function g_{t} for the tracked skidsteering vehicle platform is given by
In this study, the IPOPT [39] and CVODES [40] were applied to solve the optimal control problem.
Generation of the MP Library
Owing to the complexity of the real driving environment, having only the defined driving behavior MPs cannot fully satisfy the motionplanning problem in a complex environment. Therefore, to improve the applicability of the MP library, general MPs are generated as a supplement with the target course angle uniformly distributed on \([0, 2\uppi ]\) as the constraint.
In addition, in the online MP extension process, it is necessary to make the final heading of the MP selected in the previous MP cluster consistent with the heading of the straight driving behavior primitive in the next primitive cluster. This process can be performed either online or offline. To improve the efficiency of the algorithm, we used offline generation and online selection to complete the above process. After the MP cluster containing behavior primitives and general primitives is generated, an MP set covering the entire annular space with 36 evenly distributed MP clusters is generated by rotation transformation. The alignment between the MP clusters is completed by selecting the most appropriate angle from the MP set during online extension.
A simplified schematic of the MP set generation is shown in Figure 5; only a portion of the typical MPs and the rotation transformation at three positions are shown in the figure. The resulting MP library is a combination of multiple MP sets under different MP velocity attribute settings.
Selection of MPs
The MP selection process is shown in Figure 6. The entire processing flow can be decomposed into the selection of the MP sets, calculation of heuristics and costs, selection of a single MP, and extension of the MP sequence. The selection of MP sets refers to selecting the appropriate MP sets from the MP library based on passable areas in the environment map and the velocity attribute of the last extended periodic MP set. The heuristics include both the environmental constraint heuristic and the nonholonomic vehicle constraint heuristic. The environmental constraint heuristic is calculated from the starting and target poses in the grid map. The nonholonomic vehicle constraint heuristic is calculated based on the final state of every MP in the MP set and the target pose set by the planning system. The extension costs include the trajectory smoothing cost represented by the curve energy and collision risk cost, which is characterized by the distance from the obstacle. Finally, the linear combination of heuristics and costs is selected as the basis for singlestep MP extension, and the selected multiple MPs are combined into an MP sequence to complete the motionplanning task.
Selection of MP Sets
The extraction of the singlestep extended passable area is shown in Figure 7. The radius of the passable area, r_{area}, was calculated using the closest distance d_{min}=min{d_{obs_1}, d_{obs_2},…, d_{obs_n}} between the current position and the obstacle. The candidate set of MPs must satisfy the essential condition that the final state of every MP is within the passable area.
In addition, by considering the velocity attribute association of two adjacent periodic MPs, the MP set with a large velocity gap from the previous periodic is eliminated from the candidate set. Finally, further narrowing of the candidate set was achieved.
Heuristics Considering Environmental and Nonholonomic Constraints
The calculation of the environmental constraint heuristic J_{e} takes the target position as the initial point and utilizes the breadthfirstsearch algorithm to realize the iterative expansion of the distance cost until it expands to the starting position. The specific algorithm flow is presented in Algorithm 1. \(\Delta J\) is the distance measurement value during the iterative process. According to the difference between the adjacent positions of the current grid, \(\Delta J\) has two values: \(\left\{d,\sqrt{2}d\right\}\), where d is the grid width.
Figure 8 shows the comparison results of the environmental constraint heuristics calculated based on the Euclidean distance and the breadthfirst algorithm used in this study. It can be observed from Figure 8(a) that the calculated heuristics will first guide the MP to extend to the obstacle area, as shown by the dotted blue arrow in the figure. In contrast, in Figure 8(b), the MP is guided to the passable area, as indicated by the solid blue arrow in the figure. Therefore, the environmental constraint heuristic proposed in this study can avoid invalid extended searches and effectively improve the efficiency of the algorithm.
We also introduce the vehicle nonholonomic constraint heuristic J_{n}, taking the length of the ReedsShepp curve as the heuristic value:
The ReedsShepp curve is composed of a fixed curvature arc and a straight line. The curve can be defined as a set of poses \(\Pi =\left({x}_{r}\left(t\right),{y}_{r}\left(t\right),{\theta }_{r}\left(t\right)\right)\subset {R}^{2}\times \left.\mathrm{0,2\uppi }\right), t\in \left[{t}_{\text{start}},{t}_{\text{goal}}\right]\). It can quickly generate a curve connecting the start and end points while meeting the position and course constraints of the two points.
The nonholonomic constraint heuristic introduces the evaluation of the target point’s course reachability under the premise of considering the vehicle’s motion characteristics, which makes up for the defect in which the environmental constraint heuristic only considers the target position’s reachability.
Finally, a larger value of the environmental constraint heuristic and nonholonomic constraint heuristic is selected as the final heuristic value of the grid, that is, \({J}_{1}=\mathrm{max}\{{J}_{\mathrm{e}},{J}_{\mathrm{n}}\}\).
Extension Costs Considering Smoothness and Collision
The extension cost proposed in this study mainly includes two aspects: trajectory smoothness and possible collision risk.
The extension cost value of smoothness, J_{s}, is calculated using the curve energy function as follows:
where N is the number of discrete points of the curve, \({\kappa }_{i}\) is the corresponding curvature of the point, and \(\Delta {s}_{i}\) is the distance between two adjacent points. The essence of this function is the discrete representation of the curvature integral on the curve.
The extension cost value of collision risk, J_{c}, is calculated using the function
where d_{obs_i} is the distance from the center of each coverage circle to the nearest obstacle and r_{i} is the radius of each coverage circle. The vehicle platform is approximated by the six coverage circles shown in Figure 9 during the calculation process.
The above extension costs J_{s} and J_{c} are given different weights \({\omega }_{\mathrm{s}}\) and \({\omega }_{\mathrm{c}}\). The linear combination after weighting is the final extension value:
The weighting coefficient \({\omega }_{\mathrm{s}}\) is differentiated according to the type of MP as follows:
In principle, the relationships among the weight values of behavior MPs \({\omega }_{{\mathrm{s}}_{\mathrm{B}}}\), general MPs \({\omega }_{{\mathrm{s}}_{\mathrm{G}}}\), and MPs in reverse direction, \({\omega }_{{\mathrm{s}}_{\mathrm{D}}}\), is \({\omega }_{{\mathrm{s}}_{\mathrm{B}}}<{\omega }_{{\mathrm{s}}_{\mathrm{G}}}\ll {\omega }_{{\mathrm{s}}_{\mathrm{D}}}\).
Evaluation Function of MP Selection
The total cost of selecting a single MP mp_{i} in the MP candidate set should be the sum of the corresponding heuristic and extension costs:
where N_{p} is the number of independent MPs in the MP candidate set. In the MP extension process at each step, the MP with the lowest total cost in the MP candidate set is selected as an extension. Finally, the desired trajectory generation from the start position to the end position in the environment map can be realized.
Experimental Results and Discussion
To verify the effect of the DBTMP A* motionplanning algorithm proposed in this study, the wheeled Ackermannsteering vehicle and tracked skidsteering vehicle were chosen, and the corresponding tests were conducted in both the simulation environment and the real environment. In the simulation environment, we conducted an indepth comparative analysis of the algorithm performance in lowspeed and highspeed scenes. In the real environment, we mainly focused on the applicability of the algorithm in real situations, especially the compatibility with the platform and the corresponding autonomous driving module. In addition, the classic Hybrid A* method was selected for comparison. The specific platform parameters are listed in Table 1, where P_{a} represents the wheeled Ackermansteering platform and P_{t} represents the tracked skidsteering platform.
MP Library Offline Generation
The MP libraries for both platforms used a unified optimization framework during the generation process, and only the details of the vehicle characteristic constraints were changed during the platform transformation process.
The overall situation of the two platform MP libraries is given in Table 2, where v_{m} (m/s) is the velocity attribute of the MP cluster, d_{m} (m/s) is the longest distance covered by the MP cluster, N_{B} is the number of behavioral primitives in the MP cluster, N_{m} is the number of general primitives in the MP cluster, and \(\overline{J}_{{\text{s}}}\) is the average curve energy value of each MP in the cluster. At the end of Table 2, the overall situation of the MP library used by the Hybrid A* method, which does not distinguish between vehicle platforms and velocity attributes, is presented. In Table 2, listed are the composition structures of the MP library generated offline for the wheeled Ackermannsteering platform and tracked skidsteering platform. The structure of the online generated MP library used by the Hybrid A* algorithm is also introduced as a comparison reference.
To show the generated MP clusters more intuitively, Figure 10 presents lowspeed and highspeed typical MP clusters with wheeled platform velocity attributes of 5 and 18 m/s and tracked platform velocity attributes of 5 and 16 m/s. It also includes the MP cluster used by the Hybrid A* method.
The offline generation result of the MP library demonstrated that, regardless of the type of vehicle platform, the number of MPs gradually decreased with an increase in velocity. This is because the vehicle can achieve more diversified steering movements at low speeds, but the number of MPs at high speeds is greatly reduced owing to the weakening of the steering ability. Because the MPs are defined in discrete velocity ranges, an additional velocityplanning algorithm needs to be introduced in the actual application process to achieve a smooth velocity level transition.
In addition, when the velocity is 5 m/s, the MP cluster composition is significantly different from the composition at other velocities. The number of behavior primitives, N_{B}, is reduced, while the number of general primitives, N_{m}, is significantly increased (including reverse primitives). When the vehicle speed is low, it is often necessary to address a more complicated environment, and general primitives can effectively improve the success rate of the planning algorithm in the corresponding situation.
Table 2 indicates that the average energy of the wheeled platform MPs in the lowspeed range was lower than that of the tracked platform. Although the trajectory of the wheeled platform is smoother, and the adjustment of the course is gentle, the tracked platform has stronger and more aggressive steering adjustment ability at low speeds. In the highspeed range, the average energy of the MPs of the two platforms was basically the same, and the steering adjustment tended to be the same.
Hybrid A* utilized a circular arc generated by a fixed curvature as the basic MP. It realized neither the differentiation of the characteristics of heterogeneous platforms nor the separation of different speed intervals. In general, the types of MPs are relatively limited and do not have the diversity of MPs in the MP library proposed in this study.
Comparative Analysis of Online Selection of MPs in the Simulation Environment
To demonstrate the performance of the MP online selection algorithm proposed in this study, a lowspeed scene and a highspeed scene were designed to evaluate the motionplanning algorithm. In each scene, the motionplanning results of Hybrid A*, DBTMP A*P_{a}, and DBTMP A*P_{t} were compared. The planning results were solved using a laptop Lenovo IdeaPad Y70015ISK with 16 gigabytes of memory and an eightcore Intel Core i76700 operating at 2.6 GHz, running under 64bit Ubuntu 16.04. The entire motionplanning algorithm was written in C++ in the ROS system.
Figure 11 shows the results of motion planning in lowspeed scenes. Table 3 presents the comparison results of the corresponding evaluation indicators, including the average curve energy \({\bar{J}}_{s}\), total number of MP extensions, N_{e}, number of behavior primitives during extension, N_{be}, and the solution time of the corresponding motionplanning algorithm, T (ms).
From the perspective of the smoothness of the motionplanning results, regardless of the platform used, the trajectory generated by the DBTMP A* algorithm is smoother. One of the reasons for this is that the overall MP library used by the DBTMP A* algorithm is smoother than that of the Hybrid A* method. Another reason is that the straight driving behavior primitives and general primitives use different weighting coefficients in the evaluation (as shown in Eq. (22)); therefore, the straight driving behavior primitives occupy a larger proportion of the entire trajectory, which makes the evaluation index of smoothness significantly lower than that of the Hybrid A* algorithm.
From the perspective of the extension times of MPs, the extension times of DBTMP A*P_{a} and DBTMP A*P_{t} were only 69.56% and 78.26%, respectively, of that of the Hybrid A* method. This is mainly because the introduction of behavior primitives increases the extension category of a single MP and improves its adaptability to the environment so that it can complete the trajectoryplanning task with fewer MP combinations. This is most evident in the local scene of the parking scene. Regardless of the vehicle platform, the DBTMP A* algorithm used only one MP to complete the planning task, whereas Hybrid A* achieved the task by joining five independent MPs and did not highlight the characteristics of the two platforms.
From the perspective of the solution time of motionplanning tasks, the DBTMP A* algorithm only accounted for 35% of the overall solution time of the Hybrid A* algorithm. The main reason for this difference is that the MP sets used by the Hybrid A* algorithm are generated online based on the passable radius, while the MP sets selected by the DBTMP A* algorithm are generated offline.
The comparison results of motion planning in highspeed scenes are shown in Figure 12, and Table 4 presents the corresponding evaluation indicators.
From the motionplanning results of Figure 12 and Table 4, one can observe that the algorithm proposed in this study took advantage of the DBTMP library to achieve lower curve energy, fewer extension times, shorter solution time, and more reasonable planning results. This is mainly because the DBTMP library restricts and distinguishes the MPs that can be selected according to the velocity attribute and introduces the standard lanechange driving behavior primitives in highspeed scenes as supplements. This enables the driving experience to be successfully integrated into the MP generation and selection process proposed in this study. Although the Hybrid A* algorithm can also plan the corresponding collisionfree trajectory, it does not conform to the driver’s driving experience in highspeed scenes. The several imperfections of the Hybrid A* algorithm in highspeed scenes are mainly due to the composition of primitive clusters, which do not distinguish between highspeed and lowspeed scenes.
Applicability of the Algorithm in a Real Environment
The performance of the proposed algorithm in a real environment was verified by using two vehicle platforms the wheeled Ackermannsteering platform shown in Figure 13(a) and the tracked skidsteering platform shown in Figure 13(b). The two platforms are equipped with the same equipment, including a RoboSense 32lidar system and two RoboSense 16lidar systems, a Simpak982 GNSS receiver, an FOSN2 inertial navigation system, an AVT1290c camera, and two industrial personal computers.
Before the experiment, we obtained the environmental map required for the entire planning task using a multisource simultaneous localization and mapping algorithm. In the experiment, the fusion positioning system provided centimeterlevel realtime positioning accuracy at a frequency of 50 Hz. Based on an a priori map, the environment perception equipment carried by the vehicle added realtime environmental information to the map at a frequency of 5 Hz. Model predictive control was selected as the vehicle motion control algorithm, and the comprehensive tracking error in the offroad environment was < 0.5 m.
The purpose of the experiment in real scenes is to test the performance and efficiency of the algorithm under complex environmental conditions. Figure 14 shows the comparison results of the motionplanning algorithm at two time points during unmanned driving. These two moments were the starting moment of planning and the trigger moment of replanning after turning. Because the Hybrid A* algorithm only made extremely limited adaptability adjustments for the two platforms, only the wheeled Ackermannsteering platform was selected as the verification platform. Ti represents the initial moment of the planning, and Tr represents the moment of replanning. Figure 15 shows the efficiency indices of the three motionplanning algorithms. Where the singlestep extension time refers to the time required to select the most appropriate MP from the MP library at each MP extension step.
Judging from the experimental results in real scenes, the algorithm proposed in this study can not only be applied to unmanned systems in real time but can also achieve higher efficiency and more reasonable planning results. The experimental conclusions in the real environment are consistent with the simulation results.
Conclusions
A unified motionplanning algorithm framework based on MP generation and selection for heterogeneous vehicle platforms is proposed.

(1)
An optimized generation method is applied to realize the offline construction of a heterogeneous platform MP library. The generated MP library not only distinguishes MPs according to the velocity attributes and platform characteristics, but it also introduces five types of behavior primitives to expand the types of MPs.

(2)
The layered unequalweighted MP online selection framework makes full use of the characteristics of the platformspecific MP library generated offline. The velocity connection relationship of the MP set is limited, and the selection weight of the single extended behavior primitive and general primitive is distinguished.

(3)
The DBTMP A* motionplanning algorithm proposed in this study not only retains the strong environmental adaptability of the original Hybrid A* algorithms but also highlights the characteristic differences between heterogeneous vehicle platforms. Moreover, the proposed method effectively utilizes the driving experience to complete the reasonable guidance of the planning results, improving the trajectory smoothness and significantly reducing the required solution time.
References
S W Loke. Cooperative automated vehicles: A review of opportunities and challenges in socially intelligent vehicles beyond networking. IEEE Transactions on Intelligent Vehicles, 2019, 4(4): 509–518.
S Wang, Y Zhang, Z Liao. Building domainspecific knowledge graph for unmanned combat vehicle decision making under uncertainty. 2019 Chinese Automation Congress (CAC). IEEE, 2019: 4718–4721.
F Lin, Y Zhang, Y Zhao, et al. Trajectory tracking of autonomous vehicle with the fusion of DYC and longitudinallateral control. Chinese Journal of Mechanical Engineering, 2019, 32(1): 1–16.
L Chen, X Hu, W Tian, et al. Parallel planning: A new motion planning framework for autonomous driving. IEEE/CAA Journal of Automatica Sinica, 2018, 6(1): 236–246.
L Claussmann, M Revilloud, D Gruyer, et al. A review of motion planning for highway autonomous driving. IEEE Transactions on Intelligent Transportation Systems, 2019, 21(5): 1826–1848.
C Katrakazas, M Quddus, W H Chen, et al. Realtime motion planning methods for autonomous onroad driving: Stateoftheart and future research directions. Transportation Research Part C: Emerging Technologies, 2015, 60: 416–442.
D Dolgov, S Thrun, M Montemerlo, et al. Path planning for autonomous vehicles in unknown semistructured environments. The International Journal of Robotics Research, 2010, 29(5): 485–501.
B Wang, J Gong, H Chen. Motion primitives representation, extraction and connection for automated vehicle motion planning applications. IEEE Transactions on Intelligent Transportation Systems, 2020, 21(9): 3931–3945.
W Wang, W Han, X Na, et al. A probabilistic approach to measuring driving behavior similarity with driving primitives. IEEE Transactions on Intelligent Vehicles, 2019, 5(1): 127–138.
T Taniguchi, S Nagasaka, K Hitomi, et al. Sequence prediction of driving behavior using double articulation analyzer. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2015, 46(9): 1300–1313.
K Bergman, O Ljungqvist, D Axehill. Improved path planning by tightly combining latticebased path planning and optimal control. IEEE Transactions on Intelligent Vehicles, 2020, 6(1): 57–66.
S Kammel, J Ziegler, B Pitzer, et al. Team AnnieWAY's autonomous system for the 2007 DARPA Urban Challenge. Journal of Field Robotics, 2008, 25(9): 615–639.
Y L Chen, V Sundareswaran, C Anderson, et al. Terramax™: Team oshkosh urban robot. Journal of Field Robotics, 2008, 25(10): 841–860.
C Urmson, J Anhalt, D Bagnell, et al. Autonomous driving in urban environments: Boss and the urban challenge. Journal of Field Robotics, 2008, 25(8): 425–466.
D Ferguson, T M Howard, M Likhachev. Motion planning in urban environments. Journal of Field Robotics, 2008, 25(11–12): 939–960.
M McNaughton, C Urmson, J M Dolan, et al. Motion planning for autonomous driving with a conformal spatiotemporal lattice. 2011 IEEE International Conference on Robotics and Automation. IEEE, 2011: 4889–4895.
T M Howard, C J Green, A Kelly, et al. State space sampling of feasible motions for high‐performance mobile robot navigation in complex environments. Journal of Field Robotics, 2008, 25(6–7): 325–345.
M Montemerlo, J Becker, S Bhat, et al. Junior: The stanford entry in the urban challenge. Journal of field Robotics, 2008, 25(9): 569–597.
X Hu, L Chen, B Tang, et al. Dynamic path planning for autonomous driving on various roads with avoidance of static and moving obstacles. Mechanical Systems and Signal Processing, 2018, 100: 482–500.
Y Jiang, J Gong, G Xiong, et al. Research on differential constraintsbased planning algorithm for autonomousdriving vehicles. Acta Automatica Sinica, 2013, 39(12): 2012–2020.
J Jiang, Q Wang, J Gong, et al. Research on temporal consistency and robustness in local planning of intelligent vehicles. Acta Automatica Sinica, 2015, 41(1): 518–527.
X Yang, LiPing L, DuanFeng C, et al. Unified modeling of trajectory planning and tracking for unmanned vehicle. Acta Automatica Sinica, 2019, 45(4): 799–806.
L B Cremean, T B Foote, J H Gillula, et al. Alice: An information‐rich autonomous vehicle for high‐speed desert navigation. Journal of Field Robotics, 2006, 23(9): 777–810.
J Ziegler, P Bender, T Dang, et al. Trajectory planning for Bertha—A local, continuous method. 2014 IEEE Intelligent Vehicles Symposium Proceedings. IEEE, 2014: 450–457
A Desai, M Collins, N Michael. Efficient kinodynamic multirobot replanning in known workspaces. 2019 International Conference on Robotics and Automation. IEEE, 2019: 1021–1027.
C Chamzas, A Shrivastava, L E Kavraki. Using local experiences for global motion planning. 2019 International Conference on Robotics and Automation (ICRA). IEEE, 2019: 8606–8612.
S J Anderson, S B Karumanchi, K Iagnemma. Constraintbased planning and control for safe, semiautonomous operation of vehicles. 2012 IEEE Intelligent Vehicles Symposium. IEEE, 2012: 383–388.
Q Wang, C S Wieghardt, Y Jiang, et al. Generalized path corridorbased local path planning for nonholonomic mobile robot. 2015 IEEE 18th International Conference on Intelligent Transportation Systems. IEEE, 2015: 1922–1927.
S Karaman, M R Walter, A Perez, et al. Anytime motion planning using the RRT. 2011 IEEE International Conference on Robotics and Automation. IEEE, 2011: 1478–1483.
C Lu, F Hu, D Cao, et al. Transfer learning for driver model adaptation in lanechanging scenarios using manifold alignment. IEEE Transactions on Intelligent Transportation Systems, 2019, 21(8): 3281–3293.
H WANG, Z XIA, W CHEN. Lane departure assistance control based on extension combination of steering and braking systems considering humanmachine coordination. Journal of Mechanical Engineering, 2019, 55(4): 135–147.
C Sentouh, A T Nguyen, J J Rath, et al. Human–machine shared control for vehicle lane keeping systems: A Lyapunovbased approach. IET Intelligent Transport Systems, 2019, 13(1): 63–71.
C Guo, K Kidono, M Ogawa. Learningbased trajectory generation for intelligent vehicles in urban environment. 2016 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2016: 1236–1241.
S Schnelle, J Wang, H Su, et al. A driver steering model with personalized desired path generation. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2016, 47(1): 111–120.
H Zhao, C Wang, Y Lin, et al. Onroad vehicle trajectory collection and scenebased lane change analysis: Part I. IEEE Transactions on Intelligent Transportation Systems, 2016, 18(1): 192–205.
W Yao, Q Zeng, Y Lin, et al. Onroad vehicle trajectory collection and scenebased lane change analysis: Part II. IEEE Transactions on Intelligent Transportation Systems, 2016, 18(1): 206–220.
D Xu, Z Ding, H Zhao, et al. Naturalistic lane change analysis for humanlike trajectory generation. 2018 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2018: 1393–1399.
X He, D Xu, H Zhao, et al. A humanlike trajectory planning method by learning from naturalistic driving data. 2018 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2018: 339–346.
A Wächter, L T Biegler. On the implementation of an interiorpoint filter linesearch algorithm for largescale nonlinear programming. Mathematical Programming, 2006, 106(1): 25–57.
R Serban, A C Hindmarsh. CVODES: An ODE solver with sensitivity analysis capabilities. Technical Report UCRLJP200039, Lawrence Livermore National Laboratory, 2003.
Acknowledgements
Not applicable.
Funding
Supported by National Natural Science Foundation of China (Grant Nos. 91420203 and 61703041).
Author information
Authors and Affiliations
Contributions
BW was in charge of the whole trial; HC and JG guided the writing of the manuscript; HG and BW wrote the manuscript; JW and YL assisted with the simulation analyses. All authors read and approved the final manuscript.
Author Information
Haijie Guan, born in 1995, is currently a Ph.D. candidate at School of Mechanical Engineering, Beijing Institute of Technology, China. He received his master degree from Beijing Institute of Technology, China, in 2019.
Boyang Wang, born in 1991, is currently a postdoctoral researcher at Peking University, China. He received his Ph.D. degree from Beijing Institute of Technology, China, in 2020.
Jiaming Wei, born in 1996, is currently an engineer at Aviation Industry Corporation of China, China. He received his master degree from Beijing Institute of Technology, China, in 2021.
Yaomin Lu, born in 1996, is currently a master candidate at School of Mechanical Engineering, Beijing Institute of Technology, China. She received her bachelor degree from Hunan University, China, in 2019.
Huiyan Chen, born in 1961, is currently a professor at School of Mechanical Engineering, Beijing Institute of Technology, China. He received his Ph.D. degree from Beijing Institute of Technology, China, in 2004.
Jianwei Gong, born in 1969, is currently a professor at School of Mechanical Engineering, Beijing Institute of Technology, China. He received his Ph.D. degree from Beijing Institute of Technology, China, in 2002.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing financial interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Guan, H., Wang, B., Wei, J. et al. Generation and Selection of DriverBehaviorBased Transferable Motion Primitives. Chin. J. Mech. Eng. 35, 9 (2022). https://doi.org/10.1186/s10033022006766
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1186/s10033022006766
Keywords
 Autonomous vehicle
 Motion planning
 Motion primitives
 Driver behavior
 Heterogeneous vehicle platform