Vehicular Electronic Image Stabilization System Based on a Gasoline Model Car Platform

Zhang, Ning; Yang, Yuan; Wu, Jianhua; Zhao, Ziqian; Yin, Guodong

doi:10.1186/s10033-022-00805-1

Original Article
Open access
Published: 04 November 2022

Vehicular Electronic Image Stabilization System Based on a Gasoline Model Car Platform

Chinese Journal of Mechanical Engineering volume 35, Article number: 134 (2022) Cite this article

1983 Accesses
Metrics details

Abstract

Noise, vibration and harshness (NVH) problems in vehicle engineering are always challenging in both traditional vehicles and intelligent vehicles. Although high accuracy manufacturing, modern structural roads and advanced suspension technology have already significantly reduced NVH problems and their impacts; off-road condition, obstacles and extreme operating condition could still trigger NVH problems unexpectedly. This paper proposes a vehicular electronic image stabilization (EIS) system to solve the vibration problem of the camera and ensure the environment perceptive function of vehicles. Firstly, feature point detection and matching based on an oriented FAST and rotated BRIEF (ORB) algorithm are implemented to match images in the process of EIS. Furthermore, a novel improved random sampling consensus algorithm (i-RANSAC) is proposed to eliminate mismatched feature points and increase the matching accuracy significantly. And an adaptive Kalman filter (AKF) is applied to improve the adaptability of the vehicular EIS. Finally, an experimental platform based on a gasoline model car was established to validate its performance. The experimental results show that the proposed EIS system can satisfy vehicular performance requirements even under off-road condition with obvious obstacles.

1 Introduction

NVH problems are increasingly important issues in the automobile industry, for implications on environmental noise pollution, comfort perceived by passengers and vehicle performance. Although high accuracy manufacturing, modern structural roads and advanced suspension technology have already significantly reduced NVH problems and their impacts, off-road condition, obstacles and extreme operating condition could still trigger NVH problems unexpectedly. Specific to the visual environment perception function of a running vehicle, the inevitable bumping and shaking induce jitter of the image sequences captured by the vehicular camera, which goes against the subsequent observation and interpretation of information in the images. The induced jitter can be mitigated by introducing a mechanical damping structure or EIS. Based on the image processing method that costs less than additional mechanical structures, EIS has become a common solution. Dated back to the 1980s, Jean et al. [1] developed an EIS system for the reconnaissance vehicle at the resolution of 640*480. At present, EIS has been widely used in the industry. The real-time EIS technology launched by AMD in 2017 can process online video in real-time and can be compatible with any rendering mode [2]. Huawei combined artificial intelligence algorithm with EIS and named it AIS [3], which first appeared on its P20 series mobile phones. With this technology, the viewfinder frame can be static under the premise of small vibration amplitude, allowing for multi- frame synthesis. Most of the achievements in the industry focus on improving the whole system, while most of the attention in academia is on enhancing the performance of a certain part of the EIS system.

Ignoring the preprocessing such as graying in EIS system, EIS mainly consists of three parts: ① estimating the image transformation matrix of the current frame with respect to the reference frame; ② filtering the state variables derived by transformation matrix; ③ inverse compensation and output of the current frame [4].

Estimation of the image transformation matrix is the most important step. When calculating the transformation matrix of the current frame with respect to the reference frame, the relevant parameters are often used to derive a vector known as global motion vector. The process of calculating global motion vector is motion estimation. Motion estimation methods mainly include block matching method, gray projection method, phase correlation matching method, bit plane matching method, gray projection method, feature matching method, optical flow method and so on [5]. Block matching method and feature matching method are considered to have higher matching accuracy. However, when using block-matching method, a large matching block search area is needed to prevent the search results from falling into the local optimum, which sacrifices the image processing speed. Therefore, this paper focuses on EIS based on feature matching method. Image features include point feature, line feature, edge feature and so on [6]. Point feature has become a widely used feature description method because of its easy subsequent matching process. The Harris feature point detection algorithm was first proposed by Harris et al. [7] in 1988. It performs convolution calculation on the image through the derivative of the Gaussian function. Harris algorithm is relatively stable in dealing with rotation and brightness changes, but it does not have scale invariance. Lowe [8] proposed scale-invariant feature transform (SIFT), which has excellent scale invariance and has been widely used in related fields. Based on the SIFT, Bay et al. [9] proposed the speeded up robust features algorithm (SURF). This algorithm uses a fast Hessian matrix to detect feature points, and uses an integral image method to reduce the calculation time. In this way, the efficiency of the algorithm is improved greatly. Features from accelerated segment test (FAST) algorithm was proposed by Edward et al. [10] in 2006. FAST determines feature points by detecting the pixel values around the image. These four feature point extraction algorithms are the most widely used methods. Based on these four methods, various EIS algorithms have been developed [11,12,13,14].

The process of filtering the state variables derived from the transformation matrix is motion filtering. Its purpose is to distinguish the subjective motion from the non-subjective jitter, so as to compensate for the non-subjective jitter in the subsequent image processing. Commonly used filter algorithms in EIS include mean filter, least square fitting filter, B-spline curve fitting filter, Kalman filter (KF). Mean filtering and least square filtering cannot be operated in real-time. The method based on B-spline curve relies on kinematic model [15]. The method based on KF has become the mainstream method in various pieces of research [16,17,18]. However, the effect of KF is sensitive to the noise parameter settings of system, the frequency and amplitude of random motion, etc. [19]. Researchers have done a lot of work on these problems. Park et al. [20] proposed a new image stabilization method based on finite impulse response filter, which is more robust against mistuning on the model parameters than the KF. Choi et al. [21] used extend Kalman filter in aerial airborne imaging to remove the jitter of the camera and retain scanning motion. Yang et al. [22] proposed a novel stabilization algorithm based on particle filter in EIS. Zhu et al. [23] made further improvements based on his research. Besides, Ioannidis et al. [24] proposed the basic features of the Hilbert-Huang transform in order to separate the local motion signals, which is a novel method in the field of EIS.

The last step of EIS is to compensate and output the current frame inversely. After the image is inversely compensated according to the filtered global motion vector, there is a blank part in the image. The blank part needs to be cut before output of the current frame. Although this step is an indispensable step in the EIS system, due to its relatively low importance, this paper does not review it too much. From the above studies, the following research trends and shortcomings can be summarized: ① the existing EIS systems are mainly used for handheld recording devices and are rarely constructed from the perspective of vehicles; ② the filtering has a great impact on the performance of EIS. Irregular road excitation is easy to lead to filter divergence, especially for the KF with fixed system noise, which is rarely mentioned in the existing studies; ③ the real-time performance of EIS has become a concern of many researchers, especially in the feature detection and filtering. However, the real-time performance of the matching process has not been paid close attention to.

The image sequence captured by the vehicle camera vibrates due to vehicle vibration or harsh road conditions. This paper mainly studies the use of EIS technology to solve inter-frame blur, a form of video blur. In Section 2, the image stabilization technology based on feature point detection and matching is selected, and ORB algorithm is used to meet the real-time requirements. Furthermore, in order to enhance the instantaneity and accuracy of the elimination of mismatched point pairs, the random sampling consistency algorithm (RANSAC) is improved in Section 3. In Section 4, aiming to adapt to various dithering conditions in-vehicle scene, this paper adopts AKF algorithm to solve the problem that the classical KF is sensitive to initial values. To verify the performance of EIS, a gasoline model vehicle with remarkable vibration characteristics is refitted for experiment in Section 5. After the process of the EIS proposed in this paper, the average peak signal to noise ratio (PSNR) of the video is improved by 1.26 dB as shown in Section 6, which proves the proposed EIS system can satisfy vehicular performance requirements even under extreme conditions. Finally, conclusions are provided in Section 7.

2 Feature Point Detection Based on ORB Algorithm

2.1 Selection of Image Matching Algorithm

Image matching is the first and the most important step in estimating image transformation matrix, for its implications on the accuracy and instantaneity of the derivation of global motion vectors. At present, a large number of algorithms for image matching have been proposed, including block matching algorithm, gray projection algorithm, optical flow algorithm and feature matching algorithm. Optical flow method and gray projection method cannot adapt to complex scene changes. Block matching algorithm and feature matching method are better choices for the vehicle demands. For block matching algorithm, to satisfy the unique needs of the vehicle, a large search area for the matching blocks is required. Otherwise, the search results may fall into the local optimum easily. However, the search area size for the matching blocks positively correlates with search time. Under the real-time requirements of the vehicle, the accuracy of the block matching algorithm is limited. Based on the above discussion, feature matching algorithm is the most suitable for vehicle demands.

It has been mentioned in Section 1 that Harris, SIFT, SURF and ORB are the most widely used feature point extraction algorithms. The results for the processing time of one frame at the resolution of 480*480 pixels using these algorithms are shown as Table 1.

Table 1 Comparison of the performance of different algorithms

Full size table

ORB algorithm has significant advantages in processing time. Besides, the number of extracted points is about the same as that of other algorithms.

2.2 ORB Algorithm Principle

ORB algorithm, used to detect and describe the feature points, is the combination of FAST and improved binary robust independent elementary features algorithm (BRIEF) [25].

FAST algorithm finds key points in the image, such as corner points. Generally, feature points possess the characteristic of sharply varying pixel values among the surrounding pixels. As shown in Figure 1, by comparing the gray value of point P with the gray values of 16 surrounding points, whether P is a corner point is determined.

The output of FAST corner detection algorithm is the coordinates of corner points. In order to match the corners detected in the current frame and the reference frame, it is necessary to determine a descriptor to describe the nature of the corner points. ORB algorithm uses BRIEF algorithm to describe feature points, and BRIEF algorithm utilizes a feature descriptor of binary string. $n$ pairs of pixels ${p}_{i}$, ${q}_{i}(i=\mathrm{1,2},..., n)$ are selected in the neighborhood of a feature point P. Generally, n is 128, 256 or 512, which is set to 256 in this paper. The size of the neighborhood is $S\times S$. ${p}_{i}$ and ${q}_{i}$ obey the Gaussian distribution of $N(0, {S}^{2}/25)$. Then the gray values of each point pair are compared. If $I({p}_{i})>I({q}_{i})$, the $i$ th bit in the binary string is 1, otherwise it is 0, i.e. [25]

$$\text{descriptor}\left(X,Y,i\right)=\left\{\begin{array}{ll}1 & I\left(X\right)\ge I\left(Y\right), \\ 0 & I\left(X\right)<I\left(Y\right),\end{array}\right.$$

(1)

where $X$ denotes the feature point detected; $Y$ denotes the points to be compared. $I$ denotes the gray value of the point. $i$ represents the ith bit in the BRIEF descriptor. By connecting the bits of the $N$ pixels, a bit string is obtained. To solve the problem that the BRIEF does not define the main direction, BRIEF is improved by gray centroid method.

$${m}_{pq}=\sum_{x,y}{x}^{p}{y}^{q}I\left(x,y\right),$$

(2)

where $x$, $y$ are the pixel coordinates in the neighborhood around the feature point. The feature of grayscale centroid can be determined by

$$\theta =\mathrm{arctan}\left(\frac{{m}_{01}}{{m}_{10}}\right).$$

(3)

Therefore, ORB has rotation invariance, which is essential for the demands of on-board working condition.

2.3 Results of Feature Points Detection and Matching

It is found that the average processing time of each frame is 6.01 ms, and the average number of detected points in each frame is 483.7. The average amount of matched feature point pairs is 223.4. It should be noted that there exist some mismatched points as shown in Figure 2.

3 Elimination of Mismatched Feature Points

3.1 Image Transformation Matrix

The essence of judging whether a point pair is mismatched is to judge whether this point pair obeys the image transformation matrix ${\varvec{H}}$. Therefore, ${\varvec{H}}$ needs to be determined before the mismatched points are eliminated.

An appropriate motion model, which calculates the global motion vector, is essential for a good image stabilization effect. Although this kind of motion model desires high calculation capability, a relatively complex model is still necessary to describe the motion captured by the image. The motion model adopted should be able to adapt to possible working conditions of vehicle body shaking with large scale. The 4-parameter similarity transformation model is used to describe the rotation and translation motion with good prediction accuracy provided and takes the form

$$\left(\genfrac{}{}{0pt}{}{{x}_{2}}{{y}_{2}}\right)=s\left(\begin{array}{cc}\mathrm{cos}\Delta \theta & -\mathrm{sin}\Delta \theta \\ \mathrm{sin}\Delta \theta & \mathrm{cos}\Delta \theta \end{array}\right)\left(\genfrac{}{}{0pt}{}{{x}_{1}}{{y}_{1}}\right)+\left(\genfrac{}{}{0pt}{}{\Delta x}{\Delta y}\right),$$

(4)

where $({x}_{1},{y}_{1})$ and $({x}_{2},{y}_{2})$ represent the coordinates of the reference frame and the current frame, respectively. $\Delta \theta$ represents the roll angle between two frames. $\Delta x$ and $\Delta y$ denote the lateral displacement and the vertical displacement of the current frame with respect to the reference frame, respectively. $s$ represents the scaling factor.

3.2 Improved RANSAC Algorithm

The image transformation matrix of the current frame with respect to the reference frame can be derived by fitting method like least squares fitting, utilizing the matched feature points. However, the accuracy of transformation matrix is affected by the mismatched feature points. The RANSAC algorithm divides all points into two types: inliers and outliers. Inliers refer to points which can satisfy the model, while the outliers refer to the interference points. In this way, it prevents the calculation results from being affected by outliers.

The specific implementation process of RANSAC algorithm is shown as follows [26].

Step 1: Set the minimum number of point pairs $s$ which can be used to derive ${\varvec{H}}$ between two frames. Select $s$ pairs of points without repeating to form a point set ${S}_{r}$.

Step 2: Set the number of iterations $k$. Suppose the amount of point pairs is $n$, and the amount of inliers is $m$. Obviously, the probability of all points in set ${S}_{r}$ being inliers is

$$w=\frac{m}{n}\cdot \frac{m-1}{n-1} \cdots \frac{m-s+1}{n-s+1}.$$

(5)

Within $k$ iterations, the probability of at least one ${S}_{r}$ only containing inliers is $p$. $k$ and $p$ satisfy

$${(1-{w})}^{k}\le 1-p.$$

(6)

Then k can be derived from the inequality above

$$k\ge \frac{\mathrm{lg}\left(1-p\right)}{\mathrm{lg}\left(1-w\right)}.$$

(7)

Step 3: Determine the number of inliers that satisfies ${\varvec{H}}$, with the judgment criteria given by

$$\Vert {{{\varvec{p}}}_{{\varvec{i}}}}^{\mathrm{^{\prime}}}-{\varvec{H}}{{\varvec{p}}}_{{\varvec{i}}}\Vert <e.$$

(8)

where ${{{\varvec{p}}}_{{\varvec{i}}}}^{\mathrm{^{\prime}}}$ and ${{\varvec{p}}}_{{\varvec{i}}}$ represent the coordinates of the current frame and the reference frame, respectively. $e$ refers to the error threshold to distinguish inliers and outliers. The total number of liners is counted as $M$.

Step 4: The transformation matrix ${\varvec{H}}$ corresponding to the maximum $M$ is the best matrix to be found.

Although RANSAC algorithm has certain robustness, few defects exist in practical engineering: ① If the matching accuracy is not high enough, a large number of outliers lead to an increase in the number of iterations. ② If the random points are too concentrated, the transformation matrix's accuracy is seriously affected. ③ If the selected feature points contain outliners, the entire iteration is also performed once, which significantly wastes calculation time.

The RANSAC algorithm is improved in this paper, in consideration of the shortcomings above. The specific implementation steps of i-RANSAC are as follows.

Step 1: Rank the point pairs by Hamming distance ${D}_{H}$. Remove this pair of points, if

$${D}_{H}>E+K*\sigma \, \mathrm{or} \, {D}_{H}<E-K*\sigma ,$$

(9)

where $E$ denotes the mean Hamming distance of point pairs and $\upsigma$ denotes their variance. $K$ can be used to adjust the amount of removed points.

Step 2: Set the minimum number of point pairs $s$ which can be used to derive ${\varvec{H}}$. Select $s$ pairs of points without repeating in different grids to form a set ${S}_{r}$.

Step 3: Calculate the number of iterations.

Step 4: Select 3 pairs of points. Determine the amount of inliers. If the amount is less than 2, jump out of this iteration and proceed to the next iteration.

Step 5: The transformation matrix ${\varvec{H}}$ corresponding to the maximum $M$ is the best matrix.

3.3 Results of Eliminating Mismatched Feature Points

The image transformation matrix between two frames should be an identity matrix when the vehicle is static. However, the running cars, pedestrians or even swinging leaves exert some mismatch points, which changes the transformation matrix from identity matrix. Exploiting this feature, the performance of i-RANSAC can be detected (Figure 3).

We propose a novel definition, feature points matching accuracy (FPMA) of a specific segment of a processed video, to assess the performance of the mismatched points elimination algorithms. Denoted by ${A}_{m}$, FPMA is defined as Eq. (10).

$${A}_{m}=\frac{\left|\Delta {y}_{k}\right|}{{k}_{\mathrm{max}}} k\in \left[\mathrm{1,200}\right],$$

(10)

where ${k}_{\mathrm{max}}$ denotes the total number of frames of the processed video segment.

Obviously, $\Delta y$ should be 0 theoretically. It indicates that the ${A}_{m}$ closer to 0 corresponds to an algorithm with better performance. Since the averages of FPMA after the process of RANSAC and i-RANSAC are 0.068 and 0.020 respectively, the improved RANSAC proposed in this paper has better performance. In addition, the average of processed frames per second of i-RANSAC increases by 32.4% compared with that of RANSAC, which demonstrates better real-time performance of i-RANSAC.

4 Filter of the Image Transformation Matrix

4.1 Kalman Filter

The purpose of filtering the image transformation matrix is to distinguish the subjective motion from the non-subjective jitter, so as to compensate the non-subjective jitter in the subsequent image processing. Commonly used filters in EIS include mean filtering, least-square fitting filtering, B-spline curve fitting filtering, and KF. Mean filtering and least-squares filtering require image observation states from multiple frames. Therefore, their implementation has a lag, making it difficult to meet vehicle requirements. The method based on the B-spline curve relies on the kinematics model. The method based on KF has become the mainstream method in this research field.

Both process noise variance $Q$ and observation noise variance $R$ need to be set in advance in classical KF. As shown in Figure 4, the filter effect is completely different when the noise variance settings are $Q=0.01, R=0.1$ and $Q=0.1, R=0.1$, respectively. Therefore, considering vehicle conditions, the fixed noise variance matrix cannot adapt to various vibration conditions, especially concerning extreme dynamics.

4.2 Adaptive Kalman Filter

AKF is applied to improve the adaptability of vehicular EIS. Correcting the parameters of the model and the noise covariance in real-time, AKF reduces the influence of model error during the prediction of state variables. This paper mainly introduces the Sage-Husa AKF algorithm [27] into the proposed EIS and makes certain improvements.

Generally, the process noise variance $Q$ and the observation noise variance $R$ vary widely in operation and cannot be determined accurately. If the pre-defined $Q$ and $R$ are less than the actual noise variance, the resulting small uncertainty range of the true value leads to biased estimation and filtering divergence. Conversely, if the pre-defined $Q$ and $R$ are larger than the actual noise variance, the state estimation error increases and the filtering divergence is caused statistically. Therefore, the construction of AKF and online adjustment of $Q$ and $R$ are of great significance in improving the accuracy and stability of the filter. On this basis, the forgetting factor is introduced to endow Sage-Husa AKF with the ability to estimate the unknown time-varying noise in real-time. Using the measurement data for recursive filtering, the adaptive filtering algorithm estimates and corrects the statistical characteristics of process noise and measurement noise in real-time. Sage-Husa AKF method is simple in principle and good in real-time, so it has been widely used in engineering fields.

In KF, the state equation and observation equation are given by

$${{\varvec{X}}}_{{\varvec{k}}}={\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}{{\varvec{X}}}_{{\varvec{k}}-1}+{{\varvec{W}}}_{{\varvec{k}}-1},$$

(11)

$${{\varvec{Z}}}_{{\varvec{k}}}={{\varvec{H}}}_{{\varvec{k}}}{{\varvec{X}}}_{{\varvec{k}}}+{{\varvec{V}}}_{{\varvec{k}}},$$

(12)

where ${{\varvec{X}}}_{{\varvec{k}}}$ is the state vector, and ${\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}$ is the state transition matrix. ${{\varvec{Z}}}_{{\varvec{k}}}$ is the observation vector. ${{\varvec{H}}}_{{\varvec{k}}}$ is the observation matrix. ${{\varvec{W}}}_{{\varvec{k}}-1}$ and ${{\varvec{V}}}_{{\varvec{k}}}$ denote the system noise and the observation noise, respectively. ${{\varvec{W}}}_{{\varvec{k}}-1}$ obeys $N\left(0,{Q}_{k-1}\right)$ distribution and ${{\varvec{V}}}_{{\varvec{k}}}$ obeys $N\left(0,{R}_{k}\right)$.

With the state vector ${{\varvec{X}}}_{{\varvec{k}}}={[x, \dot{x},y, \dot{y},\theta , \dot{\theta }]}^{\mathrm{T}}$, $x$, y denote the lateral and vertical displacements of the current frame concerning the reference frame, respectively. $\theta$ denotes the roll angle of the current frame with respect to the reference frame. ${\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}$ and ${{\varvec{W}}}_{{\varvec{k}}-1}$ are given by

$$\left\{\begin{array}{c}{\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}=\left[\begin{array}{c}\begin{array}{cc}\begin{array}{ccc}1& 1& 0\end{array}& \begin{array}{ccc}0& 0& 0\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 1& 0\end{array}& \begin{array}{ccc}0& 0& 0\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 0& 1\end{array}& \begin{array}{ccc}1& 0& 0\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 0& 0\end{array}& \begin{array}{ccc}1& 0& 0\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 0& 0\end{array}& \begin{array}{ccc}0& 1& 1\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 0& 0\end{array}& \begin{array}{ccc}0& 0& 1\end{array}\end{array}\end{array}\right] ,\\ {{\varvec{W}}}_{{\varvec{k}}-1}=\left[\begin{array}{c}0\\ N(0,{\sigma }_{x})\\ 0\\ N(0,{\sigma }_{y})\\ 0\\ N(0,{\sigma }_{\theta })\end{array}\right],\end{array}\right.$$

(13)

where $x$,$y$ and $\theta$ are predicted states.

The observation state vector is ${{\varvec{Z}}}_{{\varvec{k}}}={[x,y,\theta ]}^{\mathrm{T}}$. ${{\varvec{H}}}_{{\varvec{k}}}$ and ${{\varvec{V}}}_{{\varvec{k}}}$ are given by

$$\left\{\begin{array}{c}{{\varvec{H}}}_{{\varvec{k}}}=\left[\begin{array}{c}\begin{array}{cc}\begin{array}{ccc}1& 0& 0\end{array}& \begin{array}{ccc}0& 0& 0\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 0& 1\end{array}& \begin{array}{ccc}0& 0& 0\end{array}\end{array}\\ \begin{array}{cc}\begin{array}{ccc}0& 0& 0\end{array}& \begin{array}{ccc}0& 1& 0\end{array}\end{array}\end{array}\right],\\ {{\varvec{V}}}_{{\varvec{k}}}=\left[\begin{array}{c}N(0,{{\sigma }_{x}}^{\mathrm{obs}})\\ N(0,{{\sigma }_{y}}^{\mathrm{obs}})\\ N(0,{{\sigma }_{\theta }}^{\mathrm{obs}})\end{array}\right].\end{array}\right.\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }\boldsymbol{ }$$

(14)

In AKF, the averages of observation noise and prediction noise are not considered as 0, but as ${\widehat{{\varvec{q}}}}_{\mathbf{k}}$ and ${\widehat{{\varvec{r}}}}_{{\varvec{k}}}$. The superscript $\widehat{}$ is adopted to distinguish the prediction state from the observation state. Then, the state Eq. (11) and observation Eq. (12) are given by

$${{\varvec{X}}}_{{\varvec{k}}}={\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}{{\varvec{X}}}_{{\varvec{k}}-1}+{{\varvec{W}}}_{{\varvec{k}}-1}+{\widehat{{\varvec{q}}}}_{{\varvec{k}}-1},$$

(15)

$${{\varvec{Z}}}_{{\varvec{k}}}={{\varvec{H}}}_{{\varvec{k}}}{{\varvec{X}}}_{{\varvec{k}}}+{{\varvec{V}}}_{{\varvec{k}}}+{\widehat{{\varvec{r}}}}_{{\varvec{k}}},$$

(16)

where $E\left[{\widehat{{\varvec{q}}}}_{{\varvec{k}}-1}\right]={{\varvec{q}}}_{{\varvec{k}}}$, $E\left[{\widehat{{\varvec{r}}}}_{k}\right]={{\varvec{r}}}_{{\varvec{k}}}$. The other parameters correspond with the Eqs. (11) and (12). Then, the processes of AKF are given as follows:

Step 1: Calculate one-step prediction state ${\widehat{{\varvec{X}}}}_{{\varvec{k}},{\varvec{k}}-1}$ and noise covariance matrix ${{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}$.

$${\widehat{{\varvec{X}}}}_{{\varvec{k}},{\varvec{k}}-1}={\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}{\widehat{{\varvec{X}}}}_{{\varvec{k}}-1}+{\widehat{{\varvec{q}}}}_{{\varvec{k}}-1},$$

(17)

$${{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}={\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}{{\varvec{P}}}_{{\varvec{k}}-1}{{\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}}^{\mathbf{T}}+{\widehat{{\varvec{Q}}}}_{{\varvec{k}}-1}.$$

(18)

Step 2: Update filter gain ${K}_{k}$:

$${{\varvec{K}}}_{{\varvec{k}}}={{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}{{{\varvec{H}}}_{{\varvec{k}}}}^{\mathbf{T}}\left[{{{\varvec{H}}}_{{\varvec{k}}}{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}{{{\varvec{H}}}_{{\varvec{k}}}}^{\mathbf{T}}\right]+{\widehat{{\varvec{R}}}}_{{\varvec{k}}-1}.$$

(19)

Step 3: Calculate the residual ${{\varvec{\varepsilon}}}_{{\varvec{k}}}$

$${{\varvec{\varepsilon}}}_{{\varvec{k}}}={{\varvec{Z}}}_{{\varvec{k}}}-{{\varvec{H}}}_{{\varvec{k}}}{\widehat{{\varvec{X}}}}_{{\varvec{k}},{\varvec{k}}-1}-{\widehat{{\varvec{r}}}}_{{\varvec{k}}-1}.$$

(20)

Step 4: Update the state vector and the noise covariance matrix

$${\widehat{{\varvec{X}}}}_{{\varvec{k}}}={\widehat{{\varvec{X}}}}_{{\varvec{k}},{\varvec{k}}-1}+{{\varvec{K}}}_{{\varvec{k}}}{{\varvec{\varepsilon}}}_{{\varvec{k}}},$$

(21)

$${{\varvec{P}}}_{{\varvec{k}}}=\left(\mathbf{I}-{{\varvec{K}}}_{{\varvec{k}}}{{\varvec{H}}}_{{\varvec{k}}}\right){{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}.$$

(22)

Step 5: Calculate the weigh factor ${d}_{k}$

$${d}_{k}=\frac{1-b}{1-{b}^{k+1}}$$

(23)

In Eq. (23), $b$ is the relaxation factor, $b\in (\mathrm{0,1})$.

Step 6: Update ${\widehat{{\varvec{q}}}}_{\mathbf{k}}$, ${\widehat{{\varvec{Q}}}}_{{\varvec{k}}}$, ${\widehat{{\varvec{r}}}}_{{\varvec{k}}}$, ${\widehat{{\varvec{R}}}}_{{\varvec{k}}}$

$${\widehat{{\varvec{q}}}}_{{\varvec{k}}}=\left(1-{d}_{k}\right){\widehat{{\varvec{q}}}}_{{\varvec{k}}-1}+{{\varvec{d}}}_{{\varvec{k}}}\left({\widehat{{\varvec{X}}}}_{{\varvec{k}}}-{\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}{\widehat{{\varvec{X}}}}_{{\varvec{k}}-1}\right),$$

(24)

$${\widehat{{\varvec{Q}}}}_{{\varvec{k}}}=\left(1-{d}_{k}\right){\widehat{{\varvec{Q}}}}_{{\varvec{k}}-1}+{d}_{k}\left({{{\varvec{K}}}_{{\varvec{k}}}{\varvec{\varepsilon}}}_{{\varvec{k}}}{{{\varvec{\varepsilon}}}_{{\varvec{k}}}}^{\mathbf{T}}{{\varvec{K}}}_{{\varvec{k}}}+{{\varvec{P}}}_{{\varvec{k}}}-{\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}{{\varvec{P}}}_{{\varvec{k}}-1}{{\boldsymbol{\varnothing }}_{{\varvec{k}},{\varvec{k}}-1}}^{\mathbf{T}}\right),$$

(25)

$${\widehat{{\varvec{r}}}}_{{\varvec{k}}}=\left(1-{d}_{k}\right){\widehat{{\varvec{r}}}}_{{\varvec{k}}-1}+{d}_{k}\left({{\varvec{Z}}}_{{\varvec{k}}}-{{\varvec{H}}}_{{\varvec{k}}}{\widehat{{\varvec{X}}}}_{{\varvec{k}},{\varvec{k}}-1}\right),$$

(26)

$${\widehat{{\varvec{R}}}}_{{\varvec{k}}}=\left(1-{d}_{k}\right){\widehat{{\varvec{R}}}}_{{\varvec{k}}-1}+{d}_{k}\left({{\varvec{\varepsilon}}}_{{\varvec{k}}}{{{\varvec{\varepsilon}}}_{{\varvec{k}}}}^{\mathbf{T}}-{{\varvec{H}}}_{{\varvec{k}}}{{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}{{{\varvec{H}}}_{{\varvec{k}}}}^{\mathbf{T}}\right).$$

(27)

Considering that the measurement accuracy has a positive correlation with the number of points, ${\widehat{{\varvec{R}}}}_{{\varvec{k}}}$ is updated as

$${\widehat{{\varvec{R}}}}_{{\varvec{k}}}=\left[\left(1-{d}_{k}\right){\widehat{{\varvec{R}}}}_{{\varvec{k}}-1}+{d}_{k}\left({{\varvec{\varepsilon}}}_{{\varvec{k}}}{{{\varvec{\varepsilon}}}_{{\varvec{k}}}}^{\mathbf{T}}-{{\varvec{H}}}_{{\varvec{k}}}{{\varvec{P}}}_{{\varvec{k}},{\varvec{k}}-1}{{{\varvec{H}}}_{{\varvec{k}}}}^{\mathbf{T}}\right)\right]*{\left[\frac{E\left({n}_{k}\right)}{{n}_{k}}\right]}^{r},$$

(28)

where $r$ is a parameter to indicate the influence of the number of points on the measurement accuracy.

The larger $r$ indicates the greater impact of the number of points on measurement accuracy. Through this method, this paper improves AKF under the scenarios of vehicular EIS system.

As shown in Figure 5, AKF retains the subjective motion vector and filters out vibration. More importantly, the initial noise setting has almost no effect on the filtering results, which means that AKF has the ability to adapt to various types of system noise.

5 Experimental Platform

5.1 Framework of the Model Car

In order to verify the adaptability of the developed EIS system to various conditions, an experimental platform able to provide extreme scenarios is desired. Accordingly, a gasoline model car possessing abundant NVH characteristics was established, with which off-road condition, obstacles and extreme operating condition can be easily implemented. The framework of the gasoline model car is shown in Figure 6. The model car is controlled by two control servos, namely the steering servo and the throttle servo. The control algorithm is programmed in the STM32 microcomputer placed in the front of the model car. An encoder is installed under the chassis to collect the speed signals.

5.2 EE Architecture of the Platform

Figure 7 shows the EE architecture of the platform. Concerning the computing platform, the upper computer uses Raspberry Pi 3B+ for capturing video. The photo sensitive chip of the camera used in this experiment is Sony IMX219, which is a CMOS chip. The camera captures video at the resolution of 480*480. The STM32 microcomputer is used to control the model gasoline car through two PWM signals. Due to the load limitation of the model car, an offline calculation method is adopted for EIS. The calculation platform is a quad-core CPU, and the basic frequency is 3.2 GHz.

5.3 Experimental Road Conditions

The platform includes an encoder to collect speed signals, and it is difficult for the encoder to work stably under harsh road conditions. Therefore, we installed bumps on both front wheels during the experiment to simulate the high unevenness road, which is shown in Figure 8(a) and (b). From the perspective of captured videos, this is equivalent to conducting experiments directly under harsh road conditions.

6 Analysis of the Effect of Electronic Image Stabilization Algorithm

The effect of EIS is often measured by PSNR,

$$PSNR=10\times \mathrm{lg}\left(\frac{{\left({2}^{n}-1\right)}^{2}}{MSE}\right).$$

(29)

In Eq. (29), $MSE$ is given by

$$MSE=\frac{1}{MN}\sum_{m=1}^{M}\sum_{n=1}^{N}{({I}_{o}\left(m,n\right)-{I}_{c}(m,n))}^{2},$$

(30)

where $M$ and $N$ denote the length and the width of the frame, respectively. ${I}_{o}\left(m,n\right)$ denotes the pixel value of the coordinate point (m, n) in the original figure, while ${I}_{c}(m,n)$ denotes that of the compensated figure. Figure 9 shows the comparison of PSNR between the original video and processed video.

The average PSNR increases by 1.26 dB, which proves the positive image stabilization effect of the proposed EIS system. It is worth noting that, a 1.20 dB increment seems to be less than the results in other papers. Actually, it is meaningless to compare PSNR of different EIS, because if the filter parameters are set to make the filter results smooth enough, PSNR can be significantly improved. However, if doing so, the purpose of preserving the subjective motion vector of the car is lost.

7 Conclusions

NVH-related problems induced by off-road condition, obstacles and extreme operating condition still deserve more attention in modern intelligent vehicles. They deteriorate not only the driving performance but also the function of advanced driver assistance system (ADAS) in vehicles. For instance, different levels of jitter in the image sequences captured by a vehicular camera might happen and affect subsequent observation and interpretation of information in the images. Aiming at this problem, a vehicular EIS system based on a gasoline model car platform is proposed. The gasoline model car exhibits abundant NVH characteristics, and is consequently a very convenient and appropriate experimental platform with which off-road condition, obstacles and extreme operating condition can be easily implemented. The conclusions on the proposed vehicular EIS and corresponding experimental results are summarized as follows.

(1)
Feature point detection and matching based on an ORB algorithm are implemented to match images in the process of EIS. It shows that the average processing time of each frame is 6.0 ms. The average amounts of the detected points and the matched feature point pairs in each frame are 483.7 and 223.4 respectively. It is proved that the ORB algorithm satisfies the real-time processing requirements for vehicular application.
(2)
A new i-RANSAC algorithm is proposed to eliminate the mismatched feature points, and improve its instantaneity and accuracy under certain circumstances. And a novel definition, FPMA, is proposed to quantify the performance of the algorithm proposed in a video. The i-RANSAC algorithm shows a significantly improved performance from the FPMA. Besides, the average of frames processed per second in the i-RANSAC algorithm has increased by 32.4% compared with that number in the traditional RANSAC algorithm.
(3)
A Sage-Husa-based AKF is applied to improve the adaptability of the vehicular EIS. Considering that the measurement accuracy has a positive correlation with the number of feature points, the update of observation noise variance is improved. The average PSNR of the video processed with the EIS system has increased by 1.26 dB compared with that of the original video. The results show that the proposed EIS system can satisfy vehicular performance requirements even under off-road condition with obvious obstacles.

References

M Jean, M James, M Cruickshank, et al. Video-rate image stabilization system. Proceedings of Spie the International Society for Optical Engineering, 1998, 62(6): 313-321.
Google Scholar
N Sebastien. Hot Chips 2012 AMD “Trinity” APU. 2012 IEEE Hot Chips 24 Symposium, Cupertino, USA, August 27–29, 2012: 1-40.
How strong is Huawei P20 Pro night scene shooting? Uncover the legend of "eye of night God". Beijing, China: Sina Tech, 2018[2021-05-31]. http://tech.sina.com.cn/roll/2018-04-06/doc-ifyteqtq4753502.shtml.
E Mingkhwan, W Khawsuk. Digital image stabilization technique for fixed camera on small size drone. 2017 Third Asian Conference on Defence Technology, Phuket, Thailand, January 1-20, 2017: 12-19.
R Raj, P Rajiv, P Kumar, et al. Feature based video stabilization based on boosted HAAR Cascade and representative point matching algorithm. Image and Vision Computing, 2020, 101: 1-8.
Article Google Scholar
D Yang, X Jiao, K Jiang, et al. Driving space for autonomous vehicles. Automotive Innovation, 2019, 2(4): 241-253.
Article Google Scholar
C Harris, M Stephens. A combined corner and edge detector. Proceedings of 4th Alvey Vision Conference, Manchester, England, August 31- September 21, 1988: 147-151.
D G Lowe. Distinctive image features from scale-invariant key points. International Journal of Computer Vision, 2004, 2(60): 91-110.
Article Google Scholar
H Bay, E Andreas, T Tinne, et al. SURF: Speeded up robust features. Computer Vision and Image Understanding, 2008, 110(3): 346-359.
Article Google Scholar
R Edward, D Tom. Machine learning for high-speed corner detection. European Conference on Computer Vision, Graz, Austria, May 01-13, 2006: 430-443.
B Kir, M Kurt, O Urhan. Local binary pattern based fast digital image stabilization. IEEE Signal Processing Letters, 2014, 22(3): 341-345.
Article Google Scholar
S L Al-Khafaji, Z Jun, A Zia, et al. Spectral-spatial scale invariant feature transform for hyperspectral images. IEEE Transactions on Image Process, 2018, 27(2): 837-850.
Article MathSciNet MATH Google Scholar
R Wang, Y Shi, W Cao. GA-SURF: A new speeded-up robust feature extraction algorithm for multispectral images based on geometric algebra. Pattern Recognition Letters, 2019, 127: 11-17.
Article Google Scholar
S Gupta, M Kumar, A Garg. Improved object recognition results using SIFT and ORB feature detector. Multimedia Tools and Applications, 2019, 78(23): 34157-34171.
Article Google Scholar
Y Wang, R Chang, T W Chua, et al. Video stabilization based on high degree b-spline smoothing. Proceedings of the 21st International Conference on Pattern Recognition, Tsukuba, Japan, November 11-15, 2012: 3152-3155.
Z Ren, C Chen, M Fang. Electronic image stabilization algorithm based on smoothing 3D rotation matrix. 2017 3rd IEEE International Conference on Computer and Communications, Chengdu, China, December 13-16, 2017: 2752-2755.
X Cheng, Q Hao, M Xie. A comprehensive motion estimation technique for the improvement of EIS Methods based on the SURF algorithm and Kalman filter. Sensors, 2016, 16(4): 486-500.
Article Google Scholar
K Lakshya, S Indu. A hybrid filtering approach of digital video stabilization for UAV using Kalman and low pass filter. Procedia Computer Science, 2016, 93: 359-366.
Article Google Scholar
H Yi, S Yang, S Zhou, et al. An innovative state-of-charge estimation method of Lithium-ion battery based on 5th-order cubature Kalman filter. Automotive Innovation, 2021, 4(4): 448-458.
Article Google Scholar
R Y Park, J M Pak, C K Ahn, et al. Image stabilization using FIR filters. 2015 15th International Conference on Control, Automation and Systems, Busan, Korea, October 13-16, 2015: 1234-1237.
Y W Choi, T H Kang, S G Lee. Development of image stabilization system using extended kalman filter for a mobile robot. International Conference in Swarm Intelligence, Beijing, China, June 12-15,2010: 675-682.
J Yang, S Dan, M Mohamed. Robust video stabilization based on particle filter tracking of projected camera motion. IEEE Transactions on Circuits & Systems for Video Technology, 2009, 19(7): 945-954.
Article Google Scholar
J Zhu, C Li, J Xu. Digital image stabilization for cameras on moving platform. 2015 International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Adelaide, Australia, September 23-25, 2015: 255-258.
K Ioannidis, I Andreadis. A digital image stabilization method based on the Hilbert-Huang transform. IEEE Transaction on Instrumentation and Measurement, 2012, 61(9): 2446-2457.
Article Google Scholar
J Wang, S Zheng, Y Du, et al. Study on the ORB algorithm in the application of Monocular SLAM. Nanotechnology, 2015, 2(3): 186-189.
Google Scholar
E Montijano, S Martinez, C Sagues. Distributed robust consensus using ransac and dynamic opinions. IEEE Transactions on Control Systems Technology, 2014, 23(1): 150-163.
Article Google Scholar
Y Wang, Y Sun, V Dinavahi. Robust forecasting-aided state estimation for power system against uncertainties. IEEE Transactions on Power Systems, 2020, 35(1): 691-702.
Article Google Scholar

Download references

Acknowledgements

The authors sincerely appreciate the contributions from Mr. Tian Li and Mr. Haobin Zhang on their discussion and critical suggestions during manuscript preparation and revision.

Funding

Supported by National Natural Science Foundation of China (Grant Nos. 52072072, 52025121 and 51605087).

Author information

Authors and Affiliations

School of Mechanical Engineering, Southeast University, Nanjing, 211189, China
Ning Zhang, Jianhua Wu, Ziqian Zhao & Guodong Yin
School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
Yuan Yang

Authors

Ning Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Wu
View author publications
You can also search for this author in PubMed Google Scholar
Ziqian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Guodong Yin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Ning Zhang, born in 1985, is currently a research staff and associate professor in School of Mechanical Engineering, Southeast University, China. He received his doctoral degree in Mechanical Engineering from Darmstadt University of Technology, Germany, in 2015. His research interests include vehicle system dynamics and control, intelligent and connected vehicles, sound and vibration.

Yuan Yang, born in 1984, is currently a research staff and associate professor in School of Instrument Science and Engineering, Southeast University, China. She received her doctoral degree in Informatics from Free University of Berlin, Germany, in 2015. Her research interests include integrated indoor positioning, location-based service, robot autonomous navigation.

Jianhua Wu, born in 1996, is currently a master candidate in School of Mechanical Engineering, Southeast University, China.

Ziqian Zhao, born in 1994, is currently a PhD candidate in School of Mechanical Engineering, Southeast University, China.

Guodong Yin, born in 1976, is currently a Professor in School of Mechanical Engineering, Southeast University, China. He received his doctoral degree in School of Mechanical Engineering, Southeast University, China, in 2007. His current research interests include vehicle system dynamics and control, intelligent and connected vehicles, and multi-agent control.

Corresponding authors

Correspondence to Ning Zhang or Guodong Yin.

Ethics declarations

Competing Interests

The authors declare no competing financial interests.

Author contributions

NZ proposed the idea, professional solutions, and provided project support for the work. YY proposed the requirements of EIS, and provided professional guidance for the framework of EIS. JW established the experimental platform, carried out research and wrote the manuscript draft. ZZ participated in the establishment of the experimental platform. GY provided professional suggestions and project support for the work. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, N., Yang, Y., Wu, J. et al. Vehicular Electronic Image Stabilization System Based on a Gasoline Model Car Platform. Chin. J. Mech. Eng. 35, 134 (2022). https://doi.org/10.1186/s10033-022-00805-1

Download citation

Received: 28 September 2022
Revised: 28 September 2022
Accepted: 29 September 2022
Published: 04 November 2022
DOI: https://doi.org/10.1186/s10033-022-00805-1

Vehicular Electronic Image Stabilization System Based on a Gasoline Model Car Platform

Abstract

1 Introduction

2 Feature Point Detection Based on ORB Algorithm

2.1 Selection of Image Matching Algorithm

2.2 ORB Algorithm Principle

2.3 Results of Feature Points Detection and Matching

3 Elimination of Mismatched Feature Points

3.1 Image Transformation Matrix

3.2 Improved RANSAC Algorithm

3.3 Results of Eliminating Mismatched Feature Points

4 Filter of the Image Transformation Matrix

4.1 Kalman Filter

4.2 Adaptive Kalman Filter

5 Experimental Platform

5.1 Framework of the Model Car

5.2 EE Architecture of the Platform

5.3 Experimental Road Conditions

6 Analysis of the Effect of Electronic Image Stabilization Algorithm

7 Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Author contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords