Swing-up control of double-inverted pendulum systems

Al Juboori, Ameen M.; Hussein, Mustafa Turki; Qanber, Ali Sadiq Gafer

doi:https://doi.org/10.5194/ms-15-47-2024

Articles | Volume 15, issue 1

https://doi.org/10.5194/ms-15-47-2024

Articles | Volume 15, issue 1

Research article

29 Jan 2024

Research article |

| 29 Jan 2024

Swing-up control of double-inverted pendulum systems

Ameen M. Al Juboori, Mustafa Turki Hussein, and Ali Sadiq Gafer Qanber

Abstract

This article deals with presenting a new swing-up control approach of a double-inverted pendulum on a trolley. The dynamic model of the double-inverted pendulum is derived and linearized. Two different linearization approaches are used: first, the traditional Taylor's series approach and, second, using partial linearization. A state feedback control algorithm has been implemented based on the linearized model from Taylor's series. Furthermore, a method for swinging up the pendulum to the inversion position from rest (swing-up) has been presented. The design and implementation of the swing-up function of the pendulum are implemented using the partial linearized model. The swing-up control procedure depends on using the feedforward–feedback controllers' combination to transfer the pendulums from the downward to the upward position. The time-variant controller gain is used for the sake of the swing-up control procedure. The performances of these algorithms are shown in this paper through simulations.

Download & links

Article (PDF, 2649 KB)

Download & links

How to cite.

Received: 25 Nov 2022 – Revised: 04 May 2023 – Accepted: 09 Nov 2023 – Published: 29 Jan 2024

1 Introduction

Non-linear, unstable, or underactuated systems are generally very difficult to control. Therefore, they are considered to be a challenge and are the subject of many technical reports, student projects, and academic papers. Their goal is not always to build a practically useful device but rather to develop or improve existing control algorithms, either by simulation or even experimentally to verify their applicability in general. The double-inverted pendulum model consists of several parts. A cart moves along a linear track, and two pendulums are connected to the carriage and each other by a rotational linkage (Fig. 1). The input to the system is the torque of a DC motor, which is transmitted by a toothed belt as a force to the carriage. The system has one challenging equilibrium position in which the system can be stabilized, the upper (inverse) (see Fig. 1a–c). Furthermore, three configurations of the double-inverted pendulum, which are upper-lower, lower-upper, and lower, are also considered as equilibrium positions. the lower equilibrium is always reached by the system (with the damping element) in finite time; control is needed to maintain the upper-lower, lower-upper, and inverse configurations in real conditions. The stabilization of the double-inverse pendulum is further understood in this paper as a regulation to maintain its inverse position.

The next task of the control program is to be the so-called swing-up. This term refers to the realization of such a movement of the carriage that brings the system from the lower equilibrium position to the inverse position. The situation is indicated in Fig. 1c. The control requirement is primarily for the robustness and repeatability of the swing-up. There are several published articles with different methods addressing the swing-up problem of a double-inverted pendulum. However, in the vast majority of cases, these are only simulations that have not been experimentally verified on a real system.

A mathematical model of this system was derived based on the Lagrangian mechanics; the dynamic model is discretized, and then the Laguerre series is implemented in model predictive control technology to trace the control signal for the system (Qian et al., 2011). An RNA genetic algorithm with fuzzy logic is used to control the pendulum system, where the fuzzy logic controller can improve the performance of the controller by using the RNA genetic algorithm to find certain optimal membership functions (Sun et al., 2015). The technical report compares the linear quadratic regulator, the state-dependent Riccati equation (SDRE), and the use of neural networks (NNs) and concludes that the NNs have a limited capability to improve the SDRE performance (Bogdanov et al., 2004). Adaptive sliding-mode control in combination with a fuzzy neural network is used to control a double-inverted pendulum. The fuzzy neural network is designed as a system controller, and the adaptive sliding mode is designed to carry out the disturbance problem (Mon and Lin, 2014).

In the previous literature, the dynamic model of the double-inverted pendulum system is linearized around an operating point to design a linear controller. An alternative to the above dynamic linearization is partial feedback linearization – splitting the generalized coordinates into a regularization whose dimensions are given by the number of inputs, and variables consider only zero dynamics (Hedrick and Girard, 2010; Neusser and Valášek, 2013). This path of linearization is used to design the swing-up function of the double-inverted pendulum (Hedrick and Girard, 2010). A more detailed description of the method can be found in the swing-up control practical example (Neusser and Valášek, 2013). Swing-up control of the double-inverted system was proposed to separate pendulums and to control each one distinctly (Henmi et al., 2014). A non-linear model predictive control is used to build up a control algorithm for swing-up motion (Jaiwat and Ohtsuka, 2014). A method for controlling the energy of the system with partial linearization is presented, and passivity-based control is utilized in the work (Zhong and Rock, 2001). The solution of the boundary value problem with free parameters is used to generate the control approach (Graichen et al., 2007)

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f01

Figure 1(a) Lower equilibrium position, (b) swing-up, and (c) upper equilibrium position.

Download

The motivation of this work was mainly to demonstrate the theoretical swing-up control procedures on a model of a double-inverted pendulum. Indeed, there are not many publications that deal with the application of the proposed algorithms on a real mechanism. The paper focuses on the theoretical basis for the following part. It consists of an explanation of the terms used and a search of the studied area in terms of stabilization and swing-up. It also includes the derivation of the equations of motion and the presentation of the existing double-pendulum model. Furthermore, this study serves to apply a linear quadratic controller and a Kalman filter and lastly to implement the swing-up function. The boundary value problem is solved to generate the trajectory for the swing-up motion of the pendulums. The partial linear realization method is used to linearize the system dynamically during the swing-up process. Dynamic input–output decoupling is used to keep the system in a stable position after the pendulums reach the upper unstable position (Qian et al., 2011).

The rest of the paper is organized as follows: in the following section, the mathematical model is derived; next, the stabilization control is presented; after that, the swing-up procedure is detailed; later, the simulation of the work is addressed, and the article conclusions are shown in the last section.

2 Mathematical modeling

Figure 2 schematically shows the double-inverted pendulum model. The meaning of the individual variables is evident from the figure. The system has 3 degrees of freedom – the sliding motion of the carriage and the rotational motions of the two pendulums. A vector of state variables is constructed as the displacement of the carriage or the rotation of the pendulums and the velocity of the carriage or the angular velocity of the pendulums:

\begin{matrix} (1) & θ = {[θ_{0}, θ_{1}, θ_{2}, {\dot{θ}}_{0}, {\dot{θ}}_{1}, {\dot{θ}}_{2}]}^{T} . \end{matrix}

For input u, the problem of obtaining a mathematical model is a search for an equation of motion of the following form:

\begin{matrix} (2) & \dot{θ} = f (θ) + g (θ u) . \end{matrix}

Lagrange's equations of the second kind were used for this purpose. The mathematical model of the double-inverse pendulum can be expressed as the sum of or difference between the partial derivatives of the mechanical energies according to the respective independent variables and their derivatives:

\begin{matrix} (3) & \frac{d}{d t} (\frac{\partial E_{K}}{{\dot{θ}}_{i}^{.}}) - \frac{\partial E_{K}}{θ_{i}} + \frac{\partial D}{θ_{i}} + \frac{\partial E_{P}}{θ_{i}} = \frac{\partial W_{e}}{θ_{i}} = \frac{\partial P_{e}}{{\dot{θ_{i}}}^{.}}, \end{matrix}

where E_K and E_P are the kinetic and the potential energy. D denotes the dissipative component, W_e denotes the work of external forces, and P_e denotes their power. For individual bodies, the kinetic, potential, and dissipative energies and external forces can be expressed (Bogdanov, 2004).

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f02

Figure 2Schematic of the double-inverted pendulum model.

Download

The velocity energies of the centers of gravity of the pendulums can be obtained from the time derivative of their position vector.

\begin{matrix} (4) & E_{K_{0}} = \frac{1}{2} m_{0} {\dot{θ}}_{0}^{2} + \frac{1}{2} I_{0} ω_{0}^{2} = \frac{1}{2} m_{0} {\dot{θ}}_{0}^{2} + \frac{1}{2} \frac{I_{0}}{R_{0}^{2}} {\dot{θ}}_{0}^{2} \end{matrix}

\begin{matrix} (5) & E_{K_{1}} = \frac{1}{2} m_{1} [{({\dot{θ}}_{0} + l_{1} {\dot{θ}}_{1} \cos θ_{1})}^{2} + {(l_{1} {\dot{θ}}_{1} \sin θ_{1})}^{2}] + \frac{1}{2} l_{1} {\dot{θ}}_{1}^{2} \end{matrix}

\begin{matrix} (6) & \begin{aligned} E_{K_{2}} & = \frac{1}{2} m_{2} [{({\dot{θ}}_{0} + L_{1} {\dot{θ}}_{1} \cos θ_{1} + l_{2} {\dot{θ}}_{2} \cos θ_{2})}^{2} \\ + {(L_{1} {\dot{θ}}_{1} \sin θ_{1} + l_{2} {\dot{θ}}_{2} \sin θ_{2})}^{2}] + \frac{1}{2} I_{2} {\dot{θ}}_{2}^{2} \end{aligned} \end{matrix}

Potential energy is as follows:

\begin{matrix} (7) & \begin{aligned} E_{P_{0}} & = 0, E_{P_{1}} = m_{1} g l_{1} \cos θ_{1}, E_{P_{2}} \\ = m_{2} g (L_{1} \cos θ_{1} + l_{2} \cos θ_{2}) . \end{aligned} \end{matrix}

The dissipative component is as follows:

\begin{matrix} (8) & \begin{aligned} D_{0} = \frac{1}{2} b_{0} {\dot{θ}}_{0}^{2}, D_{1} = \frac{1}{2} b_{1} {\dot{θ}}_{1}^{2} + \frac{1}{2} b_{2} {({\dot{θ}}_{2} - {\dot{θ}}_{1})}^{2}, \\ D_{2} = \frac{1}{2} b_{2} {({\dot{θ}}_{2} - {\dot{θ}}_{1})}^{2} . \end{aligned} \end{matrix}

In Eqs. (4)–(8), the θ_j represents the motion variable; m_j is the mass; b_j is the viscous friction constant; and $j = 0, 1, 2$ stands for cart and first and second pendulum, respectively. I₀ is the inertia of the pulley, and l_1,2 are the lengths of the pendulums.

The only input to the system is the moment μ on the motor shaft. However, for better understanding, the equations were derived for the force on the carriage F. There is a simple relationship of direct proportionality between these two quantities:

\begin{matrix} (9) & μ = F R_{0}, \end{matrix}

where R₀ is the radius of the pulley. Then the external force output is given by

\begin{matrix} (10) & P_{e} = F {\dot{θ}}_{0} . \end{matrix}

Substituting the above equations into Eq. (2) gives the resulting equation of motion, which can be written in standard matrix form:

\begin{matrix} (11) & M (q) \ddot{q} + B (q, \dot{q}) \dot{q} + K (q) = H_{u}, \end{matrix}

where the vector $q = [θ_{0}, θ_{1}, θ_{2}]$ ; the introduction of matrices is not listed here for the sake of abbreviation. The equation of motion in the form of Eq. (5) can be expressed as follows:

\begin{matrix} (12) & \begin{aligned} \dot{θ} & = [\begin{array}{cc} 0 & I \\ 0 & - M^{- 1} B \end{array}] θ + [\begin{array}{c} 0 \\ - M^{- 1} K \end{array}] \\ + [\begin{array}{c} 0 \\ M^{- 1} H \end{array}] u, \end{aligned} \end{matrix}

where 0 represents the zero matrix, and I represents the unit matrix of the corresponding dimensions.

To design a linear controller, the non-linear equation of motion must first be linearized. It is common to approximate around the nominal operating point (θ_n, u_n) by the first terms of the Taylor series (Franklin et al., 2002). The resulting equation of motion represented by appropriate state vector and state space matrices is as follow:

\begin{matrix} (13) & Δ \dot{θ} (t) = A Δ θ (t) + B Δ u (t) . \end{matrix}

3 Stabilization control of the system

In this section, attention is paid to the possibilities of stabilizing the double-inverted pendulum. The LQR and feedback linearization as the theoretical basis for the following system of the swing-up search have achieved detailed description.

3.1 Linear quadratic regulator (LQR)

The objective of this method is to design the optimal control of a linear system given the magnitude of the active intervention u and the deviation of the states x from the zero (desired) value in time. Practically, the objective is to minimize the cost function J, which is given by

\begin{matrix} (14) & J = \int_{0}^{T} (x^{T} (t) Q_{c} x (t) + u^{T} (t) R_{c} u (t)) d t . \end{matrix}

For the state feedback law, we have the following:

\begin{matrix} (15) & u (t) = - K_{c} x (t) . \end{matrix}

For discrete systems, the integral in Eq. (6) is replaced by the summation. The matrices Q_c and R_c correspond to the weights of the states and inputs, respectively (Franklin et al., 2002). K_c is the control matrix that can be subsequently used for stabilization and is given by the solution of the associated Riccati equation. LQR is a widely used method, and there is a wealth of documentation on it. Implementation-wise, the simplest modification of LQR for non-linear systems is the state-dependent Riccati equation (SDRE). The principle of the method is to linearize the model around the current state for each time instant and then compute the optimal control matrix. However, this method places greater demands on computational power.

3.2 Feedback linearization (FBL)

In contrast to the usual linear function approximation, the method used does not neglect the non-linear terms and works even outside the vicinity of the working point. There are two principles utilized in combination in this work's input state and input–output linearization.

The necessary algebraic operations are generally not trivial. For the sake of scope, the system is restricted to the single input, single output (SISO) described by the following:

\begin{matrix} (16) & \dot{x} = [\begin{array}{l} \begin{array}{c} {\dot{x}}_{1} \\ ⋱ \\ {\dot{x}}_{n - 1} \end{array} \\ {\dot{x}}_{n} \end{array}] = [\begin{array}{l} \begin{array}{c} x_{2} \\ ⋱ \\ x_{n} \end{array} \\ f (x) + g (x) u \end{array}] . \end{matrix}

The input state linearization consists of finding a suitable transformation of the states T and input u such that

\begin{matrix} (17) & z = T (x), u = u (x, v) . \end{matrix}

The role of regulation is transferred to system control:

\begin{matrix} (18) & \dot{z} = A_{c} x + B_{c} v . \end{matrix}

The linearization conditions, detailed description, and examples are exhaustively explained in Hedrick and Girard (2010) and Graichen et al. (2007).

We assume that the outputs of the considered system y are given by the following function:

\begin{matrix} (19) & y = h (x) . \end{matrix}

The principle of the input–output linearization is to find the direct dependence of the outputs on the input by successively deriving the function h with regard to time until the following dependence appears:

\begin{matrix} (20) & \begin{aligned} y^{(1)} & = \frac{\partial h}{\partial x} \frac{d x}{d t} = \frac{\partial h}{\partial x} [f (x) + g (x) u] = L_{f} (h) \\ + L_{g} (h) u, y^{(2)} = L_{f}^{2} (h) + L_{g} (L_{f} (h)) u, \end{aligned} \end{matrix}

and so on until, for the kth derivative, the input term is non-zero (Aguiar, 2011). The prescription of the new input u is given by

\begin{matrix} (21) & u = \frac{1}{L_{g} (L_{f}^{k - 1} (h))} (- L_{f}^{k} (h) + v), \end{matrix}

where v is equal to y(k). The transformed systems prescribed by Eqs. (22, 23) are already completely linear – they can therefore be controlled by the linear method (LQR, array placement).

\begin{matrix} (22) & \dot{z} = (\begin{array}{l} \begin{array}{c} y^{(1)} \\ ⋱ \\ y^{(k - 1)} \end{array} \\ y^{(k)} \end{array}) = A_{c} z + B_{c} v \end{matrix}

\begin{matrix} (23) & \begin{aligned} A_{c} = [\begin{array}{l} 0 1 0 ⋱ 0 \\ 0 0 1 0 ⋱ \\ ⋱ ⋱ ⋱ ⋱ ⋱ \\ 0 0 ⋱ 0 1 \\ 0 0 ⋱ 0 0 \end{array}], B_{c} = [\begin{array}{l} \begin{array}{c} 0 \\ 0 \\ ⋱ \end{array} \\ 0 \\ 1 \end{array}], \\ v = - K_{c} z \end{aligned} \end{matrix}

A schematic of the closed-loop control is shown in Fig. 3. For systems with n inputs and m outputs (MIMO), where n=m, the procedure is more complicated (we refer to static input–output decoupling), but the basic principles remain the same (Hedrick and Girard, 2010; Henson and Seborg, 1997) and lead to time-invariant feedback control.

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f03

Figure 3Block diagram of regulation with input–output feedback linearization.

Download

Also, A Kalman filter, which is a very useful tool that extracts the best possible estimate of all the states of the system (even if not all of them are measured) from imperfect knowledge of the model and inaccurate measurements (Nise, 2020), is used in this work. This is a stochastic observer – it assumes that the quantities are random with a Gaussian distribution and works with their mean and variance (uncertainty). Tuning of the algorithm consists of an appropriate choice of the matrices Q_o and R_o, which introduce the covariance of process and measurement noise into the calculation. At each stage, the state estimation and covariance matrix P_o are adjusted, which carries information about the uncertainty of the states and their correlation with each other.

4 Swing-up control

The key to planning a swing-up is finding a suitable trajectory for the carriage and pendulums so that the positions and velocities of the carriage and pendulums are zero when the swing-up is complete. However, whatever the method of obtaining the swing-up trajectory (or the necessary input for swing-up), in terms of structure, it is possible to divide the control into forward or backward. Feedforward control can only be used if the behavior of the double-inverted pendulum is well known. A previously calculated input is applied to the system, and a certain output is expected. However, the double-inverse pendulum is extremely sensitive to initial conditions. Even a small deviation from the calculated trajectory can cause a failed swing-up. A possible solution is to add feedback control. If we denote by u_f the input given by as the forward control and that given by u_b as the backward control, then the resultant is given by their sum (Fig. 4).

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f04

Figure 4Block diagram of feedforward and feedback control for swing-up.

Download

4.1 Boundary value problem (BVP) with free parameters

Finding a trajectory for the swing-up function is a case of the boundary value problem. The equation of motion (1) is a system of ordinary differential equations (ODEs), and the boundary conditions are as follows:

\begin{matrix} (24) & \begin{aligned} θ_{0} (t = 0) & = {[0, π, π, 0, 0, 0]}^{T}, θ_{T} (t = T) \\ = {[0, 0, 0, 0, 0, 0]}^{T}, \end{aligned} \end{matrix}

where t is the time, and T is the duration of the swing. For the 6 equations and 12 conditions, the associated boundary value problem is overdetermined. In order to solve it, Qian et al. (2011) propose defining the cart trajectory Y(t) by the cosine series (13) and thus adding the necessary number of free p=[p₁ p₂ p₃ p₄] to the equation of motion.

\begin{matrix} (25) & Y (t, p) = a_{0} + a_{1} \cos (\frac{π t}{T}) + \sum_{i = 2}^{5} p_{i - 1} \cos (\frac{i π t}{T}) \end{matrix}

The terms $a_{0} = - (p_{1} + p_{3})$ and $a_{1} = - (p_{2} + p_{4})$ are obtained by setting $Y (0, p) = Y (T, p) = 0$ , which appears by deriving Eq. (13) by time. The boundary value problem can be solved by a suitable numerical method. If the deviation of the actual state values from the nominal trajectory θ^∗ and input u^∗ is sufficiently small, the system can be described by a linear time-dependent state equation (Zhong and Rock, 2001):

\begin{matrix} (26) & Δ \dot{θ} (t) = A (t) Δ θ (t) + B (t) Δ u (t), \end{matrix}

where the matrices A and B are given by the linearization of Eq. (12) (see “Feedback linearization” section) along the nominal trajectory and input. And for such a system, we design a linear controller at each time instant:

\begin{matrix} (27) & Δ u (t) = - K_{c} (t) Δ θ (t) = - K_{c} (t) (θ (t) - θ^{*} (t)), \end{matrix}

where the control matrix K is computed forward. Its calculation therefore does not burden the processor on which the swing-up and subsequent stabilization program runs. The resulting forward and backward control actions are given by the following:

\begin{matrix} (28) & u (t) = u_{a} (t) + K_{c} (t) (θ^{*} (t) - θ (t)) . \end{matrix}

4.2 Partial linearization and feedforward

By studying the natural motion of a system (without control), some useful knowledge can be gained. In order to calculate a suitable elevation trajectory, the natural frequencies of the system must be found. This is based on the findings of the feedback linearization (FBL).

For the sake of partial linearization, the generalized coordinates of the double-inverse pendulum and the variable vector q are divided into two parts; q_x denotes the controlled part (the carriage), and q_θ denotes the rest with zero dynamics (the pendulum). With the proposed decomposition, Eq. (5) is decomposed into the following:

\begin{matrix} (29) & M_{x x} {\ddot{q}}_{x} + M_{x θ} {\ddot{q}}_{θ} + B_{x x} {\dot{q}}_{x} + B_{x θ} {\dot{q}}_{θ} = u, \end{matrix}

\begin{matrix} (30) & M_{θ x} {\ddot{q}}_{x} + M_{θ θ} {\ddot{q}}_{θ} + B_{θ θ} {\dot{q}}_{θ} + K_{θ} = 0, \end{matrix}

and the matrices M, B, and K become

\begin{aligned} M (q) = [\begin{array}{cc} M_{x x} (1, 1) & M_{x θ} (1, 2) \\ M_{θ x} (2, 1) & M_{θ θ} (2, 2) \end{array}], B (q, \dot{q}) \\ = [\begin{array}{cc} B_{x x} (1, 1) & B_{x θ} (1, 2) \\ 0 (2, 1) & B_{θ θ} (2, 2) \end{array}], K (q, \dot{q}) \\ = [\begin{array}{c} 0 \\ K_{θ} (2, 1) \end{array}], \end{aligned}

where 0 represents the zero vector, and the numbers in parentheses indicate the dimension of the matrix – so, for example, M_xθ=[d₂cos θ₁ d₃cos θ₂]. From Eq. (30), ${\ddot{q}}_{θ}$ is expressed and substituted into Eq. (29). If the output of the system is $y = q_{x},$ then the feedback linearization of the obtained equation gives the following:

\begin{matrix} (31) & \ddot{y} = {\ddot{q}}_{x} = v, \\ (32) & \begin{aligned} u & = B_{x x} {\dot{q}}_{x} + (B_{x θ} + {WB}_{θ θ}) {\dot{q}}_{θ} + {WK}_{θ} \\ + ({WM}_{θ x} + M_{x x}) v, \end{aligned} \end{matrix}

where, to simplify the notation, $W = - M_{x θ} M_{θ θ}^{- 1}$ .

The principle further consists of finding an input v for which the boundary conditions in Eq. (24) hold. In general, it is given by the sum of two parts. The first makes the cart move a certain distance from the initial state at time T. However, since the end position of the cart is supposed to be identical to the initial position, i.e., $θ_{0} (0) = θ_{0} (T) = 0$ , the mentioned component will be zero for this case. The second component does not change the final position of the cart, but due to the entanglement via zero dynamics, it will cause the pendulums to swing. It is proposed in the following form (Hedrick and Girard, 2010):

\begin{matrix} (33) & v = \sum_{i} A_{i} \sin (2 π f_{i} t + φ_{i}) . \end{matrix}

The amplitudes A_i are relative to the change in the potential energy of the system:

\begin{matrix} (34) & Δ E_{p} = m_{1} l_{1} g + m_{2} (L_{1} + l_{2}) g . \end{matrix}

However, the exact numbers, as well as the phase shifts φ_i, are the result of an optimization process to obtain boundary conditions for the q_θ coordinates as well. If there are natural numbers K_i, then the following holds (Neusser and Valášek, 2013):

\begin{matrix} (35) & 2 π f_{i} T = 2 π K_{i} . \end{matrix}

5 Simulations

The calculations of the linearized model and the gains of the controller and observer are shown in Fig. 5. In order to avoid the errors as much as possible, after deriving Eq. (5), all further modifications were performed using the Symbolic Math Toolbox in MATLAB. In this way, the equation of motion (12) and its linearized form in the form of state space matrices A and B were obtained. These are complex relations with a considerable number of terms. The system was further simulated in Simulink environment with an ode4 (Runge–Kutta) solver with a constant time step of 0.001 s. The verification of the equation of motion (12) was successful. It faithfully describes the model under consideration. This is one of the basic prerequisites for the design of a successful control algorithm. The parameters used in the simulation are shown in Table 1.

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f05

Figure 5Simulation procedure in MATLAB and Simulink.

Download

Table 1Parameters values used in the simulation.

Download Print Version | Download XLSX

5.1 Stabilization control simulation

In Sect. 3.1 in this article, the method of obtaining the linear equations of the system described by the matrices A and B was explained. The optimal control multipliers K were calculated based on the model and the given weight matrices Q_c and R_c. An extended Kalman filter was used as an observer to perform the correction based on the measurement of the carriage position and the pendulum rotation. The Q_c and R_c for the LQR controller and Q_Kal and R_Kal for the Kalman filter are chosen as follows: $Q_{c} = diag [\begin{array}{cccccc} 10 & 350 & 350 & 0 & 0 & 0 \end{array}]$ , R_c=1 , $Q_{Kal} = diag [\begin{array}{cccccc} 0.1 & 0.03 & 0.03 & 0.15 & 0.4 & 0.4 \end{array}]$ , and $R_{Kal} = diag [\begin{array}{ccc} 0.3 & 0.7 & 0.7 \end{array}]$ . Sensor noise was simulated at this stage by random numbers with a Gaussian distribution around zero mean. Nevertheless, the stabilization task was very easy compared to the subsequent tuning on the real system. The main objective of the simulation was to verify the correctness of the programming and the functionality of the algorithm. Figure 6 shows the stabilization waveform for initial conditions θ₀=0.2 m, θ₁=10^∘, and $θ_{2} = - 10$ ^∘. The desired carriage position is indicated in red in the figure. This is because the positioning of the carriage is relatively easy to achieve since the stability of the system does not depend on θ₀ as the only one of the states. The control law was given as $u (t) = N r (t) - K x (t)$ ; r(t) is the desired cart position, $N = - B^{- 1} (A - B K) C^{T}$ , and for this case C=[0.1 0 0 0 0 0].

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f06

Figure 6Stabilization control of the system using LQR controller.

Download

5.2 Swing-up control simulation

In this section, the simulation results of swing-up trajectories and the subsequent application of forward and reverse control are addressed. The MATLAB numerical solver bvp4c was used to find suitable swing-up trajectories according to the method described in Sect. 4.1. For the double-pendulum model, where the input is the acceleration of the trolley, boundary conditions (24) were considered, except for defining the end position of the cart and its velocity, for which the boundary value is guaranteed by choosing the input as a cosine series (13). A time-dependent linear controller was designed along the nominal swing-up trajectory (with a time step of 10 ms and subsequent linear interpolation between the calculated points). It turned out that, over a certain time interval, the control multiples undergo abrupt changes in magnitude and sign. Since the linear controller was derived from a double-pendulum model where the input is directly the acceleration of the carriage, its proposed intervention also has an acceleration dimension. Therefore, it has to be converted to a force based on the current states according to Eq. (32). In view of this and the fact that the system is often at the limit of drivability or loses it completely at these moments, it was decided to take the reverse steering out of action for a while.

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f07

Figure 7Feedforward and LQR feedback swing-up and stabilization.

Download

In Fig. 7, the swing-up state behavior is shown. It is clear that the control approach successfully followed the derived trajectory and moved the system from the downward to upward position in a short amount of time. All the states of the system are stable and follow the desired values. The actual and the estimated states of the system converge perfectly, which proves the effectiveness of the dynamic input–output decoupling approach. The abrupt disengagement and engagement of the reverse control caused a disproportionately large impulse response with the eventual consequence of loss of stability and a failed swing-up. Therefore, around the critical interval, the multiples K_c are linearly decreased or increased, as presented in Fig. 8. For the simulation, it is useful to set the duration of the linear region to 0.1 s. If the pendulum rotation approaches the inverse position during the swing-up, it is switched to LQG stabilization.

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f08

Figure 8Controller gain for swing-up control action.

Download

Trajectories with a duration of 1.85 s were selected for simulation purposes. The progress of such a swing-up is illustrated by the video sequence at the following link: https://youtu.be/zr0fzmVc9Ao (last access: 18 January 2024). In Fig. 9, the process of swing-up control is illustrated by following the pendulums during the motion. The animation shows that the swing-up process effectively finished based on the designed controller.

https://ms.copernicus.org/articles/15/47/2024/ms-15-47-2024-f09

Figure 9Simplified graphic display of the swing-up process.

Download

6 Conclusions

A model of the double-inverted pendulum was obtained, including the assumption of all necessary parameters. Also, thanks to this, a robust and long-lasting stabilization of both pendulums in the inverted position was implemented. This is implemented by the LQR method. The system states are estimated by a non-linear extended Kalman filter.

By solving the boundary value problem with the numerical proper solver, nominal trajectories for the swing-up function have been obtained. The required cart acceleration was implemented by a controller with 2 degrees of freedom – a forward PID control of the cart speed and a feedback LQR controller, whose controller gains were calculated off-line along the nominal trajectories and input. Fortunately, the planned swing-up was achieved despite the chaotic behavior of the real system. The algorithm proved its worth in simulation; a complete swing-up was still realized several times.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Author contributions

AMAJ conceived the idea and mathematical modeling. MTH was responsible the programming and finding the results. ASG Qanber was responsible for the proofreading and financial support.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Review statement

This paper was edited by Daniel Condurache and reviewed by Petr Chalupa and one anonymous referee.

References

Aguiar, A. P.: Nonlinear Control Systems, IST-DEEC PhD Course, Institute for Systems and Robotics, Lisboa, Portugal, http://users.isr.ist.utl.pt/~pedro/NCS2012/ (last access: 29 December 2023), 2011.

Bogdanov, A.: Optimal control of a double inverted pendulum on a cart, Oregon Health and Science University, Tech. Rep. CSE-04-006, OGI School of Science and Engineering, Beaverton, OR, Corpus ID: 18478090, 2004.

Franklin, G. F., Powell, J. D., Emami-Naeini, A., and Powell, J. D.: Feedback control of dynamic systems, Vol. 4. Prentice hall Upper Saddle River, ISBN 978-0-13-349659-8, 2002.

Graichen, K., Treuer, M., and Zeitz, M.: “Swing-up of the double pendulum on a cart by feedforward and feedback control with experimental validation,” Automatica, 43, 63–71, https://doi.org/10.1016/j.automatica.2006.07.023, 2007.

Hedrick, J. K. and Girard, A.: Control of nonlinear dynamic systems: Theory and applications, Controllability and observability of Nonlinear Systems, Berkeley: University of California, 151–181, 2010.

Henmi, T., Deng, M., and Inoue, A.: “Unified method for swing-up control of double inverted pendulum systems,” in: Proceedings of the 2014 International Conference on Advanced Mechatronic Systems, 572–577, https://doi.org/10.1109/ICAMechS.2014.6911611, 2014.

Henson, M. A. and Seborg, D. E.: Nonlinear process control, Prentice Hall PTR Upper Saddle River, New Jersey, ISBN 013625179X, 9780136251798, 1997.

Jaiwat, P. and Ohtsuka, T.: Real-time swing-up of double inverted pendulum by nonlinear model predictive control, in: 5th International Symposium on Advanced Control of Industrial Processes, 290–295, Corpus ID: 19016427, 2014.

Mon, Y.-J. and Lin, C.-M.: “Double inverted pendulum decoupling control by adaptive terminal sliding-mode recurrent fuzzy neural network”, J. Intell. Fuzzy Syst., 26, 1723–1729, https://doi.org/10.3233/IFS-130851, 2014.

Neusser, Z. and Valášek, M.: “Control of the double inverted pendulum on a cart using the natural motion”, Acta Polytech., 53, 883–889, https://doi.org/10.14311/AP.2013.53.0883, 2013.

Nise, N. S.: Control systems engineering, Chapter 6, 240–268, John Wiley & Sons, ISBN 978-1-119-47422-7, 2020.

Qian, Q., Dongmei, D., Feng, L., and Yongchuan, T.: “Stabilization of the double inverted pendulum based on discrete-time model predictive control,” in 2011 IEEE International Conference on Automation and Logistics (ICAL), 243–247, https://doi.org/10.1109/ICAL.2011.6024721, 2011.

Sun, Z., Wang, N., and Bi, Y.: “Type-1/type-2 fuzzy logic systems optimization with RNA genetic algorithm for double inverted pendulum”, Appl. Math. Model, 39, 70–85, https://doi.org/10.1016/j.apm.2014.04.035, 2015.

Zhong, W. and Rock, H.: “Energy and passivity based control of the double inverted pendulum on a cart,” in: Proceedings of the 2001 IEEE International Conference on Control Applications (CCA'01) (Cat. No. 01CH37204), 896–901, https://doi.org/10.1109/CCA.2001.973983, 2001.