Nonlinear control of a swinging pendulum on a wheeled mobile robot with nonholonomic constraints Nikhil Potu Surya Prakash Department of Mechanical Engineering University of Michigan, Ann Arbor, MI-48105, Email:nikhilps@umich.edu Abstract In this paper, we propose a nonlinear control strategy for swinging up a pendulum to its upright equilibrium position by shaping its swinging energy along with regulating the cart to a desired location. While the base of a usual cart-pole system is restricted to move in a straight line, the present system is allowed to move in the x-y plane with a nonholonomic consraint that its allowable velocity is only along its orientation. A simple time invariant control law has been presented and its effectiveness has been demonstrated using numerical experiments. 1 Introduction Most of the controllers that are derived for nonholonomic systems are either time dependent or at kinematic level. But using kinematics alone, the dynamic structure and properties of the system are often left out.In a case like the present system where the dynamics of the pendulum are coupled with the dynamics of the wheeled mobile robot upon which it is appended, kinematic level models and control strategies prove to be futile and there arises a need to depend on the dynamics of the complete system.In an attempt to exploit the rich dynamics of underactuated systems with nonholonomic constraints, such a system has been modelled. Energy shaping is a technique that has been proven to be very effective and has been extensively studied through implementations on various underactuated systems with pendulums. In energy shaping, first a desired energy of the whole system or a part of the system is formulated to which the system’s energy or a part of it is converged. This ensures bringing the states of the system close to the desire states from almost all initial conditions. Once the system is put in the orbit (of that particular energy level), a linear controller is implemented once the states of the system come close to the desired states. In the pendulum’s energy shaping control [4], the pendulum’s energy at the upright equilibrium position is considered and the energy of the whole system is converged to this energy level, which basically is same as bringing the states of the system onto a homoclinic orbit which starts and ends at the same unstable equilibrium point.A lyapunov function can be constructed by taking the square of the error in the desired energy and by deriving a control law based on the error in the energy, it can be shown that the error asymptotically converges to zero. It must be noted that just bringing the states of the pendulum to the homoclinic orbit does not stabilize the system about its upright equilibrium point, since any small disturbance would make the control law to converge the system back to the homoclinic orbit which might require the pendulum to complete one full revolution and then come back to the upright equilibrium point. Therefore, a need to use a linear controller near the unstable equilibrium point arises. Other energy shaping controllers include swinging up an Acrobot to its upright equilibrium position [6], where first a partial feedback linearization of the system is conducted and then an energy shaping controller is implemented. Another controller for an Acrobot [5] uses a slightly modified lyapunov function which along with the square of energy includes a term quadratic in the angular position and angular velocity of the bottom link. The idea there was to make the Acrobot behave like a single pendulum and swing it to the upright position. Although, our present system is closer to a cart-pole [7] where the pendulum’s energy alone is considered and a pd controller is added to regulate the position of the cart to a desired position. Other notable systems include a Pendubot [8], Furuta pendulum [9,10], inertia wheel pendulum [11] etc. 1 arXiv:1811.07323v1 [cs.RO] 18 Nov 2018 Figure 1: Schematic of a pendulum on a wheeled mobile robot 2 Model Description of the system The system in consideration (see fig [1]) has a wheeled mobile robot (WMR) carrying a pendulum mounted on top which is allowed to swing in the plane formed by the unit vector normal to the plane in which the WMR moves and the unit vector along the direction of motion of the WMR. the WMR is a differntial drive robot with two wheels to which torques can be applied to accelarate and rotate the robot. A smooth caster is introduced to make the WMR a stable system. The assumptions on the system are that the wheels are only allowed to roll without slipping and the system cannot move in a direction perpendicular to its orientation which form the nonholonomic constraints on the system. The degrees of freedom of the system are the cartesian coordinates of the positions of the cart (x and y), the orientation or the yaw angle ( ψ ) of the WMR (with positive X axis of the inertial frame), the orientations of the wheels ( φ l - left wheel and φ r - right wheel) and the angle made by the pendulum with the vertical ( θ ). Therefore the system evolves on a manifold whose configuration space is SE (2) × S 1 × S 1 × S 1 the state vector of the system is [ x, y, ψ, φ r , φ l , θ, ̇ x, ̇ y, ̇ ψ, ̇ φ r , ̇ φ − l, ̇ θ ] . We assume that the motion of the skate is restricted to a plane with the same gravitational potential throughout and hence there will not be any change in the potential energy of the WMR as the it moves. The nonholonomic constraints on the motion are given by the conditions that all the velocity of the WMR is along the direction in which it is oriented (no lateral slippage) and both the wheels roll without slipping.We now show how one can arrive at the nonholonomic constraints and derive the equations of motion of the system. The position vectors of different parts of the system expressed in the inertial frame are given by r center of mass =( x, y, − h − l ) (1) r right wheel =( x + dsinψ, y + dcosψ, − h − l ) (2) r lef t wheel =( x − dsinψ, y − dcosψ, − h − l ) (3) r pendulum bob =( x + lsinθcosψ, y + lsinθsinψ, l − lcosθ ) (4) Here h always remains constant and does not appear in the equations of motion. l is the length of the pendulum. Also since the potential energy of the WMR always remains constant, this can be omitted from the formulation. Differentiating the position vectors, we obtain the velocities of the parts which are also expressed in the inertial frame. v center of mass =( ̇ x, ̇ y, 0) (5) v right wheel =( ̇ x + d ̇ ψcosψ, y − d ̇ ψsinψ, 0) (6) v lef t wheel =( ̇ x − d ̇ ψcosψ, y + d ̇ ψsinψ, 0) (7) v pendulum bob =( x + l ̇ θcosθcosψ − l ̇ ψsinθsinψ, ̇ y + l ̇ θcosθsinψ + l ̇ ψsinθcosψ, l ̇ θsinθ ) (8) 2 Therefore, the constraints can now be given by the following equations − ̇ xsinψ + ̇ ycosψ = 0 (9) ̇ xcosψ + ̇ ysinψ + d ̇ ψ − R ̇ φ r = 0 (10) ̇ xcosψ + ̇ ysinψ − d ̇ ψ − R ̇ φ l = 0 (11) where 2d is the distance between the wheels assuming that the WMR is symmetric and R is the radius of the wheels. With the positions,velocities and the angular velocities set up, its quite straight forward to find the kinetic and potential energies of the system and thereby the Lagrangian of the system. The total kinetic energy (T) of the system can therefore be expressed as T = 1 2 J w 1 ̇ φ 2 r + 1 2 J w 2 ̇ ψ 2 + 1 2 m w (( ̇ x + d ̇ ψcosψ ) 2 + ( ̇ y − d ̇ ψsinψ ) 2 ) + 1 2 m w (( ̇ x − d ̇ ψcosψ ) 2 + ( ̇ y + d ̇ ψsinψ ) 2 )+ 1 2 m c ( ̇ x 2 + ̇ y 2 ) + 1 2 J c ̇ ψ 2 + 1 2 m p (( l ̇ θcosθ − l ̇ ψsinθsinψ ) 2 + ( l ̇ θcosθsinψ + l ̇ ψsinθcosψ ) 2 + ( − l ̇ θsinθ ) 2 ) (12) and the potential energy(V) of the system by taking the datum at the pendulum bob when it is vertical is given by V = − m p g ( l − lcosθ ) − ( m c + 2 m w ) g ( l + h ) (13) where m p is the mass of the pendulum bob, m c is the mass of the WMR without the wheels, m w is the mas of each wheel, J w 1 and J w 2 are the mass moments of inertia of each wheel about the major and minor axes respectively and J c is the mass moment of inertia of the WMR without the wheels about the vertical axis passing through its center of gravity. Hence the lagrangian (L) is given by L = T − V The present system is subject to three nonholonomic constraints which can be written in the form A ∗ ( q ) ̇ q = 0 (14) Here A ∗ ( q ) is a 3 × 6 matrix and q is a 6 × 1 column vector. At any configuration q, the set of all possible virtual displacements is defined to be the subspace of the tangent space to the configuration manifold at q consisting of vectors δq that satisfy the constraints, i.e., the subspace D q defined by D q = { δq ∈ T q Q | A ∗ ( q ) .δq = 0 } The system is controlled by two inputs i.e., torques on each wheel ( τ l on the left wheel and τ r on the right wheel). Choosing q = ( r, s ) where r and s are local coordinates and the forces enter only through the r coordinates such that equation (14) can be rewritten as δs a + A a α δr = 0 The Lagrange–d’Alembert equations of motion for the system are those determined by δ ∫ b a L ( q i , ̇ q i ) dt = 0 where we choose variations δq ( t ) of the curve q(t) that satisfy δq ( t ) ∈ D q for each t, a ≤ t ≤ b , and δq ( a ) = δq ( b ) = 0 . This principle is supplemented by the condition that the curve q(t) itself satisfy the constraints. we take the variation δq before imposing the constraints; that is, we do not impose the constraints on the family of curves defining the variation. The usual arguments in the calculus of variations show that this constrained variational principle is equivalent to the equations ∫ b a ( d dt ∂L ∂ ̇ q − ∂L ∂q − f ) δqdt = 0 (15) substituting the variations in local coordinates we obtain the equations of motion to be d dt ∂L ∂ ̇ r − ∂L ∂r − f = A ( r, s )( d dt ∂L ∂ ̇ s − ∂L ∂s ) where f = ( τ l , τ r ) Using the nonholonomic constraints (9-11) to eliminate ̇ φ r and ̇ φ l , the Lagrangian can be reduced and the resulting system is found analogous to a pendulum on a knife edge whose configu- rati0on manifold is SE (2) × S 1 The equations of motion for the reduced system can be established 3 as follows ( M + m ) ̈ x + ml ̈ θcosθcosψ − ml 2 ̇ θ 2 sinθcosψ − ml 2 ̇ θ ̇ ψcosθsinψ + λsinψ = F cosψ (16) ( M + m ) ̈ y + ml ̈ θcosθsinψ − ml 2 ̇ θ 2 sinθsinψ + ml 2 ̇ θ ̇ ψcosθcosψ − λcosψ = F sinψ (17) ml 2 ̈ θ + mlcosθ ( ̈ xcosψ + ̈ ysinψ ) − ml 2 ̇ ψ 2 sinθcosθ − mglsinθ = 0 (18) ( J + ml 2 sin 2 θ ) ̈ ψ + ml 2 ̇ θ ̇ ψsin 2 θ = τ (19) where M is the total mass of the WMR, J is the effective mass moment of inertia of the Here λ = M [2 ̇ ψ ( ̇ xcosψ + ̇ ysinψ ) + l ̇ θ ̇ ψcosθ ] is the lateral frictional force acting on the WMR which can easily be computed using Newton’s laws. F = ( τ l + τ r ) / 2 d is the effective force on the WMR and τ = τ r − τ l is the effective moment on the system. 3 Control design In this section we present a control strategy to bring the pendulum to its upright position along with regulating the WMR to a desired position. Without loss of generality we assume the desired position to be the origin in our further discussions. The swing up energy of the pendulum is defined as follows E = 1 2 ml 2 ̇ θ 2 − mgl (1 − cosθ ) (20) It can easily be seen that the swing up energy when the pendulum is stationary with respect to the WMR at its upright position E = 0 . Once the states of the pendulum are such that they are on this swing up energy level E = 0 , the pendulum reaches its upright position with with zero velocity in a finite time. taking the derivative of the swing up energy we obtain ̇ E = ml 2 ̇ θ ̈ θ − mglsinθ ̇ θ = ( ml 2 ̈ θ − mglsinθ ) ̇ θ (21) (16) × cosψ + (17) × sinψ gives us ( M + m )( ̈ xcosψ + ̈ ysinψ ) + ml ̈ θcosθ − ml ̇ θ 2 sinθ = F (22) Substituting (18) and (22) in (20) gives us ̇ E = [ mlcosθ ( ̈ xcosψ + ̈ ysinψ ) − ml 2 ̇ ψ 2 sinθcosθ ] ̇ θ = ml [( ̈ xcosψ + ̈ ysinψ ) − l ̇ ψ 2 sinθ ] cosθ ̇ θ (23) It is easy to see that ̈ xcosψ + ̈ ysinψ is the acceleration of the WMR along the direction of its orientation ( a l ). F can be applied in such a way that a l = l ̇ ψ 2 sinθcosθ − k E ̇ θcosθE (24) for any positive value k E . Such a value ensures that the swing up energy exponentially converges to the orbit E = 0 for almost all initial conditions except the ones starting at the downright position with zero velocity with respect to the WMR, since now ̇ E = − mlk E ̇ θ 2 cos 2 θ (25) Constructing a lyapunov function in the following way proves that the swing up energy converges to zero. V E = 1 2 E 2 (26) The lyapunov function here is a simple quadratic function of the energy which is a positive definite function.The time rate of change of the lyapunov function is given by ̇ V E = E ̇ E = − mlk E E 2 ̇ θ 2 cos 2 θ (27) 4 Let D be the set of feasible initial conditions for the above control law to work.Then D can be defined as D = { ( θ, ̇ θ ) | ̇ θ 6 = 0 and θ 6 = (2 n + 1) π, n ∈ Z } (28) Define S ⊆ D such that S = { ( θ, ̇ θ ) | ̇ V E = 0 } = { ( θ, ̇ θ ) | ̇ θ = 0 or θ = (2 p + 1) π 2 , p ∈ Z } (29) There are no trajectories in S that also lie in D apart from the trivial trajectory E = 0 , therefore by LaSalle’s invariance principle, the swing up energy asymptotically converges to the desired energy level for all conditions in D. An F defined in the following way makes a l equal to (24) F = ( M + msin 2 θ ) a d + ν 1 a d = l ̇ ψ 2 sinθcosθ − k E ̇ θcosθE ν 1 = mgsinθcosθ + ml ̇ ψ 2 sinθcos 2 θ − ml ̇ θ 2 sinθ (30) We can now add a PD term to F defined above to regulate the cart to the origin. Since the nonholonomic constraints allow F to be only in the direction of orientation of the WMR, modifying F alone does not regulate the system to the origin.Therefore we design τ in such a way that the orientation of WMR is always aiming away from the origin. So the desired orientation at any instant can be defined as follows tan ( ψ d ) = y/x (31) We can then obtain the derivatives to be ̇ ψ d = x ̇ y − y ̇ x x 2 + y 2 (32) ̈ ψ d = ( x 2 + y 2 )( x ̈ y − y ̈ x ) − ( x ̇ y − y ̇ x )(2 x ̇ x + 2 y ̇ y ) ( x 2 + y 2 ) 2 (33) The errors in orientation and its derivatives can be defined as e ψ = ψ − ψ d (34) e ̇ ψ = ̇ ψ − ̇ ψ d (35) We now define a lyapunov function to arrive at a controller that can asymptotically converge the orientation to the desired orientation V ψ = 1 2 k ψ ( ψ − ψ d ) 2 + 1 2 k ̇ ψ ( ̇ ψ − ̇ ψ d ) 2 (36) for some positive k ψ and k ̇ ψ ̇ V ψ = k ψ ( ̇ ψ − ̇ ψ d )( ψ − ψ d ) + ( ̈ ψ − ̈ ψ d )( ̇ ψ − ̇ ψ d ) (37) To make ̇ V ψ to be negative semi definite, we define τ = ml 2 ̇ θ ̇ ψsin 2 θ + ( J + ml 2 sin 2 θ )( − k ψ e ψ − k ̇ ψ e ̇ ψ ) (38) Define a set S 1 as follows S 1 = { ( ψ, ̇ ψ ) | ̇ V ψ = 0 } = { ( ψ, ̇ ψ ) | ̇ ψ = ̇ ψ d } (39) there are no trajectories in S 1 apart from the trivial trajectory ψ = ψ d and hence by LaSalle’s invariance principle ψ converges to ψ d asymptotically. We now define the PD terms to be added 5 to F so that the system is regulated to the origin. We restrict the PD gains to be in such a way that the dynamics is critically/over damped to prevent ̇ ψ d from becoming infinity when the WMR approaches the origin with nonzero velocity. F = ( M + msin 2 θ ) a d + ν 1 − k v e v − k p e p (40) for some k v and k p satisfying the above mentioned conditions. The velocity and position like error terms are defined as follows in a slightly unconventional way, the relevance of which will be understood in the next section. e v = ̇ xcosψ + ̇ ysinψ (41) e p = xcosψ + ysinψ (42) 4 Numerical simulations In this section we present numerical simulations of the system with initial conditions [ x, y, ψ, ̇ x, ̇ y, ̇ ψ.θ, ̇ θ ] = [20 , 30 , π, 0 . 5 , 0 , − 1 . 5 , π 4 , 0] , gains [ k E , k p , k v , k ψ , k ̇ ψ ] = [1 , 0 . 8 , 0 . 16 , 1 , 2] and system parameters [ M, m, J, l, g ] = [1 , 0 . 1 , 0 . 01 , 1 , 9 . 81] Figure 2: Evolution of states with time 5 Stability Analysis Since modifying F in (40) also modifies the way E evolves with time, it is not trivial to show that the desired task is achieved using Lyapunov analysis. From the nonholonomic constraint that the WMR cannot move in the direction perpendicular to its orientation, we can solve for ψ explicitly as tanψ = ̇ y ̇ x (43) In section 3, we have also shown that the actual orientation asymptotically converges to the desired orientation. Once the orientation converges tanψ = tanψ d = y x (44) 6 Figure 3: Plots showing the path, phase portrait, total system energy versus time and the swing up energy versus time Therefore after the convergence y x = ̇ y ̇ x (45) On rearranging, we obtain that ̇ xy − ̇ yx = 0 (46) Substituting (46) in (32), we obtain that ̇ ψ d and ̈ ψ d both converge to zero, implying that, ψ d and therefore ψ tend to a constant value. Assuming that at this constant value, tanpsi is finite, it can be seen from (44) that y = kx (47) ̇ y = k ̇ x (48) ̈ y = k ̈ x (49) k = tanψ (50) Substituting (47) through (50) in the control law, at steady state we obtain ( ̈ x + k v ̇ x + k p x ) √ k 2 + 1 = − E ̇ θcosθ M + msin 2 θ (51) which is an equation in one dimension and is analogous to the motion of cart pole. The proof of stability of this system has been presented using Floquet theory in [7] which the reader is encouraged to refer. [1]Tang C, Miller PT, Krovi VN, Ryu J, Agrawal SK. Kinematic Control of Nonholonomic Wheeled Mobile Manipulator: A Differential Flatness Approach. ASME. Dynamic Systems and Control Conference, ASME 2008 Dynamic Systems and Control Conference, Parts A and B ():1117- 1124. doi:10.1115/DSCC2008-2253. [2]Araki, N.; Okada, M.; Konishi, Y.; Ishigaki, and H.;. Parameter identification and swing-up control of an acrobot system. International Conference on International Technology, 2005. 7 [3]Isabelle Fantoni and Rogelio Lozano. Non-linear Control for Underactuated Mechanical Sys- tems. Communications and Control Engineering Series. Springer-Verlag, 2002. [4]K.J.Astrom, K.Furuta: Swinging up a pendulum by energy control,Automatica Volume 36, Issue 2, February 2000, Pages 287-295 [5]Arun D. Mahindrakar and Ravi N. Banavar. A swing-up of the acrobot based on a simple pen- dulum strategy. International Journal of Control, 78(6):424–429, April 2005. [6]Mark Spong. The swingup control problem for the acrobot. IEEE Control Systems Magazine, 15(1):49–55, February 1995. [7]Chung Choo Chung and John Hauser. Nonlinear control of a swinging pendulum. Automatica, 31(6):851–862, June 1995. [8]I.Fantoni, R.Lozano, and Mark W. Spong: Energy Based Control of the Pendubot,IEEE Trans- actions on Automatic Control, VOL. 45, NO. 4, APRIL 2000, Pages 725-729. [9]Johan Åkesson, Karl Johan Åström, "Safe Manual Control of the Furuta Pendulum," Proceed- ings of the 2001 IEEE international Conference on Control Applications, pp.499-502, 2001 [10]M. Iwase, K. J. Åström, K. Furuta and J. Åkesson "Analysis of safe manual control by using furuta pendulum," Proceedings of the 2006 IEEE International Conference on Control Applica- tions, pp.568-572, 2006 [11]Block D.J., Åström K.J., Spong M.W.: The reaction wheel pendulum. Synth. Lect. Control Mechatron. 1(1), 1–105 (2007) [12]R. M. Murray,M. Rathinam,W. sluis.Differential flatness of mechanical systems:A catalog of prototype systems.ASME [13]Bloch A.M. Nonholonomic Mechanics and Control, 2015 [14]M. van Nieuwstadt, M. Rathinam, R.M. Murray. Differential flatness and absolute equivalence [15]G. Walsh, D. Tilbury, S. Sastry, R. Murray,P. Laumond.Stabilization of systems with nonholo- nomic constraints. [16]J. Imura, K. Kobayashi, T. Yoshikawa, "Exponential stabilization problem of nonholonomic chained systems with specified transient response", Decision and Control 1996. Proceedings of the 35th IEEE Conference on, vol. 4, pp. 4733-4738 vol.4, 1996, ISSN 0191-2216. [17]A. M. Bloch, P. S. Krishnaprasad, J. E. Marsden, and R. M. Murray. Nonholonomic mechanical systems with symmetry.Technical Report CIT/CDS 94-013,California Institute of Technology,1994. [18]L.Bushnell, D. Tilbury and S. Sastry.Steering Three-Input Chained Form Nonholonomic Sys- tems Using Sinusoids:The Fire Truck Example.Proceedings of the European Control Conference Groningen, The Netherlands June 28 - July 1, 1993 [19]J.Hauser,S.Sastry,and G.Meyer.Nonlinear control design for slightly nonminimum phase sys- tems|application to V/STOL aircraft.Automatica, 28(4):665679, 1992. [20]I. Kanellakopoulos, P. V. Kokotovic, and A. S. Morse. Systematic design of adaptive controllers for feedback linearizable systems.IEEE Transactions on Automatic Control, 36(11):12411253, 1991. [21]J-C. Latombe.Robot Motion Planning.Kluwer Academic Publishers, Boston, 1991. [22]P.Martin.Contribution a l’Etude des Systems Differentiel lement Plats.PhD thesis,L’Ecole Na- tionale Superieure des Mines de Paris, 1992. [23]P. Martin.Endogenous feedbacks and equivalence.In Mathematical Theory of Networks and Systems,Regensburg, Germany, August 1993. [24]P. Martin.Personal communication, 1994. [25]P. Martin, S. Devasia, and B. Paden. A dierent look at output tracking:Control of a VTOL aircraft.In Proc. IEEE Control and Decision Conference,pages 2376 2381, 1994. [26]P. Martin and P. Rouchon.Feedback linearization and driftless systems.Mathematics of Control, Signals, and Systems, 7(3):235254, 1994. [27]M Fliess J Levine P Martin and P Rouchon On differentially flat nonlinear systems Comptes Rendus des Seances de lAcademie des Sciences "315:619-624 1992, Serie I. [28]M. Yamakita, M.Iwashiro, Y.Sugahara, K. Furuta, "Robust Swing Up Control of Double Pen- dulum," Proceedings of the American Control Conference, pp.290-295, vol.1, 1995 8