This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 1 (1) Université de Nantes, IRCCyN, 1, rue de la Noë - BP 92 101 - 44321 Nantes Cedex 03, France (2) HAPTION S.A, Atelier Relais de Soulgé Route de Laval, 53210 Soulgé sur Ouette, France (3) Laboratoire Central des Ponts et Chaussées, Route de Bouaye BP 4129, 44341Bouguenais Cedex, France Abstract—Off-line robot dynamic identification methods are mostly based on the use of the inverse dynamic model, which is linear with respect to the dynamic parameters. This model is sampled while the robot is tracking reference trajectories that excite the system dynamics. This allows using linear least-squares techniques to estimate the parameters. The efficiency of this method has been proved through the experimental identification of many prototypes and industrial robots. However, this method requires the joint force/torque and position measurements and the estimate of the joint velocity and acceleration, through the bandpass filtering of the joint position at high sampling rates. The proposed new method requires only the joint force/torque measurement. It is a closed-loop output error method where the usual joint position output is replaced by the joint force/torque. It is based on a closed-loop simulation of the robot using the direct dynamic model, the same structure of the control law, and the same reference trajectory for both the actual and the simulated robot. The optimal parameters minimize the 2-norm of the error between the actual force/torque and the simulated force/torque. This is a non-linear least-squares problem which is dramatically simplified using the inverse dynamic model to obtain an analytical expression of the simulated force/torque, linear in the parameters. A validation experiment on a 2 degree-of-freedom direct drive robot shows that the new method is efficient. Keywords — Identification, closed-loop output error, least-squares methods, , robot dynamics. I. INTRODUCTION HE usual identification method based on the inverse dynamic identification model (IDIM) and least- squares (LS) technique has been successfully applied to identify inertial and friction parameters of several robotic prototypes and industrial robots [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], amongst others. Good results can be obtained provided a well-tuned derivative bandpass filtering of joint position to calculate the joint velocities and accelerations is used. Another approach is to minimize a quadratic error between an actual output and a simulated output of the system, assuming both the actual and simulated systems have the same input. This is known as an output error (OE) identification method [16], [17]. The optimal values of the parameters are calculated using non-linear programming algorithms to solve a non-linear least-squares problem. The output is given by a state-space model output equation, which is typically the joint position for mechanical systems. Difficulties arise from the choice of initial conditions, resulting in multiple, local solutions [18]. The OE method has been used to identify electrical parameters of a synchronous machine, and a A new closed-loop output error method for parameter identification of robot dynamics M. Gautier (1), A. Janot (2) and P-O Vandanjon (3) T This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 2 comparison with the IDIM-LS method showed very similar results [19]. Both IDIM and OE methods require the joint position and the joint force/torque measurements. The proposed new identification method needs only the joint force/torque measurements. It is based on a closed-loop simulation using the direct dynamic model while the optimal parameters minimize the 2- norm of the error between the actual force/torque and the simulated force/torque, assuming the same control law and the same reference trajectory. This non-linear least-squares problem is dramatically simplified using the inverse dynamic model to formulate the simulated force/torque as an algebraic function linear in relation to the parameters. This paper describes the new identification method and experimental results obtained using a 2 DOF robot. A condensed version of this work has been presented in [20]. This paper contains detailed proofs to enlighten the theoretical understanding of the method and gives additional experimental results to show the practical efficiency of the method. The paper is organized as follows: section II reviews the usual identification technique of the dynamic parameters of the robot. Section III presents the output error method. The new identification method is presented in section IV. The modeling of the SCARA prototype robot is presented in section V. This direct drive prototype is very well suitable for the study of the method because it emphasizes non linear coupling while it is divided by the squared high gear ratio for industrial robots. The experimental results are given in section VI. Finally, section VII is the conclusion. II. IDIM: INVERSE DYNAMIC IDENTIFICATION MODEL TECHNIQUE The inverse dynamic model (IDM) of a rigid robot composed of n moving links calculates the motor torque vector idm τ , as a function of the generalized coordinates and their derivatives. It can be obtained from the Newton-Euler or the Lagrangian equations [13], [21]. It is given by the following relation: = ( ) + ( , ) idm τ M q q N q q && & (1) Where q , q& and q&& are respectively the ( ) x n 1 vectors of generalized joint positions, velocities and accelerations, ( ) M q is the ( ) x n n robot inertia matrix, and ( , ) N q q& is the ( ) x n 1 vector of centrifugal, Coriolis, gravitational and friction forces/torques. The choice of the modified Denavit and Hartenberg frames attached to each link allows a dynamic model that is linear in relation to a set of standard dynamic parameters, st χ [3], [22]: ( ) idm st st τ IDM q,q,q χ = & && (2) Where ( ) st IDM q,q,q & && is the ( ) x s n N jacobian matrix of idm τ , with respect to the ( ) x1 s N vector st χ of the This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 3 standard parameters given by: T T T T ... 1 2 n st st st st χ χ χ χ   =   with: T j j st j j j j j j j j j j j j j off XX XY XZ YY YZ ZZ MX MY MZ M Ia Fv Fc χ τ   =   (3) where: • j j j j j j XX , XY , XZ , YY , YZ , ZZ are the six components of the inertia matrix, j j J , of link j at the origin of frame j, • j j j MX , MY , MZ are the components of the first moments, j j MS , of link j, • j M is the mass of link j, • j Ia is a total inertia moment for rotor and gears of actuator j. • j Fv , j Fc are the viscous and Coulomb friction parameters of joint j. • j off Fsj tj τ Of Of = + is an offset parameter where Fsj Of is the dissymmetry of the Coulomb friction with respect to the sign of the velocity and tj Of is due to the current amplifier offset which supplies the motor. • 14 s N * n = is the number of standard parameters. The base parameters are the minimum number of dynamic parameters from which the dynamic model can be calculated. They are obtained from the standard inertial parameters by eliminating those which have no effect on the dynamic model, and by regrouping some others by means of linear relations. They can be determined using simple closed-form rules [22] or a numerical method based on the QR decomposition [23]. The minimal inverse dynamic model can be written as: ( ) idm τ IDM q,q,q χ = & && (4) Where: ( ) IDM q,q,q & && is the ( ) x n b matrix of the minimal set of basis functions of the rigid body dynamics, (5) χ is the ( ) x b 1 vector of the b base parameters. (6) Because of perturbations due to noise measurement and modeling errors, the actual force/torque τ differs from idm τ by an error, e , such that: ( ) idm τ e IDM q,q,q χ e τ = + = + & && (7) Equation (7) represents the Inverse Dynamic Identification Model (IDIM). We consider the off-line identification of the base dynamic parameters χ , given measured or estimated off-line data for τ and ( ) q, q, q & && , collected while the robot is tracking some planned This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 4 trajectories. Usually, the signals available from the robot controller are the joint position measurement and the ( ) x n 1 control signal vector vτ , calculated according to the control law. Then ( ) q, q, q & && in (7) are estimated with ( ) ˆ ˆ ˆq, q, q & && , respectively, obtained by bandpass filtering the measure of q [9]. The derivatives are calculated off-line without phase shift, using a central difference algorithm of the lowpass filtered position ˆq . The filtered position ˆq is calculated off-line with a non causal zero-phase digital filter by processing the input data, q , through a lowpass Butterworth filter in both the forward and reverse direction using the filtfilt procedure from Matlab. The control signal, vτ , is connected to the input current reference of the current closed-loop of the amplifiers which supplies the motors. Assuming that the current closed-loop has a bandwidth greater than 500Hz, then its transfer function is equal to its static gain, c K , in the frequency range (less than 10Hz) of the rigid robot dynamics. Then, the actual force/torque, τ , is calculated with the relation: g v τ τ τ = (8) where: gτ , is the ( ) xn n diagonal matrix of the drive gains, with: r c g K K K τ τ = (9) where: r K , is the ( ) xn n gear ratios diagonal matrix of the joint drive chains ( m r q K q = & & , with m q& , the velocity on the motor side), c K , is the ( ) xn n static gains diagonal matrix of the current amplifiers, Kτ , is the ( ) xn n diagonal matrix of the electromagnetic motor torque constants. Those parameters have a priori values, given by manufacturers, which can be checked with special tests [24]. The inverse dynamic identification model (IDIM) (7) is calculated at a frequency measurement mf , using samples of ( ) ˆ ˆ ˆq, q, q & && to calculate ( ) ˆ ˆ ˆ IDM q,q,q & && and samples of vτ to calculate τ with (8), at different times kt , m k 1,...,n = , while the robot is tracking a reference trajectory ( ) r r r q ,q ,q & && , during the time length obs T , of the trajectory. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 5 The equations of each joint are regrouped together on all the trajectory to get an over-determined linear system such that: ( ) ( ) fm fm fm ˆ ˆ ˆ Y τ W q,q,q χ ρ = + & && (10) With: ( ) m 1 fm j 1 j fm fm n fm j n Y (t ) Y τ ... , Y ... Y (t ) τ τ         = =             (11) ( ) ( ( ) ( ) ( )) ( ( ) ( ) ( ) m m m 1 j fm 1 1 1 j fm fm n j fm n n n ˆ ˆ ˆ W IDM q t ,q t ,q t ˆ ˆ ˆ W q,q,q ... , W ... ˆ ˆ W ˆ IDM q t ,q t ,q t         = =             & && & && & && (12) where: ( ( ) ( ) ( )) j k k k ˆ ˆ ˆ IDM q t ,q t ,q t & && is the jth row of the ( ) x n b matrix of the basis functions, ( ( ) ( ) ( )) k k k ˆ ˆ ˆ IDM q t ,q t ,q t & && , (5), j fm Y and j fm W represent the m n equations of joint j , * m obs m n T f = is the number of sample measurements. The notation ( ) ( ) ( ) fm fm ˆ ˆ ˆ ˆ ˆ ˆ W IDM q,q,q W q,q,q = & && & && , will be used to recall that fm W , is calculated with a sampling of ( ) ˆ ˆ ˆ IDM q,q,q & && . In order to eliminate high frequency force/torque ripple in τ , and to window the identification frequency range into the model dynamics, a parallel decimation procedure lowpass filters in parallel fm Y and each column of fm W and resamples them at a lower rate, keeping one sample over dn . This parallel decimation can be carried out with the Matlab decimate function, where the lowpass filter cut-off frequency is equal to 0.8* /(2* ) m d f n . After the data acquisition procedure and the parallel decimation of (10), we obtain the over- determined linear system: ( ) ( ) ˆ ˆ ˆ Y τ W q,q,q χ ρ = + & && (13) where: • ( ) Y τ is the x1 ( r ) vector of measurements, built from the actual force/torque τ , • ( ) ˆ ˆ ˆ W q,q,q & && is the x ( r b ) observation matrix, built from the estimated values ( ) ˆ ˆ ˆq,q,q & && of ( ) q, q, q & && . This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 6 • ρ is the x1 ( r ) vector of errors. • m d r n* n / n = is the number of rows in (13). In Y and W , the equations of each joint are grouped together such that: , 1 1 n n Y W Y ... W ... Y W         = =             (14) where j Y and j W represent the m d n / n equations of joint j . The ordinary LS (OLS) solution ˆχ minimizes the squared 2-norm 2 ρ of the vector of errors. Using the base parameters and tracking “exciting” reference trajectories [25], we get a full rank and well conditioned matrix W . The LS solution ˆχ is given by: ( ) ( ) 1 T T ˆχ W W W Y W Y − + = = (15) It is computed using the QR factorization of W . Standard deviations iˆχ σ , are estimated using classical results from statistics under the assumptions that W is a deterministic matrix, according to the data filtering procedure described above, and ρ , is a zero-mean additive independent Gaussian noise, with a covariance matrix ρρ C , such that: T 2 ρρ ρ r ( ) σ C E ρρ I = = (16) where E is the expectation operator and rI , the ( ) x r r identity matrix. An unbiased estimation of the standard deviation ρ σ is: 2 2 ρ σ (r b) ˆ ˆ Y Wχ = − − (17) The covariance matrix of the estimation error is given by: T 2 T 1 χχ ρ [( )( ) ] σ ( ) ˆˆ ˆ ˆ ˆ C E χ χ χ χ W W − = − − = (18) i 2 χ χχ σ C ( ) ˆ ˆˆ i,i = is the ith diagonal coefficient of χχ ˆˆ C . The relative standard deviation ri χ %σˆ is given by: ri i χ χ i %σ 100σ χ ˆ ˆ ˆ = , for iχˆ ≠ 0 (19) The OLS can be improved by taking into account different standard deviations on joint j equations errors [9]. Each equation of joint j in (13), (14), is weighted with the inverse of the standard deviation of the error calculated from OLS solution of the equations of joint j , given by: ( ) ( ) ( ) j j j j j ˆ ˆ ˆ Y τ W IDM q,q,q χ ρ = + & && (20) This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 7 This weighting operation normalises the errors in (13) and gives the weighted LS (WLS) estimation of the parameters. This identification method is illustrated in Fig. 1. Compared with the OE method described in the following section III, the use of IDIM, which is an analytical function of ( ) q,q,q & && , is particularly interesting because it does not require the integration of the direct dynamic model (21). Moreover, ˆχ is a one step linear LS solution which does not need initial conditions. However, the calculation of the velocities and accelerations are required using well-tuned bandpass filtering of the joint position [9]. Robot ( ) Inverse Dynam ic Identification M odel ID IM ˆ ˆ ˆ IDM q ,q ,q & && ˆ ˆ ˆ, ,& && q q q ( ) ( ) fm ˆ ˆ ˆ W IDM q,q,q & && Linear LS 2 ˆ min - Y W χ χ χ = ( ) ( ) ˆ ˆ ˆ ( ), , , Y W IDM q q q τ & && ( ) q t ( )t τ Control law ˆχ ( ) r r r q ,q ,q & && obs T sampling ( ) bandpass filtering fm + lowpass filtering + downsampling sampling(fm ) ( ) fm Y τ Fig. 1. IDIM LS identification scheme. III. THE OUTPUT ERROR METHOD (OE) The OE identification methods minimize a quadratic error between an actual output y , and a simulated output sy , of the system, assuming both the actual and the simulated systems have the same input. This approach can be implemented in an open-loop form, [17], [26], or in a closed-loop form, [27], [28]. Considering a closed-loop controlled robot, the input, in the open loop scheme shown in Fig. 2, is the actual force/torque τ , and the input, in the closed-loop scheme shown in Fig. 3, is the reference trajectory ( ) r r r q ,q ,q & && . Because the open loop simulation of unstable robotic systems is very sensitive to the initial state conditions and to the errors in numerical algorithms which solve the differential equations, it is more suitable to choose the closed-loop form. In both cases, the output is given by a state-space model output equation. Considering a robot and This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 8 taking the measured joint position as the output, the actual output is, y q = , and the simulated output is, s ddm y q = , as shown in Fig. 2 and Fig. 3, where ddm q (t ), is the solution of the differential equation given by the Direct Dynamic Model (DDM). The DDM can be obtained by writing the IDM equation (1), as following: ( , ) = - ( , , ) ddm ddm ddm ddm ddm M q q τ N q q χ χ && & (21) where: ( , ) ddm M q χ and ( , , ) ddm ddm N q q χ & depend on an estimation of the base parameters χ , ddm τ , is the force/torque input of the DDM. The function ( ) ddm q t,χ , is the result of the integration of the linear implicit differential equation (21) which can be written as a non-linear state-space model: s s s s G( x )x f ( x ,u ) = & (22) where: ddm s ddm q x q   =     & , is the ( ) 2 x1 * n state-space vector, s ddm u τ = , is the ( x1) n control input, x x G ( , ) - ( , , ) n n n ddm s s s n n ddm ddm ddm ddm I 0 q ( x ) , f ( x ,u ) 0 M q τ N q q χ χ     = =         & & (23) where, x n n 0 , is a ( x ) n n , matrix of zeros. The linear output equation is given by: s s s s s y C x D u = + (24) Taking the measure of joint position as the output, s ddm y q = , we get: [ ] x2 s n n *n C I 0 = , is the, ( x2* ) n n , output matrix, (25) x s n n D 0 = , is the, ( x ) n n , direct feedthrough matrix. (26) Hence, for robotic systems, an OE identification method is based on the integration of the Direct Dynamic Model. The optimal solution ˆχ , minimizes the quadratic criterion ( ) J χ , given by: ( ) ( ) ( ) T 2 s s s J Y Y Y Y Y Y χ = − = − − (27) where: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 9 Y and sY , are vectors obtained by filtering the vectors of samples fm Y and Sfm Y , respectively, where the equations of each joint are grouped together, with: m 1 fm j 1 j fm fm n fm j n Y q (t ) Y ... , Y ... Y q ( t )         = =         =     , 1 Sfm ddm 1 j Sfm Sfm n Sfm ddm k Y q ( t ) Y ... , Y ... Y q (t )         = =             (28) The minimization of ( ) J χ , (27), is a non-linear least-squares problem. The estimation of the parameters can be computed using algorithms such as the gradient method, the Newton methods or the Levenberg Marquardt method. These methods are based on a first or second order Taylor’s expansion of ( ) J χ . Actual Robot Direct Dynamic Model (DDM) ddm ddm ddm ddm ddm M(q ) q N(q , q ) τ = − && & Non Linear LS 1 min 2 2 s ˆ Y Y χ χ = − ( )t τ Control law r r r q q q          & && ˆχ Sampling and filtering ( ), ( ) s ddm Y q Y q ( ) s ddm y q t = obs T ( ) y q t = ( )t τ Fig. 2. Open-loop OE identification scheme. Actual Robot Direct Dynamic Model (DDM) ddm ddm ddm ddm ddm M(q ) q N(q , q ) τ = − && & Non Linear LS 1 min 2 2 s ˆ Y Y χ χ = − ( )t τ Control law r r r q q q          & && ˆχ Sampling and filtering ( ), ( ) s ddm Y q Y q ( ) s ddm y q t = obs T ddm τ ( ) y q t = Control law This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 10 Fig. 3. Closed-loop OE identification scheme. In [20], we used the Gauss-Newton method to calculate the optimal solution. It is a Newton method where approximations of the gradient and the hessian of ( ) J χ are calculated with the jacobian matrix of sy with respect to χ . The Gauss-Newton regression is a simpler way to calculate the optimal solution [29]. It is based on a Taylor series expansion of sy , at a current estimate k ˆχ , of the parameters at iteration k : ( ) ( ) +1 +1 k k k k k S S S ˆχ y ( χ ) ˆ ˆ y ( χ ) y ( χ ) χ χ o χ  ∂  = + − +   ∂   (29) where: ( ) sy / k S χ ˆχ y ( χ ) δ χ  ∂  =   ∂   (30) sy /χ δ is the ( ) x n b , jacobian matrix of sy , with respect to χ , evaluated at k ˆχ . Each coefficient of sy /χ δ , defines a sensitivity function. These sensitivity functions characterize the variation of the output function Sy , with respect to a variation of the parameter χ . The sensitivity functions are the solutions of a differential system calculated from (21). However, this technique is more time-consuming compared to the IDIM method. Indeed, the DDM and the sensitivity functions must be integrated many times at each step of the iterative non-linear optimization method. Moreover, it is necessary to have good initial conditions in order to avoid multiple and local solutions. Let us define: +1 y y k S( χ ) e = + (31) From (29), it becomes: ( ) ( ) +1 / s k k k s y χ ˆ ˆ y y ( χ ) δ χ χ o e − = − + + (32) An over-determined linear system is obtained by filtering and sampling (32) over the time window obs T : +1 k δ Y W χ ∆ ∆ ρ = + (33) with: ( ) +1 +1 k k k ˆ χ χ χ ∆ = − This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 11 Y ∆ , δ W , and ρ are, respectively, the sampling and filtering of ( ) k s ˆ y y ( χ ) − , / sy χ δ , and of ( ) o e + . +1 k ˆχ ∆ is the LS solution of (33). This process is iterated with a new estimate, +1 +1 k k k ˆ ˆ ˆ χ χ χ ∆ = + , until: k 1 k k 1 tol ρ ρ ρ + − ≤ , and, +1 k k i i 2 k i 1,...,b i ˆ ˆ χ χ max tol ˆχ = − ≤ (34) where, 1 tol and 2 tol , are values ideally chosen to be small numbers to get fast convergence with good accuracy. IV. DIDIM: DIRECT AND INVERSE DYNAMIC IDENTIFICATION MODEL TECHNIQUE A. Theoretical approach In the OE method as shown in Fig. 3, the actual output is the measured joint position, y q = . We propose to change the output, y , from the actual joint position q , to the actual joint force/torque τ , and the simulated output sy , from the simulated joint position, ddm q , to the simulated joint force/torque, ddm τ . Then, we take y τ = , and s ddm y τ = , according to Fig. 4. Actual Robot Direct Dynamic Model ddm ddm ddm ddm ddm M(q , ) q N(q , q , ) χ τ χ = − && & Non Linear LS min 2 s ˆ Y Y χ χ = − ( )t τ Control law r r r q q q          & && ˆχ Sampling and filtering ( ), ( ) s ddm Y Y τ τ obs T ddm τ ( ) q t Control law ddm sy τ = ( )t y τ = ddm q Actual closed loop robot Simulated closed loop robot Fig. 4. DIDIM identification scheme. This means that the output equation (24) of the state-space model (22) reduces to a direct feedthrough equation such as, s s ddm y u τ = = . Then we have x2* s n n C 0 = , and s n D I = , in the output equation (24). This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 12 The optimal solution, ˆχ , minimizes the quadratic criterion, ( ) J χ , (27), where, Y , and sY , are vectors obtained by filtering the vectors of samples, fm Y and Sfm Y , respectively, where the equations of each joint are grouped, with: ( ) m 1 fm j 1 j fm fm n fm j n Y ( t ) Y τ ... , Y ... Y ( t ) τ τ         = =             , j j m 1 ddm 1 Sfm j Sfm Sfm n Sfm ddm n (t ) Y Y ... , Y ... Y (t ) τ τ         = =             (35) This non-linear LS problem is solved by the Gauss-Newton regression as explained in section III. The input force/torque of the DDM, ddm τ , can be calculated with the analytical expression of the inverse dynamic model (4), such as: ( ) ( ) ( ) ( ) ( ) ( ) ( ) s ddm idm ddm ddm ddm y χ τ χ τ χ IDM q χ ,q χ ,q χ χ = = = & && (36) The Taylor series expansion (29), with s ddm y τ = , at a current estimate, k ˆχ , of the parameters χ , at iteration k , is calculated with the jacobian matrix of ( ) ddm τ χ , given by: ( ) ( ) ( ) ( ) sy / k k k k k k ddm idm χ ddm ddm ddm ˆ ˆ χ χ ˆ ˆ ˆ ˆ δ IDM q ( χ ),q ( χ ),q ( χ ) χ χ χ χ τ τ   ∂ ∂   ∂ = = =     ∂ ∂ ∂     & && (37) Then, it becomes: ( ) ( ) ( ) ( ) ( ) k k k k k k k ddm ddm ddm ddm ddm ddm k k k k ddm ddm ddm ˆ ˆ ˆ ˆ ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) χ IDM q ( χ ),q ( χ ),q ( χ ) ... χ ˆ ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) χ χ ∂ = + ∂ ∂ ∂ & && & && & && (38) The calculation of the second term on the right side of (38) needs to calculate the expression: ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) k k k k k k ddm ddm ddm ddm ddm ddm ddm ddm k k k ddm ddm ddm ddm ddm k k k ddm ddm ddm ddm ddm q ˆ ˆ ˆ ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) IDM q ( χ ),q ( χ ),q ( χ ) ... χ q χ q ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) ... q χ q ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) q χ ∂ ∂ ∂ = + ∂ ∂ ∂ ∂ ∂ + ∂ ∂ ∂ ∂ ∂ ∂ & && & && & & && & && & && && (39) Let us recall that the joint force/torque y τ = , is obtained while the robot is tracking a reference trajectory, ( ) r r r q ,q ,q & && , with a closed-loop control law. The closed-loop simulation uses the direct dynamic model, the same control law and the same reference trajectory ( ) r r r q ,q ,q & && , as the actual one, to calculate Sy . In the following section IV.B, we show how to tune the control law of the closed-loop simulation in This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 13 order to keep the same bandwidth and stability margin as the actual closed-loop for any k ˆχ , obtained at iteration k. This assumes for the simulated tracking error to keep close to the actual one for any k ˆχ , that is to say: ( ) ( ) k k k ddm ddm ddm ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , for any k ˆχ (40) This means that ( ) ddm ddm ddm q ( χ ),q ( χ ),q ( χ ) & && , have little dependence on χ , such that: ddm ddm ddm q q q 0 χ χ χ ∂ ∂ ∂ ∂ ∂ ∂ & &&    Then (39) is simplified as: ( ) ( ) 0 k k k ddm ddm ddm ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) χ ∂ ∂ & &&  Taking into account this simplification, we have in (38): ( ) ( ) ( ) k k k k k k ddm ddm ddm k ddm ddm ddm ˆ ˆ ˆ ˆ ˆ ˆ ˆ IDM q ( χ ),q ( χ ),q ( χ ) χ IDM q ( χ ),q ( χ ),q ( χ ) χ ∂ ∂ & && & &&  As a result, the jacobian matrix (37) can be approximated by: ( ) ( ) ( ) sy / k k k k k k k χ ddm ddm ddm ddm ddm ddm ˆ ˆ ˆ ˆ ˆ ˆ ˆ δ IDM q ( χ ),q ( χ ),q ( χ ) χ IDM q ( χ ),q ( χ ),q ( χ ) χ ∂ = ∂ & && & &&  (41) Each sensitivity function in the jacobian matrix is approximated by an algebraic equation. This is much more simpler than for usual OE method where the sensitivity functions are the solutions of complicated differential equations. This is the reason why it is much more simpler to minimize the error between the measured force/torque and the simulated force/torque than to minimize the error between the actual position and the simulated position. Taking the approximation (41) of the jacobian matrix into the Taylor series expansion (32), it becomes: ( )( ) ( ) +1 k k k k k k s ddm ddm ddm ˆ ˆ ˆ ˆ ˆ y y ( χ ) IDM q ( χ ),q ( χ ),q ( χ ) χ χ o e τ = = + − + + & && (42) From (36), it becomes: ( ) k k k k k k s idm ddm ddm ddm ˆ ˆ ˆ ˆ ˆ ˆ y ( χ ) ( χ ) IDM q ( χ ),q ( χ ),q ( χ ) χ τ = = & && (43) Taking (43) in (42), it becomes: ( ) ( ) +1 k k k k ddm ddm ddm ˆ ˆ ˆ y IDM q ( χ ),q ( χ ),q ( χ ) χ o e τ = = + + & && (44) This is the Inverse Dynamic Identification Model, IDIM, (7), where ( ) q, q, q & && are estimated with ( ) ddm ddm ddm q ,q ,q & && , simulated with k ˆ DDM( χ )(21). At each iteration k , the IDIM method is applied as This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 14 described in section II. The sampling of (44) at a sampling rate mf , gives an over-determined linear system such as: ( ) ( ) , k fm fm ddm ddm ddm fm ˆ Y τ W q ,q ,q χ ρ δ χ = + & && (45) With: ( ) m 1 fm j 1 j fm fm n fm j n Y (t ) Y τ ... , Y ... Y (t ) τ τ         = =             (46) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 1 1 1 , , , m m m j k 1 ddm ddm ddm fm k j fm ddm ddm ddm fm n j k fm ddm n ddm n ddm n ˆ IDM q t ,q t ,q t χ W ˆ W q ,q ,q χ ... , W ... W ˆ IDM q t ,q t ,q t χ δ δ δ δ           = =             & && & && & && (47) The parallel decimation of (45) gives: ( ) ( ) , k ddm ddm ddm ˆ Y τ W q ,q ,q χ χ ρ δ = + & && (48) The LS solution of (48) gives k 1 ˆχ + , at iteration + k 1. This process is iterated until: k 1 k k tol1 ρ ρ ρ + − ≤ , and, +1 k k i i 2 k i 1,...,b i ˆ ˆ χ χ max tol ˆχ = − ≤ , where, 1 tol and 2 tol are values ideally chosen to be small numbers to get fast convergence with good accuracy. This new identification method is based on a closed-loop simulation using the direct dynamic model (DDM) while the optimal parameters minimize the 2-norm of the error between the actual force/torque τ , and the simulated force/torque ddm τ , over an observation window time obs T . This new technique overcomes the problems of non-linear optimization in OE method, section III, using the IDIM to calculate the simulated force/torque vector, s ddm idm y τ τ = = . Because this method uses both models DDM and IDIM, it is named the DIDIM method: Direct and Inverse Dynamic Identification Models technique. The DIDIM method with the Gauss-Newton regression is illustrated Fig. 5. This approach is particularly interesting thanks to the following reasons: • It needs only the actuator force/torque measurement or estimation, • It avoids tuning the bandpass filter in the IDIM method by using the integration of the DDM in a closed-loop simulation where the tuning of the bandwidth automatically defines the same frequency This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 15 range for the dynamics of the actual and of the model to be identified. • It combines the inverse and the direct dynamic model and validates, in the same identification procedure, both models for computed torque control and for simulation. • It dramatically simplifies the computation of the matrix of the sensitivity functions which is given by an algebraic equation (the inverse dynamic identification model) whereas it is given by the resolution of a complicated system of differential equations in the usual OE method. The drawback is that the structure and the tuning of the actual closed-loop control law must be known to be implemented in the closed-loop simulation of the robot. Most often, this is not a real problem, because working on identification for simulation or control of the robot, needs a minimal knowledge on the robot controller. r r r q q q          & && Actual Robot Direct Dynamic Model k ddm ddm k ddm ddm ddm ˆ M(q , ) q ˆ N(q , q , ) χ τ χ = − && & Linear LS +1 min 2 k ˆ Y Wδ χ χ χ = − ( )t τ Control law ˆχ ( ) ( ) ˆ ( ), W , , , k ddm ddm ddm Y IDM q q q δ τ χ & && obs T ddm τ ( ) q t Control law ( ) Inverse Dynamic Identification M odel k ddm ddm ddm ˆ IDM q ,q ,q ,χ & && ( ) k ddm ddm ddm ˆ IDM q ,q ,q ,χ & && ( )t y τ = ddm q Actual closed loop robot Simulated closed loop robot sampling ( ) lowpass filtering downsampling fm Fig. 5. DIDIM with the Gauss-Newton regression, identification scheme. B. Initialization of the algorithm A problem is how to choose the initial values 0 ˆχ . We can use CAD values, or identified values with the IDIM method, but we show that there is no need at all of a priori values. We propose an algorithm not sensitive to the initial conditions, which assumes that the condition ( ) ( ) ddm k ddm k ddm k ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , is satisfied at any iteration k , and especially for k =0. This is possible by taking the same control law structure for the actual robot and for the simulated one with the same performances given by the bandwidth, the stability margin or the closed-loop poles. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 16 Because the simulated robot parameters k ˆχ , change at each iteration k , the gains of the simulated control law must be updated according to kˆχ . For example, let us consider a PD control law for each joint j . The inverse dynamic model IDM (1) for the joint j , can be written as a decoupled double integrator perturbed by a coupling force/torque, such that: = ( ) + ( , )= ( ) ( ) + ( , )= ( ) j n n j idm j ,i i j j ,j j j ,i i j j ,j j j i 1 i j τ τ M q q N q q M q q M q q N q q M q q p = ≠ = + − ∑ ∑ && & && && & && (49) where jp is considered as a perturbation given by: ( ) ( , ) n j j ,i i j i j p M q q N q q ≠ = − − ∑ && & (50) ( ) j ,i M q which depends on q , is approximated by a constant inertia moment j J , given by: ( ) ( ) j j j j a j ,j j a q J ZZ I max M q ZZ I = + + − − (51) j J , is the maximum value, with respect to q , of the inertia moment around joint jz axis. This gives the smallest damping value and the smallest stability margin of the closed-loop second order transfer function (55), while q varies. It can be calculated from a priori CAD values of inertial parameters and must be taken at least as j j a ZZ I + . The joint j dynamic model is approximated by a double integrator, where jp , is a perturbation, as following: ( ) ( ) ( ) j j j j j j ,j j 1 1 q τ p τ p M q J = + + &&  (52) Let us consider the joint j PD control of the actual robot which is illustrated Fig. 6: + - + - j agτ jrq j a vk 1 a jJ 1 s 1 s + + j a pk jp j vτ jq& jq jτ jq&& Fig. 6. Joint PD control of the actual robot. The control input calculated by the robot controller is given by: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 17 ( ) j j j j j a a a p v r j v j v k k q q k q τ = − − & (53) j vτ is the current reference of the current amplifiers which supplies the motor. The joint j , force/torque is given by: j j a j g v τ τ τ = (54) where: j agτ is the actual drive gain, calculated with the actual parameters in (9). a j J is the actual value of j J . In order to tune the tracking performances of the reference position jrq , the transfer function jr j q q is calculated with =0 jp : =0 j j j j j j j j a 2 a 2 j j r p a a a a a 2 a v p p nj nj q 1 1 H J s 2 q 1 s s 1 s 1 g k k k τ ζ ω ω    = = =     + + + + (55) where: a nj ω is the actual natural frequency which characterizes the closed-loop bandwidth, a j ζ is the actual damping coefficient which characterizes the closed-loop stability margin, with: j i i a a a a nj p v a j g k k J τ ω = , 1 j i i a a v a j a a p j g k 2 k J τ ζ = (56) Then it becomes: 2 j a nj a p a j k ω ζ = , j j a j a a a v j nj a J k 2 gτ ζ ω = (57) The closed-loop performances are chosen with the desired 2 poles of a second order transfer function characterized by, d nj ω , d j ζ , where: d nj ω is the desired natural frequency, d j ζ is the desired damping coefficient. Because the actual values are unknown, the gains are calculated from (57), where the unknown actual values, a nj ω , a j ζ , a j J , j a ig , are replaced respectively by their desired values, d nj ω , d j ζ , and by their a priori values, ap j J , j apgτ : This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 18 2 j d nj a p d j k ω ζ = , j j ap j a d d v j nj ap J k 2 gτ ζ ω = (58) where: and j ap ap j J gτ are a priori values of the actual unknown values and j a a j J gτ , respectively. Now, let us consider the joint j PD control of the simulated robot which is illustrated Fig. 7. + - + - j apgτ jrq j s vk 1 k jˆJ 1 s 1 s + + j s pk j ddm p j ddm vτ j ddm q& j ddm q j ddm τ j ddm q&& Fig. 7. Joint PD control of the simulated robot. The variables ( ) , j j j j j ddm ddm ddm ddm ddm v , q , q , q τ τ & && , in Fig. 7, are computed by numerical integration of k ˆ DDM( χ ), (21). The control law of the simulated robot has the same structure as the actual one, Fig. 6, where we take: j a ig = j ap ig , the a priori value of j a ig , a j J = k jˆJ , the value of j J , (51), calculated with the estimation kˆχ , at iteration k. j s p k , j s vk are the gains of the simulated control law. They are calculated in order to keep the same performances for the simulated closed-loop and for the actual closed-loop, that is to say to keep the same desired values, d nj ω and d j ζ , for the closed-loop poles. Then, it becomes: , 2 2 j j j j d k nj j s a s d d p p v j nj d ap j ˆJ k k k gτ ω ζ ω ζ = = = (59) The proportional gain, j s p k , does not depend at all on the parameters values, but the derivative gain in the simulator , j s vk , must be updated with k jˆJ , at each iteration k . It is important to note that only the gain in the simulated closed-loop, j s vk , is modified during the iterative procedure. The actual gain of the robot control law, j a vk , is not modified. The simulated closed-loop tuning given by, d nj ω , d j ζ , differs from the actual one, a nj ω , a j ζ , with the This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 19 following ratio, calculated by taking (58) into (56): j j a a a ap nj j j d d a ap nj j j g J J g τ τ ω ζ ω ζ = = (60) Usually this ratio is between 0.8 and 1.2. The actual values, a nj ω , a j ζ , can be estimated from step response or frequency analysis of the actual closed-loop. But this is not necessary, because there is little effect on the identification accuracy, assuming, d nj ω , is regularly chosen more than 10 times greater than the frequency range of the robot dynamics. This allows to keep ( ) ( ) ddm k ddm k ddm k ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , at each iteration k. We propose to take a regular inertia matrix 0 ( , ) ddm ˆ M q χ , in order to have a good initialization for the numerical integration of the DDM (21) . This is named the "regular initialization". It can be obtained with: 0 0 ˆχ = , except for, 0 1, j Ia j 1,n = = (61) The inertia of the rotor and gear of actuator j is generally taken into account in the IDM model (1) as: τ jr j j Ia q = && Then, the initial inertia matrix becomes the identity matrix, which is the best regular matrix: 0 ( , ) = ddm n ˆ M q I χ (62) Another simple regular initialization is to take: 0 0 ˆχ = , except for, 0 1, j ZZ j 1,n = = (63) The initial inertia matrix, 0 ( , ) ddm ˆ M q χ , is no more the identity matrix, but remains regular. Another point is to choose the state initial condition of the state vector, ( ) (0) (0) ddm ddm q ,q& , in order to integrate the DDM (21). Because DIDIM doesn't need the joint position measurement, the actual values ( ) (0) (0) q ,q& , are supposed to be unknown and we choose, ( ) ( ) (0) (0) (0) (0) ddm ddm r r q ,q q ,q = & & , which is close to ( ) (0) (0) q ,q& . Because the closed-loop transient response due to different initial conditions differs between the actual and the simulated signals during a transient period of approximately, 5 d n / ω , the corresponding joint force/torque samples are eliminated from the identification data in (45). V. CASE STUDY: MODELING OF THE SCARA ROBOT The identification method is carried out on a 2 degree-of-freedom planar direct drive prototype robot This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 20 without gravity effect, shown in Fig. 8. This direct drive prototype is very suitable for the study of DIDIM because it emphasizes non linear coupling torques while this non-linear effect is divided by at least 2500 for industrial robots with gear ratio greater than 50. Moreover, the dynamic model of this robot depends on eight parameters only, which facilitates the study of the identification efficiency with respect to several conditions. At last, this robot and its real parameters, called the nominal parameters, are well known. Thus, we can check the physical meaning of the identified parameters. The description of the geometry of the robot uses the modified Denavit and Hartenberg (DHM) notations [30] which are illustrated in Fig.9. The robot is direct driven by 2 DC permanent magnet motors supplied by PWM amplifiers. x0 q1 L O , O 0 1 x1 x 2 O 2 y2 y0 y1 y2 q 2 Fig. 8. The scara robot prototype. Fig. 9. DHM frames of the scara robot. The dynamic model depends on 8 minimal dynamic parameters, considering 4 friction parameters: [ ] T 1R 1 1 2R 2 2 2 2 ZZ Fv Fc ZZ LMX LMY Fv Fc χ = (64) 2 1R 1 1 2 ZZ ZZ Ia M L = + + 2R 2 2 ZZ ZZ Ia = + L=0.5m, is the length of the first link. In the case of the SCARA robot, the parameters, 2 LMX , and 2 LMY , are identified instead of, 2 MX , and 2 MY , respectively. The 8 columns, , 1,8 :,k IDM k = , of ( ) IDM q,q,q & && , in IDIM (7), are the following: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 21 0 0 0 1R 1 1 2 R 2 1 1 :,1 ZZ :,2 Fv 1 2 1 :,3 Fc :,4 ZZ 1 2 ) 1 2 2 2 1 2 2 :,5 LMX 2 1 2 1 q q IDM IDM , IDM IDM , q q sign( q ) IDM IDM , IDM IDM , q q ( 2q q ) cosq - q ( 2q q sinq IDM IDM q cosq q sin +     = = = =         +     = = = =     +     + = = + && & && && & && && && && & & & && & 0 0 2 2 2 2 ) 1 2 2 2 1 2 2 :,6 LMY 2 1 2 1 2 :,7 Fv :,8 Fc 2 2 , q ( 2q q ) sinq q ( 2q q cosq IDM IDM , q cosq q sinq IDM IDM , IDM IDM q sign( q ) +       − + −   = =   −       = = = =         && && & & & & && & & (65) The closed-loop control is a PD control law (53) , according to Fig. 6, with: 2 1 1R 2R 2 J ZZ ZZ LMX = + + , and 2 2R J ZZ = . The actual gains are calculated with (58), taking a desired damping, d j ζ =1, for joint 1 and joint 2. The desired natural frequency, d nj ω , is chosen according to the driving capacity without saturation of the joint drive. For this robot we obtain a full bandwidth with, 1 / 1 d f n rd s ω = , and 10 / 2 d f n rd s ω = . The sample rates of the control and of the measurement are equal to, mf =200Hz. Torque data are obtained from (54), and from the current reference data vτ . The simulation of the robot is carried out with the same reference trajectory and with the same control law structure as the actual robot. The gains in the simulator are calculated with (59) and with the same values, d j ζ =1, 1 / 1 d n rd s ω = , and 10 / 2 d n rd s ω = . VI. EXPERIMENTAL IDENTIFICATION RESULTS The new identification process is performed in different cases in order to compare the previous IDIM technique to the new DIDIM technique and to investigate the robustness of DIDIM with respect to the initialization, to the acquisition sampling rate, to the data filtering and to the closed-loop tuning. All the results are given in SI units, on the joint side. A. Comparison of IDIM and DIDIM with good initial values, 0 IDIM ˆ ˆ χ χ = . At first, the algorithm is initialized with, IDIM ˆχ , the vector of parameters identified with the IDIM LS estimator. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 22 The IDIM LS off-line estimation is carried out with a filtered position ˆq , calculated with a 20Hz cut- off frequency forward and reverse Butterworth filter, and with the velocities ˆq& , and the accelerations, ˆq&& , calculated with a central difference algorithm of ˆq . The parallel decimation of fm Y and fm W , in (10), is carried out with a sample rate divided by a factor, dn =20, and a lowpass filter cut-off frequency equal to, 0.8* /(2* )=4Hz m d f n . The results are given in Table 1. It needs only 2 steps to obtain the optimal solution which is very close to the IDIM solution. Hence, the DIDIM method does not improve the IDIM solution calculated with good bandpass filtered data. A validation is plotted on Fig. 10, at the frequency measurement, mf =200Hz. It shows that the actual joint torques, ( ) fm Y τ , and the torques estimated with the identified model, ( ) 2 , 2 e fm ddm ddm ddm ˆ ˆ Y W q ,q ,q δ χ χ = & && , as defined in (45), (46), (47), are very close. Both methods give a small relative norm error, ˆ Y W / Y χ − <3%, which shows a good accuracy for the model and for the identified values. It can be seen that the parameters, 1 Fv , and 2 Fv , have no significant estimations because of their large relative standard deviation (>30%). They have no significant contribution in the joint torques and they can be cancelled to keep a set of essential parameters of a simplified dynamic model, without loss of accuracy [31]. However, we prefer to keep all the parameters in the following, for a better comparison of IDIM and DIDIM identification methods. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 23 TABLE 1: COMPARISON OF IDIM AND DIDIM METHODS IDIM DIDIM Parameter IDIM ˆχ χ 2 σˆ rχ %σˆ 0 IDIM ˆ ˆ χ χ = 2 ˆ χ χ 2 σˆ rχ %σˆ ZZ1R 3.44 0.034 0.50 3.44 3.45 0.036 0.52 1 Fv 0.03 0.031 52. 0 0.03 0.04 0.032 40.0 1 Fc 0.82 0.1 6.0 0.82 0.82 0.05 3.0 2 ZZ 0.062 0.0006 0.51 0.062 0.061 0.0006 0.49 2 LMX 0.121 0.0014 0.56 0.121 0.124 0.0013 0.52 2 LMY 0.007 0.0007 5.0 0.007 0.007 0.0005 3.5 2 Fv 0.013 0.006 23.0 0.013 0.014 0.0084 30.0 2 Fc 0.137 0.006 2.30 0.137 0.133 0.0080 3.0 IDIM ˆ Y W / Y χ − =0.024 2 ˆ Y W / Y χ − =0.021 4000 4500 5000 5500 6000 -15 -10 -5 0 5 10 15 Joint 1 Motor torque (Nm) Sample number Measurement: Yfm Estimation: Ye Error = Yfm - Ye 4000 4500 5000 5500 6000 -1 -0.5 0 0.5 1 Joint 2 Motor torque (Nm) Sample number Measurement: Yfm Estimation: Ye Error = Yfm - Ye Fig. 10. DIDIM, validation, ( ) 2 , 2 e fm ddm ddm ddm ˆ ˆ Y W q ,q ,q δ χ χ = & && . B. DIDIM, validation of the regular initialization, 0 2 ( , )= ddm ˆ M q I χ The robustness of DIDIM with respect to a wrong initialization, such as the regular initialization (62), is investigated. The initial values of the dynamic parameters are given by (61), with: [ ] T 1 0 0 1 0 0 0 0 0 ˆ χ = The identified values given in Table 2, are very close to those given in Table 1. This result validates the regular initialization procedure , described in section IV.B. Moreover the algorithm converges in only 3 steps and is not time consuming. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 24 TABLE 2: DIDIM WITH THE REGULAR INITIALIZATION Parameter 0 ˆ χ 3 ˆ χ χ 2 σˆ rχ %σˆ ZZ1R 1 3.45 0.014 0.2 1 Fv 0 0.02 0.012 15 1 Fc 0 0.85 0.016 1.0 2 ZZ 1 0.061 0.0001 0.1 2 LMX 0 0.124 0.0002 0.1 2 LMY 0 0.007 0.0003 2.0 2 Fv 0 0.01 0.003 10 2 Fc 0 0.132 0.0008 0.3 The relative norm errors on joint position, velocity and acceleration are plotted in Fig.11 ,with the following legend: • norm error relative to the actual filtered joint position, j ddm j j ˆ ˆ q q / q − , velocity j ddm j j ˆ ˆ q q / q − , and acceleration, j ddm j j ˆ ˆ q q / q − && && && , where ( ) ˆ ˆ ˆq,q,q & && , are calculated as given in section VI.A. * norm error relative to the reference joint position, j j j ddm r r q q / q − , velocity, j j j ddm r r q q / q − & & & , and acceleration, j j j ddm r r q q / q − && && && . The assumption (40), made in section IV.B, ( ) ( ) ddm k ddm k ddm k ˆ ˆ ˆ ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , at each iteration k, is confirmed on Fig.11. , with a constant relative norm error close to 0.5% for the position, 5%, for the velocity and 10%, for the acceleration. These results validate the updating procedure (59), of the simulated PD control law gains. It can be seen also on Fig.11. , that the simulated trajectory, ( ) ddm k ddm k ddm k ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) & && , is 3 to 5 times closer to the actual one, ( ) ˆ ˆ ˆq,q,q & && , than to the reference one, ( ) r r r q ,q ,q & && , with a relative norm error close to 1.5% for the position, 15%, for the velocity and 30%, for the acceleration. Moreover, this error depends on the closed-loop bandwidth. It means that computing the observation matrix in (13) with the reference trajectory, ( ) r r r q ,q ,q & && , leads to a bad identification of the dynamic parameters. Then, the right assumption is, ( ) ( ) ddm k ddm k ddm k ˆ ˆ ˆ ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , (40), made in section IV.B. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 25 0 0.5 1 1.5 2 2.5 3 0 0.005 0.01 0.015 0.02 Relative errors: Joint 1 Positions 0 0.5 1 1.5 2 2.5 3 0.02 0.04 0.06 0.08 Velocities 0 0.5 1 1.5 2 2.5 3 0.05 0.1 0.15 0.2 0.25 Accelerations Iterations 0 0.5 1 1.5 2 2.5 3 0 0.005 0.01 0.015 0.02 Relative errors: Joint 2 Positions 0 0.5 1 1.5 2 2.5 3 0 0.05 0.1 0.15 0.2 Velocities 0 0.5 1 1.5 2 2.5 3 0 0.1 0.2 0.3 0.4 Accelerations Iterations Fig.11. • norm error relative to the filtered actual position, velocity, acceleration. * norm error relative to the reference position, velocity, acceleration. We have seen that ( ) ( ) ddm k ddm k ddm k ˆ ˆ ˆ ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , at each iteration k, with a constant small error. On the contrary, the relative torque norm error, given in Table 3, and plotted on Fig. 12, dramatically decreases in only 3 steps. This shows the fast algorithm convergence. TABLE 3: RELATIVE NORM ERROR OF JOINT TORQUE: j j k j ˆ Y W / Y χ − Iteration k 0 1 2 3 Joint j=1 0.42 0.036 0.02 0.018 Joint j=2 3.20 0.110 0.02 0.022 0 0.5 1 1.5 2 2.5 3 0 0.1 0.2 0.3 0.4 Relative error: Joint torque 1 Iterations 0 0.5 1 1.5 2 2.5 3 0 0.5 1 1.5 2 2.5 3 Relative error: Joint torque 2 Iterations Fig. 12. DIDIM, convergence of the joint torque error, j j k j ˆ Y W / Y χ − This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 26 The fast convergence of each parameter is shown in Table 4 , and is plotted on Fig. 13. TABLE 4: PARAMETERS CONVERGENCE Parameters 0 ˆ χ 1 ˆ χ 2 ˆ χ 3 ˆ χ ZZ1R 1 3.46 3.45 3.45 1 Fv 0 0.04 0.02 0.02 1 Fc 0 0.86 0.85 0.85 2 ZZ 1 0.06 0.061 0.061 2 LMX 0 0.122 0.124 0.124 2 LMY 0 0.05 0.07 0.07 2 Fv 0 0.005 0.01 0.01 2 Fc 0 0.130 0.132 0.132 0 1 2 3 1 2 3 4 ZZ1R 0 1 2 3 0 0.02 0.04 FV1 0 1 2 3 0 0.5 1 FC1 0 1 2 3 0 0.5 1 ZZ2 0 1 2 3 0 0.1 0.2 MX2 0 1 2 3 0 0.05 0.1 MY2 0 1 2 3 0 0.005 0.01 FV2 Iterations 0 1 2 3 0 0.1 0.2 FC2 Iterations Fig. 13. DIDIM, parameters convergence C. Comparison of IDIM and DIDIM robustness with respect to a low sample rate. The actual torque and the simulated data are resampled to obtain a low frequency measurement mf = 0.5Hz. This is a downsample procedure without lowpass antialiasing filtering which investigates a real problem on industrial robots where the available sample rate measurement given by the controller may be much lower than the control sample rate. All the actual and simulated data are sampled at mf = 0.5Hz. The IDIM LS estimation is carried out with the measured joint position q , and with ( ) ˆ ˆ q,q & && , calculated by a central difference algorithm of q , without lowpass Butterworth filtering. There is no parallel This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 27 decimation. DIDIM starts with the regular initialization. Results are given in Table 5 . TABLE 5: IDIM AND DIDIM, LOW SAMPLING RATE, mf = 0.5HZ. IDIM DIDIM Parameter IDIM ˆχ χ 2 σˆ rχ %σˆ 0 ˆ χ 3 ˆ χ χ 2 σˆ rχ %σˆ ZZ1R 3.10 0.03 0.3 1.0 3.45 0.04 0.5 1 Fv 0.9 1.8 100 0 0.04 0.02 30 1 Fc 1.0 0.1 5 0 0.81 0.05 3 2 ZZ 0.025 0.003 5.5 1.0 0.061 0.0006 0.5 2 LMX 0.075 0.008 5.3 0 0.124 0.001 0.5 2 LMY -0.02 0.01 250 0 0.008 0.0006 4.0 2 Fv 0.35 5.6 800 0 0.01 0.005 25 2 Fc 0.19 0.087 23 0 0.13 0.008 3.0 IDIM ˆ Y W / Y χ − =0.5 3 ˆ Y W / Y χ − =0.04 The identified values with IDIM are not good while the identified values with DIDIM are still good. This shows the robustness of DIDIM with respect to the sampling rate measurement. IDIM fails because there is an amplitude distortion in the estimation of ( ) ˆ ˆ q, q & && , with a central difference of q , sampled at a too low frequency mf . This point is illustrated in Table 6, which gives the relative norm errors on velocity (80%) and acceleration (80%). ( ) (200Hz) (200Hz) ˆ ˆ q , q & && , is calculated from q , sampled at 200Hz and lowpass filtered at a 0.5Hz cut- off frequency, and derived with a central difference algorithm. ( ) (0.5Hz) (0.5Hz) ˆ ˆ q , q & && , is calculated from q , sampled at 0.5Hz and derived with a central difference algorithm. DIDIM succeeds because, ( ) ddm ddm ddm q ,q ,q & && , is computed with accuracy by the integration of the DDM with a well-tuned variable step solver, and it can be sampled without error at any frequency mf . TABLE 6: IDIM, JOINT DATA ERRORS AT 0.5Hz mf = (200Hz) (0.5Hz) (200Hz) 1 1 1 ˆ ˆ ˆ q q / q − & & & 0.39 (200Hz) (0.5Hz) (200Hz) 1 1 1 ˆ ˆ ˆ q q / q − && && && 0.73 (200Hz) (0.5Hz) (200Hz) 2 2 2 ˆ ˆ ˆ q q / q − & & & 0.80 (200Hz) (0.5Hz) (200Hz) 2 2 2 ˆ ˆ ˆ q q / q − && && && 0.81 This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 28 D. Comparison of IDIM and DIDIM, without data filtering. All the actual and simulated data are sampled at mf = 200Hz. The IDIM LS estimation is carried out with the measured joint position q , and with ( ) ˆ ˆ q,q & && , calculated by a central difference algorithm of q , without lowpass Butterworth filtering. There is no parallel decimation. DIDIM starts with the regular initialization. Results are given in Table 7. TABLE 7: IDIM AND DIDIM ESTIMATION WITHOUT DATA FILTERING IDIM DIDIM Parameter IDIM ˆχ χ 2 σˆ rχ %σˆ 0 ˆχ 2 ˆ χ χ 2 σˆ rχ %σˆ ZZ1R 1.50 0.05 1.60 1.0 3.45 0.007 0.1 1 Fv 0.095 0.15 80. 0 0 0.05 0.023 21 1 Fc 0.55 0.26 23.3 0 0.81 0.004 0.24 2 ZZ 0.14 0.018 6.7 1.0 0.061 0.0004 0.3 2 LMX 0.63 0.035 2.7 0 0.124 0.0015 0.3 2 LMY 0.1 0.023 11.8 0 0.008 0.0009 5.6 2 Fv 0.001 0.143 700.0 0 0.023 0.0022 48 2 Fc 0.19 0.244 68.40 0 0.13 0.0038 1.5 IDIM ˆ Y W / Y χ − =0.8 2 ˆ Y W / Y χ − =0.08 The identified values with IDIM are not good while the identified values with DIDIM are still good. IDIM fails because of the too large noise in the observation matrix, ( ) fm ˆ ˆ W q,q,q & && , coming from the derivation of q , without lowpass filtering. Then the LS estimation is biased. DIDIM succeeds because the observation matrix, ( ) , k fm ddm ddm ddm ˆ W q ,q ,q δ χ & && , is calculated without noise with the simulated values ( ) ddm ddm ddm q ,q ,q & && . This validation shows that DIDIM cancels the bias of IDIM estimation, coming from a noisy estimation of ( ) ˆ ˆ ˆq,q,q & && , which gives a too noisy observation matrix ( ) fm ˆ ˆ W q,q,q & && . E. DIDIM robustness with respect to error in the simulated closed-loop tuning, d n ω This section investigates the effect of an error between the actual value, a n ω , and the simulated value d n ω , of the natural frequency which represents the closed-loop bandwidth. The DIDIM identification is performed taking half the values of the full ones given in section V, /2=1/2 (rd/s) 1 1 d d f n n ω ω = and 10/2 (rd/s) 2 2 d d f n n / 2 ω ω = = , and the same procedure used to obtain results This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 29 shown in Table 2, that is to say a frequency measurement, mf =200Hz, and a parallel decimation with a factor, dn =20, and a lowpass filter cut-off frequency equal to 4Hz . The parameters, given in Table 8, converge in only 6 steps to values which are very close to those obtained in Table 2, with a full closed-loop bandwidth. TABLE 8: DIDIM, WITH SIMULATED HALF FULL BANDWIDTH , /2 d d f n n ω ω = Parameter 0 ˆ χ 6 ˆ χ χ 2 σˆ rχ %σˆ ZZ1R 1 3.44 0.014 0.2 1 Fv 0 0.02 0.012 15 1 Fc 0 0.86 0.016 1.0 2 ZZ 1 0.060 0.0001 0.1 2 LMX 0 0.124 0.0002 0.1 2 LMY 0 0.007 0.0003 2.0 2 Fv 0 0.01 0.003 10 2 Fc 0 0.13 0.0008 0.3 The relative norm errors on joint position, velocity and acceleration are plotted on Fig. 14, with the same legend as those given for Fig.11, section VI.B. It can be seen that, ( ) ( ) ddm k ddm k ddm k ˆ ˆ ˆ ˆ ˆ ˆ q ( χ ),q ( χ ),q ( χ ) q,q,q & && & &&  , at each iteration k, with a constant norm error larger but close to the value obtained with the full bandwidth, Fig.11, close to, 0.5% for the position, 5%, for the velocity and 10%, for the acceleration. 0 1 2 3 4 5 6 0.005 0.01 0.015 0.02 0.025 Relative errors: Joint 1 Positions 0 1 2 3 4 5 6 0 0.05 0.1 0.15 0.2 Velocities 0 1 2 3 4 5 6 0.1 0.2 0.3 0.4 0.5 Accelerations Iterations 0 1 2 3 4 5 6 0 0.01 0.02 0.03 Relative errors: Joint 2 Positions 0 1 2 3 4 5 6 0 0.05 0.1 0.15 0.2 Velocities 0 1 2 3 4 5 6 0.1 0.2 0.3 0.4 0.5 Accelerations Iterations Fig. 14. • norm error relative to the filtered actual position, velocity, acceleration. * norm error relative to the reference position, velocity, acceleration. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 30 The relative torque norm error, given in Table 9, and plotted in Fig. 15, decreases in 6 steps, only twice more than with the full bandwidth, Table 3, Fig. 12. This shows that DIDIM is not very sensitive to error in the simulated closed-loop bandwidth, provided the control law structure is known. However, DIDIM fails beyond 1/3 of the full bandwidth, with 3 d d f n n / ω ω ≤ . TABLE 9: RELATIVE NORM ERROR OF JOINT TORQUE, j j k j ˆ Y W / Y χ − , FULL BANDWIDTH/2 Iteration k 0 1 2 3 4 5 6 Joint j=1 0.6 0.05 0.06 0.04 0.025 0.02 0.02 Joint j=2 3.0 0.11 0.05 0.02 0.025 0.02 0.02 0 1 2 3 4 5 6 0 0.2 0.4 0.6 0.8 Relative error: Joint torque 1 Iterations 0 1 2 3 4 5 6 0 1 2 3 4 Relative error: Joint torque 2 Iterations Fig. 15. DIDIM, convergence of the joint torque error, j j k j ˆ Y W / Y χ − VII. CONCLUSION This paper deals with a new off-line identification technique of robot dynamic parameters, called DIDIM for Direct and Inverse Dynamic Identification Models technique. This method is a closed-loop Output Error approach, but considering the output is no more the joint position but the joint force/torque. The optimal parameters are the solution of a non-linear least-squares problem which is solved with a Gauss-Newton method. Each step of the iterative procedure of the Gauss-Newton regression is dramatically simplified to a linear regression which is solved with the Inverse Dynamic Identification Model technique (IDIM). Then, DIDIM mixes the closed-loop OE technique and the IDIM technique. DIDIM needs a closed-loop simulation of the robot using the direct dynamic model (DDM) and assuming the same structure of the control law and the same reference trajectory for both the actual and the simulated robot. Then, it needs to initialize the parameters and the state vector of the DDM. The difficulties for the choice of the initial conditions for non-linear LS problem are overcome with a "regular initialization" of the parameters and an updating of the control law gains at each step of the iterative procedure. The initial state is given by the initial values of the reference trajectory. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 31 An experimental validation is carried out on a 2 dof robot. The following points were checked: • DIDIM gives the same results as IDIM, provided well-tuned data filtering for IDIM, adapted to the system dynamics, • DIDIM is robust to the initialization of both parameters and state, • DIDIM is robust to the closed-loop performances tuning errors between the simulated and the actual closed-loop robot, provided the same control law structure. Compared to IDIM, DIDIM technique is particularly attractive thanks to the following reasons: • It needs only the actuator force/torque measurement or estimation, • It avoids tuning the bandpass filter in the IDIM method by using the integration of the DDM in a closed-loop simulation where the tuning of the bandwidth automatically defines the same frequency range for the dynamics of the actual system and of the model to be identified, • It cancels bias in IDIM due to errors in bandpass filtering data, or no filtering at all, or too low frequency measurement, • It combines the inverse and the direct dynamic model and validates, in the same identification procedure, both models for computed torque control and for simulation. Up to now, the DDM was validated a posteriori in simulation. Future work concerns the validation of DIDIM on a 6 dof industrial robot. VIII. REFERENCES [1] P. Khosla and T. Kanade, “Parameter Identification of Robot Dynamics,” Proc. of 24th IEEE Conference on Decision and Control, Fort-Lauderdale, USA: 1985, pp. 1754-1760. [2] C. Atkeson, C. An, and J. Hollerbach, “Estimation of Inertial Parameters of Manipulator Loads and Links,” Int. J. of Robotics Research, vol. 5, 1986, pp. 101-119. [3] M. Gautier, “Identification of Robot Dynamics,” Proc. of IFAC Symposium on Theory of Robots, Vienne, Austria: 1986, pp. 351-356. [4] H. Kawasaki and K. Nishimura, “Terminal-Link Parameter Estimation and Trajectory Contro,” IEEE Transactions on Robotics and Automation, vol. 4, Oct. 1988, pp. 485-490. [5] I. Ha, Ko, and Kwon, “An efficient estimation algorithm for the model parameters of robotic manipulators,” IEEE Trans. on Robotics and Automation, vol. 5, 1989, pp. 386-394. [6] B. Raucent, G. Bastin, G. Campion, J. Samin, and P. Willems, “Identification of Barycentric Parameters of Robotic Manipulators from External Measurements,” Automatica, vol. 28, 1992, pp. 1011-1016. [7] M. Prüfer, C. Schmidt, and F. Wahl, “Identification of Robot Dynamics with Differential and Integral Models : a Comparison,” Proc. of IEEE International Conference on Robotics and Automation, San Diego, California, USA: 1994, pp. 340-345. [8] M. Gautier, W. Khalil, and P. Restrepo, “Identification of the Dynamic Parameters of a Closed- Loop Robot,” Proc. of IEEE International Conference on Robotics and Automation, Nagoya, Japan: 1995, pp. 3045-3050. [9] M. Gautier, “Dynamic Identification of Robots with Power Model,” Proc. of IEEE International This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 32 Conference on Robotics and Automation, Albuquerque, USA: 1997, pp. 1922-1927. [10] J. Swevers, C. Ganseman, D. Tukel, J. DeSchutter, and H. VanBrussel, “Optimal Robot Excitation and Identification,” IEEE Trans. on Robotics and Automation, vol. 13, Oct. 1997, pp. 730-740. [11] J. Swevers, W. Verdonck, and J. De Schutter, “Dynamic model identification for industrial robots - Integrated experiment design and parameter estimation,” IEEE control systems magazine, vol. 27, 2007, pp. 58-71. [12] M. Gautier and P. Poignet, “Extended Kalman Filtering and Weighted Least-squares Dynamic Identification of Robot,” Control Engineering Practice, vol. 9, 2001, pp. 1361-1372. [13] W. Khalil and E. Dombre, Modeling identification and control of robots, Taylor and Francis, 2002. [14] P. Vandanjon, A. Janot, M. Gautier, and F. Khatounian, “Comparison of two identification techniques: Theory and application,” Proc. of the 4th International Conference on Informatics in Control, Automation and Robotics; SS on Fractional Order Systems (ICINCO), Angers: 2007, pp. 341-347. [15] J. Hollerbach, W. Khalil, and M. Gautier, “Model Identification,” Springer Handbook of Robotics, Springer, 2008. [16] J. Richalet and P. Fiani, “The global approach in identification protocol optimization,” Proc. of the 4th IEEE Conference on Control Applications, 1995, pp. 423-431. [17] E. Walter and L. Pronzato, Identification of parametric models from experimental data, Springer, 1997. [18] E. Tohme, R. Ouvrard, J. Abche, J. Trigeassou, T. Poinot, and G. Mercère, “A Methodology to Enhance the Convergence of Output Error Identification Algorithms,” Proceedings of the European Control Conference, Kos, Greece: 2007, pp. 5721-5728. [19] F. Khatounian, S. Moreau, E. Monmasson, A. Janot, and F. Louveau, “Parameters Estimation of the Actuator Used in Haptic Interfaces: Comparison of two Identification Methods,” Proc. of IEEE International Symposium on Industrial Electronics, Montreal (Canada): 2006, pp. 211-216. [20] M. Gautier, A. Janot, and P. Vandanjon, “DIDIM: A new method for the dynamic identification of robots from only torque data,” Proc. of IEEE International Conference on Robotics and Automation, Pasadena, California, USA: 2008, pp. 2122-2127. [21] R. Featherstone and D. Orin, “Dynamics,” Springer Handbook of Robotics, Springer, 2008. [22] M. Gautier and W. Khalil, “Direct calculation of minimum set of inertial parameters of serial robots,” IEEE Transactions on Robotics and Automation, vol. 6, Jun. 1990, pp. 368-372. [23] M. Gautier, “Numerical calculation of the base inertial parameters,” Journal of Robotics Systems, vol. 8, 1991, pp. 485-506. [24] P. Restrepo and M. Gautier, “Calibration of drive chain of robot joints,” Proc. of 4th IEEE Conf. on Control Applications, Albany, NY, USA: 1995, pp. 526-531. [25] M. Gautier and W. Khalil, “Exciting trajectories for the identification of the inertial parameters of robots,” International Journal of Robotics Research, vol. 11, Aug. 1992, pp. 362-375. [26] H. Garnier and L. Wang (Eds), Identification of Continuous-time Models from Sampled Data, Springer, 2008. [27] F. Carrillo, A. Baysse, and A. Habbadi, “Output error identification algorithms for continuous-time systems operating in closed-loop,” Preprints of the 15th IFAC Symposium on System Identification, Saint-Malo (France): 2009, pp. 408-413. [28] I. Landau, B. Anderson, and F. De Bruyne, “Closed-Loop Output Error Identification Algorithms for Nonlinear Plants,” Proceedings of the 38th IEEE Conference on Decision and Control, Phoenix, Arizona, USA: 1999, pp. 606-611. [29] R. Davidson and J. Mackinnon, Estimation and Inference in Econometrics, Oxford University Press Inc, 1993. [30] W. Khalil and J. Kleinfinger, “A new geometric notation for open and closed loop robots,” Proc. of This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible p 33 IEEE International Conference On Robotics And Automation, San Francisco, CA, USA: 1986, pp. 1147-1180. [31] W. Khalil, M. Gautier, and P. Lemoine, “Identification of the payload inertial parameters of industrial manipulators,” Proc. 2007 IEEE International Conference on Robotics and Automation, Roma, Italy: 2007, pp. 4943-4948. Maxime Gautier received the “Doctorat d’Etat” degree in robotics and control engineering from the University of Nantes, Nantes, France, in 1990. Since 1991, he has been a Professor in automatic control with the Université de Nantes. He is carrying out his research with the Robotics Team, “Institut de Recherche en Communications et Cybernétique de Nantes” (IRCCyN). His research topics include modeling, identification, and control of robots. Alexandre Janot was born in France on July 22, 1980. He received the B.Sc. and the M. Sc. in electrical and automation engineering from the University of Nantes and Ecole Centrale de Nantes, France, in 2002 and 2004, respectively. He received the Ph. D. degree from both the University of Nantes and the French Atomic Energy Commission (CEA) in 2007. He joined HAPTION S.A. in 2008, and is now working on the identification and the control of haptic devices. Pierre-Olivier Vandanjon was born in France on November the 4th, 1969. He received the B. Sc and the M. Sc in applied mathematics from the University Paris-Dauphine. He graduated, as a civil engineer, from Ecole des Mines de Paris. He received the Ph. D degree from both the Ecole des Mines de Paris and the French Atomic Energy Commission (CEA) in 1995. He joined the Laboratoire Central des Ponts et Chaussées (LCPC) in 1997, as a junior researcher. He is providing robotics techniques on projects concerning road construction and maintenance.