arXiv:1404.3221v1 [cs.SY] 11 Apr 2014 1 UAV Circumnavigating an Unknown Target Under a GPS-denied Environment with Range-only Measurements Yongcan Cao Abstract—One typical application of unmanned aerial vehicles is the intelligence, surveillance, and reconnais- sance mission, where the objective is to improve situation awareness through information acquisition. For examples, an efficient way to gather information regarding a target is to deploy UAV in such a way that it orbits around this target at a desired distance. Such a UAV motion is called circumnavigation. The objective of the paper is to design a UAV control algorithm such that this circumnavigation mission is achieved under a GPS-denied environment using range-only measurement. The control algorithm is constructed in two steps. The first step is to design a UAV control algorithm by assuming the availability of both range and range rate measurements, where the associated control input is always bounded. The second step is to further eliminate the use of range rate measurement by using an estimated range rate, obtained via a sliding-mode estimator using range measurement, to replace actual range rate measurement. Such a controller design technique is applicable in the control design of other UAV navigation and control missions under a GPS-denied environment. Index Terms—UAV, Autonomy, Joint estimation and control, Sliding-mode estimator, GPS-denied environment I. INTRODUCTION As Unmanned Aerial Vehicles (UAVs) gain more and more favor in both military and civilian applications due to its advantages over manned aircraft, it is now a demanding technology that UAVs can be used to accomplish missions with minimum human supervision. In many cases, complete autonomy is often desired in UAV operations unless human supervision is necessary. Technologies are thus needed to increase autonomy for UAV operations [1]. The need of autonomy for UAV operations comes from two perspectives: performance and cost. From The author is with the Control Science Center of Excel- lence, Aerospace Systems Directorate, Air Force Research Lab- oratory, Wright-Patterson AFB, OH 45433, USA (email: yong- can.cao@gmail.com). Approved for public release; distribution un- limited, 88ABW-2014-0095. the performance’s point of view, UAV with autonomy capability is more reliable than manned aircraft. Human operators are one of the most common sources of errors in complex systems. In many situations, human operators are more likely to make mistakes. It is also well-known that efficiency of human operators is expected to de- crease after a certain period of time. All these drawbacks can be addressed if autonomy becomes available to replace human operators. From the cost’s point of view, UAV with autonomy is less expensive than manned aircraft as the cost of training and/or replacing a pilot is high. Another factor of cost is that manned aircraft are generally larger in size and more complex in capabilities than UAVs due to the existence of human operators onboard. Due to FAA regulations, UAVs are now mainly used in military and security applications, such as border pa- trol [2], mapping [3], and surveillance [4]. For instance, UAVs can be used to gather information of a target by orbiting around it at some desired distance. Such a UAV mission is often called circumnavigation. If GPS is available, both target location and UAV location can be obtained and this circumnavigation mission can be accomplished by using existing orbiting algorithms [5]. However, the vulnerability of UAVs to GPS jamming and spoofing poses a significant threat to the safety of UAV operation. Recent tests confirm that GPS can be denied due to jamming [6] or spoofing [7]. In military operations, GPS is more often jammed due to contested environment under which UAVs are operated. Refs. [8]– [12] were devoted to achieve circumnavigation under a GPS-denied environment. A localization-and-control framework was first proposed in [8], [9] to solve the circumnavigation problem. The main idea is to first estimate the location of the target based on the location of the UAV under some local coordinate frame and then design controllers based on estimated target loca- tion and additional bearing/range measurement. Under a GPS-denied environment, the location of the UAV can be tracked down using inertial sensors at the cost of integration drift. An increasing drift means perfor- 2 mance degradation. It is thus desired that the location of the UAV is not used in the controller design. A different control strategy was developed in [10], [11] where both range and range rate measurements are used to design control algorithms. Rigorous analysis was provided in [10], [11] to demonstrate the efficacy of the proposed control law when the UAV does not stay close to the target initially. Due to the nature of local stability, it is necessary to design a controller that guarantees global convergence. To overcome the local stability issue, a “aiming” controller using both range and range rate measurements was developed in [12]. The aiming controller is to control the heading of the UAV such that the UAV moves towards a well-designed circle. Global stability was shown in [12] regardless of the initial state of the UAV. Although range measurement is possible under a GPS-denied environment [13], the need of range rate measurement in the controller design in [12] is impractical as this measurement is typical unavailable or otherwise suffers from large errors. To further remove range rate measurement required in the controller design, the objective of this paper is to develop a new controller using range-only measurement such that circumnavigation is achieved under a GPS-denied environment by expanding on the preliminary work reported in [14]. The objective of the paper is fulfilled via a two-step analysis. First, a control algorithm based on range and range rate measurements is proposed to accomplish the circumnavigation mission. One promising feature of the control algorithm is that the associated control input is always bounded. Second, a sliding-mode estimator using range measurement is designed to accurately estimate range rate in finite time when applying the proposed control algorithm with range rate measurement being replaced by the estimated value. The design of this control algorithm is motivated by the study on sliding- mode observer in [15]–[17], where numerous velocity observers were designed based on position information. By combining the two steps, the circumnavigation mis- sion can be accomplished using the proposed control al- gorithm when actual range rate measurement is replaced by its estimated value obtained from the sliding-mode estimator. To our best knowledge, this is the first paper that solves the circumnavigation problem using range- only measurement. The rest of the paper is structured as follows. Sec- tion II introduces the circumnavigation problem. In Section III, a control algorithm based on range and range rate measurements is proposed to solve the cir- cumnavigation problem, where the associated control input is always bounded. It is shown that UAV can always circumnavigate the target at the desired radius regardless of its initial state. In Section IV, range rate measurement used in the control algorithm in Section III is replaced by its estimated value obtained via a sliding- mode estimator. Such a replacement is valid because the estimated value and the actual value become identical after a finite period of time. In Section V, two illustrative simulation examples are provided as a proof of concept. Finally, Section VI is given to conclude the paper. II. PROBLEM STATEMENT Circumnavigation concerns with the behavior that a UAV orbits around an unknown target at some desired distance. For instance, in Fig. 1, let T denote the un- known target whose location is [xT , yT ]T and the blue triangle denote the UAV. The objective is to have the UAV orbit around T at some desired distance rd. In other words, the desired UAV trajectory is the black solid circle regardless of a counter clockwise or clockwise motion. Assuming that UAV can maintain its altitude, we consider the following UAV dynamics given by ˙x = V cos(ψ) ˙y = V sin(ψ) (1) ˙ψ = ω, where [x, y]T denotes the 2D location of the UAV, ψ denotes the heading of the UAV, ω is the control input to be designed, and V is the (constant) veloc- ity of the UAV. Although this is a simplified model, it serves as a good approximation of practical UAV dynamics. Let the range measurement be denoted by r △= p (x −xT )2 + (y −yT )2. The objective is to design control input ω such that r(t) →rd as t →∞. Such a motion that UAV orbits around a target at a fixed radius is referred to as a stable circular motion, which is defined as follows. Definition 2.1: A stable circular motion refers to the behavior that the UAV, with dynamics (1), moves around a target with a constant speed and a constant radius. Notice that (1) characterizes the relationship between the controller ω and the state of the UAV [x, y, ψ]. Since the objective is to control r by designing an appropriate controller ω, it is thus desirable that (1) can be rewritten into a new form, in which the relationship between ω and r can be explicitly identified. Clearly, r needs to be a state variable in the new form. Another variable used in the new form is the bearing angle, defined as follows. Definition 2.2: Denote the reference vector as the vector from the current location of the UAV to T. The bearing angle θ(t) ∈[0, 2π) at time t is defined as the 3 rd ra T r(t) θ V Fig. 1. An illustrative example of some variables used in this paper. The blue triangle denotes the UAV. T denotes the target. The blue arrow denotes the heading of the UAV. r(t) denotes the range between the UAV and the target. V denotes the (constant) velocity of the UAV. θ denotes the bearing angle. rd denotes the desired radius. ra is some design parameter in (3). angle from the reference vector to the current heading of the UAV measured counterclockwise. As seen in Fig. 1, if the bearing angle is π/2 or 3π/2, the heading of UAV is perpendicular to the vector from the UAV to the target. Physically, this means that the UAV cannot get closer to or farther away from the target. Indeed, if r and θ are chosen as the state variables, the dynamics (1) can be rewritten as ˙r = −V cos(θ) ˙θ = ω + V sin(θ) r . (2) The first equation in (2), illustrating the relationship between the bearing angle and the rate of r, matches the analyzed physical property. To distinguish (1) and (2), we call (1) “Cartesian dynamics” and (2) “Polar dynamics”. Considering the objective of the paper, i.e., to design ω based on r such that r(t) →rd as t →∞, (2) is used when referring to UAV dynamics although the two dynamics (Cartesian dynamics and Polar dynamics) are physically equivalent. As can be noticed from (2), the rate of r is controlled by the bearing angle θ, which can then be controlled by ω. Intuitively, it is possible to design a controller ω to meet the desired property. However, it remains unclear if such a controller exists when only range measurement is available. III. A CONTROLLER BASED ON RANGE AND RANGE RATE MEASUREMENTS In this section, a control algorithm based on range and range rate measurements is proposed to accomplish the circumnavigation mission. We show that the proposed control algorithm is able to guarantee globally asymp- totic convergence and the equilibrium is exponentially stable. Globally asymptotic convergence means that the desired orbit motion is always guaranteed for an arbi- trary initial state. System with exponentially convergence property has the benefit of improved robustness against disturbances. Following the idea behind the control algorithm design in [12], a new control algorithm is proposed as ω =  k[V cos(π −sin−1( ra r(t))) −˙r(t)], r(t) ≥ra, 0, otherwise, (3) where k is a constant gain and ra is a parameter to be determined later. Notice that (3) is a switching control law. To understand how the controller works, let’s refer to Fig. 1 for an illustrative situation. When r(t) ≥ra, i.e., the UAV is on or outside the red dashed circle, the control input is determined by the difference between V cos(π−sin−1( ra r(t))) and ˙r(t). One may consider V cos(π −sin−1( ra r(t))) as a reference for ˙r(t) to track. In fact, V cos(π −sin−1( ra r(t))) refers to the change rate of r(t) when the UAV moves towards one of the two tangent points on the red dashed circle. It should be emphasized that the two tangent points are not static as the UAV moves with a nonzero velocity. When r(t) < ra, i.e., the UAV is inside the red dashed circle, the control input is zero, meaning that the UAV will move forward along its current heading. Because the UAV has a nonzero (constant) velocity, it takes a finite period of time before it exits the red dashed circle. Zero control when r(t) < ra is used to drive the UAV outside the undesirable zone while the feedback control k[V cos(π −sin−1( ra r(t))) −˙r(t)] when r(t) ≥ra is to drive the UAV in such a way that the desired stable circular motion is reached eventually. For sake of conciseness, we adopt the following defi- nition even if it is a slight abuse of notation. Definition 3.1: The circle centered at T with a radius ra is defined as Ca. The UAV is inside (resp., outside) Ca if r(t) < ra (resp., r(t) ≥ra). According to Definition 3.1, the red dashed circle is Ca in Fig. 1. As mentioned earlier, the radius of Ca, i.e., ra, is to be designed. Before choosing a proper ra, let’s first analyze the radius of the stable circular motion under the proposed control algorithm (3) assuming that such a stable circular motion does exist. Lemma 3.1: Consider system dynamics (1) subject to control input (3). If a stable circular motion exists, the radius is given by r⋆= q r2a + 1 k2 . In addition, the UAV rotates clockwise (resp., counter clockwise) when k > 0 (resp. k < 0). Proof: When a stable circular motion exists, let the radius of the stable circular motion be given by r⋆. Then 4 the magnitude of the nominal angular velocity is given by V r⋆. Because the angular velocity of the UAV is equal to the control input (3), the magnitude of the nominal angular velocity is also equal to |ω|. For a stable circular motion, θ = π 2 or θ = 3π 2 , which implies that (i) The UAV cannot be inside Ca because otherwise the UAV moves along a straight line; (ii) ˙r = 0 based on (2). Then |ω| becomes kV cos(π −sin−1(ra r⋆)) for the stable circular motion. It then follows that V r⋆ and kV cos(π −sin−1(ra r⋆)) should be identical, which hap- pens if and only if r⋆= q r2a + 1 k2. When k > 0 and r(t) = r⋆, it can be computed that ω < 0, indicating that the UAV rotates clockwise. When k < 0, one can obtain that ω > 0, indicating that the UAV rotates counter clockwise. Lemma 3.1 illustrates the relationship between the radius of the stable circular motion, if existing, and the parameter ra in (3). By choosing ra = r r2 d −1 k2 , (4) one can obtain that r⋆= rd according to Lemma 3.1. One may observe that ra is strictly smaller than rd, meaning that the UAV should aim towards a circle with a smaller radius in order to establish the desired circular motion. This is due to the nonlinear dynamics (2). For a nonnegative ra, one can obtain that k ≥1 rd . The validity of Lemma 3.1 is based on the assumption that a stable circular motion does exist. It remains an open question whether this assumption is true. Our effort next is to show that the assumption is indeed true. For the simplicity of presentation, we only consider the case when k > 0. A similar analysis can be applied to the case when k < 0. Our focus next is to show that a stable circular motion is guaranteed using the control algorithm (3) with k > 0. As shown in Fig. 2, we arrange the proof based on three different phases. Phase 1 is to show that UAV moves inside Ca at most once. The detailed analysis is shown in Lemma 3.3 based on Lemma 3.2. Phase 2 is to show that θ always stays in the set [0, π] after a finite period of time. The detailed analysis is shown in Lemma 3.4. Phase 3 is to show that [r, θ] converges to [rd, π 2 ] asymptotically. The detailed analysis is shown in Theorem 3.2. Following this logic, we next present these three lemmas as well as the main theorem. The proofs of these lemmas and theorem are similar to those in [12] with some variations. To make the paper self- contained, complete proofs of these lemmas and theorem are provided next. UAV moves inside Ca at most once (shown in Lemma 3.3) Lemma 3.2 θ always stays in the set [0, π] after a finite period of time (shown in Lemma 3.4) [r, θ] converges to [rd, π 2] asymp- totically (shown in Theorem 3.2) Fig. 2. Three phases to prove that a stable circular motion exists using the control algorithm (3). Lemma 3.2: Consider the UAV dynamics in (1) sub- ject to the control policy in (3). Let there be t0 ≥0 such that r(t0) ≥ra and θ(t0) ∈(sin−1( ra r(t0)), 2π − sin−1( ra r(t0))), then r(t) ≥ra, ∀t ≥t0. Proof: The proof of the lemma can be divided into the following two steps: Step 1: θ(t) ∈[sin−1( ra r(t)), 2π −sin−1( ra r(t))) holds for all t ≥t0. Based on the control algorithm (3), ω < 0 at t = t0 since ˙r(t) > V cos(π −sin−1( ra r(t))) when t = t0. As a consequence, the UAV will rotate clockwise initially. Note that both ˙r(t) and V cos(π − sin−1( ra r(t))) are continuous with respect to t. Therefore, ˙r(t) > V cos(π −sin−1( ra r(t))) always holds before ˙r(t) = V cos(π −sin−1( ra r(t))) happens. Consequently, the UAV will stop rotating clockwise once ˙r(t) = V cos(π −sin−1( ra r(t))). Indeed, ˙r(t) = V cos(π − sin−1( ra r(t))) if and only if θ(t) = sin−1( ra r(t)) or θ(t) = 2π −sin−1( ra r(t)). Thus, θ(t) ∈[sin−1( ra r(t)), 2π − sin−1( ra r(t))]. To prove Step 1, it suffices to show that θ(t) = 2π −sin−1( ra r(t)) cannot hold. When θ(t0) ∈ (sin−1( ra r(t0)), 2π −sin−1( ra r(t0))) and ω = 0, θ(t) = 2π −sin−1( ra r(t)) never holds for all t ≥t0 because the UAV moves along a straight line and the straight line is always outside the circle Ca. As a consequence, θ(t) = 2π −sin−1( ra r(t)) never holds for all t ≥t0 when θ(t0) ∈[sin−1( ra r(t0)), 2π −sin−1( ra r(t0))) and ω ≤0. Combining the previous arguments completes the proof of Step 1. Step 2: r(t) ≥ra for all t ≥t0. When r(t) = ra, it follows that sin−1( ra r(t)) = π 2 . From Step 1, it is known that θ(t) ∈[sin−1( ra r(t)), 2π −sin−1( ra r(t))). This means that θ(t) ∈(π 2 , 3π 2 ] at the time when r(t) = ra. By recalling the second equation in (2), one can obtain that ˙r ≥0 at the time when r(t) = ra. This implies that the UAV cannot get any closer to the target if r(t) = 5 ra. At the time t⋆when r(t⋆) becomes larger than ra, the bearing angle has to be in the set (π 2 , 3π 2 ), which satisfies the condition that θ(t⋆) ∈[sin−1( ra r(t⋆)), 2π − sin−1( ra r(t⋆))). By repeating the analysis in Steps 1 and 2, it is clear that the UAV will never move inside Ca. Lemma 3.3: Consider the UAV dynamics in (1) sub- ject to the control policy in (3). The UAV can only move inside Ca at most once. Proof: When the UAV enters Ca at some time te, i.e., r(te) = ra, θ(te) ∈[0, π 2 ) ∪(3π 2 , 2π) holds true in order to guarantee that the UAV enters Ca. Recall that no control input is imposed on the UAV when it is inside Ca. Then at the time tx when it exits Ca, θ(tx) =  π −θ(te), θ(te) ∈[0, π 2 ), 3π −θ(te), θ(te) ∈(3π 2 , 2π), because the UAV moves along a straight line due to the fact ω = 0. As a consequence, θ(tx) ∈(π 2 , 3π 2 ). When r(tx) = ra, it follows that sin−1( ra r(tx)) = π 2 . Therefore, θ(tx) ∈(sin−1( ra r(tx)), 2π −sin−1( ra r(tx))) when r(tx) = ra. By considering the current time tx be the time t0 in Lemma 3.2, it follows from Lemma 3.2 that the UAV will never move inside Ca again. Therefore, the UAV can only move inside Ca at most once. Lemma 3.4: Consider the UAV dynamics in (1) sub- ject to the control policy in (3). For any θ(0), there exists t⋆≥0 such that θ(t) ∈[0, π] for any t ≥t⋆. Proof: By Lemma 3.3, the UAV can move inside Ca at most once. When the UAV never moves inside Ca, let t1 = 0. When the UAV moves inside Ca once, let t1 be the time when the UAV moves from inside Ca to outside Ca. It is clear that t1 is finite. The lemma is proved if the following two statements are valid: (1) For any θ(0), there exists t⋆≥t1 such that θ(t⋆) ∈ [0, π]; and (2) Once θ(t⋆) ∈[0, π] for some t⋆≥t1, θ(t) ∈[0, π] for any t ≥t⋆. The first statement is proved by considering the fol- lowing three cases: (i) θ(t1) ∈ (π, 2π −sin−1( ra r(t1))): From (2), one can obtain that ˙θ(t) < 0, i.e., θ(t) will decrease, whenever θ(t) ∈(π, 2π −sin−1( ra r(t))) because ω < 0 while −V sin(θ(t)) r(t) > 0. When ω is (approx- imately) zero, −V sin(θ(t)) r(t) is (approximately) V ra r2(t). Because both ω and −V sin(θ(t)) r(t) are continuous with respect to t, ˙θ(t) is always upper bounded by some negative constant. Similarly, when −V sin(θ(t)) r(t) is (approximately) zero, ω is upper bounded by some negative constant, indicating that ˙θ(t) is always upper bounded by some negative constant as well. Therefore, θ(t) ≤π in finite time. Noting that θ(t1) ∈(π, 2π−sin−1( ra r(t1))) ∈(sin−1( ra r(t1)), 2π− sin−1( ra r(t1))), it follows from Step 1 in the proof of Lemma 3.2 that θ(t) cannot get smaller than sin−1( ra r(t)), which implies that θ(t) ≥sin−1( ra r(t)). (ii) θ(t1) = 2π −sin−1( ra r(t1)): Under this case, the UAV is heading towards the tangent point such that the bearing is 2π −sin−1( ra r(t1)). By computation, ω = 0, which implies that the UAV will move along a straight line towards the tangent point. As a consequence, the UAV will move towards the tangent point whenever r(t) > ra. Given a constant nonzero velocity, it takes a finite period of time before r(t) = ra happens. Notice that r(t) = ra cannot hold for an arbitrary period of time because otherwise a contradiction happens by noting that (i) ω = 0 based on (3) during that period of time, indicating that the UAV cannot rotate; and (ii) the UAV has to rotate such that r(t) = ra holds for that period of time. For t ≥t1, the UAV cannot move inside Ca, as discussed in the first paragraph of the proof. This implies that r(t) will increase to be greater than ra as soon as r(t) = ra happens. The bearing angle θ(tf) must be in the interval (π 2 , 3π 2 ) at the time when r(t) increases to be greater than ra. When θ(tf) ∈(π, 3π 2 ), it follows from Case (i) that θ(t) will be in the set [0, π] after a finite period of time. When θ(tf) ∈(π 2 , π], it is already in the set [0, π]. (iii) θ(t1) = (2π −sin−1( ra r(t1)), 2π): If θ(t) = (2π − sin−1( ra r(t)), 2π) always holds, it takes a finite period of time before r(t) ≤ra happens. Since the UAV never moves inside Ca for t ≥t1, either Case (i) or Case (ii) will happen after a finite period of time. By following the analysis in Cases (i) and (ii), θ(t) will be in the set [0, π] after a finite period of time. To prove the second statement, it is essential to study ˙θ(t) when θ(t) = 0 or θ(t) = π. Because the UAV is outside Ca, it can be computed that ω = k[V cos(π −sin−1( ra r(t))) + V ] > 0, where the first equation in (2) was used to derive the equality. This indicates that θ(t) will increase as soon as θ(t) = 0 happens. Similarly, when θ(t) = π, one can obtain that ω = k[V cos(π −sin−1( ra r(t))) −V ] < 0, which indicates that θ(t) will decrease as soon as θ(t) = π happens. Therefore the second statement holds as well. 6 It can be noted that the proof of Lemma 3.3 depends on Lemma 3.2 and the proof of Lemma 3.4 depends on Lemma 3.3. With these three lemmas, we now present the main result in this section. Theorem 3.2: Consider the UAV dynamics in (1) sub- ject to the control policy in (3). If k > 1 rd and ra is chosen satisfying (4), then r(t) →rd and θ(t) →π 2 as t →∞. Proof: According to Lemmas 3.3 and 3.4, there ex- ists a time instant t⋆such that r(t) ≥ra and θ(t) ∈[0, π]. Then the control input can be simplified as ω = k[V cos(π −sin−1( ra r(t))) + V cos(θ)], t ≥t⋆. For t ≥t⋆, consider a Lyapunov function candidate given by V = 1 −sin(θ) + ϕ, where ϕ = R r rd( 1 rd − 1 z + k cos sin−1(ra z ) − k cos sin−1(ra rd ))dz ≥0. Because 1 rd −1 z    > 0, z > rd, = 0, z = rd, < 0, z < rd, and k cos sin−1(ra z ) −k cos sin−1(ra rd ) satisfies such a property as well, combining with the property of inte- gration shows that ϕ ≥0, indicating that V ≥0 as 1−sin(θ) ≥0. When ra is chosen satisfying (4), one can obtain that 1 rd = k cos sin−1(ra rd ) by computation. Then ϕ can be simplified as R r rd(−1 z +k cos sin−1(ra z ))dz. Taking derivative of V yields that ˙V = −cos(θ) ˙θ + ˙ϕ = −cos(θ) " kV  −cos sin−1(ra r ) + cos(θ)  + V sin(θ) r # −  k cos sin−1(ra r ) −1 r  V cos(θ) = V cos(θ)  −k cos(θ) −sin(θ) r + 1 r  , where (2) was used to derive the second equality. When θ ∈[π 2 , π], ˙V ≤0 because cos(θ) ≤0 and −k cos(θ) − sin(θ) r + 1 r ≥0. When θ ∈[0, π 2 ), ˙V ≤0 because cos(θ) > 0 and −k cos(θ) −sin(θ) r + 1 r ≤−cos(θ) r −sin(θ) r + 1 r ≤0. Therefore, ˙V ≤0. Note that ˙V is uniformly continuous when r ≥ra and θ ∈[0, π], it follows from Lemma 4.3 in [18] that ˙V →0 as t →∞. When k > 1 rd , ˙V = 0 implies that θ = π 2 . It then follows from (2) that when θ(t) = π 2 , r(t) is constant. Therefore, a stable circular motion does exist. It then follows from the analysis in the paragraph right after Lemma 3.2 that r(t) = rd if ra is chosen satisfying (4). Therefore, θ(t) → π 2 and r(t) →rd as t →∞. Remark 3.3: Although it is assumed that the velocity of the UAV, V , is constant, Lemmas 3.1, 3.2, 3.3, 3.4, and Theorem 3.2 are still valid if V is varying within a set [V , V ], where V and V are two positive constant. In other words, the assumed constant velocity for the UAV is not required as long as the velocity is both lower bounded and upper bounded. Lemma 3.1 also shows that the final stable radius remains unchanged even if the velocity of the UAV changes. In the previous part of this section, the proposed controller (3) was shown to guarantee global asymptotic stability regardless of the initial state. By observation, one can find that the control input associated with (3) is always bounded by 2kV because cos(π −sin−1( rd r(t))) is bounded by 1 and | ˙r| ≤V due to (2). Although this controller (3) uses both range and range rate mea- surements, we will show in the next section that the requirement of range rate measurement can be eliminated thanks to the boundedness of the control input. Before proceeding to the next section, we show that the closed-loop system of (2) subject to (3) is expo- nentially stable at its equilibrium. If the equilibrium is exponentially stable, the associated closed-loop system not only converges, but in fact converges at a rate faster or at least as fast as some know rate near the equilibrium. One benefit of such a system is its robustness against disturbances around the equilibrium. Theorem 3.4: Consider the closed-loop system given by (2) subject to (3). If k > 1 rd and ra is chosen satisfying (4), then [rd, π 2 ]T is a locally exponentially stable equilibrium. Proof: For the UAV dynamics in (2) subject to the control policy in (3), the corresponding closed-loop system can be written as ˙r(t) = −V cos(θ(t)) (5) ˙θ(t) =k[V cos(π −sin−1( ra r(t))) + V cos(θ(t))] + V sin(θ(t)) r(t) , (6) where (5) was substituted into the first equation in (2) to obtain (6). Let’s define f1(r(t), θ(t)) △= −V cos(θ(t)) and f2(r(t), θ(t)) △=k[V cos(π −sin−1( ra r(t))) + V cos(θ(t))] + V sin(θ(t)) r(t) . 7 The linearization of (5) and (6) around the equilibrium [rd, π 2 ]T is given by ˙x(t) = A(t)x(t), (7) where x(t) = [r(t), θ(t)]T and A(t) △= [Aij] ∈R2×2 with A11 = ∂f1(r(t), θ(t)) ∂r(t) |r(t)=rd,θ(t)= π 2 = 0 A12 = ∂f1(r(t), θ(t)) ∂θ(t) |r(t)=rd,θ(t)= π 2 = V A21 =∂f2(r(t), θ(t)) ∂r(t) |r(t)=rd,θ(t)= π 2 = − kV r2 a r3 d q 1 −r2 a r2 d −V r2 d and A22 = ∂f2(r(t), θ(t)) ∂θ(t) |r(t)=rd,θ(t)= π 2 = −kV. Then the eigenvalues of A(t) are given by −kV ± s (kV )2 −V ( kV r2 a r3 d r 1− r2a r2 d + V r2 d ) 2 . Clearly, A(t) is Hurwitz. It then follows from [19] that the equilibrium [rd, π 2 ] is exponentially stable. IV. A REVISED CONTROLLER BASED ON RANGE MEASUREMENT AND ESTIMATED RANGE RATE In Section III, a controller based on range and range rate measurements was presented to guarantee the de- sired circular motion for the UAV. Direct range rate measurement is typically unavailable or suffers from significant uncertainties. The purpose of this section is to remove range rate measurement needed in the control algorithm (3). In particular, an estimated range rate, obtained via a sliding-mode estimator using range mea- surement, is used to replace the range rate measurement used in (3). Here control algorithm (3) is revised as bω =  k[V cos(π −sin−1( ra r(t))) −ˆx2], r(t) ≥ra, 0, otherwise, (8) where ˆx2 is an estimate of ˙r. In particular, ˆx2 is obtained via the following sliding-mode estimator as ˙ˆx1 = ˆx2 + k1 |r −ˆx1| 1 2 sgn(r −ˆx1), ˙ˆx2 = k2sgn(r −ˆx1) + k3(r −ˆx1), (9) if r ≥ra and ˙ˆx1 = 0, ˆx1(tx) = 2rd −ˆx1(te) ˙ˆx2 = 0, ˆx2(tx) = −ˆx2(te), (10) if r < ra, where sgn(·) is the sign function, te is the time when UAV moves inside Ca from outside Ca1 and tx denotes the first subsequent time when the UAV moves outside Ca, and ki, i = 1, 2, 3, are positive constants. Because the UAV could move inside and outside Ca multiple times, multiple te and tx may be expected. In fact, the number of te and the number of tx should be exactly the same since the UAV cannot stabilize inside Ca due to the zero control applied when the UAV is inside Ca. Here ˆx1 can be considered an estimate of r. Briefly speaking, the main idea behind the estimator is that (i) ˆx1 and ˆx2 satisfy (9) if the UAV is outside Ca; (ii) ˆx1 and ˆx2 remain unchanged if the UAV is inside Ca; and (iii) once the UAV moves outside Ca, ˆx2 is reset as its negate while ˆx1 is reset as 2rd plus its negate. As shown in the proof of the following Theorem 4.3, the reset of ˆx1 and ˆx2 at the time when the UAV moves from inside Ca to outside Ca is crucial in establishing a finite-time convergence of (ˆx1, ˆx2) to (r, ˙r). Remark 4.1: Instead of using actual range rate in con- troller (3), estimated range rate is used in controller (8). Because the designed sliding mode estimator guarantees that the estimation error converges to zero in finite time, the wellknown separation principle can be applied in the controller design. That is, the design of range rate estimator and the design of a feedback controller based on estimated range rate can be decoupled into two separated problems. Before presenting the main result in this section, the following lemma is needed. Lemma 4.1: Consider the differential equation given by ˙p = q −k1 |p| 1 2 sgn(p), ˙q = −k2sgn(p) −k3p + f(t, p, q), (11) where |f(t, p, q)| < δ1 + δ2 |q| with δi > 0, i = 1, 2. If k1 > 0, k2 > max{1 + δ2 1 k1 , 1 2δ2 2 + 2δ2}, and k3 > 0, (p, q) approaches (0, 0) in finite time. Proof: Let ξ △= [|p| 1 2 sgn(p), p, q]T and consider the following Lyapunov function candidate given by V = ξT Pξ, (12) where P = 1 2   4k2 + k2 1 0 −k1 0 2k3 0 −k1 0 2  . 1If the UAV is initially inside Ca, the initial time is one te. 8 Note that V(x) is positive-definite and is differentiable almost everywhere except at p = 0. When p ̸= 0, the derivative of V is given by ˙V = ξT P ˙ξ. Notice that the derivative of ξ is given by [1 2 |p|−1 2 ˙p, ˙p, ˙q]T . By recalling (11), ˙V can be rewritten as ˙V = −|p|−1 2 ξT Q1ξ −ξT Q2ξ −k1f(t, p, q) |p| 1 2 sgn(p) + 2qf(t, p, q), (13) where Q1 = k1 2   2k2 + k2 1 0 −k1 0 2k3 0 −k1 0 1   and Q2 = k2   k2 + 2k2 1 0 0 0 k4 0 0 0 1  . Because |f(t, p, q)| ≤δ1 + δ2q under the assumption of the lemma, it can be further obtained that k1f(t, p, q) |p| 1 2 sgn(p) ≤δ1k1 |p| 1 2 + k1δ2 |q| |p| 1 2 =δ1k1 |p|−1 2 [|p| 1 2 sgn(p)]2 + 1 2{δ2 2q2 + k2 1[|p| 1 2 sgn(p)]2} and |2qf(t, p, q)| ≤2δ1 |q| + 2δ2q2 =2δ1 |p|−1 2 |p| 1 2 sgn(p) |q| + 2δ2q2 ≤δ1 |p|−1 2 {[|p| 1 2 sgn(p)]2 + q2} + 2δ2q2. Then it follows from (13) that ˙V ≤−|p|−1 2 ξT Q1ξ −ξT Q2ξ + |p|−1 2 ξT Q3ξ + ξT Q4ξ, where Q3 =   δ1(1 + k1) 0 0 0 0 0 0 0 δ1   and Q4 =   1 2k2 1 0 0 0 0 0 0 0 1 2δ2 2 + 2δ2  . When ki, i = 1, 2, 3, satisfy the conditions in the lemma, Q1 −Q3 and Q2 −Q4 are positive definite. It then can be obtained that ˙V ≤−|p|−1 2 ξT (Q1 −Q3)ξ ≤−∥ξ∥−1 2 ξT (Q1 −Q3)ξ ≤−∥ξ∥−1 2 λmin(Q1 −Q3) ∥ξ∥2 = −λmin(Q1 −Q3) ∥ξ∥, where λmin(·) denotes the minimum eigenvalue of a symmetric positive-definite matrix, |p| ≤∥ξ∥is used to derive the second inequality, ξT (Q1 −Q3)ξ ≥ λmin(Q1−Q3) ∥ξ∥2 is used to derive the third inequality. Because V ≤λmax(P) ∥ξ∥2, where λmax(·) denotes the maximum eigenvalue of a symmetric positive-definite matrix, it follows that ∥ξ∥≥ 1 √ λmax(P )V 1 2 . It can then be obtained that ˙V ≤−λmin(Q1 −Q3) p λmax(P) V 1 2 . After some manipulation, one can get that 2 p V(t) ≤2 p V(0) −λmin(Q1 −Q3) p λmax(P) t. (14) Because V is differentiable almost everywhere except at p = 0, (14) is valid almost everywhere except at p = 0. Note that V = 0 if and only if p = 0 and q = 0. Therefore, V(t) →0 in finite time, which implies that (p, q) approaches (0, 0) in finite time. Remark 4.2: In [16], finite-time convergence of (11) with f(t, p, q) ≤δ1 + δ2 |p| was studied. Lemma 4.1 further analyzed the case when f(t, p, q) ≤δ1 + δ2 |q|. With Lemma 4.1, we are now ready to present the main result in this section. Theorem 4.3: Consider the UAV dynamics in (1) sub- ject to the control policy in (8). If k > 1 ra , k1 > 0, k2 > max{1 + V 4(2k+ 1 ra )2 k1 , 1 2k2V 2 + 2kV } and k3 > 0, then r(t) →rd and θ(t) → π 2 as t →∞, where ra = q r2 d −1 k2 . Proof: Notice from Theorem 3.2 that r(t) →rd and θ(t) →π 2 as t →∞by using (3), where ra = q r2 d −1 k2 . Then it is sufficient to prove this theorem if (8) and (3) become identical after a finite period of time because one can simply consider the initial time be the time after which (8) and (3) are always identical. Therefore, our focus next is to show (8) and (3) become identical in finite time under the condition of the theorem. Define x1 △= r and x2 △= ˙r. By recalling the polar dynamics in (2) and the control policy (8), the derivatives of x1 and x2 are given by ˙x1 = x2, ˙x2 = V sin(θ)  bω + V sin(θ) x1  . (15) Let p △= x1 −ˆx1 and q △= x2 −ˆx2. When r ≥ra, i.e., 9 x1 ≥ra, it follows from (15) and (9) that ˙p =q −k1 |p| 1 2 sgn(p), ˙q =V sin(θ)  bω + V sin(θ) x1  −k2sgn(p) −k3p = V sin(θ)  −kV cos sin−1(ra r ) −kx2 −kq + V sin(θ) x1  | {z } f(t,p,q) −k2sgn(p) −k3p. Because x2 = ˙r = −V cos(θ), one can obtain that |f(t, p, q)| ≤V (2kV + k |q| + V ra ). By letting δ1 = V 2(2k + 1 ra ) and δ2 = kV , |f(t, p, q)| ≤ δ1 + δ2 |q|. The property of f(t, p, q) in Lemma 4.1 is fulfilled under the condition of the theorem. By considering the Lyapunov function V defined in (12), it follows from the analysis in Lemma 4.1 that ˙V ≤−ηV 1 2 (16) holds almost everywhere for some positive η that is determined by k, k1, k2, and k3 under the condition of the theorem. Now let’s consider the case when r < ra, i.e., x1 < ra. Because bω = 0, function f(t, p, q) becomes [V sin(θ)]2 x1 , which is not necessarily bounded since x1 might ap- proach zero. The property |f(t, p, q)| ≤δ1 + δ2 |q| for some positive δ1 and δ2 is not necessarily satisfied. To overcome this issue, let’s drop the time when the UAV is inside Ca. Because zero control input is imposed when the UAV is inside Ca and V > 0, it takes a finite period of time before the UAV moves outside Ca. Without loss of generality, let te be the time when the UAV moves inside Ca and tx be the first subsequent time when the UAV moves outside Ca. Clearly, tx −te is bounded (by 2ra V ). By only considering the case when the UAV is outside Ca, the time interval [te, tx) is dropped. One consequence is that the state (p, q) might be different at time instants te and tx. The change of (p, q) at the two time instants could result in a jump of V from te to tx. If such a jump exists, it is desired that V becomes smaller or at least remains unchanged. Then the Lyapunov function still satisfies (16) almost everywhere and jumps to be smaller at some distinct instants if the time interval [te, tx) is dropped. When the UAV is initially outside Ca, let the initial time be unchanged. When the UAV is initially inside Cd, let the initial time be the time when the UAV moves outside Cd for the first time. Notice that V in (12) can be written as V =2k2 |p| + k3p2 + 1 2q2 + 1 2 h k1 |p| 1 2 sgn(p) −q i2 . (17) When p(tx) = 2rd −ˆx1(te) and q(tx) = −ˆx2(te), it can be computed that p(tx) =x1(tx) −ˆx1(tx) =x1(tx) −[2rd −ˆx1(te)] =rd −[2rd −ˆx1(te)] =ˆx1(te) −rd = −p(te). Recall from (1) that ˙r = −V cos(θ) and from the proof of Lemma 3.3 that θ(tx) = π −θ(te) or θ(tx) = 3π − θ(te). One can obtain that ˙r(tx) = −˙r(te). Equivalently, x2(tx) = −x2(te). It thus follows that q(tx) =x2(tx) −ˆx2(tx) = −x2(te) −[−ˆx2(tx)] = −q(te). From (17), it can be observed that V remains unchanged if both p and q are negated. Thus, V remains unchanged when the state (p(te), q(te)) changes to (p(tx), q(tx)) under the proposed control algorithm (8). If the finite time interval [te, tx) is dropped, V is continuous and satisfies (16) almost everywhere. By following the anal- ysis in Lemma 4.1, V approaches zero in finite time if [te, tx) is excluded. This implies that ˆx1 −r and ˆx2 −˙r go to zero in finite time if [te, tx) is excluded. In other words, there exists a time instant t⋆such that r(t⋆) ≥ra, ˆx1(t) −r(t) = 0, and ˆx2(t) −˙r(t) = 0 for any t ≥t⋆if [te, tx) is excluded. That is, (8) and (3) are identical for t ≥t⋆when [te, tx) is excluded. For t ∈[te, tx), (8) and (3) are identical because both of them are zero. Therefore, (8) and (3) are always identical for all t ≥t⋆ if we consider t⋆as the initial time. By recalling the analysis in the first paragraph of the proof of the theorem, it can be obtained that r(t) →rd and θ(t) →π 2 as t →∞. Remark 4.4: One unique feature of the proposed con- trol algorithm (8) is that only range measurement is needed in its implementation. Considering the limited sensing capabilities for small UAVs due to their payload restrictions, the control algorithm (8) and the concept behind it are more applicable in small UAV operations under a GPS-denied environment. V. SIMULATION EXAMPLES In this section, two simulation examples are provided to demonstrate the effectiveness of the two control 10 −10 −5 0 5 10 −20 −18 −16 −14 −12 −10 −8 −6 −4 −2 x y Fig. 3. The trajectory of the UAV under (3). The red triangle repre- sents the starting position of the UAV. The dashed circle represents Ca. algorithms proposed in Sections III and IV. For both control algorithms (3) and (8), the parameters are chosen as: rd = 10, [xT , yT ] = [0, −10], k = 0.2, V = 1. The initial state of the UAV is given by [13, −2, 5π/4]. By computation, ra = q r2 d −1 k2 = 8.6603. For control algorithm (8), we further let k1 = 2, k2 = 1.2, k3 = 0.1 and the sliding-mode estimator be initialized as [10, 0]. These parameters are chosen in such a way that those conditions in Theorems 3.2 and 4.3 are satisfied. Figs. 3 and 4 show, respectively, the trajectory of the UAV and the distance between the UAV and the target using control algorithm (3). Both figures show such a fact that the desired distance can be reached eventually. The fact that ra = 8.6603 means that the UAV aims towards a circle centered at the target’s location with radius 8.6603 in order to stabilize at the desired radius 10. Figs. 5, 6, and 7 are the simulation results using control algorithm (8). Fig. 5 shows the trajectory of the UAV under the control algorithm (8), where the blue line represents the actual trajectory. Again, the desired distance between the UAV and the target can be reached eventually. Fig. 6 shows the estimated range ˆx1 and the actual range r under the control algorithm (8). It can be observed that the estimated range approaches the actual range in finite time. Fig. 7 shows the estimated range rate ˆx2 and the actual range rate ˙r under the control algorithm (8). Again, the estimated range rate approaches the actual range rate in finite time if the time interval during which the UAV is inside Cd is excluded. One can also see from Fig. 7 that the reset mechanism in (10) is 0 50 100 150 7 8 9 10 11 12 13 14 15 16 time r Fig. 4. The distance between the UAV and the target under (3), i.e., r(t). −10 −5 0 5 10 15 −20 −15 −10 −5 0 x y Fig. 5. The trajectory of the UAV under (8). The red triangle repre- sents the starting position of the UAV. The dashed circle represents Ca. necessary to guarantee the accurate estimate of ˙r because otherwise the estimated range rate will be significantly different from the actual one at the time when the UAV exits Cd. VI. CONCLUSION AND FUTURE WORK In this paper, we proposed a control algorithm based on range-only measurement such that a UAV can cir- cumnavigate an unknown target at the desired distance under a GPS-denied environment. The design of such a control algorithm can be divided into two steps. First, a control algorithm based on range and range rate mea- surements was proposed to solve the circumnavigation 11 0 50 100 150 4 6 8 10 12 14 16 time estimated range actural range Fig. 6. Estimated range ˆx1 vs actual range r. 0 50 100 150 −1.5 −1 −0.5 0 0.5 1 1.5 2 time estimated range rate actural range rate Fig. 7. Estimated range rate ˆx2 vs actual range rate ˙r. problem. Second, a sliding-mode estimator was designed to replace the range rate measurement used in the control algorithm proposed in the first step. By choosing several parameters carefully, the sliding-mode estimator is able to accurately estimate the range rate in finite time using range measurement. Thus, the circumnavigation mission can be accomplished if the control algorithm in the first step is applied with range rate measurement being replaced by its estimated value obtained in the second step. Because the proposed control algorithm based on range-only measurement requires a minimum number of measurement for implementation, it is very suitable for UAV operations under a GPS-denied environment when limited measurements are available. One future research direction is to study how mea- surement noises and wind affect the proposed control algorithm and test the control algorithm in real flights. Another interesting topic is to consider control input saturation since the allowable control input is normally limited for physical UAVs. ACKNOWLEDGEMENT The author would like to thank Prof. Brian Anderson for his insightful suggestions and comments. REFERENCES [1] W. J. Dahm, “Technology horizons: A vision for air force science & technology during 2010-2030,” USAF, Tech. Rep., 2010. [2] C. Bolkcom, “Homeland security: Unmanned aerial vehicles and border surveillance,” CRS Report for Congress, Tech. Rep., 2004. [3] M. Nagai, T. Chen, R. Shibasaki, H. Kumagai, and A. Ahmed, “UAV-borne 3-D mapping system by multisensor integration,” IEEE Transactions on Geoscience and Remote Sensing, vol. 47, no. 3, pp. 701–708, March 2009. [4] M. Quigley, M. A. Goodrich, S. Griffiths, A. Eldredge, and R. W. Beard, “Target acquisition, localization, and surveil- lance using a fixed-wing mini-UAV and gimbaled camera,” in IEEE International Conference on Robotics and Automation, Barcelona, Spain, April 2005, pp. 2600–2605. [5] R. W. Beard and T. W. McLain, Small Unmanned Aircraft: Theory and Practice. Princeton, NJ: Princeton University Press, 2012. [6] G. Warwick, “Lightsquared tests confirm GPS jamming,” Aviation Week, June 2011. [Online]. Available: http://www.aviationweek.com/aw/generic/story.jsp?id=news/awx/2011/06/09/a [7] D. Shepard, J. Bhatti, and T. Humphreys, “Drone hack: Spoofing attack demonstration on a civilian unmanned aerial vehicle,” GPS World, 2012. [Online]. Available: http://www.gpsworld.com/drone-hack/ [8] I. Shames, S. Dasgupta, B. Fidan, and B. D. O. Anderson, “Circumnavigation using distance measurements under slow drift,” IEEE Transactions on Automatic Control, vol. 57, no. 4, pp. 889–903, 2012. [9] M. Deghat, I. Shames, B. D. O. Anderson, and C. Yu, “Lo- calization and circumnavigation of a slowly moving target using bearing measurements,” IEEE Transactions on Automatic Control, 2014. [10] A. S. Matveev, H. Teimoori, and A. V. Savkin, “The problem of target following based on range-only measurements for car- like robots,” in Joint IEEE Conference on Decision and Control and Chinese Control Conference, Shanghai, China, December 2009, pp. 8537–8542. [11] ——, “Range-only measurements based target following for wheeled mobile robots,” Automatica, vol. 47, no. 1, pp. 177– 184, 2011. [12] Y. Cao, J. Muse, D. Casbeer, and D. Kingston, “Circumnaviga- tion of an unknown target using UAVs with range and range rate measurements,” in Proceedings of the 52nd IEEE Conference on Decision and Control, Firenze, Italy, December 2013, pp. 3617–3622. [13] Z. Sahinoglu and S. Genzici, “Ranging in the IEEE 802.15.4a standard,” in IEEE Wireless and Microwave Technology Con- ference, 2006. [14] Y. Cao, “UAV circumnavigating an unknown target using range measurement and estimated range rate,” in American Control Conference, Portland, OR, June 2014, accepted. 12 [15] J. Davila, L. Fridman, and A. Levant, “Second-order sliding mode observer for mechanical systems,” IEEE Transactions on Automatic Control, vol. 50, no. 11, pp. 1785–1789, November 2005. [16] J. A. Moreno and M. Osorio, “A Lyapunov approach to second- order sliding mode controllers and observers,” in Proceedings of the IEEE Conference on Decision and Control, Cancun, Mexico, December 2008, pp. 2856–2861. [17] ——, “Strict Lyapunov functions for the super-twisting algo- rithm,” IEEE Transactions on Automatic Control, vol. 57, no. 4, pp. 1035–1040, April 2012. [18] J.-J. E. Slotine and W. Li, Applied Nonlinear Control. Engle- wood Cliffs, NJ: Prentice Hall, 1991. [19] H. K. Khalil, Nonlinear Systems, 3rd ed. Upper Saddle River, NJ: Prentice Hall, 2002.