arXiv:1606.08164v2 [cs.RO] 13 Jul 2016 Online Informative Path Planning for Active Classification on UAVs Marija Popovi´c, Gregory Hitz, Juan Nieto, Roland Siegwart, and Enric Galceran ETH Z¨urich, Autonomous Systems Lab 0 0.2 0.4 0.6 0.8 1 Weed probability 50m 0m 35m 0m 50m Past Future 0m 50m 50m 0m 35m 0 10 20 30 40 Height (m) 0 0.2 0.4 0.6 0.8 Weed classif. probability Weeds Nonweeds Fig. 1: By planning adaptively, our IPP approach (center) produces a weed map for precision agriculture with over half the entropy of a “lawnmower” coverage path (left) in the same time period. Our sensor model (right) allows better weed classifier performance with images taken at lower altitudes. The pyramid shows the camera footprint. Abstract. We propose an informative path planning (IPP) algorithm for active classification using an unmanned aerial vehicle (UAV), focus- ing on weed detection in precision agriculture. We model the presence of weeds on farmland using an occupancy grid and generate plans accord- ing to information-theoretic objectives, enabling the UAV to gather data efficiently. We use a combination of global viewpoint selection and evolu- tionary optimization to refine the UAV’s trajectory in continuous space while satisfying dynamic constraints. We validate our approach in simu- lation by comparing against standard “lawnmower” coverage, and study the effects of varying objectives and optimization strategies. We plan to evaluate our algorithm on a real platform in the immediate future. 1 Introduction Autonomous robots are increasingly used to gather information about the Earth’s ecosystems [6]. In agricultural monitoring, unmanned aerial vehicles (UAVs) are capable of providing high-resolution data in a flexible, cost-efficient manner [5]. Using sensors, UAVs can survey crops to find precision treatment targets, improv- ing yield and leading to sustainability and economic gain [2]. Unfortunately, UAVs are often constrained by limited battery and computational capacities. Therefore, planning for efficient data collection is key in enabling robotics in this field. We address the problem by proposing an informative path planning (IPP) algorithm for active classification on a UAV equipped with an image-based weed classifier. We model the presence of weed on farmland using an occupancy grid. We continuously plan paths online through a combination of global viewpoint selection and evolutionary optimization, which refines the UAV’s trajectory in continuous 3D space while satisfying dynamic constraints. The resulting infor- mative paths abide by a limited time budget and address the key challenge of trading offsensor resolution against coverage when flying at variable altitudes. 2 Marija Popovi´c et al. Our contributions are: • An IPP algorithm with the following properties: – generates dynamically feasible trajectories in continuous space, – obeys budget and sensing constraints, – trades offsensor resolution against coverage in a principled manner by incorporating a height-dependent sensor noise model. • An evolutionary strategy to optimize continuous UAV paths for maximum informativeness. • Validation of our approach in simulation against a coverage planner. We plan to evaluate our algorithm in field experiments in the immediate future. 2 Related Work Most previous IPP approaches seek to minimize map uncertainty using objec- tives derived from Shannon’s entropy [3, 13]. To exploit new data, adaptive ap- proaches [8, 11, 14] replan paths based on specific interests. IPP can be per- formed using combinatorial optimization over a discrete grid [1, 4, 12]. However, the drawbacks of this representation are its limited scalability and resolution. Alternatively, some planners work in continuous space by leveraging sampling- based methods [13] or splines [3, 11, 15]. Similarly to Charrow et al. [3], we use global viewpoint selection to escape local minima and optimization to refine our trajectory. IPP addressing UAV imaging is a relatively unexplored area. A set-up similar to ours has been studied recently by Sadat et al. [18]. However, their method as- sumes discrete viewpoints and prior knowledge of target regions, neglecting sensor noise. In contrast, our approach considers a height-dependent sensor model and incrementally replans as data are collected. Moreover, we use smooth polynomial trajectories which guarantee feasibility of the UAV’s dynamic constraints. 3 Problem Definition We define the general IPP problem as follows. We seek a continuous path P in the space of all possible paths Ψ for maximum gain in some information measure: P ∗= argmax P ∈Ψ I[measure(P)] time(P) , s.t. time(P) ≤B, (1) where B denotes a time budget and I quantifies the objective, discussed in §4.3 for our application. The function measure(·) obtains measurements and time(·) provides the travel time along the path. Maximizing information gain rate, as opposed to maximizing only information, enables comparing the values of paths over different time scales. 4 Technical Approach In this section, we present our IPP algorithm. The main idea is to create fixed- horizon plans maximizing an informative objective. To do this efficiently, we first select global viewpoints and then optimize the path in continuous space using an evolutionary method. In §4.1 and §4.2, we introduce our approaches to modeling and path parametrization. In §4.3, we detail our planning routine, shown in Alg. 1. Informative Path Planning for Active Classification on UAVs 3 Algorithm 1 replan path procedure 1: X g, X i ←∅ ⊲Initialize global and intermediate viewpoints. 2: while H ≥|X g ∪X i| do 3: if t/B < rand() then ⊲Tradeoffglobal selection objectives based on time. 4: x∗←Select viewpoint in L using Eq. 3 5: else 6: x∗←Select viewpoint in L using Eq. 4 7: M ←simulate measurement(M, x) ⊲Simulate using ML. 8: t ←t + time(x∗) 9: X g ←X g ∪x∗; X i ←X i ∪add intermediate points(x∗) 10: X ←X g ∪X i; X ←cmaes(X , M) ⊲Optimize polynomial path. 4.1 Environment and Measurement Models We represent the environment (a farmland above which the UAV flies) using a 2D occupancy grid map M [7], where each cell is associated with a Bernoulli random variable representing the probability of weed occupancy. For our measurement model, we assume a square footprint for a down-looking camera providing input to a weed classifier. The classifier provides probabilistic weed occupancy for cells within field of view (FoV) from a UAV configuration x. For each observed cell mi ∈M at time t, we perform a log-likelihood update given an observation z: logit(p(mi|z1:t, x1:t)) = logit(p(mi|z1:t−1, x1:t−1)) + logit(p(mi|zt, xt)), (2) where the second term denotes the height-dependent sensor model capturing the weed classifier output. Our sensor model (Fig. 1, right) matches real datasets at low altitudes [10] and accounts for poorer classification performance with lower- resolution images taken at higher altitudes. 4.2 Path Parametrization To create paths abiding by the dynamic constraints of the UAV, we connect viewpoints x ∈X using the method of Richter et al. [17]. As in their work, we express a 12-degree polynomial trajectory in terms of end-point derivatives, allowing for efficient optimization in an unconstrained quadratic program. 4.3 Planning Algorithm We use a fixed-horizon approach to plan adaptively. During the mission, we main- tain viewpoints X within a horizon H. We alternate plan execution and replan- ning, stopping when a time budget B is exceeded. For replanning, we adopt a two-stage approach consisting of global viewpoint selection and optimization. This procedure is described in Alg. 1 and illustrated in Fig. 2. The following sub- sections detail the key steps of Alg. 1. Global Viewpoint Selection In the first step (Lines 3-9), we sequentially se- lect global measurement sites X g (Fig. 2b). Unlike in frontier-based exploration, common for indoor mapping [3], choosing viewpoints using map boundaries is not applicable in our set-up. Instead, we apply Eq. 1 over the horizon H (Line 2) 4 Marija Popovi´c et al. 0m 50m 50m 40m 0m (a) (b) (c) (d) Fig. 2: Our planner uses a lattice (a) for selecting global viewpoints (b). The trajectory is then refined globally (c) or locally (d). The orange and maroon curves show paths before and after optimization, respectively. to find most informative measurement sites. To find the next viewpoint x∗effi- ciently, we evaluate the objective over a multiresolution lattice L (Fig. 2a). To encourage exploration, we maximize entropy reduction in M: I[t + 1|t] = H(Mt) −H(Mt+1). (3) To encourage classification, we divide M into “weed” and “non-weed” cells using thresholds δw and δnw, leaving an unclassified subset U = {mi ∈M | δnw < p(mi) < δw}. This is similar to finding unknown space in conventional occupancy mapping. We maximize the reduction of U between time-steps: I[t + 1|t] = |Ut| −|Ut+1|. (4) We use an optional time-varying parameter (Line 3) to gradually bias viewpoint selection towards Eq. 4 from Eq. 3, focusing on weed identification over time. We then simulate a maximum likelihood (ML) measurement at x∗(Line 7) and interpolate intermediate viewpoints X i to add degrees of freedom to the path (Line 9). Optimization In the second step (Line 10), we optimize the polynomial path by solving Eq. 1 in §3 using the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). CMA-ES is a gradient-free optimizer suitable for continuous shape fitting with our discrete measurement model [9]. As evaluated in §5, we consider (i) globally optimizing X (Fig. 2c) and (ii) optimizing X i only (Fig. 2d). 5 Results Our algorithm is validated in simulation on 260 50 × 50m environments with 120 Poisson-distributed weeds. We use thresholds of δnw = 0.25 and δw = 0.75, and use a replanning horizon H of 7 viewpoints to limit optimization complexity. We initialize the UAV position as the map center with maximum altitude (45m). Our methods are evaluated against traditional “lawnmower” coverage with height fixed (8.66 m) for the same 300 s budget B. We consider map entropy, classification rate, and mean F1-score as metrics, common for classification tasks. We simulate sensor noise using our model in §4.1. Following a similar approach to Pomerleau et al. [16], we compute the cumulative distribution function (CDF) of entropy over a time histogram to summarize variability among trajectories. Informative Path Planning for Active Classification on UAVs 5 In our experiments, we study varying: • Global viewpoint objectives: information only (Eq. 3), classification only (Eq. 4), time-varying (§4.3) • Optimization methods: no CMA-ES, local CMA-ES, global CMA-ES As described in §4.3, for local CMA-ES, we consider optimizing X i to reduce inter-segment entropy. For global CMA-ES, we optimize X, allowing points in X g to vote on the optimization objective for the entire trajectory. Fig. 3 compares the global viewpoint selection objectives with local CMA-ES optimization. Our methods outperform na¨ıve coverage as they permit variable- altitude flight for wider FoVs. As shown in Fig. 1, our planners usually produce paths similar to spirals, starting with descent to the unknown map center. The curves illustrate the coverage-resolution trade-off: for the classification objective (red curve), flying at low altitudes quickly provides accurate classification, as shown by the rises in classification rate and F1-score. However, entropy reduction is limited. By considering elapsed time when selecting global viewpoints (yellow curve), we obtain high certainty with efficient classification. Fig. 4 compares the CMA-ES optimization methods for the time-varying objective. The global method likely performs best due to the highest number of optimized variables. 0 5000 10000 Entropy (bits) 0 0.2 0.4 0.6 0.8 1 CDF 0 100 200 300 Time (s) 0 20 40 60 80 100 Classification rate (%) 0 100 200 300 Time (s) 0 0.2 0.4 0.6 0.8 1 Mean F1-score Information Classification Time-varying Coverage Fig. 3: Comparison of global viewpoint selection objectives (§4.3) with local CMA-ES optimization. The solid lines indicate means over 260 trials. The thin shaded regions de- pict 95% confidence bounds. Using our methods, the metrics improve quickly as the UAV flies at variable altitudes. Accounting for spent budget (time) trades offthe objectives. 0 5000 10000 Entropy (bits) 0 0.2 0.4 0.6 0.8 1 CDF 0 100 200 300 Time (s) 0 20 40 60 80 100 Classification rate (%) 0 100 200 300 Time (s) 0 0.2 0.4 0.6 0.8 1 Mean F1-score No CMA-ES Local CMA-ES Global CMA-ES Coverage Fig. 4: Comparison of optimizers for the time-varying objective. The effect of local op- timization is marginal due to the small number of refined points. Overall, global opti- mization performs best as the entire path can be varied. 6 Marija Popovi´c et al. 6 Conclusion and Scheduled Experiments We presented an adaptive IPP strategy for active weed classification on UAVs. Our algorithm combines global viewpoint selection with evolutionary optimiza- tion to generate dynamically feasible paths with informative objectives. We val- idated our strategy against a “lawnmower” coverage pattern and demonstrated the effects of using different objectives and optimization strategies. We aim to implement our algorithm on an AscTec Neo UAV platform. Our experiments will take place at the ETH Lindau-Eschikon Research Station for Plant Sciences in Switzerland. We will consider active classification of crop-weed distributions on a 20-plot 40 × 100m sugarbeet field. Acknowledgments. This work was funded by the European Community’s Hori- zon 2020 programme under grant agreement no 644227-Flourish and from the Swiss State Secretariat for Education, Research and Innovation (SERI) under contract number 15.0029. We would like to thank the ETH Crop Science Group for providing the testing facilities. Bibliography [1] Binney, J., Krause, A., Sukhatme, G.S.: Optimizing waypoints for monitoring spa- tiotemporal phenomena. IJRR 32(8), 873–888 (2013) [2] Cardina, J., Johnson, G.A., Sparrow, D.H.: The nature and consequence of weed spatial distribution. Weed Science 45(3), 364–373 (1997) [3] Charrow, B., Kahn, G., Patil, S., Liu, S., Goldberg, K., Abbeel, P., Michael, N., Ku- mar, V.: Information-Theoretic Planning with Trajectory Optimization for Dense 3D Mapping. In: Proc. RSS. pp. 3–12. Rome (2015) [4] Chekuri, C., P´al, M.: A Recursive Greedy Algorithm for Walks in Directed Graphs. In: Proc. IEEE FOCS. pp. 245–253 (2005) [5] Detweiler, C., Ore, J.P., Anthony, D., Elbaum, S., Burgin, A., Lorenz, A.: Bringing Unmanned Aerial Systems Closer to the Environment. Environmental Practice 17(3), 188–200 (2015) [6] Dunbabin, M., Marques, L.: Robots for environmental monitoring: Significant ad- vancements and applications. IEEE Life Sciences 19(1), 24–39 (2012) [7] Elfes, A.: Using occupancy grids for mobile robot perception and navigation. Com- puter 22(6), 46–57 (1989) [8] Girdhar, Y., Dudek, G.: Modeling Curiosity in a Mobile Robot for Long-Term Autonomous Exploration and Monitoring. AURO 33(4), 645–657 (2015) [9] Hansen, N.: The CMA evolution strategy: A comparing review. Studies in Fuzziness and Soft Computing 192(2006), 75–102 (2006) [10] Haug, S., Ostermann, J.: A Crop / Weed Field Image Dataset for the Evaluation of Computer Vision Based Precision Agriculture Tasks. In: Proc. ECCV. pp. 105–116. Springer, Z¨urich (2014) [11] Hitz, G., Galceran, E., Garneau, M.`E., Pomerleau, F., Siegwart, R.: Adaptive Continuous-Space Informative Path Planning for Online Environmental Monitor- ing. JFR (2016), under review [12] Hollinger, G., Singh, S., Djugash, J., Kehagias, A.: Efficient Multi-robot Search for a Moving Target. IJRR 28(2), 201–219 (2009) [13] Hollinger, G.A., Sukhatme, G.S.: Sampling-based robotic information gathering algorithms. IJRR 33(9), 1271–1287 (2014) [14] Lim, Z.W., Hsu, D., Lee, W.S.: Adaptive informative path planning in metric spaces. IJRR pp. 1–14 (2015) Informative Path Planning for Active Classification on UAVs 7 [15] Marchant, R., Ramos, F.: Bayesian Optimisation for Informative Continuous Path Planning. In: Proc. IEEE ICRA. pp. 6136–6143. Hong Kong (2014) [16] Pomerleau, F., Colas, F., Siegwart, R., Magnenat, S.: Comparing icp variants on real-world data sets. AURO 34(3), 133–148 (April 2013) [17] Richter, C., Bry, A., Roy, N.: Polynomial Trajectory Planning for Aggressive Quadrotor Flight in Dense Indoor Environments. In: Proc. ISRR. Springer, Singa- pore (2013) [18] Sadat, S.A., Wawerla, J., Vaughan, R.: Fractal Trajectories for Online Non-Uniform Aerial Coverage. In: Proc. IEEE ICRA. pp. 2971–2976. Seattle, WA (2015)