UAI2003 ROSENCRANTZ ET AL. 493 Decentralized Sensor Fusion with Distributed Particle Filters Matt Rosencrantz, Geoffrey Gordon, and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Abstract This paper presents a scalable Bayesian tech­ nique for decentralized state estimation from multiple platforms in dynamic environments. As has long been recognized, centralized architec­ tures impose severe scaling limitations for dis­ tributed systems due to the enormous commu­ nication overheads. We propose a strictly de­ centralized approach in which only nearby plat­ forms exchange information. They do so through an interactive communication protocol aimed at maximizing information flow. Our approach is evaluated in the context of a distributed surveil­ lance scenario that arises in a robotic system for playing the game of laser tag. Our results, both from simulation and using physical robots, illus­ trate an unprecedented scaling capability to large teams of vehicles. 1 Introduction In recent years, probabilistic tracking techniques have found widespread application in computer vision (Isard & Blake, 1998), diagnostics (Lerner et a!., 2002; Verma et al., 2003), modeling (Rusinkiewicz & Levoy, 2001), intelligent environments (Pasula et al., 1999), and robotics (Thrun, 2002). Virtually all state-of-the-art approaches are based on the basic Bayes filter (Jazwinsky, 1970; Maybeck, 1990). While Bayes filters (and derivatives) are easily de­ fined for centralized architectures, in which a single com­ putational node receives all sensor data, they have proven difficult to extend to distributed systems. In distributed systems sensor data is acquired through mul­ tiple platforms and the communication between these plat­ forms is the major bottleneck. Examples of distributed sys­ tems include agents on the Internet, environments instru­ mented with many sensors, multi-camera surveillance sys­ tems, and teams of mobile robots pursuing a joint goal. The simplest kind of architecture for state estimation is cen­ tralized. In centralized architectures, all platforms com­ municate all sensor data to a single special agent, which processes it centrally and broadcasts the resulting state es­ timate back to the individual platforms. Such an approach suffers from two problems: First, it is brittle because it pos­ sesses a single point of failure. Second, the enormous com­ munication overhead often prohibits its use in situations with more than a few dozen platforms. Both limitations have long been recognized in the decentralized tracking lit­ erature (Rao et al., 1993). Decentralized architectures, by contrast, rely on communi­ cation between nearby platforms. By doing so, the num­ ber of messages that each platform sends or receives is independent of the total number of platforms in the sys­ tem. This property ensures scalability to distributed sys­ tems with (almost) any number of platforms. In a recent result (Nettleton et al., 2000), it was shown that scalable decentralized state estimation can be achieved for static environments with Gaussian error-that is, environ­ ments where the state does not change over time and ob­ servations are (approximately) linear functions of the state with Gaussian error. The key insight in (Nettleton et al., 2000) is that in static worlds, information can be accumu­ lated at any time and in any order, regardless of the time at which it was acquired. Furthermore, because of the simple form of the observations, there is a simple algorithm for accumulating local evidence in the form of additive infor­ mation matrices. Unfortunately, the "trick " of additive in­ formation works only for distributions (such as Gaussians) which are in the exponential family. Attempts to apply sim­ ilar ideas to more general observations tend to lead to over­ confident state estimates (Fox et al., 2000). More impor­ tantly, such approaches are inapplicable to dynamic envi­ ronments, since we are no longer free to incorporate infor­ mation in any order. This paper extends previous research in two critical di­ rections: our decentralized filter can cope with dynamic environments and it can handle non-Gaussian posteriors. As in (Nettleton et al., 2000), each platform maintains 494 ROSENCRANTZ ET AL. UAI2003 Figure 1: The CMU laser tag system consists of six robots based on Pioneer and Scout platforms, of which five are shown here. The robots are equipped with in­ frared emitters and detectors. Navigation is achieved using the CARMEN software system (Montemerlo et al., 2002). its own Bayesian state estimate, computed from its local sensor measurements and information received from other (nearby) platforms. In contrast to (Nettleton et a!., 2000), however, each platform maintains a belief over state histo­ ries instead of just single states. As we shall see below, this extension enables our approach to cope with dynamic en­ vironments. However, keeping this extra information intro­ duces a new problem: because the space of possible state histories is so large, it is no longer feasible to represent, communicate, or incorporate evidence about full histories directly. The key innovation of our approach is a selective communi­ cation scheme which never attempts to send evidence about full histories around the network. Our algorithm takes the form of a query-response protocol. Platforms query their neighbors by sending compact summaries of their poste­ rior beliefs. The neighbors keep databases of past sensor measurements, and respond by sending the most informa­ tive piece of evidence in their local memory. So, informa­ tion exchange is efficient, and our communication overhead can be kept to a minimum. Our approach has been implemented for a distributed surveillance scenario (Dolan et a!., 1999) motivated by on­ going research on a robotic laser tag system (Rosencrantz et a!., 2003). This domain involves two teams of robots pursuing each other in an indoor environment, tagging each other using infrared emitters (see Figure 1). The domain is characterized by massive occlusion, as opponents are usu­ ally hidden behind obstacles. This naturally leads to non­ Gaussian, multi-modal beliefs, as illustrated in Figure 2. So, our implementation uses particle filters (Doucet et a!., 2001; Liu & Chen, 1998) to estimate non-Gaussian poste­ riors over state sequences. Recently there has been some other efforts to design dis­ tributed Bayes filters in the sensor networks community (Chu et a!., 2002). However, this work focuses on the Figure 2: A typical belief in robot laser tag. The small circles are robots. This belief is from the perspective of robot A who is sharing information with its team­ mate B. Together, they are tracking opponents C and D. The belief D's position is highly non-Gaussian, as illustrated by the panicles. problem of querying a network of sensors to provide an answer for an external party, whereas we are interested in scenarios where the sensors are also the consumers of in­ formation, in particular where each needs to maintain an accurate posterior. Because of this their technique does not provide a mechanism to disseminate relevant informa­ tion to all sensor nodes. Nor does it allow new sensors to join the network and contribute observations that were col­ lected at some point in the past, whereas our algorithm is specifically designed for such situations. There has also been some work toward deriving general minimum dis­ tortion bounds for certain topologies of sensor networks (Chen et al., 2001 ), in contrast we will exploit features of our domain to simplify our problem. Our approach extends previous work on multi-robot track­ ing (Rosencrantz et a!., 2003) (which required a centralized tracker) to a fully decentralized system. As a result, our approach performs well using 50 robots (in simulation); in fact, it consistently outperforms alternative decentralized algorithms. To the best of our knowledge, such scalabil­ ity is unprecedented in a dynamic environment. We also provide results obtained using our physical robot system, albeit with many fewer robots per team. 2 The Decentralized Bayes Filter 2.1 Static Environments To goal of Bayesian state estimation is to calculate a pos­ terior distribution over the state. Let us denote state by x. Naturally, the state is not directly observable but has to be inferred from data. Let us denote the number of platforms UAI2003 ROSENCRANTZ ET AL. 495 (robots) by N. The data acquired by the i-th platform at time t will be denoted dj; here the subscript refers to the time and the superscript to the platform identity. The set of all data items collected between the initial time, denoted 0, and timet, will be collectively denoted by dh:r· The desired posterior is given by p(x I dJ" ... ,d<1) (I) The posterior can be conveniently calculated by combin­ ing local posteriors, obtained by integrating the sensor data acquired by a local platform. To see, we note that p(x I dJn . . . ,d<1) oc p(x) p(dJ:r, ... ,d<1 I x) oc N p(x) IJp(db,, I x) i=i p(x) TI p(x I d0,) i=I p(x) (2) Here p(x I db,,) is the local posterior of the i-th platform, and p(x) is simply the prior over the state. The ratio of the two is the evidence accumulated by the ith platform about the state x. For beliefs from the exponential family (e.g., Gaussians), each agent's evidence can be calculated in closed form and can be represented by a message of fixed size no matter how many observations the agent has seen. Furthermore, the product in Eq. (2) can be calculated in closed form and in arbitrary order: when we find out the evidence from each agent we can just multiply it into a running total (or add it in if we are accumulating on a log scale). These insights make possible several important computa­ tional simplifications: if an agent updates its evidence all we need to do is divide out the old evidence and multiply in the new; and if we have evidence available from sev­ eral platforms we can combine it all into a single message before passing it to our neighbors. In this way, a globally consistent result can be achieved using a communication structure where only neighbors communicate, and where the number of messages per platform is independent of the number of platforms. This observation forms the core of a literature on decentralized sensor fusion, of which (Nettle­ ton et al., 2000) is an example. 2.2 Dynamic Environments Unfortunately, extending this approach to dynamic envi­ ronments is not straightforward. In dynamic environments, the state can change over time; so, we must annotate the state with a time index. We will write x1 for the state at time t, and Xo:r for the sequence of all states from time 0 to t. Unfortunately, the factorization in Eq. (2) does not ap­ ply if the static state xis replaced by the dynamic state x1• To see why, consider the example in Figure 3. This figure (a) (b) (c) (d) Figure 3: Illustration of why combining posterior states of individual platforms yields false results !n dy\ namic environments. See text for details. illustrates a distributed surveillance problem, with two plat­ forms indicated by circles with direction markers. Suppose that the opponent is known to start off in the shaded region of Figure 3a. Because the opponent can move, at some point in timet the local posterior p(x I dh:t) of the top robot is illustrated by the shaded region in Figure 3b. The pos­ terior of the other robot is symmetrical (not shown here). Combining both posteriors using the rule (2) leads to a dis­ tribution shown in Figure 3c. This distribution is incorrect: it suggests the opponent could have slipped through the robots' perceptual fields. The correct posterior (for robots with perfect detection) is shown in Figure 3d. Suoh a poste­ rior cannot be obtained by combining the two robots' mo­ mentary beliefs. This example illustrates why the simple form (2) is inapplicable to dynamic environments. The obvious solution to this problem is to estimate the pos­ terior over the entire state sequence xo:r, not just the mo­ mentary state x1• In particular, the following factorization applies (with essentially the same derivation as in Eq. 2): oc N ( I j ( ) II p XO:t d0,1) P XO:t i=I p(xo:r) Unfortunately, the dimension of the evidence p(x("ld ,) P XO:t grows over time. So, we can no longer communicate all of our evidence in a fixed-size message. 3 Approximating the Bayes Filter The above argument demonstrates that, in general, we can­ not combine two pieces of evidence into a compact repre­ sentation unless they refer to the same state of the world. In a dynamic environment, that means we can only combine two observations if they were gathered simultaneously; for example, we could combine sensor readings taken by two different agents on the same time step. 496 ROSENCRANTZ ET AL. UA12003 Depending on the details of our observation model, com­ bining simultaneously-gathered evidence may or may not save any bandwidth. For example, in a d-dimensional linear-Gaussian model such as the one studied by (Net­ tleton et al., 2000), each observation takes O(d) space to store. We can combine arbitrarily many observations into a d x d evidence matrix, but doing so only saves space if we have O.(d) separate observations. Usually sensor plat­ forms are scarce compared to landmarks, so it is not worth combining evidence. In our target application of laser tag, it is even harder to take advantage of the ability to combine simultaneous ob­ servations: our observations are laser range scans of a small portion of a map. We can combine arbitrarily many obser­ vations into a data structure the size of a map of the world, but this process will not save us any space unless we have enough agents that their visible regions overlap and tile the entire map. Since we cannot profitably combine evidence, and since we cannot afford to communicate all evidence, our algorithm must instead decide which evidence is worth sending. The remainder of this section describes how we do so. 3.1 Selective communication The key idea of our approach is to selectively communi­ cate only the most informative sensor measurements. More specifically, our approach approximates the desired poste­ rior as follows: p(xo:r I dJ,,, . . . ,d< ,) 7 p(xo:r I dJ,,o6:r , ... ,8=,) (3) Here we assume, for notational convenience, that the plat­ form of interest has index i = 1; the data dJ,, in (3) is hence the local data acquired by platform i = 1. Each ob,, with j i- i, however, is a subset of the data acquired by platform j: (4) Thus, (3) is an approximation. This approximation is jus­ tified when sensor data is locally redundant, as is the case in our surveillance application. In this and many other sce­ narios, a small number of measurements suffices to obtain an accurate approximation. The crux to the efficiency of our approach, thus, lies in the appropriate choice of the set 8b,,· A simple choice would be to communicate every k-th measurement, for an appro­ priate value of k. However, such an approach would not be very specific; it would risk communicating uninformative sensor measurements while omitting ones that are more in­ formative. In what follows, we will describe a communi­ cation scheme that leads to the exchange of approximately maximally informative sensor measurements. 3.2 Selecting Informative Measurements for Communication The key contribution of our approach is a protocol that en­ ables platforms to selectively communicate measurements that are maximally informative. The information of a mea­ surement is given by its relative entropy. Let d{ be a measurement in robot's j local database, and write D1 = (dJ,,86," ... ,8=1) for the data available to robot I. Then the information of d4 relative to the 1st robot's local belief is given by Here DKL denotes the KL divergence. The most informa­ tive measurement is obtained by maximizing this expres­ sion: d4 = argmallr q(d4). For the j-th robot to identify the most informative measurement requires knowledge of an­ other robot's posterior belief. Thus, our approach requires the communication of the belief of a robot. This belief is a "query," and the "answer" is the most informative sensor measurement in the data base of another entity. Unfortunately, communicating the belief can be pro­ hibitively expensive. In particular, the belief comprises all past states, not just the most recent one. On this basis alone, it might appear that our selective information scheme is in­ efficient. It merely shifts communication overhead from the exchange of information to the exchange of queries. In the next section, however, we will describe an efficient im­ plementation that communicates highly compact approxi­ mate posterior beliefs. 4 The Decentralized Particle Filter 4.1 Basic Algorithm Our approach represents each platform's posterior belief by a local particle filter (Doucet et al., 2001; Liu & Chen, 1998). The motivation for using particle filters is two-fold. For one, particle filters can represent almost arbitrary pos­ terior distributions; they are certainly well-suited to accom­ modate the types of uncertainty that arise in the distributed surveillance scenario. More importantly, particle filters es­ timate posteriors over entire paths, not just the current state. Put differently, each particle can be thought of as an en­ tire history or trajectory, and the set of all particles repre­ sents an approximation of the posterior over trajectories. This well-known property of particle filters (which is not shared by Kalman filters) makes them well-suited for the type of posteriors required by our selective communication scheme. In the particle filter, an agent's posterior p(xo:r I D1) is rep­ resented by a set of weighted samples or particles. We will write xiJ,, for the ith particle in this set, and w! for its weight. UAI2003 ROSENCRANTZ ET AL. 497 We need to implement two different operations for our par­ ticle filter: incorporation of new evidence, and propagation from time t to timet+ I. If agent I finds out a new measurement d4 taken by agent j at time 1:, we can incorporate this evidence by setting +- w: p(d41 xb:tl w; p(d41 x) (6) for all i. This is exactly the standard particle filter observa­ tion update, with a simplification because x, d-separates d4 from all other variables. To propagate forward a time step, we pick a particle x0,, at random according to the weights w: /:L w:. Then we sam­ ple a new particle x'0,, : 1 for timet+ I according to 4.2 Communication if Xo:t = XO:t otherwise (7) To implement the query step of our communication algo­ rithm, we could send the set of particles that characterizes a robot's local belief. Since each particle contains a sequence of past states, this message may still be prohibitively ex­ pensive. Thus, our approach employs a final approxima­ tion step: instead of communicating the full particle set, we only send a very small subset of particles. In partic­ ular, a query is composed of M randomly selected parti­ cles, where each particle contains an entire state trajectory (within a fixed history window). The robot that is being queried then searches its local database for the sensor measurement that is most informa­ tive, measured by the KL divergence criterion described above. It may choose one of its own measurements, or it may pass on a measurement which it has received from a third robot. The only restriction we impose is that it must send a relatively current observation, that is, one from some time 1: ?: t- Ll. (This condition is not overly restrictive: old measurements are frequently uninformative anyway. And, it saves storage and computation, since states and observa­ tions older than t- Ll may be discarded.) While our selection algorithm is somewhat sensitive to out­ liers (because they affect the posterior more than other measurements do), we found it to be extremely effective in our surveillance application. 4.3 Re-simulation Unfortunately, the procedure as described so far some­ times results in large variance in the particles' importance weights. This variance can cause some importance weights to drop to zero, reducing the effective number of particles in our filter and thereby wasting computation. There are two reasons for this behavior: first, because of communication between agents our particle filter receives more observation information than it would with just one agent's sensors. It is well known that particle filters behave poorly when ob­ servations are very informative compared to the proposal distribution (Thrun & Fox, 2000). Second, if our observa­ tion comes from several steps in the past (1: « t), many of our particles will have overlapping histories at time 1:. That means that we will have fewer distinct importance weights, and so the normalization step will introduce extra variance. In order to reduce the variance of our importance weights, our approach periodically re-simulates each agent's parti­ cle filter. That is, it erases part of the filter's history and then runs it forward again to the current time incorporating all collected observations at the appropriate times. Our cur­ rent ln1plen1entation re-sin1ulates fron1 tiine .. t whenever we receive an observation for some time 1: < t. Re-simulation is related to well-known techniques for improving sample diversity in particle filters; for more detail see, e.g., (van der Merwe et al., 200 I). 5 Experimental Results A systematic series of experiments was performed to eval­ uate the scaling properties of our approach. Experiments were performed both in simulation and using physical robots. Simulation enabled us to test the ability of our al­ gorithm to scale to large numbers of robots, whereas the physical testbed provided us with insights into the practi­ cal effectiveness of our approach in the context of an actual robot system. In both cases, the scenario involved a dis­ tributed tracking task motivated by the laser tag problem. A team of robots was tasked to estimate a posterior distribu­ tion over an opponent's location from laser range data. In contrast to classical distributed tracking problems (Dolan et al., 1999), the laser tag domain is characterized by the pervasive occlusion of opponents. As a result, posterior distributions are highly non-Gaussian and multi-modal. 5.1 Physical Robot Results In a first series of experiments, we collected data from a team of four robots in an indoor arena of size 25 by 20 meters, shown in Figure 4(a). The arena was outfit­ ted with a number of obstacles, causing significant occlu­ sion problems. A belief tracker for these robots was de­ scribed in (Rosencrantz et al., 2003); however, the filter described in that publication is centralized. To evaluate the effectiveness of our communication scheme, we also im­ plemented an alternative non-selective algorithm of com­ municating every k-th sensor measurement, for an appro­ priate value of k. This algorithm is a non-selective coun­ terpart to our approach; it does not respond to queries, and it does not tailor its communication to the particular un­ certainty faced by neighboring robots. The lack of a query 498 ROSENCRANTZ ET AL. UAI2003 i .i .i I (a) (b) (c) Figure 4: Illustration of the algorithm: Panel (a) shows the belief of the bottom robot with regards to the possible location of an opponent. Panel (b) illustrates a query communicated by the bottom robot. In response, the top robot determines that the measurement represented by the shaded area is most informative. After communicating this measurement back to the bottom robot, its posterior becomes the one shown in Panel (c). Note that (a) and (c) in this case are similar, this is because a relatively old observation was incorporated and re-simulation has allowed the the bottom robots distribution to spread again after that old observation was incorporated. phase reduces the total communication overhead, but at the expense of communicating measurements of inferior infor­ mation value. Thus, by comparing our approach to this straw man approach we can evaluate the practical utility of selective information exchange. A simple example of our protocol in this domain is given in Figure 4. Two robots, illustrated by the two circular ob­ jects, are attempting to track an opponent (not shown here). Initially, the opponent is known to be in the top left are of the map. The local posterior of the bottom robot is shown in Figure 4a; it confines the opponent to the occluded are in the top left of the map. Unfortunately, the bottom robot does not survey the target area. A query, selected using our approach by the bottom robot, is shown in Figure 4b (some­ what idealized for visual clarity). Notice that this query consists of two paths, chosen from the bottom robot's set of particles. The shaded region represents the observation the top robot has chosen as most informative and the small squares show the time along each trajectory that the obser­ vation applies to. After communicating this measurement back to the bottom robot, the bottom robot's belief evolves to the one shown in Figure 4c. Notice that this new belief is similar to the one it started with in Figure 4a, this is be­ cause re-simulation has allowed the bottom robots posterior to spread after this new observation was incorporated. The reflects the fact that a given distribution can have a large number of potential pasts. In general we will need many observations to rule out all impossible histories. To evaluate the performance of our approach systemati­ cally, we varied the rate at which the team members ex­ change information, from three times a second to once every three seconds. Each rate induces a different total communication bandwidth: in the non-selective scheme, the bandwidth is determined by the sensor measurements that are being broadcast through the system. In the selec­ tive scheme, the queries add additional overhead to the to­ tal bandwidth. Figure 5 shows the results of the compari­ son, for different values of the communication bandwidth. This graph is obtained by measuring the KL-divergence of the resulting posterior over x1 from the posterior achieved by a particle filter with full communication (using Laplace smoothing over a fine-grained grid to interpolate between different discrete measurements). Each point is averaged over 6 runs and the error bars indicate the standard devi­ ation at that point. This result shows that our algorithm outperforms the non-selective alternative as bandwidth be­ comes restricted. Put differently, our selective communi­ cation scheme leads to improved posterior estimates when compared to a non-selective alternative. All of these results were obtained using physical robot data, and they reflect the performance one could expect in an actual robotic laser tag setting. 5.2 Simulation Results The physical testbed prohibits us from conducting experi­ ments with larger number of platforms. This is because the number of robots available to us is limited. To investigate the scaling abilities of our approach, we developed a multi­ robot simulator capable of handling very large numbers of robots. Figure 6 shows the performance obtained with a simula­ tion of 50 robots. Shown there are two graphs, one for our approach and one for the non-selective counterpart.Each graph is accompanied by curves characterizing 95% con­ fidence intervals. As is easily seen, our algorithm outper- UAI2003 ROSENCRANTZ ET AL. 499 6 c Tracking Performance at Varying Bandwidth Levels 0.9,----- - -  ---, 0.8  0.7 - Latest Observation Passing - - Intelligent Filtering w - 0.6 c_  0.5 i N 0.4 c • e> [ 0.3  02 o.h.L,((---:-((-:',:-, -)-,::---(,-,::.,((--::*+----;',5 Jog{Messages/s) Figure 5: A comparison of our method with an al­ ternative where robots send their most recent observa­ tion at varying frequencies. Our method outperforms this alternative, maintaining distributions closer to the truth. forms the naive algorithm. It maintains a satisfactorily low divergence at lower bandwidths. The margin of improve­ ment is more significant than in the physical robot experi­ ment, which we attribute to the superior scaling ability to large number of robots. In conclusion, our approach con­ sistently generates excellent state estimation results in dy­ namic environments using a large number of robots. 6 Discussion We have presented a scalable and robust distributed parti­ cle filter for dynamic environments. In contrast to previous research on decentralized filtering, our approach can cope with non-Gaussian posteriors and with dynamic environ­ ments. The key technical innovation is a selective commu­ nication scheme that enable individual platforms to com­ municate the most informative piece of information to other entities. Our approach utilizes particle filters, for which this idea can be implemented efficiently, and which are capable of representing non-Gaussian posteriors. Systematic experimental results were conducted in the con­ text of a distributed surveillance scenario, motivated by a robotic laser tag testbed presently under development at CMU. The results, using both simulation and physical robots, illustrate the effectiveness of our approach when compared to a straw-man approach that communicates in­ formation in a non-selective way. Simulations with up to 50 robots show superior performance of our approach. There remain several open questions that warrant future re­ search. Chief of those open questions is a more informed strategy for composing queries than random selection of particles. Clearly, certain particles are more suited for Bandwidth vs. Divergence with 50 Robots - - Selective Communication ---e- Naive Communication 05 ,.l 5((--(-:',:-5-. 5,+*/)065 Log1 O{Messages) Figure 6: A comparison of our method with an alter­ native where robots send their most recent observation. Our method outperforms this alternative, maintaining distributions closer to the truth for a range of band­ widths. querying neighbors than others, and by taking advantage of this the overall performance may improve. The present algorithm for generating answers is also limited, in that it may favor outlier measurements on the sheer basis that they surprise. We are presently seeking ways to assess the information of measurements relative to the true (but un­ known) posterior, to avoid communicating high-noise mea­ surements. However, in our implementation this is not a problem, largely because of the low degree of noise in laser range measurements. Another direction for future work is the intelligent selec­ tion of which agent to query. In our implementation robots chose which peer to communicate with at random from their k nearest neighbors. This is a natural choice because many networks have the property that only nearby agents can communicate. However, this is not the only possibility, and choosing carefully when the underlying network can offer more flexibility deserves attention. Another open question is how to improve re-simulation. The re-simulation step is the most computation-intensive part of our algorithm; so, in future work we plan to ex­ periment with ways to avoid re-simulating as often. Pos­ sibilities include re-simulating only when too many impor­ tance weights get small, or re-simulating only some of the particles in our filter. A final opportunity is to extend our approach to one in which entities consider their control ob­ jective in their composition of queries. For example, in the laser tag domain, robots seek opponents, and the choice of particles in a query could incorporate the location of al­ leged opponents. To that end, we believe that our decentral­ ized information fusion approach opens new opportunities for the decentralized control of teams of mobile robots. 500 ROSENCRANTZ ET AL. UAI2003 Regardless of these limitations, we believe our that our approach is unique in its ability to accommodate non­ Gaussian posteriors in a strictly decentralized fashion. Fur­ thermore, we believe that it is applicable to a number of distributed state estimation problems in the context of ubiq­ uitous computing. References Chen, B., Draper, S., & Wornell, G. (2001). Information embedding and related problems: Recent results and ap­ plications. Proceedings of Allerton Conf on Comm., Control, and Computing. Monticello, IL. Chu, M., Haussecker, H., & Zhao, F. (2002). Information­ driven dynamic sensor collaboration. IEEE Signal Pro­ cessing Magazine, I 9, 61-72. Dolan, J., Hampshire, J., & Khosla, P. (1999). Collabo­ ratively tracking and surveillence of targets in dynamic environments. Proceedings of SPIE. Orlando, FL. Doucet, A., de Freitas, J., & Gordon, N. (Eds.). (2001). Sequential monte carlo methods in practice. New York: Springer Verlag. Fox, D., Burgard, W., Kruppa, H., & Thrun, S. (2000). A probabilistic approach to collaborative multi-robot local­ ization. Autonomous Robots, 8. Isard, M., & Blake, A. (1998). CONDENSATION: condi­ tional density propagation for visual tracking. Interna­ tional Journal of Computer Vision, 29, S-28. Jazwinsky, A. (1970). Stochastic processes and filtering theory. New York: Academic. Lerner, U., Moses, B., Scott, M., Mcllraith, S., & Koller, D. (2002). Monitoring a complex physical system using a hybrid dynamic hayes net. Proceedings of the I 8th Annual Conference on Uncertainty in AI ( UAJ). Liu, J., & Chen, R. (1998). Sequential monte carlo methods for dynamic systems. Journal of the American Statistical Association, 93, 1032-1044. Maybeck, P. (1990). The Kalman filter: An introduction to concepts. In I. Cox and G. Wilfong (Eds.), Autonomous robot vehicles. Springer verlag. Montemerlo, M., Roy, N., & Thrun, S. (2002). Carnegie mellon robot navigation toolkit. Software package for download at www.cs.cmu.edu/Ňcarmen. Nettleton, E., Gibbens, P., & Durrant-Whyte, H. (2000). Closed form solutions to the multiple platform simul­ taneous localisation and map building (slam) problem. Sensor Fusion: Architectures, Algorithms, and Applica­ tions IV (pp. 428-437). Bellingham. Pasula, H., Russell, S., Ostland, M., & Ritov, Y. (1999). Tracking many objects with many sensors. Proceedings of the Sixteenth International Joint Conference on Artifi­ cial Intelligence ( /JCAJ). Stockholm, Sweden. Rao, B. S. Y., Durrant-Whyte, H. F., & Sheen, J. A. (1993). A fully decentralized multi-sensor system for tracking and surveillance. International Journal of Robotics Re­ search, 12,20-44. Rosencrantz, M., Gordon, G., & Thrun, S. (2003). Locating moving entities in dynamic indoor environments with teams of mobile robots. Proceedings of Autonomous Agents and Multi-Agent Systems. Melbourne, Australia. Rusinkiewicz, S., & Levoy, M. (2001). Efficient variants of the ICP algorithm. Proc. Third International Conference on 3D Digital Imaging and Modeling ( 3DJM). Quebec City, Canada: IEEEComputer Society. Thrun, S. (2002). Particle filters in robotics. Proceedings of the 17th Annual Conference on Uncertainty in AI (UAJ). Thrun, S., & Fox, D. (2000). Monte carlo localization with mixture proposal distribution. Proceedings of the AAAJ National Conference on Artificial Intelligence. Austin, TX: AAAI. van der Merwe, R., de Freitas, N., Doucet, A., & Wan, E. (2001). The unscented particel filter. Advances in Neural Information Processing Systems 13. Verma, V., Simmons, R., & Thrun, S. (2003). Variable resolution particle filter. Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (/JCAJ). Acapulco, Mexico.