Project Objectives
Develop complete, effective and scalable software for autonomous robot teams. Demonstrate robot teams with integrated perception, reasoning, learning, communication and cooperative strategies that solve complex multi-agent tasks.
Project Website
Approach
Under the MARS program, we continue to focus on the key challenges for building individually skilled autonomous robots and successful multi-robot teams to operate in uncertain, adversarial domains. These challenges range from single robot issues of localization, real-time visual sensing, robust tracking, high-speed navigation control, and automatic segmentation and recognition of environmental regimes through to multi-robot team issues of adaptive team strategy, robust cooperation through communication in the face of uncertainty, and team learning. All of these issues are important to the successful operation of multi-robot teams.
We recently competed at the RoboCup-2002 world robot soccer championships, winning first prize in the legged league competition. Additionally, we demonstrated our research during the DarpaTech Symposium held during July. More details on these events are listed in the recent accomplishments. Our recent research focuses on:
Adaptive Team Strategy
For multi-robot teams in adversarial settings, good individual skills
must be employed within the team's high-level strategy to be
effective. This strategy should emphasize the team's strengths while
simultaneously exploiting the opponent's weaknesses. We have developed
an adaptive architecture that customizes the team strategy on-line and in
real-time to enhance the overall performance of the team.
Our playbook strategy engine fulfills three goals concurrently. To coordinate team behavior, to provide an easy human and/or automated augmentation of the team strategy and to facilitate on-line adaptation to an opponent. The playbook engine achieves each of these goals through the use of plays. Each play contains a set of roles for each member of the team, and a set of predicates that determine when that particular play is applicable. Roles are assigned dynamically to team members based on evaluation heuristics e.g. closest to the ball. Each role contains a simple temporal plan; a series of high-level parametrized actions that player should perform. Thus each play concisely encodes a short sequence of coordinated team activity for a particular situational context. Plays are stored as human readable text files with a format that is easy to read, create, and modify as required.
The most interesting innovation, is that play selection is adaptive. A play is selected to be the active play based in part on its applicability and upon a weight that translates into its probability of selection. Our playbook engine adaptively adjusts the play selection probabilities based upon the performance of the play against the given opponent as measured by a small set of reward criteria. The reward criteria includes both positive and negative reward conditions such as a goal scored for or against us. With this approach, play selection adapts over the duration of a game (ie within minutes) to select plays that work the best against the given opponent. (IROS workshop paper, see list below, further papers forthcoming)
Teamwork communication and collaboration
Since the recent availability of wavelan to support wireless
communication for our Sony AIBOs, we have been investigating
algorithms and representations for collaborative behavior between
autonomous legged robots. This work represents a unification and
extension of our prior work with the Sony AIBO's and our minnow team
for distributed cooperation via communication.
We have developed algorithms for building a global world model that is jointly maintained across the team. Each robot shares with the team their own high-level local perception output along with the associated measures of uncertainty. Building on our prior work in distributed sensing, our algorithms effectively fuse the information reliably while handling uncertainty and conflict resolution in a principled manner. The resulting global map enables us to estimate the confidence of our model, to predict future states of the world, and extend each robot's effective perception range beyond the capabilities of any single robot. (paper forthcoming)
Based on the fused global model, we have build team coordination algorithms. Our robots position themselves by a gradient ascent function in a field of constraints and objectives; they spread out on the field assuming roles of attacker, supporter, and defender in positions that are suitable to carry out tasks in line with that role. (paper forthcoming)
Localization
In our previous work under the MARS program, we developed Sensor
Resetting Localization (SRL), an extension to Monte-Carlo probablistic
localization techniques. Because effective robots may be small, they
can be moved by external sources beyond the robot's own control. The
SRL algorithm that we devised allows robots to use their sensor input
to rapidly adapt to changes in their position. The robots effectively
localize in real-time with an accuracy of 2-5 cm. Although effective,
SRL still requires heavy computational resources given the position
belief computation for its large number of sample points.
Continuing our commitment to effective real-time localization, we have investigated new localization algorithms that attempt to address the multi-sample problem. We developed a localization algorithm, Constraint Based Localization (CBL), which maintains a single belief of the robot's location updated by its sensor input and movement. CBL is computationally more efficient than SRL and is more robust through its use of multiple landmarks. Our latest algorithms have been implemented on different robot platforms. (Various publications, see list below)
High-speed navigation and control
High-speed navigation is a poorly explored area of robotics research
and yet there are many robotics problems that require high-speed
navigation for them to be effective. To date, it is a commonly
accepted belief that high-speed navigation can only be addressed by a
combination of low-level reactive navigation algorithms driven by
high-level planning algorithms. Our recent work has proven this belief
to be incorrect.
We have extended the Rapidly-Exploring Random Trees (RRT) planning algorithm and applied it to controlling our team of five small-size robots (average computation time is 2ms per robot) moving at speeds of up to 1.7m/s through cluttered, moving obstacles fields. The algorithm works by progressively building a search tree through a combination of random exploration and biased motion towards the goal state. Our extensions to the basic RRT algorithm include:
Preliminary comparisons against conventional reactive navigation schemes indicate that it can control the robot at significantly higher speeds without requiring significant computational resources. (IROS'02 publication, see list below)
Generalized vision-based obstacle avoidance
We have developed a new vision processing algorithm, built upon our
CMVision library (see
http://www.cs.cmu.edu/~jbruce/cmvision), to perform general
(ie. any shape or color) obstacle avoidance. The algorithm operates on
our Sony AIBO's at the full frame-rate of 25Hz on a 200MHz MIPs
processor. The algorithm, which we call visual sonar,
calculates the radial distances to objects around the robot regardless
of the robot's head or body orientation. (paper forthcoming)
Real-time visual sensing and tracking
In order for our
robots to react sensibly to changes in the real world, they must be
able to quickly sense objects in their environment. We have extended
our previously developed color vision library, called CMVision (see
http://www.cs.cmu.edu/~jbruce/cmvision for details) to process
images at the full available frame rate with low CPU usage. Moreover,
CMVision is cross platform and operates on our Sony AIBO's as well.
Additionally, we have developed new techniques to improve the
robustness of multi-agent tracking in the face of false-positive
recognitions. This technique has applications in systems where
false-positives must be dealt with using a minimal amount of
computational resources. (ICRA'02 publication, see list below)
Automatic segmentation and recognition of environmental regimes
Robot environments are typically quasi-stationary systems. Being able
to identify and recognize the different environmental regimes or
contexts offers the ability to extend the use and dynamic range of
many current algorithms significantly. We are developing algorithms
to automatically segment and recognize different sensory inputs using
on-line, non-parametric statistical tests. Our current work focuses on
learning to distinguish lighting level regimes in an office
environment on a Sony Aibo. Once the lighting level is recognized, the
appropriate thresholds for color image segmentation can be chosen
thereby increasing the dynamic range of the color vision routines used
by the robot. (paper forthcoming)
In order to create our teams of autonomous robots, we have investigated a number of issues that apply to any multi-robot team operating in an adversarial environment. In doing so, we have produced a number of publications reporting on our contributions which are listed at the end of this page.
Our specific accomplishments during the last reporting period are:
Plan
Technology Transition
The following robot designs and software funded under the MARS program have been developed by us and released to the community:
Relevant Publications