Article

Free access

Integrating planning and execution in stochastic domains

Authors:

Richard Dearden,

Craig BoutilierAuthors Info & Claims

UAI'94: Proceedings of the Tenth international conference on Uncertainty in artificial intelligence

Pages 162 - 169

Published: 29 July 1994 Publication History

PDF eReader Publisher Site

Abstract

We investigate planning in time-critical domains represented as Markov Decision Processes, showing that search based techniques can be a very powerful method for finding close to optimal plans. To reduce the computational cost of planning in these domains, we execute actions as we construct the plan, and sacrifice optimality by searching to a fixed depth and using a heuristic function to estimate the value of states. Although this paper concentrates on the search algorithm, we also discuss ways of constructing heuristic functions suitable for this approach. Our results show that by interleaving search and execution, close to optimal policies can be found without the computational requirements of other approaches.

References

[1]

Ballard, B. W. 1983. The *-minimax search procedure for trees containing chance nodes. Artificial Intelligence, 21:327-350.

Digital Library

Google Scholar

[2]

Boutilier, C. and Dearden, R. 1994. Using abstractions for decision-theoretic planning with time constraints. In Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle. (to appear).

Digital Library

Google Scholar

[3]

Dean, T., Kaelbling, L. P., Kirman, J., and Nicholson, A. 1993a. Deliberation scheduling for time-critical decision making. In Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence, pages 309- 316, Washington, D.C.

Google Scholar

[4]

Dean, T., Kaelbling, L. P., Kirman, J., and Nicholson, A. 1993b. Planning with deadlines in stochastic domains. In Proceedings of the Eleventh National Conference on Artificial Intelligence, pages 574-579, Washington, D.C.

Digital Library

Google Scholar

[5]

Fikes, R. E. and Nilsson, N. J. 1971. Strips: A new approach to the application of theorem proving to problem solving. Artificial Intelligence, 2:189-208.

Crossref

Google Scholar

[6]

Howard, R. A. 1971. Dynamic Probabilistic Systems. Wiley, New York.

Google Scholar

[7]

Korf, R. E. 1990. Real-time heuristic search. Artificial Intelligence, 42:189-211.

Digital Library

Google Scholar

[8]

Kushmerick, N., Hanks, S., and Weld, D. 1993. An algorithm for probabilistic planning. Technical Report 93-06-04, University of Washington, Seattle.

Google Scholar

[9]

Pearl, J. 1984. Heuristics: Intelligent Search Strategies for Computer Problem Solving. Addison-Wesley, Reading, Massachusetts.

Digital Library

Google Scholar

[10]

Russell, S. J. and Wefald, E. 1991. Do the Right Thing: Studies in Limited Rationality. MIT Press, Cambridge.

Digital Library

Google Scholar

[11]

Tenenberg, J. D. 1991. Abstraction in planning. In Allen, J. F., Kautz, H. A., Pelavin, R. N., and Tenenberg, J. D., editors, Reasoning about Plans, pages 213-280. Morgan-Kaufmann, San Mateo.

Digital Library

Google Scholar

Cited By

View all

Pantke FEdelkamp SHerzog O(2014)Planning with Numeric Key Performance Indicators over Dynamic Organizations of Intelligent AgentsProceedings of the 12th German Conference on Multiagent System Technologies - Volume 873210.1007/978-3-319-11584-9_10(138-155)Online publication date: 23-Sep-2014
https://dl.acm.org/doi/10.1007/978-3-319-11584-9_10
Kearns MMansour YNg A(2002)A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision ProcessesMachine Language10.1023/A:101793242973749:2-3(193-208)Online publication date: 1-Nov-2002
https://dl.acm.org/doi/10.1023/A%3A1017932429737
Horvitz ESeiver A(1997)Time-critical actionProceedings of the Thirteenth conference on Uncertainty in artificial intelligence10.5555/2074226.2074256(250-257)Online publication date: 1-Aug-1997
https://dl.acm.org/doi/10.5555/2074226.2074256
Show More Cited By

Index Terms

Integrating planning and execution in stochastic domains
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
      1. Probabilistic reasoning
      2. Vagueness and fuzzy logic
2. Mathematics of computing
  1. Probability and statistics
    1. Stochastic processes

Recommendations

Planning and execution through variable resolution planning

Generating sequences of actions-plans-for robots using Automated Planning in stochastic and dynamic environments has been shown to be a difficult task with high computational complexity. These plans are composed of actions whose execution might fail due ...
An integrated approach to hierarchical planning, plan execution, and on-line learning in uncertain continuous domains
Cooperative planning and plan execution in partially observable dynamic domains
RoboCup 2004

In this paper we focus on plan execution in highly dynamic environments. Our plan execution procedure is part of a high-level planning system which controls the actions of our RoboCup team ”Mostly Harmless”. The used knowledge representation scheme is ...

Comments

Information & Contributors

Information

Published In

UAI'94: Proceedings of the Tenth international conference on Uncertainty in artificial intelligence

July 1994

616 pages

ISBN:1558603328

Editors:
Ramon Lopez De Mantaras
Artificial Intelligence Research Institute, CSIC
,
David Poole
University of British Columbia, Vancouver

Publisher

Morgan Kaufmann Publishers Inc.

San Francisco, CA, United States

Publication History

Published: 29 July 1994

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
44
Total Downloads

Downloads (Last 12 months)20
Downloads (Last 6 weeks)12

Reflects downloads up to 17 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Pantke FEdelkamp SHerzog O(2014)Planning with Numeric Key Performance Indicators over Dynamic Organizations of Intelligent AgentsProceedings of the 12th German Conference on Multiagent System Technologies - Volume 873210.1007/978-3-319-11584-9_10(138-155)Online publication date: 23-Sep-2014
https://dl.acm.org/doi/10.1007/978-3-319-11584-9_10
Kearns MMansour YNg A(2002)A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision ProcessesMachine Language10.1023/A:101793242973749:2-3(193-208)Online publication date: 1-Nov-2002
https://dl.acm.org/doi/10.1023/A%3A1017932429737
Horvitz ESeiver A(1997)Time-critical actionProceedings of the Thirteenth conference on Uncertainty in artificial intelligence10.5555/2074226.2074256(250-257)Online publication date: 1-Aug-1997
https://dl.acm.org/doi/10.5555/2074226.2074256
Boutilier C(1996)Planning, learning and coordination in multiagent decision processesProceedings of the 6th conference on Theoretical aspects of rationality and knowledge10.5555/1029693.1029710(195-210)Online publication date: 17-Mar-1996
https://dl.acm.org/doi/10.5555/1029693.1029710
Haddawy P(1996)Focusing attention in anytime decision-theoretic planningACM SIGART Bulletin10.1145/242587.2425967:2(34-40)Online publication date: 1-Apr-1996
https://dl.acm.org/doi/10.1145/242587.242596

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Planning and execution through variable resolution planning

An integrated approach to hierarchical planning, plan execution, and on-line learning in uncertain continuous domains

Cooperative planning and plan execution in partially observable dynamic domains

Comments

Published In

Sponsors

Publisher

Publication History

Qualifiers

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF

eReader

Login options

Full Access

Abstract

References

Cited By

Index Terms

Recommendations

Planning and execution through variable resolution planning

An integrated approach to hierarchical planning, plan execution, and on-line learning in uncertain continuous domains

Cooperative planning and plan execution in partially observable dynamic domains

Comments

Information

Published In

Sponsors

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations