skip to main content
10.5555/2074394.2074416guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article
Free access

Integrating planning and execution in stochastic domains

Published: 29 July 1994 Publication History

Abstract

We investigate planning in time-critical domains represented as Markov Decision Processes, showing that search based techniques can be a very powerful method for finding close to optimal plans. To reduce the computational cost of planning in these domains, we execute actions as we construct the plan, and sacrifice optimality by searching to a fixed depth and using a heuristic function to estimate the value of states. Although this paper concentrates on the search algorithm, we also discuss ways of constructing heuristic functions suitable for this approach. Our results show that by interleaving search and execution, close to optimal policies can be found without the computational requirements of other approaches.

References

[1]
Ballard, B. W. 1983. The *-minimax search procedure for trees containing chance nodes. Artificial Intelligence, 21:327-350.
[2]
Boutilier, C. and Dearden, R. 1994. Using abstractions for decision-theoretic planning with time constraints. In Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle. (to appear).
[3]
Dean, T., Kaelbling, L. P., Kirman, J., and Nicholson, A. 1993a. Deliberation scheduling for time-critical decision making. In Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence, pages 309- 316, Washington, D.C.
[4]
Dean, T., Kaelbling, L. P., Kirman, J., and Nicholson, A. 1993b. Planning with deadlines in stochastic domains. In Proceedings of the Eleventh National Conference on Artificial Intelligence, pages 574-579, Washington, D.C.
[5]
Fikes, R. E. and Nilsson, N. J. 1971. Strips: A new approach to the application of theorem proving to problem solving. Artificial Intelligence, 2:189-208.
[6]
Howard, R. A. 1971. Dynamic Probabilistic Systems. Wiley, New York.
[7]
Korf, R. E. 1990. Real-time heuristic search. Artificial Intelligence, 42:189-211.
[8]
Kushmerick, N., Hanks, S., and Weld, D. 1993. An algorithm for probabilistic planning. Technical Report 93-06-04, University of Washington, Seattle.
[9]
Pearl, J. 1984. Heuristics: Intelligent Search Strategies for Computer Problem Solving. Addison-Wesley, Reading, Massachusetts.
[10]
Russell, S. J. and Wefald, E. 1991. Do the Right Thing: Studies in Limited Rationality. MIT Press, Cambridge.
[11]
Tenenberg, J. D. 1991. Abstraction in planning. In Allen, J. F., Kautz, H. A., Pelavin, R. N., and Tenenberg, J. D., editors, Reasoning about Plans, pages 213-280. Morgan-Kaufmann, San Mateo.

Cited By

View all
  • (2014)Planning with Numeric Key Performance Indicators over Dynamic Organizations of Intelligent AgentsProceedings of the 12th German Conference on Multiagent System Technologies - Volume 873210.1007/978-3-319-11584-9_10(138-155)Online publication date: 23-Sep-2014
  • (2002)A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision ProcessesMachine Language10.1023/A:101793242973749:2-3(193-208)Online publication date: 1-Nov-2002
  • (1997)Time-critical actionProceedings of the Thirteenth conference on Uncertainty in artificial intelligence10.5555/2074226.2074256(250-257)Online publication date: 1-Aug-1997
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
UAI'94: Proceedings of the Tenth international conference on Uncertainty in artificial intelligence
July 1994
616 pages
ISBN:1558603328

Sponsors

  • Information Extraction and Transportation
  • HUGIN: HUGIN
  • Microsoft: Microsoft
  • KI: Knowledge Industries

Publisher

Morgan Kaufmann Publishers Inc.

San Francisco, CA, United States

Publication History

Published: 29 July 1994

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)12
Reflects downloads up to 17 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2014)Planning with Numeric Key Performance Indicators over Dynamic Organizations of Intelligent AgentsProceedings of the 12th German Conference on Multiagent System Technologies - Volume 873210.1007/978-3-319-11584-9_10(138-155)Online publication date: 23-Sep-2014
  • (2002)A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision ProcessesMachine Language10.1023/A:101793242973749:2-3(193-208)Online publication date: 1-Nov-2002
  • (1997)Time-critical actionProceedings of the Thirteenth conference on Uncertainty in artificial intelligence10.5555/2074226.2074256(250-257)Online publication date: 1-Aug-1997
  • (1996)Planning, learning and coordination in multiagent decision processesProceedings of the 6th conference on Theoretical aspects of rationality and knowledge10.5555/1029693.1029710(195-210)Online publication date: 17-Mar-1996
  • (1996)Focusing attention in anytime decision-theoretic planningACM SIGART Bulletin10.1145/242587.2425967:2(34-40)Online publication date: 1-Apr-1996

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media