Doina Precup's Publications

Note: Publications from 2002 are not yet listed!

DISSERTATION

Precup, D. (2000). "Temporal Abstraction in Reinforcement Learning". Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst.


JOURNAL ARTICLES

Sutton, R. S.,Precup, D., Singh, S. (1999). "Between MDPs and semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning". In Artificial Intelligence, vol. 112, pp.181-211.
An earlier version appeared as Technical Report UM-CS-1998-74, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610.


REFEREED CONFERENCES AND WORKSHOPS

Letia, I.A., Precup, D., Craciun, F. (2001) " Developing collaborative Golog agents by reinforcement learning. To appear in Proceedings of the Thirteenth Conference on Intelligent Tools with Artificial Intelligence (ICTAI 2001). IEEE Computer Press

Precup, D., Sutton, R.S., Dasgupta, S. (2001) "Off-policy temporal-difference learning with function approximation ". In Proceedings of the Eighteenth Conference on Machine Learning (ICML 2001), pp.417-424. Morgan Kaufmann.

Precup, D.,Sutton, R. S.,Singh, S. (2000) "Eligibility Traces for Off-Policy Policy Evaluation". In Proceedings of the Seventeenth Conference on Machine Learning (ICML 2000), pp. 759--766. Morgan Kaufmann.

Sutton, R. S., Singh, S., Precup, D., Ravindran, B. (1999) "Improved Switching among Temporally Abstract Actions". In Advances in Neural Information Processing Systems 11 (Proceedings of NIPS'98), pp.1066-1072. MIT Press.

Sutton, R. S., Precup, D., Singh, S. (1998). "Intra-Option Learning about Temporally Abstract Actions".In Proceedings of the Fifteenth International Conference on Machine Learning (ICML'98), pp.556-564. Morgan Kaufmann.

Precup, D., Utgoff, P.E. (1998). "Classification using Phi-machines and constructive function approximation". In Proceedings of the Fifteenth International Conference on Machine Learning, ICML'98, pp.439-444. Morgan Kaufmann.
An earlier version appeared as Technical Report UM-CS-1997-005, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610.

Precup, D., Sutton, R. S., Singh, S. (1998). "Theoretical Results on Reinforcement Learning with Temporally Abstract Options". In Machine Learning: ECML-98. 10th European Conference on Machine Learning, Chemnitz, Germany, April 1998. Proceedings, pp. 382-393. Springer Verlag.

Precup, D., Sutton, R. S. (1998). "Multi-Time Models for Temporally Abstract Planning". In Advances in Neural Information Processing Systems 10 (Proceedings of NIPS'97), pp. 1050-1056. MIT Press.

Moss, J. E. B., Utgoff, P. E., Cavazos, J., Precup, D., Stefanovic, D., Brodley, C. E., Scheeff, D. T. (1998). "Learning to Schedule Straight-Line Code". In Advances in Neural Information Processing Systems 10 (Proceedings of NIPS'97), pp.929-935. MIT Press.

Precup, D., Sutton, R. S., Singh, S. (1997). "Planning with Closed-Loop Macro Actions". In Working Notes of the AAAI Fall Symposium '97 on Model-directed Autonomous Systems, pp. 70-76.

Precup, D., Sutton, R. S. (1997) "Multi-Time Models for Reinforcement Learning". In Proceedings of the ICML'97 Workshop on Modelling in Reinforcement Learning.

Precup, D., Sutton, R. S. (1997) "Exponentiated Gradient Methods for Reinforcement Learning". In Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97), pp.272-277. Morgan Kaufmann,
An earlier version appeared as Technical Report UM-CS-1996-070", Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610.

McGeoch, C.C., Precup, D., Cohen, P.R. (1997) " How to Find Big-Oh in Your Data Set (and How Not To)". In "Advances in Intelligent Data Analysis: Reasoning about Data." Proceedings of the Second International Symposium on Intelligent Data Analysis, IDA-97, pp. 41-52. Springer Verlag.

Letia, I.A., Precup, D. (1995) " Knowledge Transfer when Learning a Second Programming Language " In Proceedings of the 6th IFIP World Conference on Computers in Education, pp.97-106. Chapmann and Hall.

Precup, D., Precup, T. (1995) " Trajectory Simulation and Optimization for Fuzzy Controlled Mobile Robot " In Proceeding of the 3rd IFAC/IFIP/IFORS Workshop, Intelligent Manufacturing Systems, IMS'95, Bucharest, Romania. Preprints, pp. 39-43. Also accepted for publication in the post-prints, edited by Elsevier Science.


BOOK CHAPTERS

Utgoff, P.E., Precup, D. (1998). " Constructive function approximation." In Motoda & Liu (Eds.), Feature extraction, construction, and selection: A data-mining perspective. Kluwer.
An earlier version appeared as Technical Report UM-CS-1997-004, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610.

UNREFEREED PUBLICATIONS

McGovern, A., Precup, D., Ravindran, B., Singh, S., Sutton, R. S. (1998). "Hierarchical Optimal Control of MDPs". Proceedings of the Tenth Yale Workshop on Adaptive and Learning Systems, pp.186-191.

Maties, V., Precup, T., Precup, D., Sipos, C.(1994). " Simulation of a Fuzzy-Guided Mobile Robot in a Static Environment " In Proceedings of the National Conference on Systems Theory, Robotics and Automatic Control SINTES 7, Craiova, Romania, pp. 203-208.

Precup, D., Precup, T., Sipos, C.(1994). "Simulation of Automatic Fuzzy Controllers for Robot Guidance " In Proceedings of the Basis of Electronics Workshop, Cluj-Napoca, Romania, pp.58-63.


TECHNICAL REPORTS WHICH DO NOT OVERLAP WITH PREVIOUS PUBLICATIONS

Perkins, T.J., Precup, D. (1999) "Using Options for Knowledge Transfer in Reinforcement Learning", Technical Report UM-CS-1999-034, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610.

Utgoff, P.E., Precup, D. (1997) "Relative Value Function Approximation", Technical Report UM-CS-1997-003, Department of Computer Science, University of Massachusetts, Amherst, MA 01003-4610.


OTHER DOCUMENTS

Using Analogic Reasoning for Natural Language Semantic Processing, " Advanced studies in computer science" MScCSE graduation project, Technical University Cluj-Napoca, Romania, 1995.

Student Modeling for Learning a Second Programming Language, BScCSE graduation project, Technical University Cluj-Napoca, Romania, 1995.