Refereed Publications
2013
- W. Hamilton, M. M. Fard, J. Pineau "Modelling sparse dynamical systems with compressed predictive state representations". International Conference on Machine Learning (ICML). 2013.
[.pdf]
- G. Panuccio, A. Guez, R. Vincent, M. Avoli, J. Pineau. "Adaptive control of epileptiform excitability in an in vivo model of limbic seizures". Experimental Neurology. To Appear.
[.pdf (unformatted)]
- J. Frank, S. Mannor, J. Pineau, D. Precup. "Time Series Analysis Using Geometric Template Matching". Transactions on Pattern Matching and Machine Intelligence (PAMI). To Appear.
[Link]
- C. Hundt, P. Panagaden, J. Pineau, D. Precup, M. Dinculescu. "The duality of state and observation in probabilistic transition systems". TbiLLC 2011 (original presentation). Lecture Notes in Computer Science (LNCS) 7758. Springer. pp.206-230. 2013.
[.pdf]
- C. Paduraru, D. Precup, J. Pineau, G. Comanici. "An empirical analysis of off-policy learning in discrete MDPs.". JMLR: Workshop and Conference Proceedings. 10th European Workshop on Reinforcement Learning. vol.24. pp.89-101.
[.pdf]
2012
- A.M.S. Barreto, D. Precup, J. Pineau. "On-line reinforcement learning using incremental kernal-based stochastic factorization". Neural Information Processing Systems (NIPS). 2012.
[.pdf] (main paper)
[.pdf] (supplemental material)
- S. Png, J. Pineau, B. Chaib-draa. "Buildling adaptive dialogue systems via Bayes-adaptive POMDP". IEEE Journal of Selected Topics in Signal Processing. vol.6(8). 2012.
[.pdf]
- G. Shani, J. Pineau, R. Kaplow. "A survey of point-based POMDP solvers". Autonomous Agents and Multi-Agent Systems. 2012.
[.pdf]
- K. Bush, G. Panuccio, M. Avoli, J. Pineau. "Evidence-based modeling of network discharge dynamics during periodic pacing to control epileptiform activity". Journal of Neuroscience Methods. 2012. vol.204. pp.318-325.
[.pdf]
- F. Doshi-Velez, J. Pineau, N. Roy. "Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs". Artifical Intelligence. vol.187-188. August 2012. pp.115-132.
[.pdf]
- M. M. Fard, Y. Grinberg, J. Pineau, D. Precup, "Compressed Least-Squares Regression on Sparse Spaces", Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI'12). 2012.
[.pdf]
- M. M. Fard, Y. Grinberg, J. Pineau, D. Precup. "Random Projections Preserve Linearity in Sparse Spaces. Technical Report. 2012. (Companion to our AAAI'12 paper.)
[.pdf]
- E. Tsang, S. C. W. Ong, J. Pineau. "Design and Evaluation of a Flexible Interface for Spatial Navigation". Canadian Conference on Computer and Robot Vision. 2012.
[.pdf]
- C. Paduraru, D. Precup, J. Pineau, G. Comanici. "A Study of Off-policy Learning in Computational Sustainabiity". European Workshop on Reinforcement Learning (EWRL). 2012.
[.pdf]
- M. M. Fard, Y. Grinberg, J. Pineau, D. Precup. "Bellman Error Based Feature Generation Using Random Projections". European Workshop on Reinforcement Learning (EWRL). 2012.
[.pdf]
2011
- A. M. S. Barreto, D. Precup, J. Pineau. "Reinforcement Learning using Kernel-Based Stochastic Factorization". Neural Information Processing Systems (NIPS-24). 2011.
[.pdf]
- S. Ross, J. Pineau, B. Chaib-draa, P. Kreitmann. "A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes". Journal of Machine Learning. 12. pp.1655-1696. 2011.
[.pdf]
- M. M. Fard, J. Pineau. "Non-Deterministic Policies in Markovian Decision Processes". Journal of Artificial Intelligence Research (JAIR). 40. pp.1-24. 2011.
[.pdf]
- S. M. Shortreed, E. Laber, D. J. Lizotte, S. Stroup, J. Pineau, S. Murphy. "Informing sequential clinical decision-making through reinforcement learning: an empirical study". Machine Learning. 84(1). pp.109-136. 2011.
[Link] (Or email me for a copy.)
- J. Pineau, R. West, A. Atrash, J. Villemure, F. Routhier. "On the Feasibility of Using a Standardized Test for Evaluating a Speech-Controlled Smart Wheelchair". International Journal of Intelligent Control and Systems. 16(2). pp.121-128. 2011.
[.pdf]
- R.D. Vincent, A. Courville, J. Pineau. "A bistable computational model of recurring epileptiform activity as observed in rodent slice preparation". Neural Networks. 24(6) pp.526-537. 2011.
[.pdf]
- M. Fard, J. Pineau, C. Szepesvari. "PAC-Bayesian Policy Evaluation for Reinforcement Learning". Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI). 2011.
[.pdf]
- K. Deng, J. Pineau and S.A. Murphy. "Active Learning for Developing Personalizing Treatment". Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI). 2011.
[.pdf]
- S. Png, J. Pineau. "Bayesian Reinforcement Learning for POMDP-based dialogue systems". International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2011.
[.pdf]
- A. K. Moghaddam, J. Pineau, J. Frank, P. S. Archambault, F. Routhier, T. Audet, J. Polgar, F. Michaud, P. Boissy. "Mobility Profile and Wheelchair Driving Skills of Powered Wheelchair Users: Sensor-Based Event Recognition Using a Support Vector Machine Classifier". 33rd Annual International IEEE EMBS Conference. 2011.
[.pdf]
- S.C.W. Ong, Y. Grinberg, J. Pineau. "Goal-directed online learning of predictive models". European Workshop on Reinforcement Learning (EWRL). LNCS. Springer. 2011.
- C. Paduraru, D. Precup, J. Pineau. "A framework for computing bounds for the return of a policy". European Workshop on Reinforcement Learning (EWRL). LNCS. Springer. 2011.
- K. Deng, J. Pineau, S. Murphy. "Active Learning for Personalizing Treatment". IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL). 2011.
[.pdf]
- Y. Grinberg, M. M. Fard, J. Pineau, "Least-Squares Regression on Sparse Spaces", NIPS Workshop on Sparse Representation and Low-rank Approximation (NIPS'11). 2011.
[.pdf]
- Y. Grinberg, M. M. Fard, J. Pineau, "LSTD on Sparse Spaces", NIPS Workshop on New Frontiers in Model Order Selection (NIPS'11). 2011.
[.pdf]
2010
- M. M. Fard, J. Pineau. "PAC-Bayesian Model Selection for Reinforcement Learning". Neural Information Processing Systems (NIPS-23). 2010.
[.pdf]
- R. West, D. Precup, J. Pineau. "Automatically Suggesting Topics for Augmenting Text Documents". The 19th ACM Conference on Information and Knowledge Management (CIKM). 2010.
[.pdf]
- W. Honore, A. Atrash, P. Boucher, R. Kaplow, S. Kelouwani, H. Nguyen, J. Villemure, R. West, F. Routhier, P. Stone, C. Dufour, J.-P. Dussault, D. Rock, P. Cohen, L. Demers, R. Forget, J. Pineau. "Human-Oriented Design and Initial Validation of an Intelligent Powered Wheelchair". RESNA Annual Conference. 2010.
[.pdf]
[.html]
- A. Guez, J. Pineau. "Multi-Tasking SLAM". International Conference on Robotics and Automation (ICRA). 2010.
[.pdf]
- R. Kaplow, A. Atrash, J. Pineau. "Variable Resolution Decomposition For Robotic Navigation Under a POMDP Framework". International Conference on Robotics and Automation (ICRA). 2010.
[.pdf]
- J. Pineau, R. West, A. Atrash, J. Villemure, F. Routhier. "Towards a Standardized Test for Intelligent Wheelchairs". Performance Metrics for Intelligent Systems (PerMIS). 2010.
[.pdf]
- J. Pineau, A. Atrash, R. Kaplow, J. Villemure. "On the design and validation of an intelligent powered wheelchair: Lessons from the SmartWheeler project". CIM Symposium on Brain, Body and Machine. 2010.
[.pdf]
2009
- A. Atrash, R. Kaplow, J. Villemure, R. West, H. Yamani, J. Pineau. "Development and Validation of a Robust Interface for Improved Human-Robot Interaction". International Journal of Social Robotics. 2009.
[.pdf]
- J. Pineau, A. Guez, R. Vincent, G. Panuccio, M. Avoli. "Treating epilepsy via adaptive neurostimulation: A reinforcement learning approach". International Journal of Neural Systems. 19(4). pp.227-240. 2009.
(Email me for a copy.)
- K. Bush, J. Pineau. "Manifold Embeddings for Model-Based Reinforcement Learning under Partial Observability". Neural Information Processing Systems (NIPS-22). 2009.
[.pdf]
- R. West, D. Precup, J. Pineau. "Completing Wikipedia's Hyperlink Structure through Dimensionality Reduction". The 18th ACM Conference on Information and Knowledge Management (CIKM). 2009.
[.pdf]
- R. West, J. Pineau and D. Precup. "Wikispeedia: An Online Game for Inferring Semantic Distances between Concepts". International Joint Conferences on Artificial Intelligence (IJCAI). 2009.
[.pdf]
- A. Atrash and J. Pineau. "A Bayesian Reinforcement Learning Approach for Customizing Human-Robot Interfaces". International Conference on Intelligent User Interfaces (IUI). 2009.
[.pdf]
- K. Bush, J. Pineau & M. Avoli. "Manifold Embeddings for Model-Based Reinforcement Learning of Neurostimulation Policies". ICML/UAI/COLT Workshop on Abstraction in Reinforcement Learning. 2009.
[.pdf]
2008
- S. Ross, J. Pineau, S. Paquet, B. Chaib-draa. "Online Planning Algorithms for POMDPs". Journal of Artificial Intelligence Research (JAIR). 32. pp.663-704. 2008.
[.pdf]
- M. M. Fard and J. Pineau. "MDPs with Non-Deterministic Policies". Neural Information Processing Systems (NIPS-21). 2008.
[.pdf]
- S. Ross & J. Pineau. "Model-Based Bayesian Reinforcement Learning in Large Structured Domains". Uncertainty in Artificial Intelligence (UAI). 2008.
[.pdf]
- M. M. Fard, J. Pineau and P. Sun. "A Variance Analysis for POMDP Policy Evaluation". AAAI Conference on Artificial Intelligence. 2008.
[.pdf]
- A. Guez, R. Vincent, M. Avoli, & J. Pineau. "Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning". Innovative Applications of Artificial Intelligence (IAAI). 2008.
[.pdf]
- F. Doshi, J. Pineau and N. Roy. "Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs". International Conference on Machine Learning (ICML). 2008.
[.pdf]
- S. Ross, B. Chaib-draa, & J. Pineau. "Bayesian Reinforcement Learning in Continuous POMDPs with Application to Robot Navigation". International Conference on Robotics and Automation (ICRA). 2008.
[.pdf]
2007
- J. Pineau, M.G. Bellemare, A.J. Rush, A. Ghizaru, & S.A. Murphy.
"Constructing evidence-based treatment strategies using methods from computer science."
Drug and Alcohol Dependence. 88S Elsevier. p.S52-S60. 2007. (Email me for a copy.)
- R. Jaulmes, J. Pineau & D. Precup.
"Apprentissage actif dans les Processus Decisionnels de Markov Partiellement Observables''.
Revue d'Intelligence Artificielle. 2(1). pp.9-34. 2007.
[.pdf]
- S. Ross, B. Chaib-draa, & J. Pineau. "Bayes-Adaptive POMDPs". Neural Information Processing Systems (NIPS-20). 2007.
[.pdf]
Tech report version with the proofs:
[.pdf]
- S. Ross, J. Pineau & B. Chaib-draa. "Theoretical Analysis of Heuristic Search Methods for Online POMDPs". Neural Information Processing Systems (NIPS-20). 2007.
[.pdf]
- R. Jaulmes, J. Pineau & D. Precup.
"A formal framework for robot learning and control under model uncertainty."
IEEE International Conference on Robotics and Automation (ICRA). 2007.
[.pdf]
- R. Vincent, J. Pineau, P. de Guzman & M. Avoli.
"Recurrent Boosting for Classification of Natural and Synthetic Time-Series Data".
Canadian Conference on Artificial Intelligence (CanAI). pp.192-293. 2007.
[.pdf]
- J. Pineau & A. Atrash for the SmartWheeler team.
"SmartWheeler: A robotic wheelchair test-bed for investigating new models of human-robot interaction."
AAAI Spring Symposium on Multidisciplinary Collaboration for Socially Assistive Robotics. 2007.
[.pdf]
2006
- J. Pineau, G. Gordon & S. Thrun.
"Anytime point-based approximations for large POMDPs."
Journal of Artificial Intelligence Research (JAIR). 27. pp.335-380. 2006.
[.pdf]
- R. Gavalda, P.W. Keller, J. Pineau and D. Precup.
"PAC-Learning of Markov Models with Hidden State".
European Conference on Machine Learning (ECML). 2006.
[.pdf]
- C. Hundt, P. Panangaden, J. Pineau & D. Precup
"Representing systems with hidden state''.
National Conference of Artificial Intelligence (AAAI). 2006.
[.pdf]
- D. Burfoot, J. Pineau & D. Dudek
"RRT-Plan: a Randomized Algorithm for STRIPS Planning''.
International Conference on Automated Planning and Scheduling (ICAPS). 2006.
[.pdf]
- A. Atrash & J. Pineau
"Efficient Planning and Tracking in POMDPs with Large Observation Spaces".
AAAI-06 Workshop on Empirical and Statistical Approaches for Spoken Dialogue Systems. 2006.
[.pdf]
- R. Vincent, J. Pineau, P. de Guzman, & M. Avoli
"Recurrent Boosting Method for Time-Dependent Classification of Epileptiform Signals".
North Easth Student Colloquium on Artificial Intelligence (NESCAI). 2006
[.pdf]
2005
- J. Pineau & G. Gordon
"POMDP Planning for Robust Robot Control''. International Symposium on Robotics Research (ISRR)
San Francisco, CA. 2005.
[.pdf]
- R. Jaulmes, J. Pineau & D. Precup
"Active Learning in Partially Observable Markov Decision Processes". European Conference on Machine Learning
(ECML). Porto, Portugal. 2005.
[.pdf]
- R. Jaulmes, J. Pineau & D. Precup
"Active Learning in Partially Observable Markov Decision Processes".
NIPS Workshop on Value of Information in Inference, Learning and Decision-Making. Whistler, Canada. 2005
[.pdf]
- R. Jaulmes, J. Pineau & D. Precup
"Probabilistic Robot Planning Under Model Uncertainty: An Active Learning Approach".
NIPS Workshop on Machine Learning Based Robotics in Unstructured Environments. Whistler, Canada. 2005
[.pdf]
- R. Jaulmes, J. Pineau & D. Precup
"Learning in Non-Stationary Partially Observable Markov Decision Processes".
ECML Workshop on Reinforcement Learning in Non-Stationary Environments. Porto, Portugal. 2005.
[.pdf]
2004
- J. Pineau, G. Gordon & S. Thrun
"Applying Metric-Trees to Belief-Point POMDPs''. Neural Information Processing Systems
(NIPS-16). Vancouver, Canada. 2004.
[.ps]
[.pdf]
2003
- J. Pineau, M. Montemerlo, M. Pollack, N. Roy, & S. Thrun
"Towards robotic assistants in nursing homes: Challenges and results''.
Special issue on Socially Interactive Robots, Robotics and Autonomous Systems 42 (3-4). pp.271-281. 2003.
[.pdf]
- J. Pineau, G. Gordon & S. Thrun
"Point-based value iteration: An anytime algorithm for POMDPs''. International
Joint Conference on Artificial Intelligence (IJCAI). Acapulco, Mexico. pp. 1025-1032. Aug. 2003.
[.ps]
[.pdf]
- J. Pineau, G. Gordon & S. Thrun.
"Policy-contingent abstraction for robust robot control''. Conference on Uncertainty
in Articifical Intelligence (UAI). Acapulco, Mexico. pp. 477-484. Aug. 2003.
[.ps]
[.pdf]
2002
- M. Montemerlo, J. Pineau, N. Roy, S. Thrun & V. Verma
"Experiences with a Mobile Robotic Guide for the Elderly''. National
Conference on Artificial Intelligence (AAAI). Edmonton, AB. pp. 587-592. Aug. 2002.
[.ps]
[.pdf]
- J. Pineau, M. Montemerlo, M. Pollack, N. Roy & S. Thrun.
"Probabilistic control of human robot interaction: Experiments with a robotic assistant for Nursing Homes''. The second IARP/IEEE/RAS Joint Workshop on Technical Challenges for Robots in Human Environments (DRHE). Toulouse, France. Oct. 2002.
[.ps]
[.pdf]
- J. Pineau, & S. Thrun.
"High-level robot behaviour control with POMDPs''.
AAAI Workshop on Cognitive Robotics. Edmonton, Canada. Aug. 2002.
[.ps]
[.pdf]
[Presentation slides (.ppt)]
- M. Pollack, S. Engberg, J.T. Matthews, S. Thrun, L. Brown, D. Colbry, C. Orosz, B. Peintner, S. Ramakrishnan, J. Dunbar-Jacob, C. McCarthy, M. Montemerlo, J. Pineau, & N. Roy.
"Pearl: A Mobile Robotic Assistant for the Elderly''.
Workshop on Automation as Caregiver: the Role of Intelligent Technology in Elder Care (AAAI).
Edmonton, AB. Aug. 2002.
2001
- J. Pineau, N. Roy & S. Thrun.
"A Hierarchical Approach to POMDP Planning and Execution''.
ICML Workshop on Hierarchy and Memory in Reinforcement Learning. Williams College, MA. June 2001.
June 2001.
[.ps]
[.pdf]
- N. Roy, G. Baltus, D. Fox, F. Gemperle, J. Goetz, T. Hirsch,
D. Margaritis, M. Montemerlo, J. Pineau, J. Schulte &
S. Thrun. "Towards Personal Service Robots for the
Elderly''. Workshop on Interactive Robots and Entertainment (WIRE). Pittsburgh, PA. 2000.
[.ps]
[.pdf]
[.html]
2000
- N. Roy, J. Pineau & S. Thrun.
"Spoken Dialog Management Using Probabilistic Reasoning''. Association for Computational
Linguistics (ACL). Hong Kong, Oct. 2000.
[.ps]
[.pdf]
- D. Goddeau & J. Pineau
"Fast Reinforcement Learning of Dialog Strategies''. IEEE Conference on Acoustics,
Speech and Signal Processing (ICASSP). Istanbul, Turkey. June 2000.
[.ps]
[.pdf]
[Presentation slides (.html)]
Book chapters
- N. Roy & J. Pineau
"Robotics and Independence for the Elderly''. In Growing Old in a Technological Society. G. Lesnoff-Caravaglia (Ed.) C. Thomas of Springfield. Illinois, USA. pp.209-242. 2007.
Edited volumes
- N. Vlassis, G. Gordon, & J. Pineau, editors.
"Reasoning with Uncertainty in Robotics".
Proceedings from the IJCAI-05 Workshop on Reasoning with Uncertainty in Robotics (RUR-05).
International Joint Conference on Artificial Intelligence. Edinburgh, Scotland. 2005
[.pdf]
Technical Reports
- S. Ross, B. Chaib-draa, & J. Pineau. "Bayes-Adaptive POMDPs". SOCS-TR-2007. School of Computer Science. McGill University. 2007. (NOTE: This is a long version of the identically-titled NIPS paper above. It includes proofs for the lemmas and theorems presented in the paper.)
[.pdf]
- D. Burfoot, J. Pineau & D. Dudek
"RRT-Plan: a Randomized Algorithm for STRIPS Planning''.
CIM-TR-2006. McGill University, Center for Intelligent Machines.2006. (NOTE: This is a long version of the identically-titled ICAPS paper above.)
[.pdf]
- J. Pineau, G. Gordon & S. Thrun.
"Point-based approximations for fast POMDP solving".
SOCS-TR-2005.4. McGill University. School of Computer Science. 2005.
[.pdf]
- J. Pineau & S. Thrun.
"An integrated approach to hierarchy and abstraction for POMDPs".
CMU-RI-TR-02-21. Carnegie Mellon University. Robotics Institute. 2002.
[.ps]
[.pdf]
Thesis
- J. Pineau.
"Tractable Planning Under Uncertainty: Exploiting Structure''. PhD Thesis. August 2004.
[.ps]
[.pdf]
[Presentation slides (.pdf)]
- J. Pineau.
"Hierarchical Methods for Planning under Uncertainty''. Thesis proposal. June 2001.
[.ps]
[.pdf]
Last modified: Thu May 18 10:04:28 EDT 2005