site:www.cs.utexas.edu

www.cs.utexas.edu1d

Our students and faculty are changing the world through their contributions to computing education, research, and industry. These awards received by members of the UT Computer Science community make ...

www.cs.utexas.edu4d

Overlapping Layered Learning

Patrick MacAlpine and Peter Stone.

www.cs.utexas.edu4d

Transfer Learning for Reinforcement Learning Domains: A Survey

Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.

www.cs.utexas.edu4d

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.

www.cs.utexas.edu4d

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...

www.cs.utexas.edu4d

Grounded Action Transformation for Robot Learning in Simulation

Grounded Action Transformation for Robot Learning in Simulation. Josiah Hanna and Peter Stone. @InProceedings{AAAI17-Hanna, author = {Josiah Hanna and Peter Stone}, title = {Grounded Action ...

www.cs.utexas.edu4d

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications

In reinforcement learning (RL), a reward function that aligns exactly with a task's true performance metric is often sparse. For example, a true task metric might encode a reward of 1 upon success and ...

www.cs.utexas.edu4d

Mobile Robot Planning using Action Language BC with an Abstraction Hierarchy

Mobile Robot Planning using Action Language BC with an Abstraction Hierarchy. Shiqi Zhang, Fangkai Yang, Piyush Khandelwal, and Peter Stone. In Proceedings of the 13th International Conference on ...

www.cs.utexas.edu4d

Multiagent Systems: A survey from a machine learning perspective

Multiagent Systems: A survey from a machine learning perspective. Peter Stone and Manuela Veloso. Autonomous Robots, 8(3):345–383, July 2000. @Article(MASsurvey, Author="Peter Stone and Manuela Veloso ...

www.cs.utexas.edu4d

Generative Adversarial Imitation from Observation

Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...

www.cs.utexas.edu4d

Adversarial Imitation Learning from Video using a State Observer

Haresh Karnan, Garrett Warnell, Faraz Torabi, and Peter Stone. Adversarial Imitation Learning from Video using a State Observer. In International Conference on Robotics and Automation, 2022, May 2022.

www.cs.utexas.edu5d

CS 377P: Programming for Performance

In this assignment, you will implement parallel programs to compute an approximation to pi using the numerical integration program discussed in class. You will implement several variations of this ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results