default search action

combined dblp search
author search
venue search
publication search

ask others

Scott Niekum

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c73]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChuckFQSA0N25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChuckFQSA0N25
Caleb Chuck, Fan Feng, Carl Qi, Chang Shi, Siddhant Agarwal, Amy Zhang, Scott Niekum:
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning. ICLR 2025
[c72]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuLSN025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuLSN025
Haoran Xu, Shuozhe Li, Harshit Sikchi, Scott Niekum, Amy Zhang:
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning. ICLR 2025
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-06416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-06416
Stephane Hatgis-Kessell, W. Bradley Knox, Serena Booth, Scott Niekum, Peter Stone:
Influencing Humans to Conform to Preference Models for RLHF. CoRR abs/2501.06416 (2025)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-18447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-18447
Will Schwarzer, Jordan Schneider, Philip S. Thomas, Scott Niekum:
Supervised Reward Inference. CoRR abs/2502.18447 (2025)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-07896
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-07896
Harshit Sikchi, Andrea Tirinzoni, Ahmed Touati, Yingchen Xu, Anssi Kanervisto, Scott Niekum, Amy Zhang, Alessandro Lazaric, Matteo Pirotta:
Fast Adaptation with Behavioral Foundation Models. CoRR abs/2504.07896 (2025)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-13368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-13368
Haoran Xu, Shuozhe Li, Harshit Sikchi, Scott Niekum, Amy Zhang:
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning. CoRR abs/2504.13368 (2025)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-14716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-14716
Tuhina Tripathi, Manya Wadhwa, Greg Durrett, Scott Niekum:
Pairwise or Pointwise? Evaluating Feedback Protocols for Bias in LLM-Based Evaluation. CoRR abs/2504.14716 (2025)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-03172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-03172
Caleb Chuck, Fan Feng, Carl Qi, Chang Shi, Siddhant Agarwal, Amy Zhang, Scott Niekum:
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning. CoRR abs/2505.03172 (2025)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01692
Sylee Dandekar, Shripad Deshmukh, Frank Chiu, W. Bradley Knox, Scott Niekum:
A Descriptive and Normative Theory of Human Beliefs in RLHF. CoRR abs/2506.01692 (2025)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-08266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-08266
Yaswanth Chittepu, Blossom Metevier, Will Schwarzer, Austin Hoag, Scott Niekum, Philip S. Thomas:
Reinforcement Learning from Human Feedback with High-Confidence Safety Constraints. CoRR abs/2506.08266 (2025)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-08574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-08574
Ameya Agaskar, Sriram Siva, William Pickering, Kyle O'Brien, Charles Kekeh, Ang Li, Brianna Gallo Sarker, Alicia Chua, Mayur Nemade, Charun Thattai, Jiaming Di, Isaac Iyengar, Ramya Dharoor, Dino Kirouani, Jimmy Erskine, Tamir Hegazy, Scott Niekum, Usman A. Khan, Federico Pecora, Joseph W. Durham:
DeepFleet: Multi-Agent Foundation Models for Mobile Robots. CoRR abs/2508.08574 (2025)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-19464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-19464
Shripad Vilasrao Deshmukh, Will Schwarzer, Scott Niekum:
Evaluation-Aware Reinforcement Learning. CoRR abs/2509.19464 (2025)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22851
Yaswanth Chittepu, Prasann Singhal, Greg Durrett, Scott Niekum:
Adaptive Margin RLHF via Preference over Preferences. CoRR abs/2509.22851 (2025)
2024
[j9]
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/RudolphCBLN024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/RudolphCBLN024
Max Rudolph, Caleb Chuck, Kevin Black, Misha Lvovsky, Scott Niekum, Amy Zhang:
Learning Action-based Representations Using Invariance. RLJ 1: 342-365 (2024)
[j8]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/HannaCTW0N24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/HannaCTW0N24
Josiah P. Hanna, Yash Chandak, Philip S. Thomas, Martha White, Peter Stone, Scott Niekum:
Data-Efficient Policy Evaluation Through Behavior Policy Search. J. Mach. Learn. Res. 25: 313:1-313:58 (2024)
[j7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ChuckBAZN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ChuckBAZN24
Caleb Chuck, Kevin Black, Aditya Arjun, Yuke Zhu, Scott Niekum:
Granger Causal Interaction Skill Chains. Trans. Mach. Learn. Res. 2024 (2024)
[j6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/KnoxHBNSA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/KnoxHBNSA24
W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, Alessandro Gabriele Allievi:
Models of human preference for learning reward functions. Trans. Mach. Learn. Res. 2024 (2024)
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KnoxHABDSN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KnoxHABDSN24
W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur O. Adalgeirsson, Serena Booth, Anca D. Dragan, Peter Stone, Scott Niekum:
Learning Optimal Advantage from Preferences and Mistaking It for Reward. AAAI 2024: 10066-10073
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/BiswasPCHNAA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/BiswasPCHNAA24
Abhijat Biswas, Badal Arun Pardhi, Caleb Chuck, Jarrett Holtz, Scott Niekum, Henny Admoni, Alessandro Allievi:
Gaze Supervision for Mitigating Causal Confusion in Driving Agents. AAMAS 2024: 2159-2161
[c69]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/SikchiC0N24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/SikchiC0N24
Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum:
A Dual Approach to Imitation Learning from Observations with Offline Datasets. CoRL 2024: 1125-1147
[c68]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HejnaRSFNKS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HejnaRSFNKS24
Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh:
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning. ICLR 2024
[c67]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SikchiCTG0N24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SikchiCTG0N24
Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum:
Score Models for Offline Goal-Conditioned Reinforcement Learning. ICLR 2024
[c66]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SikchiZ0N24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SikchiZ0N24
Harshit Sikchi, Qinqing Zheng, Amy Zhang, Scott Niekum:
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning. ICLR 2024
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/ivs/BiswasPCHNAA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ivs/BiswasPCHNAA24
Abhijat Biswas, Badal Arun Pardhi, Caleb Chuck, Jarrett Holtz, Scott Niekum, Henny Admoni, Alessandro Allievi:
Gaze Supervision for Mitigating Causal Confusion in Driving Agents. IV 2024: 2331-2338
[c64]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChungN024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChungN024
Stephen Chung, Scott Niekum, David Krueger:
Predicting Future Actions of Reinforcement Learning Agents. NeurIPS 2024
[c63]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/RafailovCPSHKFN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RafailovCPSHKFN24
Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum:
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms. NeurIPS 2024
[c62]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangHCCM0N024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangHCCM0N024
Zizhao Wang, Jiaheng Hu, Caleb Chuck, Stephen Chen, Roberto Martín-Martín, Amy Zhang, Scott Niekum, Peter Stone:
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions. NeurIPS 2024
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16369
Max Rudolph, Caleb Chuck, Kevin Black, Misha Lvovsky, Scott Niekum, Amy Zhang:
Learning Action-based Representations Using Invariance. CoRR abs/2403.16369 (2024)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-10883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-10883
Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum:
Automated Discovery of Functional Actual Causes in Complex Environments. CoRR abs/2404.10883 (2024)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-01511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-01511
Prasann Singhal, Nathan Lambert, Scott Niekum, Tanya Goyal, Greg Durrett:
D2PO: Discriminator-Guided DPO with Response Evaluation Models. CoRR abs/2405.01511 (2024)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-03113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-03113
Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum:
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning. CoRR abs/2405.03113 (2024)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02900
Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum:
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms. CoRR abs/2406.02900 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08805
Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum:
A Dual Approach to Imitation Learning from Observations with Offline Datasets. CoRR abs/2406.08805 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15599
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15599
Ryan Boldi, Li Ding, Lee Spector, Scott Niekum:
Pareto-Optimal Learning from Preferences with Hidden Context. CoRR abs/2406.15599 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-18416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-18416
Zizhao Wang, Jiaheng Hu, Caleb Chuck, Stephen Chen, Roberto Martín-Martín, Amy Zhang, Scott Niekum, Peter Stone:
SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions. CoRR abs/2410.18416 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-22459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-22459
Stephen Chung, Scott Niekum, David Krueger:
Predicting Future Actions of Reinforcement Learning Agents. CoRR abs/2410.22459 (2024)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-05718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-05718
Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum:
RL Zero: Zero-Shot Language to Behaviors without any Supervision. CoRR abs/2412.05718 (2024)
2023
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/SikchiSGN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/SikchiSGN23
Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum:
A Ranking Game for Imitation Learning. Trans. Mach. Learn. Res. 2023 (2023)
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BoothKSNSA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BoothKSNSA23
Serena Booth, W. Bradley Knox, Julie Shah, Scott Niekum, Peter Stone, Alessandro Allievi:
The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications. AAAI 2023: 5920-5929
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-09770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-09770
Prasoon Goyal, Raymond J. Mooney, Scott Niekum:
Language-guided Task Adaptation for Imitation Learning. CoRR abs/2301.09770 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08560
Harshit Sikchi, Amy Zhang, Scott Niekum:
Imitation from Arbitrary Experience: A Dual Unification of Reinforcement and Imitation Learning Methods. CoRR abs/2302.08560 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09509
Caleb Chuck, Kevin Black, Aditya Arjun, Yuke Zhu, Scott Niekum:
Granger-Causal Hierarchical Skill Discovery. CoRR abs/2306.09509 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-02728
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-02728
Andrew Levy, Sreehari Rammohan, Alessandro Allievi, Scott Niekum, George Konidaris:
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill-Learning. CoRR abs/2307.02728 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02456
W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur O. Adalgeirsson, Serena Booth, Anca D. Dragan, Peter Stone, Scott Niekum:
Learning Optimal Advantage from Preferences and Mistaking it for Reward. CoRR abs/2310.02456 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13639
Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh:
Contrastive Preference Learning: Learning from Human Feedback without RL. CoRR abs/2310.13639 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-02013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-02013
Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum:
Score Models for Offline Goal-Conditioned Reinforcement Learning. CoRR abs/2311.02013 (2023)
2022
[c60]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001MSBTN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001MSBTN22
Stephen Giguere, Blossom Metevier, Bruno Castro da Silva, Yuriy Brun, Philip S. Thomas, Scott Niekum:
Fairness Guarantees under Demographic Shift. ICLR 2022
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/SaranDCLTN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/SaranDCLTN22
Akanksha Saran, Kush Desai, Mai Lee Chang, Rudolf Lioutikov, Andrea Thomaz, Scott Niekum:
Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots. IROS 2022: 9172-9179
[c58]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/l4dc/CuiNGKR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/l4dc/CuiNGKR22
Yuchen Cui, Scott Niekum, Abhinav Gupta, Vikash Kumar, Aravind Rajeswaran:
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation? L4DC 2022: 893-905
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03481
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03481
Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum:
A Ranking Game for Imitation Learning. CoRR abs/2202.03481 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-11134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-11134
Yuchen Cui, Scott Niekum, Abhinav Gupta, Vikash Kumar, Aravind Rajeswaran:
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation? CoRR abs/2204.11134 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00695
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00695
Wonjoon Goo, Scott Niekum:
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL. CoRR abs/2206.00695 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02231
W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, Alessandro Allievi:
Models of human preference for learning reward functions. CoRR abs/2206.02231 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00352
Akanksha Saran, Kush Desai, Mai Lee Chang, Rudolf Lioutikov, Andrea Thomaz, Scott Niekum:
Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots. CoRR abs/2211.00352 (2022)
2021
[j4]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/KroemerNK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/KroemerNK21
Oliver Kroemer, Scott Niekum, George Konidaris:
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms. J. Mach. Learn. Res. 22: 30:1-30:82 (2021)
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ml/HannaNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/HannaNS21
Josiah P. Hanna, Scott Niekum, Peter Stone:
Importance sampling in reinforcement learning with an estimated behavior policy. Mach. Learn. 110(6): 1267-1317 (2021)
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/CuiZJASNK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/CuiZJASNK21
Yuchen Cui, Qiping Zhang, Sahil Jain, Alessandro Allievi, Peter Stone, Scott Niekum, W. Bradley Knox:
Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback. AAAI 2021: 16017-16019
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/SaranZSN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/SaranZSN21
Akanksha Saran, Ruohan Zhang, Elaine Schaertl Short, Scott Niekum:
Efficiently Guiding Imitation Learning Agents with Human Gaze. AAMAS 2021: 1109-1117
[c55]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/KimND21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/KimND21
Mincheol Kim, Scott Niekum, Ashish D. Deshpande:
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences. CoRL 2021: 1512-1521
[c54]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/GooN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/GooN21
Wonjoon Goo, Scott Niekum:
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL. CoRL 2021: 1543-1553
[c53]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/JainGLN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/JainGLN21
Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum:
Distributional Depth-Based Estimation of Object Articulation Models. CoRL 2021: 1611-1621
[c52]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BrownSDN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BrownSDN21
Daniel S. Brown, Jordan Schneider, Anca D. Dragan, Scott Niekum:
Value Alignment Verification. ICML 2021: 1105-1115
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/JainLCN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/JainLCN21
Ajinkya Jain, Rudolf Lioutikov, Caleb Chuck, Scott Niekum:
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory. ICRA 2021: 13670-13677
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/CuiKANSSF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/CuiKANSSF21
Yuchen Cui, Pallavi Koppol, Henny Admoni, Scott Niekum, Reid G. Simmons, Aaron Steinfeld, Tesca Fitzgerald:
Understanding the Relationship between Interactions and Outcomes in Human-in-the-Loop Machine Learning. IJCAI 2021: 4382-4391
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/MemarianGLNT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/MemarianGLNT21
Farzan Memarian, Wonjoon Goo, Rudolf Lioutikov, Scott Niekum, Ufuk Topcu:
Self-Supervised Online Reward Shaping in Sparse-Reward Environments. IROS 2021: 2369-2375
[c48]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DurugkarTNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DurugkarTNS21
Ishan Durugkar, Mauricio Tec, Scott Niekum, Peter Stone:
Adversarial Intrinsic Motivation for Reinforcement Learning. NeurIPS 2021: 8622-8636
[c47]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YuanCGTN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuanCGTN21
Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum:
SOPE: Spectrum of Off-Policy Estimators. NeurIPS 2021: 18958-18969
[c46]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChandakNSLBT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChandakNSLBT21
Yash Chandak, Scott Niekum, Bruno C. da Silva, Erik G. Learned-Miller, Emma Brunskill, Philip S. Thomas:
Universal Off-Policy Evaluation. NeurIPS 2021: 27475-27490
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08442
Mincheol Kim, Scott Niekum, Ashish D. Deshpande:
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences. CoRR abs/2102.08442 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04529
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04529
Farzan Memarian, Wonjoon Goo, Rudolf Lioutikov, Ufuk Topcu, Scott Niekum:
Self-Supervised Online Reward Shaping in Sparse-Reward Environments. CoRR abs/2103.04529 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-12820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-12820
Yash Chandak, Scott Niekum, Bruno Castro da Silva, Erik G. Learned-Miller, Emma Brunskill, Philip S. Thomas:
Universal Off-Policy Evaluation. CoRR abs/2104.12820 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-13345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-13345
Ishan Durugkar, Mauricio Tec, Scott Niekum, Peter Stone:
Adversarial Intrinsic Motivation for Reinforcement Learning. CoRR abs/2105.13345 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02972
Prasoon Goyal, Raymond J. Mooney, Scott Niekum:
Zero-shot Task Adaptation using Natural Language. CoRR abs/2106.02972 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00116
Farzan Memarian, Abolfazl Hashemi, Scott Niekum, Ufuk Topcu:
Robust Generative Adversarial Imitation Learning via Local Lipschitzness. CoRR abs/2107.00116 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-05875
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-05875
Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum:
Distributional Depth-Based Estimation of Object Articulation Models. CoRR abs/2108.05875 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02304
Wonjoon Goo, Scott Niekum:
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL. CoRR abs/2110.02304 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-03936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-03936
Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum:
SOPE: Spectrum of Off-Policy Estimators. CoRR abs/2111.03936 (2021)
2020
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/GoyalNM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/GoyalNM20
Prasoon Goyal, Scott Niekum, Raymond J. Mooney:
PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards. CoRL 2020: 485-497
[c44]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/CuiZKASN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/CuiZKASN20
Yuchen Cui, Qiping Zhang, W. Bradley Knox, Alessandro Allievi, Peter Stone, Scott Niekum:
The EMPATHIC Framework for Task Learning from Implicit Human Feedback. CoRL 2020: 604-626
[c43]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BrownCSN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BrownCSN20
Daniel S. Brown, Russell Coleman, Ravi Srinivasan, Scott Niekum:
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences. ICML 2020: 1165-1177
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhangSLZGNBH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhangSLZGNBH20
Ruohan Zhang, Akanksha Saran, Bo Liu, Yifeng Zhu, Sihang Guo, Scott Niekum, Dana H. Ballard, Mary M. Hayhoe:
Human Gaze Assisted Artificial Intelligence: A Review. IJCAI 2020: 4951-4958
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/JainN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/JainN20
Ajinkya Jain, Scott Niekum:
Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty. IROS 2020: 5253-5260
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/ChuckCN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/ChuckCN20
Caleb Chuck, Supawit Chockchowwat, Scott Niekum:
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning. IROS 2020: 5572-5579
[c39]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BrownNP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BrownNP20
Daniel S. Brown, Scott Niekum, Marek Petrik:
Bayesian Robust Optimization for Imitation Learning. NeurIPS 2020
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-03272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-03272
Wonjoon Goo, Scott Niekum:
Local Nonparametric Meta-Learning. CoRR abs/2002.03272 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-09089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-09089
Daniel S. Brown, Russell Coleman, Ravi Srinivasan, Scott Niekum:
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences. CoRR abs/2002.09089 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-12500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-12500
Akanksha Saran, Ruohan Zhang, Elaine Schaertl Short, Scott Niekum:
Efficiently Guiding Imitation Learning Algorithms with Human Gaze. CoRR abs/2002.12500 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-12315
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-12315
Daniel S. Brown, Scott Niekum, Marek Petrik:
Bayesian Robust Optimization for Imitation Learning. CoRR abs/2007.12315 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-15543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-15543
Prasoon Goyal, Scott Niekum, Raymond J. Mooney:
PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards. CoRR abs/2007.15543 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-10518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-10518
Ajinkya Jain, Rudolf Lioutikov, Scott Niekum:
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory. CoRR abs/2008.10518 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-13649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-13649
Yuchen Cui, Qiping Zhang, Alessandro Allievi, Peter Stone, Scott Niekum, W. Bradley Knox:
The EMPATHIC Framework for Task Learning from Implicit Human Feedback. CoRR abs/2009.13649 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-01557
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-01557
Daniel S. Brown, Jordan Schneider, Scott Niekum:
Value Alignment Verification. CoRR abs/2012.01557 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BrownN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BrownN19
Daniel S. Brown, Scott Niekum:
Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications. AAAI 2019: 7749-7758
[c37]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/BrownGN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/BrownGN19
Daniel S. Brown, Wonjoon Goo, Scott Niekum:
Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrations. CoRL 2019: 330-359
[c36]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/SaranSTN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/SaranSTN19
Akanksha Saran, Elaine Schaertl Short, Andrea Thomaz, Scott Niekum:
Understanding Teacher Gaze Patterns for Robot Learning. CoRL 2019: 1247-1258
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/hri/GutierrezSNT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hri/GutierrezSNT19
Reymundo A. Gutierrez, Elaine Schaertl Short, Scott Niekum, Andrea Lockerd Thomaz:
Learning from Corrective Demonstrations. HRI 2019: 712-714
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/hri/SaranSTN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hri/SaranSTN19
Akanksha Saran, Elaine Schaertl Short, Andrea Thomaz, Scott Niekum:
Enhancing Robot Learning with Human Social Cues. HRI 2019: 745-747
[c33]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/BrownGNN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BrownGNN19
Daniel S. Brown, Wonjoon Goo, Prabhat Nagarajan, Scott Niekum:
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations. ICML 2019: 783-792
[c32]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HannaNS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HannaNS19
Josiah Hanna, Scott Niekum, Peter Stone:
Importance Sampling Policy Evaluation with an Estimated Behavior Policy. ICML 2019: 2605-2613
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/CuiINF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/CuiINF19
Yuchen Cui, David Isele, Scott Niekum, Kikuo Fujimura:
Uncertainty-Aware Data Aggregation for Deep Imitation Learning. ICRA 2019: 761-767
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/GooN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/GooN19
Wonjoon Goo, Scott Niekum:
One-Shot Learning of Multi-Step Tasks from Observation via Activity Localization in Auxiliary Video. ICRA 2019: 7755-7761
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/GoyalNM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/GoyalNM19
Prasoon Goyal, Scott Niekum, Raymond J. Mooney:
Using Natural Language for Reward Shaping in Reinforcement Learning. IJCAI 2019: 2385-2391
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-02161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-02161
Daniel S. Brown, Yuchen Cui, Scott Niekum:
Risk-Aware Active Inverse Reinforcement Learning. CoRR abs/1901.02161 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-02020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-02020
Prasoon Goyal, Scott Niekum, Raymond J. Mooney:
Using Natural Language for Reward Shaping in Reinforcement Learning. CoRR abs/1903.02020 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-06387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-06387
Daniel S. Brown, Wonjoon Goo, Prabhat Nagarajan, Scott Niekum:
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations. CoRR abs/1904.06387 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-02780
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-02780
Yuchen Cui, David Isele, Scott Niekum, Kikuo Fujimura:
Uncertainty-Aware Data Aggregation for Deep Imitation Learning. CoRR abs/1905.02780 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-01408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-01408
Caleb Chuck, Supawit Chockchowwat, Scott Niekum:
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning. CoRR abs/1906.01408 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-03146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-03146
Oliver Kroemer, Scott Niekum, George Dimitri Konidaris:
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms. CoRR abs/1907.03146 (2019)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-03976
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-03976
Daniel S. Brown, Wonjoon Goo, Scott Niekum:
Ranking-Based Reward Extrapolation without Rankings. CoRR abs/1907.03976 (2019)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-07202
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-07202
Akanksha Saran, Elaine Schaertl Short, Andrea Thomaz, Scott Niekum:
Understanding Teacher Gaze Patterns for Robot Learning. CoRR abs/1907.07202 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-09014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-09014
Ajinkya Jain, Scott Niekum:
Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty. CoRR abs/1907.09014 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-04472
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-04472
Daniel S. Brown, Scott Niekum:
Deep Bayesian Reward Learning from Preferences. CoRR abs/1912.04472 (2019)
2018
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/AlshiekhBEKNT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/AlshiekhBEKNT18
Mohammed Alshiekh, Roderick Bloem, Rüdiger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu:
Safe Reinforcement Learning via Shielding. AAAI 2018: 2669-2678
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BrownN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BrownN18
Daniel S. Brown, Scott Niekum:
Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning. AAAI 2018: 2754-2762
[c26]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/BrownCN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/BrownCN18
Daniel S. Brown, Yuchen Cui, Scott Niekum:
Risk-Aware Active Inverse Reinforcement Learning. CoRL 2018: 362-372
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/JainN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/JainN18
Ajinkya Jain, Scott Niekum:
Efficient Hierarchical Robot Motion Planning Under Uncertainty and Hybrid Dynamics. CoRL 2018: 757-766
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/hri/FaulknerNT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hri/FaulknerNT18
Taylor Kessler Faulkner, Scott Niekum, Andrea Thomaz:
Asking for Help Effectively via Modeling of Human Beliefs. HRI (Companion) 2018: 149-150
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/GutierrezCTN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/GutierrezCTN18
Reymundo A. Gutierrez, Vivian Chu, Andrea Lockerd Thomaz, Scott Niekum:
Incremental Task Modification via Corrective Demonstrations. ICRA 2018: 1126-1133
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/CuiN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/CuiN18
Yuchen Cui, Scott Niekum:
Active Reward Learning from Critiques. ICRA 2018: 6907-6914
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/SaranMSTN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/SaranMSTN18
Akanksha Saran, Srinjoy Majumdar, Elaine Schaertl Short, Andrea Thomaz, Scott Niekum:
Human Gaze Following for Human-Robot Interaction. IROS 2018: 8615-8621
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-04205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-04205
Ajinkya Jain, Scott Niekum:
Efficient Hierarchical Robot Motion Planning Under Uncertainty and Hybrid Dynamics. CoRR abs/1802.04205 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-07687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-07687
Daniel S. Brown, Scott Niekum:
Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications. CoRR abs/1805.07687 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-01347
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-01347
Josiah Hanna, Scott Niekum, Peter Stone:
Importance Sampling Policy Evaluation with an Estimated Behavior Policy. CoRR abs/1806.01347 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-11244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-11244
Wonjoon Goo, Scott Niekum:
Learning Multi-Step Robotic Tasks from Observation. CoRR abs/1806.11244 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-01036
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-01036
Reymundo A. Gutierrez, Elaine Schaertl Short, Scott Niekum, Andrea Lockerd Thomaz:
Towards Online Learning from Corrective Demonstrations. CoRR abs/1810.01036 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03563
Yuqian Jiang, Nick Walker, Minkyu Kim, Nicolas Brissonneau, Daniel S. Brown, Justin W. Hart, Scott Niekum, Luis Sentis, Peter Stone:
LAAIR: A Layered Architecture for Autonomous Interactive Robots. CoRR abs/1811.03563 (2018)
2017
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HannaSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HannaSN17
Josiah P. Hanna, Peter Stone, Scott Niekum:
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation. AAAI 2017: 4933-4934
[c19]
- view
- export record
  dblp key:
  - conf/aaaifs/BrownN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaaifs/BrownN17
Daniel S. Brown, Scott Niekum:
Toward Probabilistic Safety Bounds for Robot Learning from Demonstration. AAAI Fall Symposia 2017: 10-18
[c18]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/HannaSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/HannaSN17
Josiah P. Hanna, Peter Stone, Scott Niekum:
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation. AAMAS 2017: 538-546
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HannaTSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HannaTSN17
Josiah P. Hanna, Philip S. Thomas, Peter Stone, Scott Niekum:
Data-Efficient Policy Evaluation Through Behavior Policy Search. ICML 2017: 1394-1403
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/PoonawalaANT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/PoonawalaANT17
Hasan A. Poonawala, Mohammed Alshiekh, Scott Niekum, Ufuk Topcu:
Classification error correction: A case study in brain-computer interfacing. IROS 2017: 3006-3012
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/SaranLMHN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/SaranLMHN17
Akanksha Saran, Branka Lakic, Srinjoy Majumdar, Jürgen Hess, Scott Niekum:
Viewpoint selection for visual failure detection. IROS 2017: 5437-5444
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HannaTSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HannaTSN17
Josiah P. Hanna, Philip S. Thomas, Peter Stone, Scott Niekum:
Data-Efficient Policy Evaluation Through Behavior Policy Search. CoRR abs/1706.03469 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BrownN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BrownN17
Daniel S. Brown, Scott Niekum:
Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning. CoRR abs/1707.00724 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-08611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-08611
Mohammed Alshiekh, Roderick Bloem, Rüdiger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu:
Safe Reinforcement Learning via Shielding. CoRR abs/1708.08611 (2017)
2016
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KhandelwalLNS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KhandelwalLNS16
Piyush Khandelwal, Elad Liebman, Scott Niekum, Peter Stone:
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search. ICML 2016: 1319-1328
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HannaSN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HannaSN16
Josiah P. Hanna, Peter Stone, Scott Niekum:
High Confidence Off-Policy Evaluation with Models. CoRR abs/1606.06126 (2016)
2015
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ijrr/NiekumOKCMB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijrr/NiekumOKCMB15
Scott Niekum, Sarah Osentoski, George Dimitri Konidaris, Sachin Chitta, Bhaskara Marthi, Andrew G. Barto:
Learning grounded finite-state representations from unstructured demonstrations. Int. J. Robotics Res. 34(2): 131-157 (2015)
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/NiekumOAB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/NiekumOAB15
Scott Niekum, Sarah Osentoski, Christopher G. Atkeson, Andrew G. Barto:
Online Bayesian changepoint detection for articulated motion models. ICRA 2015: 1468-1475
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/HausmanNOS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/HausmanNOS15
Karol Hausman, Scott Niekum, Sarah Osentoski, Gaurav S. Sukhatme:
Active articulation model estimation through interactive perception. ICRA 2015: 3305-3312
[c11]
- view
- export record
  dblp key:
  - conf/nips/ThomasNTK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ThomasNTK15
Philip S. Thomas, Scott Niekum, Georgios Theocharous, George Dimitri Konidaris:
Policy Evaluation Using the Ω-Return. NIPS 2015: 334-342
2014
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/humanoids/YamaguchiANO14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/humanoids/YamaguchiANO14
Akihiko Yamaguchi, Christopher G. Atkeson, Scott Niekum, Tsukasa Ogasawara:
Learning pouring skills from demonstration and practice. Humanoids 2014: 908-915
2013
[c9]
- view
  - electronic edition @ aaai.org
  - details & citations
- export record
  dblp key:
  - conf/aaaiss/Niekum13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaaiss/Niekum13
Scott Niekum:
An Integrated System for Learning Multi-Step Robotic Tasks from Unstructured Demonstrations. AAAI Spring Symposium: Designing Intelligent Robots 2013
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/rss/NiekumCBMO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rss/NiekumCBMO13
Scott Niekum, Sachin Chitta, Andrew G. Barto, Bhaskara Marthi, Sarah Osentoski:
Incremental Semantically Grounded Learning from Demonstration. Robotics: Science and Systems 2013
2012
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Niekum12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Niekum12
Scott Niekum:
Complex Task Learning from Unstructured Demonstrations. AAAI 2012: 2402-2403
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/NiekumOKB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/NiekumOKB12
Scott Niekum, Sarah Osentoski, George Dimitri Konidaris, Andrew G. Barto:
Learning and generalization of complex tasks from unstructured demonstrations. IROS 2012: 5239-5246
2011
[c5]
- view
- export record
  dblp key:
  - conf/aaai/NiekumB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/NiekumB11
Scott Niekum, Andrew G. Barto:
Clustering via Dirichlet Process Mixture Models for Portable Skill Discovery. Lifelong Learning 2011
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/gecco/NiekumSB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gecco/NiekumSB11
Scott Niekum, Lee Spector, Andrew G. Barto:
Evolution of reward functions for reinforcement learning. GECCO (Companion) 2011: 177-178
[c3]
- view
- export record
  dblp key:
  - conf/nips/NiekumB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NiekumB11
Scott Niekum, Andrew G. Barto:
Clustering via Dirichlet Process Mixture Models for Portable Skill Discovery. NIPS 2011: 1818-1826
[c2]
- view
- export record
  dblp key:
  - conf/nips/KonidarisNT11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KonidarisNT11
George Dimitri Konidaris, Scott Niekum, Philip S. Thomas:
TD_gamma: Re-evaluating Complex Backups in Temporal Difference Learning. NIPS 2011: 2402-2410
2010
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tamd/NiekumBS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/NiekumBS10
Scott Niekum, Andrew G. Barto, Lee Spector:
Genetic Programming for Reward Function Search. IEEE Trans. Auton. Ment. Dev. 2(2): 83-90 (2010)
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Niekum10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Niekum10
Scott Niekum:
Evolved Intrinsic Reward Functions for Reinforcement Learning. AAAI 2010: 1955-1956

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.