default search action
18th AAMAS 2019: Montreal, QC, Canada
- Edith Elkind, Manuela Veloso, Noa Agmon, Matthew E. Taylor:
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS '19, Montreal, QC, Canada, May 13-17, 2019. International Foundation for Autonomous Agents and Multiagent Systems 2019, ISBN 978-1-4503-6309-9
Keynote Talks
- Subbarao Kambhampati:
Synthesizing Explainable Behavior for Human-AI Collaboration. 1-2 - Francesca Rossi, Andrea Loreggia:
Preferences and Ethical Priorities: Thinking Fast and Slow in AI. 3-4 - Carles Sierra:
Responsible Autonomy. 5 - Doina Precup:
Building Knowledge for AI Agents with Reinforcement Learning. 6
1A: Reinforcement Learning 1
- Sammie Katt, Frans A. Oliehoek, Christopher Amato:
Bayesian Reinforcement Learning in Factored POMDPs. 7-15 - Jiang Rong, Tao Qin, Bo An:
Competitive Bridge Bidding with Deep Neural Networks. 16-24 - Sanmit Narvekar, Peter Stone:
Learning Curriculum Policies for Reinforcement Learning. 25-33 - Bohan Wu, Jayesh K. Gupta, Mykel J. Kochenderfer:
Model Primitive Hierarchical Lifelong Reinforcement Learning. 34-42 - Gregory Palmer, Rahul Savani, Karl Tuyls:
Negative Update Intervals in Deep Multi-Agent Reinforcement Learning. 43-51 - Yang Liu, Yifeng Zeng, Yingke Chen, Jing Tang, Yinghui Pan:
Self-Improving Generative Adversarial Reinforcement Learning. 52-60
1B: Socially Intelligent Agents 1
- Mike Ligthart, Timo Fernhout, Mark A. Neerincx, Kelly L. A. van Bindsbergen, Martha A. Grootenhuis, Koen V. Hindriks:
A Child and a Robot Getting Acquainted - Interaction Design for Eliciting Self-Disclosure. 61-70 - Pooja Prajod, Mohammed Al Owayyed, Tim Rietveld, Jaap-Jan van der Steeg, Joost Broekens:
The Effect of Virtual Agent Warmth on Human-Agent Negotiation. 71-76 - O. Can Görür, Benjamin Rosman, Sahin Albayrak:
Anticipatory Bayesian Policy Selection for Online Adaptation of Collaborative Robots to Unknown Human Types. 77-85 - Hannes Ritschel, Ilhan Aslan, David Sedlbauer, Elisabeth André:
Irony Man: Augmenting a Social Robot with the Ability to Use Irony in Multimodal Communication with Humans. 86-94 - Kim Baraka, Marta Couto, Francisco S. Melo, Manuela Veloso:
An Optimization Approach for Structured Agent-Based Provider/Receiver Tasks. 95-103 - Sepehr Janghorbani, Ashutosh Modi, Jakob Buhmann, Mubbasir Kapadia:
Domain Authoring Assistant for Intelligent Virtual Agent. 104-112
1C: Multi-Robot Systems
- Michael Amir, Alfred M. Bruckstein:
Minimizing Travel in the Uniform Dispersal Problem for Robotic Sensors. 113-121 - Rui Liu, Fan Jia, Wenhao Luo, Meghan Chandarana, Changjoo Nam, Michael Lewis, Katia P. Sycara:
Trust-Aware Behavior Reflection for Robot Swarm Self-Healing. 122-130 - Florence Ho, Ana Salta, Rúben Geraldes, Artur Goncalves, Marc Cavazza, Helmut Prendinger:
Multi-Agent Path Finding for UAV Traffic Management. 131-139 - Pierre Thalamy, Benoît Piranda, Julien Bourgeois:
Distributed Self-Reconfiguration using a Deterministic Autonomous Scaffolding Structure. 140-148 - Yinon Douchan, Ran Wolf, Gal A. Kaminka:
Swarms Can be Rational. 149-157 - Ebtehal Turki Saho Alotaibi:
A Complete Multi-Robot Path-Planning Algorithm: JAAMAS Track. 158-160
1D: Verification and Validation
- Alessio Lomuscio, Edoardo Pirovano:
A Counter Abstraction Technique for the Verification of Probabilistic Swarm Systems. 161-169 - Natasha Alechina, Mehdi Dastani, Brian Logan:
Decidable Model Checking with Uniform Strategies. 170-178 - Panagiotis Kouvaros, Alessio Lomuscio, Edoardo Pirovano, Hashan Punchihewa:
Formal Verification of Open Multi-Agent Systems. 179-187 - Giuseppe Perelli:
Enforcing Equilibria in Multi-Agent Systems. 188-196 - Damian Kurpiewski, Michal Knapik, Wojciech Jamroga:
On Domination and Control in Strategic Ability. 197-205 - Francesco Belardinelli, Stéphane Demri:
Resource-bounded ATL: the Quest for Tractable Fragments. 206-214
1E: Economic Paradigms: Learning and Adaptation
- Weiran Shen, Pingzhong Tang, Song Zuo:
Automated Mechanism Design via Neural Networks. 215-223 - Michal Sustr, Vojtech Kovarík, Viliam Lisý:
Monte Carlo Continual Resolving for Online Strategy Computation in Imperfect Information Games. 224-232 - James P. Bailey, Georgios Piliouras:
Multi-Agent Learning in Network Zero-Sum Games is a Hamiltonian System. 233-241 - Yasser F. O. Mohammad, Shinji Nakadai:
Optimal Value of Information Based Elicitation During Negotiation. 242-250 - Jayakumar Subramanian, Aditya Mahajan:
Reinforcement Learning in Stationary Mean-field Games. 251-259 - Jasper Bakker, Aron Hammond, Daan Bloembergen, Tim Baarslag:
RLBOA: A Modular Reinforcement Learning Framework for Autonomous Negotiating Agents. 260-268
1F: Agent Societies and Societal Issues 1
- Jason Xu, Julián García, Toby Handfield:
Cooperation with Bottom-up Reputation Dynamics. 269-276 - Yi Yang, Quan Bai, Qing Liu:
Dynamic Source Weight Computation for Truth Inference over Data Streams. 277-285 - Nanda Kishore Sreenivas, Shrisha Rao:
Egocentric Bias and Doubt in Cognitive Agents. 286-295 - Fan Yang, Bo Liu, Wen Dong:
Optimal Control of Complex Systems through Variational Inference with a Discrete Event Decision Process. 296-304 - Kai Zhou, Tomasz P. Michalak, Marcin Waniek, Talal Rahwan, Yevgeniy Vorobeychik:
Attacking Similarity-Based Link Prediction in Social Networks. 305-313 - Sixie Yu, Yevgeniy Vorobeychik:
Removing Malicious Nodes from Networks. 314-322
2A: Reinforcement Learning 2
- Yuxiang Yang, Ken Caluwaerts, Atil Iscen, Jie Tan, Chelsea Finn:
NoRML: No-Reward Meta Learning. 323-331 - Banafsheh Rafiee, Sina Ghiassian, Adam White, Richard S. Sutton:
Prediction in Intelligence: An Empirical Comparison of Off-policy Algorithms on Robots. 332-340 - Chao Yu, Xin Wang, Jianye Hao, Zhanbo Feng:
Reinforcement Learning for Cooperative Overtaking. 341-349 - Richard Klíma, Daan Bloembergen, Michael Kaisers, Karl Tuyls:
Robust Temporal Difference Learning for Critical Domains. 350-358 - Changjian Li, Krzysztof Czarnecki:
Urban Driving with Multi-Objective Deep Reinforcement Learning. 359-367 - Xinlei Pan, Weiyao Wang, Xiaoshuai Zhang, Bo Li, Jinfeng Yi, Dawn Song:
How You Act Tells a Lot: Privacy-Leaking Attack on Deep Reinforcement Learning. 368-376
2B: Practicial Applications of Game Theory
- Haris Aziz, Serge Gaspers, Zhaohong Sun, Toby Walsh:
From Matching with Diversity Constraints to Matching with Regional Quotas. 377-385 - David Mguni, Joel Jennings, Emilio Sison, Sergio Valcarcel Macua, Sofia Ceppi, Enrique Munoz de Cote:
Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems. 386-394 - Shahrzad Gholami, Amulya Yadav, Long Tran-Thanh, Bistra Dilkina, Milind Tambe:
Don't Put All Your Strategies in One Basket: Playing Green Security Games with Imperfect Prior Knowledge. 395-403 - Chenxi Qiu, Anna Cinzia Squicciarini, Benjamin V. Hanrahan:
Incentivizing Distributive Fairness for Crowdsourcing Workers. 404-412 - Péter Biró, Walter Kern, Dömötör Pálvölgyi, Daniël Paulusma:
Generalized Matching Games for International Kidney Exchange. 413-421 - Hongyao Ma, Reshef Meir, David C. Parkes, James Y. Zou:
Contingent Payment Mechanisms for Resource Utilization. 422-430
2C: Knowledge Representation and Reasoning
- Andrew Perrault, Craig Boutilier:
Experiential Preference Elicitation for Autonomous Heating and Cooling Systems. 431-439 - Peta Masters, Sebastian Sardiña:
Goal Recognition for Rational and Irrational Agents. 440-448 - Min He, Hongliang Guo:
Interleaved Q-Learning with Partially Coupled Training Process. 449-457 - Nikhil Bhargava, Brian C. Williams:
Multiagent Disjunctive Temporal Networks. 458-466 - Luis Enrique Pineda, Shlomo Zilberstein:
Soft Labeling in Stochastic Shortest Path Problems. 467-475 - Atena M. Tabakhi, William Yeoh, Makoto Yokoo:
Parameterized Heuristics for Incomplete Weighted CSPs with Elicitation Costs. 476-484
2D: Social Choice Theory 1
- Luis Sánchez Fernández, Jesús A. Fisteus:
Monotonicity Axioms in Approval-based Multi-winner Voting Rules. 485-493 - Markus Brill, Piotr Faliszewski, Frank Sommer, Nimrod Talmon:
Approximation Algorithms for BalancedCC Multiwinner Rules. 494-502 - Aizhong Zhou, Yongjie Yang, Jiong Guo:
Parameterized Complexity of Committee Elections with Dichotomous and Trichotomous Votes. 503-510 - Sushmita Gupta, Pallavi Jain, Sanjukta Roy, Saket Saurabh, Meirav Zehavi:
Gehrlein Stability in Committee Selection: Parameterized Hardness and Algorithms. 511-519 - Felix Brandt, Johannes Hofbauer, Martin Strobel:
Exploring the No-Show Paradox for Condorcet Extensions Using Ehrhart Theory and Computer Simulations. 520-528 - Jasper Lu, David Kai Zhang, Zinovi Rabinovich, Svetlana Obraztsova, Yevgeniy Vorobeychik:
Manipulating Elections by Selecting Issues. 529-537
2E: Game Theory 1
- Gabriel Istrate, Cosmin Bonchis, Alin Brîndusescu:
Attacking Power Indices by Manipulating Player Reliability. 538-546 - Kai Jin, Ce Jin, Zhaoquan Gu:
Cooperation via Codes in Restricted Hat Guessing Games. 547-555 - Arunesh Sinha, Michael P. Wellman:
Incentivizing Collaboration in a Competition. 556-564 - Robert Bredereck, Edith Elkind, Ayumi Igarashi:
Hedonic Diversity Games. 565-573 - Raffaello Carosi, Gianpiero Monaco, Luca Moscardelli:
Local Core Stability in Simple Symmetric Fractional Hedonic Games. 574-582 - Naoyuki Kamiyama:
Many-to-Many Stable Matchings with Ties, Master Preference Lists, and Matroid Constraints. 583-591
2F: Agent Societies and Societal Issues 2
- Vahid Yazdanpanah, Mehdi Dastani, Wojciech Jamroga, Natasha Alechina, Brian Logan:
Strategic Responsibility Under Imperfect Information. 592-600 - Candice Schumann, Samsara N. Counts, Jeffrey S. Foster, John P. Dickerson:
The Diverse Cohort Selection Problem. 601-609 - Nicolas De Bufala, Jean-Daniel Kant:
An Evolutionary Approach to Find Optimal Policies with an Agent-Based Simulation. 610-618 - Jie Gao, Grant Schoenebeck, Fang-Yi Yu:
The Volatility of Weak Ties: Co-evolution of Selection and Influence in Social Networks. 619-627 - Palash Dey, Sourav Medya:
Covert Networks: How Hard is It to Hide? 628-637 - Ferdinando Fioretto, Pascal Van Hentenryck:
Privacy-Preserving Federated Data Sharing. 638-646
3A: Learning and Adaptation
- Riccardo Sartea, Alessandro Farinelli, Matteo Murari:
Agent Behavioral Analysis Based on Absorbing Markov Chains. 647-655 - Oscar Chang, Robert Kwiatkowski, Siyuan Chen, Hod Lipson:
Agent Embeddings: A Latent Representation for Pole-Balancing Networks. 656-664 - Panayiotis Danassis, Boi Faltings:
Courtesy as a Means to Coordinate. 665-673 - Rohith Dwarakanath Vallam, Sarthak Ahuja, Surya Shravan Kumar Sajja, Ritwik Chaudhuri, Rakesh Pimplikar, Kushal Mukherjee, Ramasuri Narayanam, Gyana R. Parija:
Dynamic Particle Allocation to Solve Interactive POMDP Models for Social Decision Making. 674-682 - Jane X. Wang, Edward Hughes, Chrisantha Fernando, Wojciech M. Czarnecki, Edgar A. Duéñez-Guzmán, Joel Z. Leibo:
Evolving Intrinsic Motivations for Altruistic Behavior. 683-692 - Ryan Lowe, Jakob N. Foerster, Y-Lan Boureau, Joelle Pineau, Yann N. Dauphin:
On the Pitfalls of Measuring Emergent Communication. 693-701
3B: Socially Intelligent Agents 2
- Gabriel Castillo, Michael Neff:
What do we express without knowing?: Emotion in Gesture. 702-710 - Yaqian Zhang, Wooi-Boon Goh:
Bootstrapped Policy Gradient for Difficulty Adaptation in Intelligent Tutoring Systems. 711-719 - Samantha Krening, Karen M. Feigh:
Newtonian Action Advice: Integrating Human Verbal Instruction with Reinforcement Learning. 720-727 - Taylor Kessler Faulkner, Reymundo A. Gutierrez, Elaine Schaertl Short, Guy Hoffman, Andrea Lockerd Thomaz:
Active Attention-Modified Policy Shaping: Socially Interactive Agents Track. 728-736 - Kallirroi Georgila, Mark G. Core, Benjamin D. Nye, Shamya Karumbaiah, Daniel Auerbach, Maya Ram:
Using Reinforcement Learning to Optimize the Policies of an Intelligent Tutoring System for Interpersonal Skills Training. 737-745 - Jize Chen, Changhong Wang:
Reaching Cooperation using Emerging Empathy and Counter-empathy. 746-753
3C: Engineering Multiagent Systems 1
- Buster A. Bernstein, Jasper C. M. Geurtz, Vincent J. Koeman:
Evaluating the Effectiveness of Multi-Agent Organisational Paradigms in a Real-Time Strategy Environment: Engineering Multiagent Systems Track. 754-762 - Mohammad Al-Zinati, Rym Wenkstern:
Agent-Environment Interactions in Large-Scale Multi-Agent Based Simulation Systems. 763-771 - Sandra Garcia-Rodriguez, Jorge J. Gómez-Sanz:
Robust Decentralised Agent Based Approach for Microgrid Energy Management. 772-780 - Akin Günay, Amit K. Chopra, Munindar P. Singh:
Supple: Multiagent Communication Protocols with Causal Types. 781-789 - Alessandro Ricci, Andrei Ciortea, Simon Mayer, Olivier Boissier, Rafael H. Bordini, Jomi Fred Hübner:
Engineering Scalable Distributed Environments and Organizations for MAS. 790-798 - Rafael C. Cardoso, Rafael H. Bordini:
Decentralised Planning for Multi-Agent Programming Platforms. 799-807
3D: Social Choice Theory 2
- Robert Bredereck, Junjie Luo:
Complexity of Manipulation in Premise-Based Judgment Aggregation with Simple Formulas. 819-827 - Sirin Botan, Umberto Grandi, Laurent Perrussel:
Multi-Issue Opinion Diffusion under Constraints. 828-836 - Hadi Hosseini, Kate Larson:
Multiple Assignment Problems under Lexicographic Preferences. 837-845 - Gábor Erdélyi, Christian Reger, Yongjie Yang:
Towards Completing the Puzzle: Solving Open Problems for Control in Elections. 846-854 - Palash Dey, Swaprava Nath, Garima Shakya:
Testing Preferential Domains Using Sampling. 855-863 - Jingyan Wang, Nihar B. Shah:
Your 2 is My 1, Your 3 is My 9: Handling Arbitrary Miscalibrations in Ratings. 864-872
3E: Game Theory 2
- Gianpiero Monaco, Luca Moscardelli, Yllka Velaj:
On the Performance of Stable Outcomes in Modified Fractional Hedonic Games with Egalitarian Social Welfare. 873-881 - Hendrik Fichtenberger, Amer Krivosija, Anja Rey:
Testing Individual-Based Stability Properties in Graphical Hedonic Games. 882-890 - Anna Maria Kerkmann, Jörg Rothe:
Stability in FEN-Hedonic Games for Single-Player Deviations. 891-899 - Aurélie Beynier, Sylvain Bouveret, Michel Lemaître, Nicolas Maudet, Simon Rey, Parham Shams:
Efficiency, Sequenceability and Deal-Optimality in Fair Division of Indivisible Goods. 900-908 - Andrea Celli, Stefano Coniglio, Nicola Gatti:
Computing Optimal Ex Ante Correlated Equilibria in Two-Player Sequential Games. 909-917 - Yossi Azar, Allan Borodin, Michal Feldman, Amos Fiat, Kineret Segal:
Efficient Allocation of Free Stuff. 918-925
3F: Logics for Agents
- Christoph Schwering, Maurice Pagnucco:
A Representation Theorem for Reasoning in First-Order Multi-Agent Knowledge Bases. 926-934 - Xinliang Song, Tonghan Wang, Chongjie Zhang:
Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games. 935-943 - Emiliano Lorini, Fabián Romero:
Decision Procedures for Epistemic Logic Exploiting Belief Bases. 944-952 - Tim French, Rustam Galimullin, Hans van Ditmarsch, Natasha Alechina:
Groups Versus Coalitions: On the Relative Expressivity of GAL and CAL. 953-961 - Wojciech Jamroga, Vadim Malvone, Aniello Murano:
Natural Strategic Ability under Imperfect Information. 962-970 - Aurèle Barrière, Bastien Maubert, Aniello Murano, Sasha Rubin:
Reasoning about Changes of Observational Power in Logics of Knowledge and Time. 971-979
4A: Learning Agent Capabilities
- Xihan Li, Jia Zhang, Jiang Bian, Yunhai Tong, Tie-Yan Liu:
A Cooperative Multi-Agent Reinforcement Learning Framework for Resource Balancing in Complex Logistics Network. 980-988 - Siyuan Li, Fangda Gu, Guangxiang Zhu, Chongjie Zhang:
Context-Aware Policy Reuse. 989-997 - Giuseppe Cuccu, Julian Togelius, Philippe Cudré-Mauroux:
Playing Atari with Six Neurons. 998-1006 - Tong Mu, Karan Goel, Emma Brunskill:
PLOTS: Procedure Learning from Observations using subTask Structure. 1007-1015 - Josiah P. Hanna, Peter Stone:
Reducing Sampling Error in Policy Gradient Learning. 1016-1024 - Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Pan:
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning. 1025-1032
4B: Multimodal Interaction
- Prashan Madumal, Tim Miller, Liz Sonenberg, Frank Vetere:
A Grounded Interaction Protocol for Explainable Artificial Intelligence. 1033-1041 - Akshat Agarwal, Swaminathan Gurumurthy, Vasu Sharma, Mike Lewis, Katia P. Sycara:
Community Regularization of Visually-Grounded Dialog. 1042-1050 - Kathrin Janowski, Elisabeth André:
What If I Speak Now?: A Decision-Theoretic Approach to Personality-Based Turn-Taking. 1051-1059 - Dan Feng, Elín Carstensdóttir, Magy Seif El-Nasr, Stacy Marsella:
Exploring Improvisational Approaches to Social Knowledge Acquisition. 1060-1068 - Julie Porteous, Alan Lindsay:
Protagonist vs Antagonist PROVANT: Narrative Generation as Counter Planning. 1069-1077 - Sule Anjomshoae, Amro Najjar, Davide Calvaresi, Kary Främling:
Explainable Agents and Robots: Results from a Systematic Literature Review. 1078-1088
4C: Deep Learning
- Hyun-Rok Lee, Taesik Lee:
Improved Cooperative Multi-agent Reinforcement Learning Algorithm Augmented by Mixing Demonstrations from Centralized Policy. 1089-1098 - Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. 1099-1107 - Hangyu Mao, Zhengchao Zhang, Zhen Xiao, Zhibo Gong:
Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG. 1108-1116 - Diana Borsa, Nicolas Heess, Bilal Piot, Siqi Liu, Leonard Hasenclever, Rémi Munos, Olivier Pietquin:
Observational Learning by Reinforcement Learning. 1117-1124 - Ondrej Biza, Robert Platt Jr.:
Online Abstraction with MDP Homomorphisms for Deep Learning. 1125-1133 - Dylan Banarse, Yoram Bachrach, Siqi Liu, Guy Lever, Nicolas Heess, Chrisantha Fernando, Pushmeet Kohli, Thore Graepel:
The Body is Not a Given: Joint Agent Policy Learning and Morphology Evolution. 1134-1142
4D: Robotics
- Mikko Lauri, Joni Pajarinen, Jan Peters:
Information Gathering in Decentralized POMDPs by Policy Graph Improvement. 1143-1151 - Minghua Liu, Hang Ma, Jiaoyang Li, Sven Koenig:
Task and Path Planning for Multi-Agent Pickup and Delivery. 1152-1160 - Benjamin Schnieders, Shan Luo, Gregory Palmer, Karl Tuyls:
Fully Convolutional One-Shot Object Segmentation for Industrial Robotics. 1161-1169 - Saurabh Arora, Prashant Doshi, Bikramjit Banerjee:
Online Inverse Reinforcement Learning Under Occlusion. 1170-1178 - Hao-Tsung Yang, Shih-Yu Tsai, Kin Sum Liu, Shan Lin, Jie Gao:
Patrol Scheduling Against Adversaries with Varying Attack Durations. 1179-1188 - Gokarna Sharma, Ayan Dutta, Jong-Hoon Kim:
Optimal Online Coverage Path Planning with Energy Constraints. 1189-1197