default search action
Remi Tachet des Combes
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c27]Zhang-Wei Hong, Pulkit Agrawal, Remi Tachet des Combes, Romain Laroche:
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting. ICLR 2023 - [c26]Hongyu Zang, Xin Li, Jie Yu, Chen Liu, Riashat Islam, Remi Tachet des Combes, Romain Laroche:
Behavior Prior Representation learning for Offline Reinforcement Learning. ICLR 2023 - [c25]Siqi Zeng, Remi Tachet des Combes, Han Zhao:
Learning Structured Representations by Embedding Class Hierarchy. ICLR 2023 - [c24]Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Rajiv Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford:
Principled Offline RL in the Presence of Rich Exogenous Information. ICML 2023: 14390-14421 - [c23]Romain Laroche, Remi Tachet des Combes:
On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs. ICML 2023: 18548-18562 - [c22]Shangtong Zhang, Remi Tachet des Combes, Romain Laroche:
On the Convergence of SARSA with Linear Function Approximation. ICML 2023: 41613-41646 - [c21]Hongyu Zang, Xin Li, Leiji Zhang, Yang Liu, Baigui Sun, Riashat Islam, Remi Tachet des Combes, Romain Laroche:
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning. NeurIPS 2023 - [i33]Zhang-Wei Hong, Pulkit Agrawal, Rémi Tachet des Combes, Romain Laroche:
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting. CoRR abs/2306.13085 (2023) - [i32]Hongyu Zang, Xin Li, Leiji Zhang, Yang Liu, Baigui Sun, Riashat Islam, Remi Tachet des Combes, Romain Laroche:
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning. CoRR abs/2310.17139 (2023) - 2022
- [j2]Shangtong Zhang, Remi Tachet des Combes, Romain Laroche:
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch. J. Mach. Learn. Res. 23: 343:1-343:91 (2022) - [c20]Romain Laroche, Remi Tachet des Combes:
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms. AISTATS 2022: 5658-5688 - [c19]Shangtong Zhang, Romain Laroche, Harm van Seijen, Shimon Whiteson, Remi Tachet des Combes:
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms. AAMAS 2022: 1491-1499 - [c18]Jesse Dodge, Taylor Prewitt, Remi Tachet des Combes, Erika Odmark, Roy Schwartz, Emma Strubell, Alexandra Sasha Luccioni, Noah A. Smith, Nicole DeCario, Will Buchanan:
Measuring the Carbon Intensity of AI in Cloud Instances. FAccT 2022: 1877-1894 - [c17]Riashat Islam, Hongyu Zang, Anirudh Goyal, Alex Lamb, Kenji Kawaguchi, Xin Li, Romain Laroche, Yoshua Bengio, Remi Tachet des Combes:
Discrete Compositional Representations as an Abstraction for Goal Conditioned Reinforcement Learning. NeurIPS 2022 - [i31]Shangtong Zhang, Remi Tachet des Combes, Romain Laroche:
On the Chattering of SARSA with Linear Function Approximation. CoRR abs/2202.06828 (2022) - [i30]Romain Laroche, Remi Tachet des Combes:
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms. CoRR abs/2202.07496 (2022) - [i29]Romain Laroche, Remi Tachet des Combes, Jacob Buckman:
Non-Markovian policies occupancy measures. CoRR abs/2205.13950 (2022) - [i28]David Brandfonbrener, Remi Tachet des Combes, Romain Laroche:
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning. CoRR abs/2206.01085 (2022) - [i27]Jesse Dodge, Taylor Prewitt, Remi Tachet des Combes, Erika Odmark, Roy Schwartz, Emma Strubell, Alexandra Sasha Luccioni, Noah A. Smith, Nicole DeCario, Will Buchanan:
Measuring the Carbon Intensity of AI in Cloud Instances. CoRR abs/2206.05229 (2022) - [i26]Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford:
Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information. CoRR abs/2211.00164 (2022) - [i25]Riashat Islam, Hongyu Zang, Anirudh Goyal, Alex Lamb, Kenji Kawaguchi, Xin Li, Romain Laroche, Yoshua Bengio, Remi Tachet des Combes:
Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning. CoRR abs/2211.00247 (2022) - [i24]Hongyu Zang, Xin Li, Jie Yu, Chen Liu, Riashat Islam, Remi Tachet des Combes, Romain Laroche:
Behavior Prior Representation learning for Offline Reinforcement Learning. CoRR abs/2211.00863 (2022) - 2021
- [c16]Yadollah Yaghoobzadeh, Soroush Mehri, Remi Tachet des Combes, Timothy J. Hazen, Alessandro Sordoni:
Increasing Robustness to Spurious Correlations using Forgettable Examples. EACL 2021: 3319-3332 - [c15]Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Ioannis Mitliagkas, Remi Tachet des Combes:
Adversarial score matching and improved sampling for image generation. ICLR 2021 - [c14]Alessandro Sordoni, Nouha Dziri, Hannes Schulz, Geoffrey J. Gordon, Philip Bachman, Remi Tachet des Combes:
Decomposed Mutual Information Estimation for Contrastive Representation Learning. ICML 2021: 9859-9869 - [c13]Sébastien Bubeck, Yeshwanth Cherapanamjeri, Gauthier Gidel, Remi Tachet des Combes:
A single gradient step finds adversarial examples on random two-layers neural networks. NeurIPS 2021: 10081-10091 - [c12]Romain Laroche, Remi Tachet des Combes:
Dr Jekyll & Mr Hyde: the strange case of off-policy policy updates. NeurIPS 2021: 24442-24454 - [i23]James Vuckovic, Aristide Baratin, Remi Tachet des Combes:
On the Regularity of Attention. CoRR abs/2102.05628 (2021) - [i22]Sébastien Bubeck, Yeshwanth Cherapanamjeri, Gauthier Gidel, Rémi Tachet des Combes:
A single gradient step finds adversarial examples on random two-layers neural networks. CoRR abs/2104.03863 (2021) - [i21]Alessandro Sordoni, Nouha Dziri, Hannes Schulz, Geoffrey J. Gordon, Philip Bachman, Remi Tachet des Combes:
Decomposed Mutual Information Estimation for Contrastive Representation Learning. CoRR abs/2106.13401 (2021) - [i20]Romain Laroche, Remi Tachet des Combes:
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates. CoRR abs/2109.14727 (2021) - [i19]Shangtong Zhang, Remi Tachet des Combes, Romain Laroche:
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch. CoRR abs/2111.02997 (2021) - 2020
- [c11]Ching-An Cheng, Remi Tachet des Combes, Byron Boots, Geoffrey J. Gordon:
A Reduction from Reinforcement Learning to No-Regret Online Learning. AISTATS 2020: 3514-3524 - [c10]Thiago D. Simão, Romain Laroche, Rémi Tachet des Combes:
Safe Policy Improvement with an Estimated Baseline Policy. AAMAS 2020: 1269-1277 - [c9]Dmitrii Krylov, Remi Tachet des Combes, Romain Laroche, Michael Rosenblum, Dmitry V. Dylov:
Reinforcement Learning Framework for Deep Brain Stimulation Study. IJCAI 2020: 2847-2854 - [c8]Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, Geoffrey J. Gordon:
Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift. NeurIPS 2020 - [c7]Bogdan Mazoure, Remi Tachet des Combes, Thang Doan, Philip Bachman, R. Devon Hjelm:
Deep Reinforcement and InfoMax Learning. NeurIPS 2020 - [i18]Dmitrii Krylov, Rémi Tachet des Combes, Romain Laroche, Michael Rosenblum, Dmitry V. Dylov:
Reinforcement Learning Framework for Deep Brain Stimulation Study. CoRR abs/2002.10948 (2020) - [i17]Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, Geoffrey J. Gordon:
Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift. CoRR abs/2003.04475 (2020) - [i16]Bogdan Mazoure, Remi Tachet des Combes, Thang Doan, Philip Bachman, R. Devon Hjelm:
Deep Reinforcement and InfoMax Learning. CoRR abs/2006.07217 (2020) - [i15]James Vuckovic, Aristide Baratin, Remi Tachet des Combes:
A Mathematical Theory of Attention. CoRR abs/2007.02876 (2020) - [i14]Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Rémi Tachet des Combes, Ioannis Mitliagkas:
Adversarial score matching and improved sampling for image generation. CoRR abs/2009.05475 (2020) - [i13]Shangtong Zhang, Romain Laroche, Harm van Seijen, Shimon Whiteson, Remi Tachet des Combes:
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms. CoRR abs/2010.01069 (2020)
2010 – 2019
- 2019
- [j1]Dániel Kondor, Hongmou Zhang, Remi Tachet des Combes, Paolo Santi, Carlo Ratti:
Estimating Savings in Parking Demand Using Shared Vehicles for Home-Work Commuting. IEEE Trans. Intell. Transp. Syst. 20(8): 2903-2912 (2019) - [c6]Mariya Toneva, Alessandro Sordoni, Remi Tachet des Combes, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon:
An Empirical Study of Example Forgetting during Deep Neural Network Learning. ICLR (Poster) 2019 - [c5]Romain Laroche, Paul Trichelair, Remi Tachet des Combes:
Safe Policy Improvement with Baseline Bootstrapping. ICML 2019: 3652-3661 - [c4]Han Zhao, Remi Tachet des Combes, Kun Zhang, Geoffrey J. Gordon:
On Learning Invariant Representations for Domain Adaptation. ICML 2019: 7523-7532 - [c3]Kimia Nadjahi, Romain Laroche, Rémi Tachet des Combes:
Safe Policy Improvement with Soft Baseline Bootstrapping. ECML/PKDD (3) 2019: 53-68 - [i12]Han Zhao, Remi Tachet des Combes, Kun Zhang, Geoffrey J. Gordon:
On Learning Invariant Representation for Domain Adaptation. CoRR abs/1901.09453 (2019) - [i11]Kimia Nadjahi, Romain Laroche, Rémi Tachet des Combes:
Safe Policy Improvement with Soft Baseline Bootstrapping. CoRR abs/1907.05079 (2019) - [i10]Thiago D. Simão, Romain Laroche, Rémi Tachet des Combes:
Safe Policy Improvement with an Estimated Baseline Policy. CoRR abs/1909.05236 (2019) - [i9]Yadollah Yaghoobzadeh, Remi Tachet des Combes, Timothy J. Hazen, Alessandro Sordoni:
Robust Natural Language Inference Models with Example Forgetting. CoRR abs/1911.03861 (2019) - [i8]Ching-An Cheng, Remi Tachet des Combes, Byron Boots, Geoffrey J. Gordon:
A Reduction from Reinforcement Learning to No-Regret Online Learning. CoRR abs/1911.05873 (2019) - 2018
- [c2]Remi Tachet des Combes, Philip Bachman, Harm van Seijen:
Learning Invariances for Policy Generalization. ICLR (Workshop) 2018 - [i7]Xingdi Yuan, Marc-Alexandre Côté, Alessandro Sordoni, Romain Laroche, Remi Tachet des Combes, Matthew J. Hausknecht, Adam Trischler:
Counting to Explore and Generalize in Text-based Games. CoRR abs/1806.11525 (2018) - [i6]Remi Tachet des Combes, Philip Bachman, Harm van Seijen:
Learning Invariances for Policy Generalization. CoRR abs/1809.02591 (2018) - [i5]Remi Tachet des Combes, Mohammad Pezeshki, Samira Shabanian, Aaron C. Courville, Yoshua Bengio:
On the Learning Dynamics of Deep Neural Networks. CoRR abs/1809.06848 (2018) - [i4]Mariya Toneva, Alessandro Sordoni, Remi Tachet des Combes, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon:
An Empirical Study of Example Forgetting during Deep Neural Network Learning. CoRR abs/1812.05159 (2018) - 2017
- [i3]Dániel Kondor, Hongmou Zhang, Remi Tachet des Combes, Paolo Santi, Carlo Ratti:
Estimating savings in parking demand using shared vehicles for home-work commuting. CoRR abs/1710.04983 (2017) - 2015
- [i2]Stanislav Sobolevsky, Izabela Sitko, Remi Tachet des Combes, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Cities through the Prism of People's Spending Behavior. CoRR abs/1505.03854 (2015) - 2014
- [c1]Stanislav Sobolevsky, Izabela Sitko, Remi Tachet des Combes, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Money on the Move: Big Data of Bank Card Transactions as the New Proxy for Human Mobility Patterns and Regional Delineation. The Case of Residents and Foreign Visitors in Spain. BigData Congress 2014: 136-143 - [i1]Stanislav Sobolevsky, Izabela Sitko, Sebastian Grauwin, Remi Tachet des Combes, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Mining Urban Performance: Scale-Independent Classification of Cities Based on Individual Economic Transactions. CoRR abs/1405.4301 (2014)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint