default search action

combined dblp search
author search
venue search
publication search

ask others

Sertan Girgin

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c27]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/CideronAFGEBPR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CideronAFGEBPR25
Geoffrey Cideron, Andrea Agostinelli, Johan Ferret, Sertan Girgin, Romuald Elie, Olivier Bachem, Sarah Perrin, Alexandre Ramé:
Diversity-Rewarded CFG Distillation. ICLR 2025
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SessaDHFVRSPFCG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SessaDHFVRSPFCG25
Pier Giuseppe Sessa, Robert Dadashi-Tazehozi, Léonard Hussenot, Johan Ferret, Nino Vieillard, Alexandre Ramé, Bobak Shahriari, Sarah Perrin, Abram L. Friesen, Geoffrey Cideron, Sertan Girgin, Piotr Stanczyk, Andrea Michi, Danila Sinopalnikov, Sabela Ramos Garea, Amélie Héliou, Aliaksei Severyn, Matthew Hoffman, Nikola Momchev, Olivier Bachem:
BOND: Aligning LLMs with Best-of-N Distillation. ICLR 2025
2024
[c25]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/CideronGVVKBMUB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/CideronGVVKBMUB24
Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. ICML 2024
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MunosVCARGTGMFM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MunosVCARGTGMFM24
Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Côme Fiegel, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot:
Nash Learning from Human Feedback. ICML 2024
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-04229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-04229
Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. CoRR abs/2402.04229 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-07839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-07839
Aleksandar Botev, Soham De, Samuel L. Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz Gustavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas:
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models. CoRR abs/2404.07839 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16768
Alexandre Ramé, Johan Ferret, Nino Vieillard, Robert Dadashi, Léonard Hussenot, Pierre-Louis Cedoz, Pier Giuseppe Sessa, Sertan Girgin, Arthur Douillard, Olivier Bachem:
WARP: On the Benefits of Weight Averaged Rewarded Policies. CoRR abs/2406.16768 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14622
Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Nino Vieillard, Alexandre Ramé, Bobak Shahriari, Sarah Perrin, Abe Friesen, Geoffrey Cideron, Sertan Girgin, Piotr Stanczyk, Andrea Michi, Danila Sinopalnikov, Sabela Ramos, Amélie Héliou, Aliaksei Severyn, Matt Hoffman, Nikola Momchev, Olivier Bachem:
BOND: Aligning LLMs with Best-of-N Distillation. CoRR abs/2407.14622 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-00118
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-00118
Morgane Rivière, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman, Shantanu Thakoor, Jean-Bastien Grill, Behnam Neyshabur, Olivier Bachem, Alanna Walton, Aliaksei Severyn, Alicia Parrish, Aliya Ahmad, Allen Hutchison, Alvin Abdagic, Amanda Carl, Amy Shen, Andy Brock, Andy Coenen, Anthony Laforge, Antonia Paterson, Ben Bastian, Bilal Piot, Bo Wu, Brandon Royal, Charlie Chen, Chintu Kumar, Chris Perry, Chris Welty, Christopher A. Choquette-Choo, Danila Sinopalnikov, David Weinberger, Dimple Vijaykumar, Dominika Rogozinska, Dustin Herbison, Elisa Bandy, Emma Wang, Eric Noland, Erica Moreira, Evan Senter, Evgenii Eltyshev, Francesco Visin, Gabriel Rasskin, Gary Wei, Glenn Cameron, Gus Martins, Hadi Hashemi, Hanna Klimczak-Plucinska, Harleen Batra, Harsh Dhand, Ivan Nardini, Jacinda Mein, Jack Zhou, James Svensson, Jeff Stanway, Jetha Chan, Jin Peng Zhou, Joana Carrasqueira, Joana Iljazi, Jocelyn Becker, Joe Fernandez, Joost van Amersfoort, Josh Gordon, Josh Lipschultz, Josh Newlan, Ju-yeong Ji, Kareem Mohamed, Kartikeya Badola, Kat Black, Katie Millican, Keelin McDonell, Kelvin Nguyen, Kiranbir Sodhia, Kish Greene, Lars Lowe Sjösund, Lauren Usui, Laurent Sifre, Lena Heuermann, Leticia Lago, Lilly McNealus:
Gemma 2: Improving Open Language Models at a Practical Size. CoRR abs/2408.00118 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06084
Geoffrey Cideron, Andrea Agostinelli, Johan Ferret, Sertan Girgin, Romuald Elie, Olivier Bachem, Sarah Perrin, Alexandre Ramé:
Diversity-Rewarded CFG Distillation. CoRR abs/2410.06084 (2024)
2023
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/KharitonovVBMGP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/KharitonovVBMGP23
Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matt Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. Trans. Assoc. Comput. Linguistics 11: 1703-1718 (2023)
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/RoitFSACDGGHKMG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/RoitFSACDGGHKMG23
Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos Garea, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. ACL (1) 2023: 6252-6272
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03540
Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matthew Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. CoRR abs/2302.03540 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-01400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-01400
Geoffrey Cideron, Baruch Tabanpour, Sebastian Curi, Sertan Girgin, Léonard Hussenot, Gabriel Dulac-Arnold, Matthieu Geist, Olivier Pietquin, Robert Dadashi:
Get Back Here: Robust Imitation by Return-to-Distribution Planning. CoRR abs/2305.01400 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00186
Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. CoRR abs/2306.00186 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-00886
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-00886
Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot:
Nash Learning from Human Feedback. CoRR abs/2312.00886 (2023)
2022
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/CabannesLPMGPPB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/CabannesLPMGPPB22
Theophile Cabannes, Mathieu Laurière, Julien Pérolat, Raphaël Marinier, Sertan Girgin, Sarah Perrin, Olivier Pietquin, Alexandre M. Bayen, Eric Goubault, Romuald Elie:
Solving N-Player Dynamic Routing Games with Congestion: A Mean-Field Approach. AAMAS 2022: 1557-1559
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/AdolphsHBGBCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/AdolphsHBGBCH22
Leonard Adolphs, Michelle Chen Huebscher, Christian Buck, Sertan Girgin, Olivier Bachem, Massimiliano Ciaramita, Thomas Hofmann:
Decoding a Neural Retriever's Latent Space for Query Suggestion. EMNLP 2022: 8786-8804
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DadashiHVGRGP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DadashiHVGRGP22
Robert Dadashi, Léonard Hussenot, Damien Vincent, Sertan Girgin, Anton Raichuk, Matthieu Geist, Olivier Pietquin:
Continuous Control with Action Quantization from Demonstrations. ICML 2022: 4537-4557
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LaurierePGMJCPP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LaurierePGMJCPP22
Mathieu Laurière, Sarah Perrin, Sertan Girgin, Paul Muller, Ayush Jain, Theophile Cabannes, Georgios Piliouras, Julien Pérolat, Romuald Elie, Olivier Pietquin, Matthieu Geist:
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games. ICML 2022: 12078-12095
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11973
Mathieu Laurière, Sarah Perrin, Sertan Girgin, Paul Muller, Ayush Jain, Theophile Cabannes, Georgios Piliouras, Julien Pérolat, Romuald Élie, Olivier Pietquin, Matthieu Geist:
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games. CoRR abs/2203.11973 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06792
Geoffrey Cideron, Sertan Girgin, Anton Raichuk, Olivier Pietquin, Olivier Bachem, Léonard Hussenot:
vec2text with Round-Trip Translations. CoRR abs/2209.06792 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12084
Leonard Adolphs, Michelle Chen Huebscher, Christian Buck, Sertan Girgin, Olivier Bachem, Massimiliano Ciaramita, Thomas Hofmann:
Decoding a Neural Retriever's Latent Space for Query Suggestion. CoRR abs/2210.12084 (2022)
2021
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/AndrychowiczRSO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AndrychowiczRSO21
Marcin Andrychowicz, Anton Raichuk, Piotr Stanczyk, Manu Orsini, Sertan Girgin, Raphaël Marinier, Léonard Hussenot, Matthieu Geist, Olivier Pietquin, Marcin Michalski, Sylvain Gelly, Olivier Bachem:
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study. ICLR 2021
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HussenotAVDRRMG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HussenotAVDRRMG21
Léonard Hussenot, Marcin Andrychowicz, Damien Vincent, Robert Dadashi, Anton Raichuk, Sabela Ramos, Nikola Momchev, Sertan Girgin, Raphaël Marinier, Lukasz Stafiniak, Manu Orsini, Olivier Bachem, Matthieu Geist, Olivier Pietquin:
Hyperparameter Selection for Imitation Learning. ICML 2021: 4511-4522
[c16]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/FreemanFRGMB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FreemanFRGMB21
C. Daniel Freeman, Erik Frey, Anton Raichuk, Sertan Girgin, Igor Mordatch, Olivier Bachem:
Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation. NeurIPS Datasets and Benchmarks 2021
[c15]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/OrsiniRHVDGGBPA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OrsiniRHVDGGBPA21
Manu Orsini, Anton Raichuk, Léonard Hussenot, Damien Vincent, Robert Dadashi, Sertan Girgin, Matthieu Geist, Olivier Bachem, Olivier Pietquin, Marcin Andrychowicz:
What Matters for Adversarial Imitation Learning? NeurIPS 2021: 14656-14668
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-12034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-12034
Léonard Hussenot, Marcin Andrychowicz, Damien Vincent, Robert Dadashi, Anton Raichuk, Lukasz Stafiniak, Sertan Girgin, Raphaël Marinier, Nikola Momchev, Sabela Ramos, Manu Orsini, Olivier Bachem, Matthieu Geist, Olivier Pietquin:
Hyperparameter Selection for Imitation Learning. CoRR abs/2105.12034 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00672
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00672
Manu Orsini, Anton Raichuk, Léonard Hussenot, Damien Vincent, Robert Dadashi, Sertan Girgin, Matthieu Geist, Olivier Bachem, Olivier Pietquin, Marcin Andrychowicz:
What Matters for Adversarial Imitation Learning? CoRR abs/2106.00672 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-13281
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-13281
C. Daniel Freeman, Erik Frey, Anton Raichuk, Sertan Girgin, Igor Mordatch, Olivier Bachem:
Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation. CoRR abs/2106.13281 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10149
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10149
Robert Dadashi, Léonard Hussenot, Damien Vincent, Sertan Girgin, Anton Raichuk, Matthieu Geist, Olivier Pietquin:
Continuous Control with Action Quantization from Demonstrations. CoRR abs/2110.10149 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-11943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-11943
Theophile Cabannes, Mathieu Laurière, Julien Pérolat, Raphaël Marinier, Sertan Girgin, Sarah Perrin, Olivier Pietquin, Alexandre M. Bayen, Éric Goubault, Romuald Elie:
Solving N-player dynamic routing games with congestion: a mean field approach. CoRR abs/2110.11943 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-02767
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-02767
Sabela Ramos, Sertan Girgin, Léonard Hussenot, Damien Vincent, Hanna Yakubovich, Daniel Toyama, Anita Gergely, Piotr Stanczyk, Raphaël Marinier, Jeremiah Harmsen, Olivier Pietquin, Nikola Momchev:
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning. CoRR abs/2111.02767 (2021)
2020
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-05990
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-05990
Marcin Andrychowicz, Anton Raichuk, Piotr Stanczyk, Manu Orsini, Sertan Girgin, Raphaël Marinier, Léonard Hussenot, Matthieu Geist, Olivier Pietquin, Marcin Michalski, Sylvain Gelly, Olivier Bachem:
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study. CoRR abs/2006.05990 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2017
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/DogruozPGJO17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/DogruozPGJO17
A. Seza Dogruöz, Natalia Ponomareva, Sertan Girgin, Reshu Jain, Christoph Oehler:
Text based user comments as a signal for automatic language identification of online videos. ICMI 2017: 374-378
2013
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/ijmi/KuruGAB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijmi/KuruGAB13
Kaya Kuru, Sertan Girgin, Kemal Arda, Ugur Bozlar:
A novel report generation approach for medical applications: The SISDS methodology and its applications. Int. J. Medical Informatics 82(5): 435-447 (2013)
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/bigdataconf/SealesCYG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bigdataconf/SealesCYG13
W. Brent Seales, Steve Crossan, Mark Yoshitake, Sertan Girgin:
From assets to stories via the Google Cultural Institute Platform. IEEE BigData 2013: 71-76
2012
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/GirginMPN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/GirginMPN12
Sertan Girgin, Jérémie Mary, Philippe Preux, Olivier Nicol:
Managing advertising campaigns - an approximate planning approach. Frontiers Comput. Sci. 6(2): 209-229 (2012)
2011
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/prib/KuruG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/prib/KuruG11
Kaya Kuru, Sertan Girgin:
A Bilinear Interpolation Based Approach for Optimizing Hematoxylin and Eosin Stained Microscopical Images. PRIB 2011: 168-178
2010
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ml/GirginPA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/GirginPA10
Sertan Girgin, Faruk Polat, Reda Alhajj:
Improving reinforcement learning by using sequence trees. Mach. Learn. 81(3): 283-331 (2010)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/GirginMPN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/GirginMPN10
Sertan Girgin, Jérémie Mary, Philippe Preux, Olivier Nicol:
Advertising Campaigns Management: Should We Be Greedy? ICDM 2010: 821-826

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/adprl/PreuxGL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adprl/PreuxGL09
Philippe Preux, Sertan Girgin, Manuel Loth:
Feature discovery in approximate dynamic programming. ADPRL 2009: 109-116
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/aime/KuruGA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aime/KuruGA09
Kaya Kuru, Sertan Girgin, Kemal Arda:
A Novel Multilingual Report Generation System for Medical Applications. AIME 2009: 201-205
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/ksem/KuruGABA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ksem/KuruGABA09
Kaya Kuru, Sertan Girgin, Kemal Arda, Ugur Bozlar, Veysel Akgün:
Developing Diagnostic DSSs Based on a Novel Data Collection Methodology. KSEM 2009: 110-121
2008
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/eurogp/GirginP08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eurogp/GirginP08
Sertan Girgin, Philippe Preux:
Feature Discovery in Reinforcement Learning Using Genetic Programming. EuroGP 2008: 218-229
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/ewrl/GirginP08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ewrl/GirginP08
Sertan Girgin, Philippe Preux:
Basis Expansion in Natural Actor Critic Methods. EWRL 2008: 110-123
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icmla/GirginP08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmla/GirginP08
Sertan Girgin, Philippe Preux:
Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture. ICMLA 2008: 75-82
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/ncs/SahinGBT08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/ncs/SahinGBT08
Erol Sahin, Sertan Girgin, Levent Bayindir, Ali Emre Turgut:
Swarm Robotics. Swarm Intelligence 2008: 87-100
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/ewrl/2008
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ewrl/2008
Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko:
Recent Advances in Reinforcement Learning, 8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30 - July 3, 2008, Revised and Selected Papers. Lecture Notes in Computer Science 5323, Springer 2008, ISBN 978-3-540-89721-7 [contents]
2007
[b1]
- view
  - electronic edition @ yok.gov.tr
  - details & citations
- export record
  dblp key:
  - phd/tr/Girgin07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/tr/Girgin07
Sertan Girgin:
Abstraction in reinforcement learning (Pekiştirmeli öğrenmede soyutlama). Middle East Technical University, Turkey, 2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tsmc/GirginPA07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsmc/GirginPA07
Sertan Girgin, Faruk Polat, Reda Alhajj:
Positive Impact of State Similarity on Reinforcement Learning Performance. IEEE Trans. Syst. Man Cybern. Part B 37(5): 1256-1270 (2007)
[c4]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/GirginPA07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/GirginPA07
Sertan Girgin, Faruk Polat, Reda Alhajj:
State Similarity Based Approach for Improving Performance in RL. IJCAI 2007: 817-822
2006
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/arobots/SahinGU06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/arobots/SahinGU06
Erol Sahin, Sertan Girgin, Emre Ugur:
Area measurement of large closed regions with a mobile robot. Auton. Robots 21(3): 255-266 (2006)
[c3]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/ecai/GirginPA06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/GirginPA06
Sertan Girgin, Faruk Polat, Reda Alhajj:
Learning by Automatic Option Discovery from Conditionally Terminating Sequences. ECAI 2006: 494-498
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ideal/GirginPA06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ideal/GirginPA06
Sertan Girgin, Faruk Polat, Reda Alhajj:
Effectiveness of Considering State Similarity for Reinforcement Learning. IDEAL 2006: 163-171
2005
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/cimca/GirginP05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cimca/GirginP05
Sertan Girgin, Faruk Polat:
Option Discovery in Reinforcement Learning using Frequent Common Subsequences of Actions. CIMCA/IAWTIC 2005: 371-376

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.