default search action
Sertan Girgin
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. ICML 2024 - [c24]Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Côme Fiegel, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot:
Nash Learning from Human Feedback. ICML 2024 - [i20]Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. CoRR abs/2402.04229 (2024) - [i19]Aleksandar Botev, Soham De, Samuel L. Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz GUStavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas:
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models. CoRR abs/2404.07839 (2024) - [i18]Alexandre Ramé, Johan Ferret, Nino Vieillard, Robert Dadashi, Léonard Hussenot, Pierre-Louis Cedoz, Pier Giuseppe Sessa, Sertan Girgin, Arthur Douillard, Olivier Bachem:
WARP: On the Benefits of Weight Averaged Rewarded Policies. CoRR abs/2406.16768 (2024) - [i17]Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Nino Vieillard, Alexandre Ramé, Bobak Shahriari, Sarah Perrin, Abe Friesen, Geoffrey Cideron, Sertan Girgin, Piotr Stanczyk, Andrea Michi, Danila Sinopalnikov, Sabela Ramos, Amélie Héliou, Aliaksei Severyn, Matt Hoffman, Nikola Momchev, Olivier Bachem:
BOND: Aligning LLMs with Best-of-N Distillation. CoRR abs/2407.14622 (2024) - [i16]Morgane Rivière, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman, Shantanu Thakoor, Jean-Bastien Grill, Behnam Neyshabur, Olivier Bachem, Alanna Walton, Aliaksei Severyn, Alicia Parrish, Aliya Ahmad, Allen Hutchison, Alvin Abdagic, Amanda Carl, Amy Shen, Andy Brock, Andy Coenen, Anthony Laforge, Antonia Paterson, Ben Bastian, Bilal Piot, Bo Wu, Brandon Royal, Charlie Chen, Chintu Kumar, Chris Perry, Chris Welty, Christopher A. Choquette-Choo, Danila Sinopalnikov, David Weinberger, Dimple Vijaykumar, Dominika Rogozinska, Dustin Herbison, Elisa Bandy, Emma Wang, Eric Noland, Erica Moreira, Evan Senter, Evgenii Eltyshev, Francesco Visin, Gabriel Rasskin, Gary Wei, Glenn Cameron, Gus Martins, Hadi Hashemi, Hanna Klimczak-Plucinska, Harleen Batra, Harsh Dhand, Ivan Nardini, Jacinda Mein, Jack Zhou, James Svensson, Jeff Stanway, Jetha Chan, Jin Peng Zhou, Joana Carrasqueira, Joana Iljazi, Jocelyn Becker, Joe Fernandez, Joost van Amersfoort, Josh Gordon, Josh Lipschultz, Josh Newlan, Ju-yeong Ji, Kareem Mohamed, Kartikeya Badola, Kat Black, Katie Millican, Keelin McDonell, Kelvin Nguyen, Kiranbir Sodhia, Kish Greene, Lars Lowe Sjösund, Lauren Usui, Laurent Sifre, Lena Heuermann, Leticia Lago, Lilly McNealus:
Gemma 2: Improving Open Language Models at a Practical Size. CoRR abs/2408.00118 (2024) - [i15]Geoffrey Cideron, Andrea Agostinelli, Johan Ferret, Sertan Girgin, Romuald Elie, Olivier Bachem, Sarah Perrin, Alexandre Ramé:
Diversity-Rewarded CFG Distillation. CoRR abs/2410.06084 (2024) - 2023
- [j6]Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matt Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. Trans. Assoc. Comput. Linguistics 11: 1703-1718 (2023) - [c23]Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos Garea, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. ACL (1) 2023: 6252-6272 - [i14]Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matthew Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. CoRR abs/2302.03540 (2023) - [i13]Geoffrey Cideron, Baruch Tabanpour, Sebastian Curi, Sertan Girgin, Léonard Hussenot, Gabriel Dulac-Arnold, Matthieu Geist, Olivier Pietquin, Robert Dadashi:
Get Back Here: Robust Imitation by Return-to-Distribution Planning. CoRR abs/2305.01400 (2023) - [i12]Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. CoRR abs/2306.00186 (2023) - [i11]Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot:
Nash Learning from Human Feedback. CoRR abs/2312.00886 (2023) - 2022
- [c22]Theophile Cabannes, Mathieu Laurière, Julien Pérolat, Raphaël Marinier, Sertan Girgin, Sarah Perrin, Olivier Pietquin, Alexandre M. Bayen, Eric Goubault, Romuald Elie:
Solving N-Player Dynamic Routing Games with Congestion: A Mean-Field Approach. AAMAS 2022: 1557-1559 - [c21]Leonard Adolphs, Michelle Chen Huebscher, Christian Buck, Sertan Girgin, Olivier Bachem, Massimiliano Ciaramita, Thomas Hofmann:
Decoding a Neural Retriever's Latent Space for Query Suggestion. EMNLP 2022: 8786-8804 - [c20]Robert Dadashi, Léonard Hussenot, Damien Vincent, Sertan Girgin, Anton Raichuk, Matthieu Geist, Olivier Pietquin:
Continuous Control with Action Quantization from Demonstrations. ICML 2022: 4537-4557 - [c19]Mathieu Laurière, Sarah Perrin, Sertan Girgin, Paul Muller, Ayush Jain, Theophile Cabannes, Georgios Piliouras, Julien Pérolat, Romuald Elie, Olivier Pietquin, Matthieu Geist:
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games. ICML 2022: 12078-12095 - [i10]Mathieu Laurière, Sarah Perrin, Sertan Girgin, Paul Muller, Ayush Jain, Theophile Cabannes, Georgios Piliouras, Julien Pérolat, Romuald Élie, Olivier Pietquin, Matthieu Geist:
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games. CoRR abs/2203.11973 (2022) - [i9]Geoffrey Cideron, Sertan Girgin, Anton Raichuk, Olivier Pietquin, Olivier Bachem, Léonard Hussenot:
vec2text with Round-Trip Translations. CoRR abs/2209.06792 (2022) - [i8]Leonard Adolphs, Michelle Chen Huebscher, Christian Buck, Sertan Girgin, Olivier Bachem, Massimiliano Ciaramita, Thomas Hofmann:
Decoding a Neural Retriever's Latent Space for Query Suggestion. CoRR abs/2210.12084 (2022) - 2021
- [c18]Marcin Andrychowicz, Anton Raichuk, Piotr Stanczyk, Manu Orsini, Sertan Girgin, Raphaël Marinier, Léonard Hussenot, Matthieu Geist, Olivier Pietquin, Marcin Michalski, Sylvain Gelly, Olivier Bachem:
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study. ICLR 2021 - [c17]Léonard Hussenot, Marcin Andrychowicz, Damien Vincent, Robert Dadashi, Anton Raichuk, Sabela Ramos, Nikola Momchev, Sertan Girgin, Raphaël Marinier, Lukasz Stafiniak, Manu Orsini, Olivier Bachem, Matthieu Geist, Olivier Pietquin:
Hyperparameter Selection for Imitation Learning. ICML 2021: 4511-4522 - [c16]C. Daniel Freeman, Erik Frey, Anton Raichuk, Sertan Girgin, Igor Mordatch, Olivier Bachem:
Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation. NeurIPS Datasets and Benchmarks 2021 - [c15]Manu Orsini, Anton Raichuk, Léonard Hussenot, Damien Vincent, Robert Dadashi, Sertan Girgin, Matthieu Geist, Olivier Bachem, Olivier Pietquin, Marcin Andrychowicz:
What Matters for Adversarial Imitation Learning? NeurIPS 2021: 14656-14668 - [i7]Léonard Hussenot, Marcin Andrychowicz, Damien Vincent, Robert Dadashi, Anton Raichuk, Lukasz Stafiniak, Sertan Girgin, Raphaël Marinier, Nikola Momchev, Sabela Ramos, Manu Orsini, Olivier Bachem, Matthieu Geist, Olivier Pietquin:
Hyperparameter Selection for Imitation Learning. CoRR abs/2105.12034 (2021) - [i6]Manu Orsini, Anton Raichuk, Léonard Hussenot, Damien Vincent, Robert Dadashi, Sertan Girgin, Matthieu Geist, Olivier Bachem, Olivier Pietquin, Marcin Andrychowicz:
What Matters for Adversarial Imitation Learning? CoRR abs/2106.00672 (2021) - [i5]C. Daniel Freeman, Erik Frey, Anton Raichuk, Sertan Girgin, Igor Mordatch, Olivier Bachem:
Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation. CoRR abs/2106.13281 (2021) - [i4]Robert Dadashi, Léonard Hussenot, Damien Vincent, Sertan Girgin, Anton Raichuk, Matthieu Geist, Olivier Pietquin:
Continuous Control with Action Quantization from Demonstrations. CoRR abs/2110.10149 (2021) - [i3]Theophile Cabannes, Mathieu Laurière, Julien Pérolat, Raphaël Marinier, Sertan Girgin, Sarah Perrin, Olivier Pietquin, Alexandre M. Bayen, Éric Goubault, Romuald Elie:
Solving N-player dynamic routing games with congestion: a mean field approach. CoRR abs/2110.11943 (2021) - [i2]Sabela Ramos, Sertan Girgin, Léonard Hussenot, Damien Vincent, Hanna Yakubovich, Daniel Toyama, Anita Gergely, Piotr Stanczyk, Raphaël Marinier, Jeremiah Harmsen, Olivier Pietquin, Nikola Momchev:
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning. CoRR abs/2111.02767 (2021) - 2020
- [i1]Marcin Andrychowicz, Anton Raichuk, Piotr Stanczyk, Manu Orsini, Sertan Girgin, Raphaël Marinier, Léonard Hussenot, Matthieu Geist, Olivier Pietquin, Marcin Michalski, Sylvain Gelly, Olivier Bachem:
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study. CoRR abs/2006.05990 (2020)
2010 – 2019
- 2017
- [c14]A. Seza Dogruöz, Natalia Ponomareva, Sertan Girgin, Reshu Jain, Christoph Oehler:
Text based user comments as a signal for automatic language identification of online videos. ICMI 2017: 374-378 - 2013
- [j5]Kaya Kuru, Sertan Girgin, Kemal Arda, Ugur Bozlar:
A novel report generation approach for medical applications: The SISDS methodology and its applications. Int. J. Medical Informatics 82(5): 435-447 (2013) - [c13]W. Brent Seales, Steve Crossan, Mark Yoshitake, Sertan Girgin:
From assets to stories via the Google Cultural Institute Platform. IEEE BigData 2013: 71-76 - 2012
- [j4]Sertan Girgin, Jérémie Mary, Philippe Preux, Olivier Nicol:
Managing advertising campaigns - an approximate planning approach. Frontiers Comput. Sci. 6(2): 209-229 (2012) - 2011
- [c12]Kaya Kuru, Sertan Girgin:
A Bilinear Interpolation Based Approach for Optimizing Hematoxylin and Eosin Stained Microscopical Images. PRIB 2011: 168-178 - 2010
- [j3]Sertan Girgin, Faruk Polat, Reda Alhajj:
Improving reinforcement learning by using sequence trees. Mach. Learn. 81(3): 283-331 (2010) - [c11]Sertan Girgin, Jérémie Mary, Philippe Preux, Olivier Nicol:
Advertising Campaigns Management: Should We Be Greedy? ICDM 2010: 821-826
2000 – 2009
- 2009
- [c10]Philippe Preux, Sertan Girgin, Manuel Loth:
Feature discovery in approximate dynamic programming. ADPRL 2009: 109-116 - [c9]Kaya Kuru, Sertan Girgin, Kemal Arda:
A Novel Multilingual Report Generation System for Medical Applications. AIME 2009: 201-205 - [c8]Kaya Kuru, Sertan Girgin, Kemal Arda, Ugur Bozlar, Veysel Akgün:
Developing Diagnostic DSSs Based on a Novel Data Collection Methodology. KSEM 2009: 110-121 - 2008
- [c7]Sertan Girgin, Philippe Preux:
Feature Discovery in Reinforcement Learning Using Genetic Programming. EuroGP 2008: 218-229 - [c6]Sertan Girgin, Philippe Preux:
Basis Expansion in Natural Actor Critic Methods. EWRL 2008: 110-123 - [c5]Sertan Girgin, Philippe Preux:
Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture. ICMLA 2008: 75-82 - [p1]Erol Sahin, Sertan Girgin, Levent Bayindir, Ali Emre Turgut:
Swarm Robotics. Swarm Intelligence 2008: 87-100 - [e1]Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko:
Recent Advances in Reinforcement Learning, 8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30 - July 3, 2008, Revised and Selected Papers. Lecture Notes in Computer Science 5323, Springer 2008, ISBN 978-3-540-89721-7 [contents] - 2007
- [b1]Sertan Girgin:
Abstraction in reinforcement learning (Pekiştirmeli öğrenmede soyutlama). Middle East Technical University, Turkey, 2007 - [j2]Sertan Girgin, Faruk Polat, Reda Alhajj:
Positive Impact of State Similarity on Reinforcement Learning Performance. IEEE Trans. Syst. Man Cybern. Part B 37(5): 1256-1270 (2007) - [c4]Sertan Girgin, Faruk Polat, Reda Alhajj:
State Similarity Based Approach for Improving Performance in RL. IJCAI 2007: 817-822 - 2006
- [j1]Erol Sahin, Sertan Girgin, Emre Ugur:
Area measurement of large closed regions with a mobile robot. Auton. Robots 21(3): 255-266 (2006) - [c3]Sertan Girgin, Faruk Polat, Reda Alhajj:
Learning by Automatic Option Discovery from Conditionally Terminating Sequences. ECAI 2006: 494-498 - [c2]Sertan Girgin, Faruk Polat, Reda Alhajj:
Effectiveness of Considering State Similarity for Reinforcement Learning. IDEAL 2006: 163-171 - 2005
- [c1]Sertan Girgin, Faruk Polat:
Option Discovery in Reinforcement Learning using Frequent Common Subsequences of Actions. CIMCA/IAWTIC 2005: 371-376
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-19 20:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint