default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Author search results

Exact matches

Sadegh Talebi

Likely matches

Mohammad Sadegh Talebi
aka: M. Sadegh Talebi
Mohammad Sadegh Talebi Mazraeh Shahi

Publication search results

found 76 matches

2024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15662
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15662
Odalric-Ambrym Maillard, Mohammad Sadegh Talebi:
How to Shrink Confidence Sets for Many Equivalent Discrete Distributions? CoRR abs/2407.15662 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02747
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02747
Ahana Deb, Roberto Cipollone, Anders Jonsson, Alessandro Ronca, Mohammad Sadegh Talebi:
Tractable Offline Learning of Regular Decision Processes. CoRR abs/2409.02747 (2024)
2023
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/entropy/LyuCZT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/entropy/LyuCZT23
Yunlian Lyu, Aymeric Côme, Yijie Zhang, Mohammad Sadegh Talebi:
Scaling Up Q-Learning via Exploiting State-Action Equivalence. Entropy 25(4): 584 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/npl/LyuT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/npl/LyuT23
Yunlian Lyu, Mohammad Sadegh Talebi:
Double Graph Attention Networks for Visual Semantic Navigation. Neural Process. Lett. 55(7): 9019-9040 (2023)
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/acml/SaberPMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acml/SaberPMT23
Hassan Saber, Fabien Pesquerel, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi:
Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits. ACML 2023: 1167-1182
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/BourelJMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/BourelJMT23
Hippolyte Bourel, Anders Jonsson, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi:
Exploration in Reward Machines with Low Regret. AISTATS 2023: 4114-4146
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/00020RT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/00020RT23
Roberto Cipollone, Anders Jonsson, Alessandro Ronca, Mohammad Sadegh Talebi:
Provably Efficient Offline Reinforcement Learning in Regular Decision Processes. NeurIPS 2023
2022
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ojcs/NomikosTCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ojcs/NomikosTCW22
Nikolaos Nomikos, Mohammad Sadegh Talebi, Themistoklis Charalambous, Risto Wichman:
Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks With Strict-Sense Stationary and Non-Stationary Wireless Communication Channels. IEEE Open J. Commun. Soc. 3: 366-378 (2022)
- view
  authority control:
- export record
  dblp key:
  - journals/pvldb/SkitsasPTKKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pvldb/SkitsasPTKKK22
Konstantinos Skitsas, Ioannis G. Papageorgiou, Mohammad Sadegh Talebi, Vasiliki Kantere, Michael N. Katehakis, Panagiotis Karras:
SIFTER: Space-Efficient Value Iteration for Finite-Horizon MDPs. Proc. VLDB Endow. 16(1): 90-98 (2022)
2021
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/TalebiJM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/TalebiJM21
Mohammad Sadegh Talebi, Anders Jonsson, Odalric Maillard:
Improved Exploration in Factored Average-Reward MDPs. AISTATS 2021: 3988-3996
2020
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BourelMT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BourelMT20
Hippolyte Bourel, Odalric Maillard, Mohammad Sadegh Talebi:
Tightening Exploration in Upper Confidence Reinforcement Learning. ICML 2020: 1056-1066
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/NomikosTWC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/NomikosTWC20
Nikolaos Nomikos, Sadegh Talebi, Risto Wichman, Themistoklis Charalambous:
Bandit-Based Relay Selection in Cooperative Networks Over Unknown Stationary Channels. MLSP 2020: 1-6
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangHTLW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangHTLW20
Lin Yang, Mohammad Hassan Hajiesmaili, Mohammad Sadegh Talebi, John C. S. Lui, Wing Shing Wong:
Adversarial Bandits with Corruptions: Regret Lower Bound and No-regret Algorithm. NeurIPS 2020
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-09656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-09656
Hippolyte Bourel, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi:
Tightening Exploration in Upper Confidence Reinforcement Learning. CoRR abs/2004.09656 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-04575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-04575
Mohammad Sadegh Talebi, Anders Jonsson, Odalric-Ambrym Maillard:
Improved Exploration in Factored Average-Reward MDPs. CoRR abs/2009.04575 (2020)
2019
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/acml/AsadiTBM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acml/AsadiTBM19
Mahsa Asadi, Mohammad Sadegh Talebi, Hippolyte Bourel, Odalric-Ambrym Maillard:
Model-Based Reinforcement Learning Exploiting State-Action Equivalence. ACML 2019: 204-219
- view
- export record
  dblp key:
  - conf/nips/TalebiM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TalebiM19
Mohammad Sadegh Talebi, Odalric-Ambrym Maillard:
Learning Multiple Markov Chains via Adaptive Allocation. NeurIPS 2019: 13322-13332
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11128
M. Sadegh Talebi, Odalric-Ambrym Maillard:
Learning Multiple Markov Chains via Adaptive Allocation. CoRR abs/1905.11128 (2019)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-04077
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-04077
Mahsa Asadi, Mohammad Sadegh Talebi, Hippolyte Bourel, Odalric-Ambrym Maillard:
Model-Based Reinforcement Learning Exploiting State-Action Equivalence. CoRR abs/1910.04077 (2019)
2018
- view
  authority control:
- export record
  dblp key:
  - journals/pomacs/TalebiP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pomacs/TalebiP18
Mohammad Sadegh Talebi, Alexandre Proutière:
Learning Proportionally Fair Allocations with Low Regret. Proc. ACM Meas. Anal. Comput. Syst. 2(2): 36:1-36:31 (2018)
- view
  authority control:
- export record
  dblp key:
  - journals/tac/TalebiZCPJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/TalebiZCPJ18
Mohammad Sadegh Talebi, Zhenhua Zou, Richard Combes, Alexandre Proutière, Mikael Johansson:
Stochastic Online Shortest Path Routing: The Value of Feedback. IEEE Trans. Autom. Control. 63(4): 915-930 (2018)
- view
  authority control:
- export record
  dblp key:
  - journals/tcns/HajiesmailiTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcns/HajiesmailiTK18
Mohammad Hassan Hajiesmaili, Mohammad Sadegh Talebi, Ahmad Khonsari:
Multiperiod Network Rate Allocation With End-to-End Delay Constraints. IEEE Trans. Control. Netw. Syst. 5(3): 1087-1097 (2018)
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/alt/TalebiM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/TalebiM18
Mohammad Sadegh Talebi, Odalric-Ambrym Maillard:
Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs. ALT 2018: 770-805
- view
  authority control:
- export record
  dblp key:
  - conf/iwqos/AliniaTHYC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwqos/AliniaTHYC18
Bahram Alinia, Mohammad Sadegh Talebi, Mohammad Hassan Hajiesmaili, Ali Yekkehkhany, Noël Crespi:
Competitive Online Scheduling Algorithms with Applications in Deadline-Constrained EV Charging. IWQoS 2018: 1-10
- view
  authority control:
- export record
  dblp key:
  - conf/sigmetrics/TalebiP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigmetrics/TalebiP18
Mohammad Sadegh Talebi, Alexandre Proutière:
Learning Proportionally Fair Allocations with Low Regret. SIGMETRICS (Abstracts) 2018: 50-52
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-01626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-01626
Mohammad Sadegh Talebi, Odalric-Ambrym Maillard:
Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs. CoRR abs/1803.01626 (2018)
2017
- view
  - electronic edition @ nbn-resolving.org
  - no references & citations available
  authority control:
- export record
  dblp key:
  - phd/basesearch/Shahi17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Shahi17
Mohammad Sadegh Talebi Mazraeh Shahi:
Minimizing Regret in Combinatorial Bandits and Reinforcement Learning. Royal Institute of Technology, Stockholm, Sweden, 2017
- view
  authority control:
- export record
  dblp key:
  - journals/rairo/JahanshahlooTLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/rairo/JahanshahlooTLS17
Gholam Reza Jahanshahloo, Bahram Talebian, F. Hosseinzadeh Lotfi, Jafar Sadeghi:
Finding a solution for Multi-Objective Linear Fractional Programming problem based on goal programming and Data Envelopment Analysis. RAIRO Oper. Res. 51(1): 199-210 (2017)
2016
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/TalebiP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/TalebiP16
Mohammad Sadegh Talebi, Alexandre Proutière:
An Optimal Algorithm for Stochastic Matroid Bandit Optimization. AAMAS 2016: 548-556
2015
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/HajiesmailiTK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/HajiesmailiTK15
Mohammad Hassan Hajiesmaili, Mohammad Sadegh Talebi, Ahmad Khonsari:
Utility-optimal dynamic rate allocation under average end-to-end delay requirements. CDC 2015: 4842-4847

skipping 46 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results