![](https://dblp.org/img/logo.ua.320x120.png)
![](https://dblp.org/img/dropdown.dark.16x16.png)
![](https://dblp.org/img/peace.dark.16x16.png)
Остановите войну!
for scientists:
![search dblp search dblp](https://dblp.org/img/search.dark.16x16.png)
![search dblp](https://dblp.org/img/search.dark.16x16.png)
default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
Exact matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 37 matches
- 2024
- Ruiyang Jin, Zaiwei Chen, Yiheng Lin, Jie Song, Adam Wierman:
Approximate Global Convergence of Independent Learning in Multi-Agent Systems. CoRR abs/2405.19811 (2024) - 2023
- Yizhou Zhang
, Guannan Qu
, Pan Xu
, Yiheng Lin
, Zaiwei Chen
, Adam Wierman
:
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning. Proc. ACM Meas. Anal. Comput. Syst. 7(1): 13:1-13:51 (2023) - Zaiwei Chen
, John-Paul Clarke, Siva Theja Maguluri:
Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning. SIAM J. Math. Data Sci. 5(4): 1078-1101 (2023) - Chenwei Wu, Li Erran Li, Stefano Ermon, Patrick Haffner, Rong Ge, Zaiwei Zhang:
The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models. ICBINB 2023: 118-126 - Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games. NeurIPS 2023 - Yizhou Zhang
, Guannan Qu
, Pan Xu
, Yiheng Lin
, Zaiwei Chen
, Adam Wierman
:
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning. SIGMETRICS (Abstracts) 2023: 83-84 - Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman:
Convergence rates for localized actor-critic in networked Markov potential games. UAI 2023: 2563-2573 - Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games. CoRR abs/2303.03100 (2023) - Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman:
Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games. CoRR abs/2303.04865 (2023) - Zaiwei Chen, Siva Theja Maguluri, Martin Zubeldia:
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise. CoRR abs/2303.15740 (2023) - Haitao Yang, Zaiwei Zhang, Xiangru Huang, Min Bai, Chen Song, Bo Sun, Li Erran Li, Qixing Huang:
LiDAR-Based 3D Object Detection via Hybrid 2D Semantic Scene Generation. CoRR abs/2304.01519 (2023) - Chenwei Wu, Li Erran Li, Stefano Ermon, Patrick Haffner, Rong Ge, Zaiwei Zhang:
The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models. CoRR abs/2310.02777 (2023) - Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman E. Ozdaglar, Adam Wierman:
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games. CoRR abs/2312.04905 (2023) - 2022
- Zaiwei Chen, Sheng Zhang, Thinh T. Doan, John-Paul Clarke, Siva Theja Maguluri:
Finite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learning. Autom. 146: 110623 (2022) - Zaiwei Chen
, Sajad Khodadadian
, Siva Theja Maguluri
:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation. IEEE Control. Syst. Lett. 6: 2611-2616 (2022) - Zaiwei Chen, Shancong Mou
, Siva Theja Maguluri:
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization. Proc. ACM Meas. Anal. Comput. Syst. 6(1): 19:1-19:24 (2022) - Zaiwei Chen:
A Unified Lyapunov Framework for Finite-Sample Analysis of Reinforcement Learning Algorithms. SIGMETRICS Perform. Evaluation Rev. 50(3): 12-15 (2022) - Zaiwei Chen, Siva Theja Maguluri:
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation. AISTATS 2022: 11195-11214 - Ziqian Bai
, Yimin Wang
, Jiawei Guo
, Anqi Wang
, Hua Yue
, Ze Gao
, Zesheng Castiel Chen
, Xiaoyu Zhou
, Ziwei Pan
, Hao Li
, Yanming Sheng
, Jianwei Zheng
, Zaiwei Liu
, Kejia Zhang
, Yunhua Zhang
, Zerong Hong
, Liang Tan
:
Interspaces Between Nature and the City. CCHI 2022: 291-303 - Zaiwei Chen, Shancong Mou, Siva Theja Maguluri:
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization. SIGMETRICS (Abstracts) 2022: 109-110 - Zaiwei Chen, John-Paul Clarke, Siva Theja Maguluri:
Target Network and Truncation Overcome The Deadly triad in Q-Learning. CoRR abs/2203.02628 (2022) - Zaiwei Chen, Siva Theja Maguluri:
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation. CoRR abs/2208.03247 (2022) - Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman:
Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning. CoRR abs/2211.17116 (2022) - 2021
- Sajad Khodadadian, Zaiwei Chen, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm. ICML 2021: 5420-5431 - Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam
:
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators. NeurIPS 2021: 21440-21452 - Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants. CoRR abs/2102.01567 (2021) - Sajad Khodadadian, Zaiwei Chen, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm. CoRR abs/2102.09318 (2021) - Fanruiqi Zeng, Zaiwei Chen, John-Paul Clarke, David Goldsman:
Nested Vehicle Routing Problem: Optimizing Drone-Truck Surveillance Operations. CoRR abs/2103.01528 (2021) - Zaiwei Chen, Sajad Khodadadian, Siva Theja Maguluri:
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation. CoRR abs/2105.12540 (2021) - Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam:
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators. CoRR abs/2106.12729 (2021)
skipping 7 more matches
loading more results
failed to load more results, please try again later
![](https://dblp.org/img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-07-27 08:58 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint