default search action
Mengdi Wang
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j57]Huicheng Hao, Mengdi Wang, Hongyu Wang:
Optimization of Emergency Supply and Distribution of Fresh Agricultural Products Under Public Health Emergencies. IEEE Access 12: 28636-28653 (2024) - [j56]Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang, Mengdi Wang, Alec Koppel:
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control. J. Mach. Learn. Res. 25: 39:1-39:58 (2024) - [j55]Zhenghao Xu, Xiang Ji, Minshuo Chen, Mengdi Wang, Tuo Zhao:
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds. J. Mach. Learn. Res. 25: 226:1-226:67 (2024) - [j54]Min Jiang, Mengdi Wang, Jun Kong:
Prototype equilibrium network with group emotional contagion for few-shot emotion recognition in conversation. Int. J. Mach. Learn. Cybern. 15(6): 2229-2246 (2024) - [j53]Yanyi Chu, Dan Yu, Yupeng Li, Kaixuan Huang, Yue Shen, Le Cong, Jason Zhang, Mengdi Wang:
A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions. Nat. Mac. Intell. 6(4): 449-460 (2024) - [j52]Josepha Godivier, Elizabeth A. Lawrence, Mengdi Wang, Chrissy L. Hammond, Niamh C. Nowlan:
Compressive stress gradients direct mechanoregulation of anisotropic growth in the zebrafish jaw joint. PLoS Comput. Biol. 20(2) (2024) - [j51]Jiandong Mu, Mengdi Wang, Feiwen Zhu, Jun Yang, Wei Lin, Wei Zhang:
Boosting the Convergence of Reinforcement Learning-Based Auto-Pruning Using Historical Data. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(2): 548-561 (2024) - [j50]Minshuo Chen, Jie Meng, Yu Bai, Yinyu Ye, H. Vincent Poor, Mengdi Wang:
Efficient Reinforcement Learning With Impaired Observability: Learning to Act With Delayed and Missing State Observations. IEEE Trans. Inf. Theory 70(10): 7251-7272 (2024) - [j49]Liang-Yong Xia, Yu Wu, Longfei Zhao, Leying Chen, Shiyi Zhang, Mengdi Wang, Jie Luo:
Redefining the Game: MVAE-DFDPnet's Low-Dimensional Embeddings for Superior Drug-Protein Interaction Predictions. IEEE J. Biomed. Health Informatics 28(7): 4317-4324 (2024) - [j48]Zheng Yu, Junyu Zhang, Zheng Wen, Andrea Tacchetti, Mengdi Wang, Ian Gemp:
Teamwork Reinforcement Learning With Concave Utilities. IEEE Trans. Mob. Comput. 23(5): 5709-5721 (2024) - [c113]Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang:
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization. AAAI 2024: 14686-14694 - [c112]Mengdi Wang, Anna Bodonhelyi, Efe Bozkir, Enkelejda Kasneci:
TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients. AAAI 2024: 15546-15554 - [c111]Xiangyu Qi, Kaixuan Huang, Ashwinee Panda, Peter Henderson, Mengdi Wang, Prateek Mittal:
Visual Adversarial Examples Jailbreak Aligned Large Language Models. AAAI 2024: 21527-21536 - [c110]Zihao Li, Xiang Ji, Minshuo Chen, Mengdi Wang:
Policy Evaluation for Reinforcement Learning from Human Feedback: A Sample Complexity Analysis. AISTATS 2024: 2737-2745 - [c109]Fuping Li, Ying Wang, Yujie Wang, Mengdi Wang, Yinhe Han, Huawei Li, Xiaowei Li:
Chipletizer: Repartitioning SoCs for Cost-Effective Chiplet Integration. ASPDAC 2024: 58-64 - [c108]Efe Bozkir, Süleyman Özdel, Ka Hei Carrie Lau, Mengdi Wang, Hong Gao, Enkelejda Kasneci:
Embedding Large Language Models into Extended Reality: Opportunities and Challenges for Inclusion, Engagement, and Privacy. CUI 2024: 38 - [c107]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, Furong Huang:
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback. ICLR 2024 - [c106]Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai:
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight. ICLR 2024 - [c105]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit S. Bedi, Mengdi Wang:
MaxMin-RLHF: Alignment with Diverse Human Preferences. ICML 2024 - [c104]Zehao Dou, Minshuo Chen, Mengdi Wang, Zhuoran Yang:
Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling. ICML 2024 - [c103]Alec Koppel, Sujay Bhatt, Jiacheng Guo, Joe Eappen, Mengdi Wang, Sumitra Ganesh:
Information-Directed Pessimism for Offline Reinforcement Learning. ICML 2024 - [c102]Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson:
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications. ICML 2024 - [c101]Yuchen Wu, Minshuo Chen, Zihao Li, Mengdi Wang, Yuting Wei:
Theoretical insights for diffusion guidance: A case study for Gaussian mixture models. ICML 2024 - [c100]Lei Zhao, Mengdi Wang, Yu Bai:
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective. ICML 2024 - [c99]Shuhua Yang, Hui Yuan, Xiaoying Zhang, Mengdi Wang, Hong Zhang, Huazheng Wang:
Conversational Dueling Bandits in Generalized Linear Models. KDD 2024: 3806-3817 - [i123]Joseph C. Kim, David Bloore, Karan Kapoor, Jun Feng, Ming-Hong Hao, Mengdi Wang:
Scalable Normalizing Flows Enable Boltzmann Generators for Macromolecules. CoRR abs/2401.04246 (2024) - [i122]Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang:
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization. CoRR abs/2401.06173 (2024) - [i121]Mengdi Wang, Anna Bodonhelyi, Efe Bozkir, Enkelejda Kasneci:
TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients. CoRR abs/2401.12012 (2024) - [i120]Efe Bozkir, Süleyman Özdel, Ka Hei Carrie Lau, Mengdi Wang, Hong Gao, Enkelejda Kasneci:
Embedding Large Language Models into Extended Reality: Opportunities and Challenges for Inclusion, Engagement, and Privacy. CoRR abs/2402.03907 (2024) - [i119]Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson:
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications. CoRR abs/2402.05162 (2024) - [i118]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang:
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences. CoRR abs/2402.08925 (2024) - [i117]Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang:
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning. CoRR abs/2402.10810 (2024) - [i116]Yuchen Wu, Minshuo Chen, Zihao Li, Mengdi Wang, Yuting Wei:
Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models. CoRR abs/2403.01639 (2024) - [i115]Zihao Li, Hui Lan, Vasilis Syrgkanis, Mengdi Wang, Masatoshi Uehara:
Regularized DeepIV with Model Selection. CoRR abs/2403.04236 (2024) - [i114]Kaiyan Chang, Kun Wang, Nan Yang, Ying Wang, Dantong Jin, Wenlong Zhu, Zhirong Chen, Cangyuan Li, Hao Yan, Yunhao Zhou, Zhuoliang Zhao, Yuan Cheng, Yudong Pan, Yiqi Liu, Mengdi Wang, Shengwen Liang, Yinhe Han, Huawei Li, Xiaowei Li:
Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework. CoRR abs/2403.11202 (2024) - [i113]Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup:
Offline Multitask Representation Learning for Reinforcement Learning. CoRR abs/2403.11574 (2024) - [i112]Hengyu Fu, Zhuoran Yang, Mengdi Wang, Minshuo Chen:
Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory. CoRR abs/2403.11968 (2024) - [i111]Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang:
Embodied LLM Agents Learn to Cooperate in Organized Teams. CoRR abs/2403.12482 (2024) - [i110]Zihao Li, Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Yinyu Ye, Minshuo Chen, Mengdi Wang:
Diffusion Model for Data-Driven Black-Box Optimization. CoRR abs/2403.13219 (2024) - [i109]Minshuo Chen, Song Mei, Jianqing Fan, Mengdi Wang:
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization. CoRR abs/2404.07771 (2024) - [i108]Yingqing Guo, Hui Yuan, Yukang Yang, Minshuo Chen, Mengdi Wang:
Gradient Guidance for Diffusion Models: An Optimization Perspective. CoRR abs/2404.14743 (2024) - [i107]Kaixuan Huang, Yuanhao Qu, Henry Cousins, William A. Johnson, Di Yin, Mihir Shah, Denny Zhou, Russ B. Altman, Mengdi Wang, Le Cong:
CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments. CoRR abs/2404.18021 (2024) - [i106]Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Geiping, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J. Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal:
AI Risk Management Should Incorporate Both Safety and Security. CoRR abs/2405.19524 (2024) - [i105]Kaixuan Huang, Xudong Guo, Mengdi Wang:
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths. CoRR abs/2405.19715 (2024) - [i104]Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang:
Transfer Q Star: Principled Decoding for LLM Alignment. CoRR abs/2405.20495 (2024) - [i103]Xiang Ji, Sanjeev Kulkarni, Mengdi Wang, Tengyang Xie:
Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models. CoRR abs/2406.04274 (2024) - [i102]Mucong Ding, Souradip Chakraborty, Vibhu Agrawal, Zora Che, Alec Koppel, Mengdi Wang, Amrit S. Bedi, Furong Huang:
SAIL: Self-Improving Efficient Online Alignment of Large Language Models. CoRR abs/2406.15567 (2024) - [i101]Zehao Dou, Minshuo Chen, Mengdi Wang, Zhuoran Yang:
Provable Statistical Rates for Consistency Diffusion Models. CoRR abs/2406.16213 (2024) - [i100]Jibang Wu, Siyu Chen, Mengdi Wang, Huazheng Wang, Haifeng Xu:
Contractual Reinforcement Learning: Pulling Arms with Invisible Hands. CoRR abs/2407.01458 (2024) - [i99]Kaiyan Chang, Zhirong Chen, Yunhao Zhou, Wenlong Zhu, Kun Wang, Haobo Xu, Cangyuan Li, Mengdi Wang, Shengwen Liang, Huawei Li, Yinhe Han, Ying Wang:
Natural language is not enough: Benchmarking multi-modal generative AI for Verilog generation. CoRR abs/2407.08473 (2024) - [i98]Hengyu Fu, Zehao Dou, Jiawei Guo, Mengdi Wang, Minshuo Chen:
Diffusion Transformer Captures Spatial-Temporal Dependencies: A Theory for Gaussian Process Data. CoRR abs/2407.16134 (2024) - [i97]Shuhua Yang, Hui Yuan, Xiaoying Zhang, Mengdi Wang, Hong Zhang, Huazheng Wang:
Conversational Dueling Bandits in Generalized Linear Models. CoRR abs/2407.18488 (2024) - [i96]Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei:
Relative-Translation Invariant Wasserstein Distance. CoRR abs/2409.02416 (2024) - [i95]Kaixuan Huang, Yukang Yang, Kaidi Fu, Yanyi Chu, Le Cong, Mengdi Wang:
Latent Diffusion Models for Controllable RNA Sequence Generation. CoRR abs/2409.09828 (2024) - 2023
- [j47]Chengzhuo Ni, Yaqi Duan, Munther A. Dahleh, Mengdi Wang, Anru R. Zhang:
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition. J. Mach. Learn. Res. 24: 115:1-115:53 (2023) - [j46]Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang:
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning. J. Mach. Learn. Res. 24: 385:1-385:43 (2023) - [j45]Mingbao Lin, Yuxin Zhang, Yuchao Li, Bohong Chen, Fei Chao, Mengdi Wang, Shen Li, Yonghong Tian, Rongrong Ji:
1xN Pattern for Pruning Convolutional Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 45(4): 3999-4008 (2023) - [j44]Junyu Zhang, Mengdi Wang, Mingyi Hong, Shuzhong Zhang:
Primal-Dual First-Order Methods for Affinely Constrained Multi-block Saddle Point Problems. SIAM J. Optim. 33(2): 1035-1060 (2023) - [j43]Mengdi Wang, Di Xiao, Jia Liang, Guiqiang Hu:
Distributed privacy-preserving nested compressed sensing for multiclass data collection with identity authentication. Signal Process. 204: 108823 (2023) - [j42]Mengdi Wang, Hung Chau, Khushboo Thaker, Peter Brusilovsky, Daqing He:
Knowledge Annotation for Intelligent Textbooks. Technol. Knowl. Learn. 28(1): 1-22 (2023) - [c98]Huiqing Xu, Kuang Mao, Quihong Pan, Zhaorong Tang, Mengdi Wang, Ying Wang:
Deep Learning Compiler Optimization on Multi-Chiplet Architecture. AICAS 2023: 1-5 - [c97]Yiding Chen, Xuezhou Zhang, Kaiqing Zhang, Mengdi Wang, Xiaojin Zhu:
Byzantine-Robust Online and Offline Distributed Reinforcement Learning. AISTATS 2023: 3230-3269 - [c96]Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang:
Provable Benefits of Representational Transfer in Reinforcement Learning. COLT 2023: 2114-2187 - [c95]Chengsi Gao, Ying Wang, Cheng Liu, Mengdi Wang, Weiwei Chen, Yinhe Han, Lei Zhang:
Layer-Puzzle: Allocating and Scheduling Multi-task on Multi-core NPUs by Using Layer Heterogeneity. DATE 2023: 1-6 - [c94]Hui Huang, Di Xiao, Mengdi Wang:
Hierarchical Privacy-Preserving and Communication-Efficient Compression via Compressed Sensing. DCC 2023: 342 - [c93]Mengdi Wang, You Wu, Tao Ding, Xingwei Zhao, Bo Tao:
The Construction of Intelligent Grasping System Based on EEG. ICIRA (5) 2023: 245-256 - [c92]Xiang Ji, Minshuo Chen, Mengdi Wang, Tuo Zhao:
Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks. ICLR 2023 - [c91]Chuanhao Li, Huazheng Wang, Mengdi Wang, Hongning Wang:
Learning Kernelized Contextual Bandits in a Distributed and Asynchronous Environment. ICLR 2023 - [c90]Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Zihan Ding, Chi Jin, Mengdi Wang:
Representation Learning for Low-rank General-sum Markov Games. ICLR 2023 - [c89]Ming Yin, Mengdi Wang, Yu-Xiang Wang:
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient. ICLR 2023 - [c88]Zheng Yu, Yikuan Li, Joseph C. Kim, Kaixuan Huang, Yuan Luo, Mengdi Wang:
Deep Reinforcement Learning for Cost-Effective Medical Diagnosis. ICLR 2023 - [c87]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha:
STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning. ICML 2023: 3949-3978 - [c86]Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang:
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data. ICML 2023: 4672-4712 - [c85]Jiacheng Guo, Zihao Li, Huazheng Wang, Mengdi Wang, Zhuoran Yang, Xuezhou Zhang:
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP. ICML 2023: 11967-11997 - [c84]Zixuan Zhang, Minshuo Chen, Mengdi Wang, Wenjing Liao, Tuo Zhao:
Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories. ICML 2023: 40911-40931 - [c83]Minshuo Chen, Yu Bai, H. Vincent Poor, Mengdi Wang:
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations. NeurIPS 2023 - [c82]Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yian Ma:
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation. NeurIPS 2023 - [c81]Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang:
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement. NeurIPS 2023 - [c80]Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang:
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective. NeurIPS 2023 - [i94]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha:
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning. CoRR abs/2301.12038 (2023) - [i93]Minshuo Chen, Kaixuan Huang, Tuo Zhao, Mengdi Wang:
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data. CoRR abs/2302.07194 (2023) - [i92]Zheng Yu, Yikuan Li, Joseph C. Kim, Kaixuan Huang, Yuan Luo, Mengdi Wang:
Deep Reinforcement Learning for Cost-Effective Medical Diagnosis. CoRR abs/2302.10261 (2023) - [i91]Kaiyan Chang, Ying Wang, Haimeng Ren, Mengdi Wang, Shengwen Liang, Yinhe Han, Huawei Li, Xiaowei Li:
ChipGPT: How far are we from natural language hardware design. CoRR abs/2305.14019 (2023) - [i90]Efe Bozkir, Süleyman Özdel, Mengdi Wang, Brendan David-John, Hong Gao, Kevin R. B. Butler, Eakta Jain, Enkelejda Kasneci:
Eye-tracked Virtual Reality: A Comprehensive Survey on Methods and Privacy Challenges. CoRR abs/2305.14080 (2023) - [i89]Zihao Li, Zhuoran Yang, Mengdi Wang:
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism. CoRR abs/2305.18438 (2023) - [i88]Zichen Wang, Rishab Balasubramanian, Hui Yuan, Chenyu Song, Mengdi Wang, Huazheng Wang:
Adversarial Attacks on Online Learning to Rank with Stochastic Click Models. CoRR abs/2305.19218 (2023) - [i87]Minshuo Chen, Yu Bai, H. Vincent Poor, Mengdi Wang:
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations. CoRR abs/2306.01243 (2023) - [i86]Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang:
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective. CoRR abs/2306.07528 (2023) - [i85]Jiacheng Guo, Zihao Li, Huazheng Wang, Mengdi Wang, Zhuoran Yang, Xuezhou Zhang:
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP. CoRR abs/2306.12356 (2023) - [i84]Xiangyu Qi, Kaixuan Huang, Ashwinee Panda, Mengdi Wang, Prateek Mittal:
Visual Adversarial Examples Jailbreak Large Language Models. CoRR abs/2306.13213 (2023) - [i83]Zixuan Zhang, Minshuo Chen, Mengdi Wang, Wenjing Liao, Tuo Zhao:
Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories. CoRR abs/2306.14859 (2023) - [i82]Kaiqi Zhang, Zixuan Zhang, Minshuo Chen, Mengdi Wang, Tuo Zhao, Yu-Xiang Wang:
Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks. CoRR abs/2307.01649 (2023) - [i81]Tianle Cai, Kaixuan Huang, Jason D. Lee, Mengdi Wang:
Scaling In-Context Demonstrations with Structured Attention. CoRR abs/2307.02690 (2023) - [i80]Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai:
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight. CoRR abs/2307.02884 (2023) - [i79]Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang:
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement. CoRR abs/2307.07055 (2023) - [i78]Xiang Ji, Huazheng Wang, Minshuo Chen, Tuo Zhao, Mengdi Wang:
Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems. CoRR abs/2307.12975 (2023) - [i77]Siyu Chen, Mengdi Wang, Zhuoran Yang:
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks. CoRR abs/2307.14085 (2023) - [i76]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Dinesh Manocha, Huazheng Wang, Furong Huang, Mengdi Wang:
Aligning Agent Policy with Externalities: Reward Design via Bilevel RL. CoRR abs/2308.02585 (2023) - [i75]Yikuan Li, Chengsheng Mao, Kaixuan Huang, Hanyin Wang, Zheng Yu, Mengdi Wang, Yuan Luo:
Deep Reinforcement Learning for Efficient and Fair Allocation of Health Care Resources. CoRR abs/2309.08560 (2023) - [i74]Zhenghao Xu, Xiang Ji, Minshuo Chen, Mengdi Wang, Tuo Zhao:
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds. CoRR abs/2309.13915 (2023) - [i73]Shuoguang Yang, Xuezhou Zhang, Mengdi Wang:
Federated Multi-Level Optimization over Decentralized Networks. CoRR abs/2310.06217 (2023) - [i72]Zihao Li, Xiang Ji, Minshuo Chen, Mengdi Wang:
Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks. CoRR abs/2310.10556 (2023) - [i71]Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma:
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation. CoRR abs/2310.18919 (2023) - [i70]Lei Zhao, Mengdi Wang, Yu Bai:
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? CoRR abs/2312.00054 (2023) - 2022
- [j41]Ziwei Zhu, Xudong Li, Mengdi Wang, Anru Zhang:
Learning Markov Models Via Low-Rank Optimization. Oper. Res. 70(4): 2384-2398 (2022) - [j40]Xifeng Xu, Yunni Xia, Feng Zeng, Fan Li, Hong Xie, Xiaodong Fu, Mengdi Wang:
A novel vehicular task deployment method in hybrid MEC. J. Cloud Comput. 11: 88 (2022) - [j39]Le Xie, Tong Huang, Xiangtian Zheng, Yan Liu, Mengdi Wang, Vijay Vittal, P. R. Kumar, Srinivas Shakkottai, Yi Cui:
Energy system digitization in the era of AI: A three-layered approach toward carbon neutrality. Patterns 3(12): 100640 (2022) - [j38]Qitong Xu, Chang Liu, Enshan Yang, Mengdi Wang:
An Improved Convolutional Capsule Network for Compound Fault Diagnosis of RV Reducers. Sensors 22(17): 6442 (2022) - [j37]Zhijian Zhou, Yihang Wang, Weijie Sun, Mengdi Wang:
Weight Optimization of the Induction Magnetometer at Low Frequency. IEEE Trans. Instrum. Meas. 71: 1-6 (2022) - [j36]Shiying Xiong, Zhecheng Wang, Mengdi Wang, Bo Zhu:
A clebsch method for free-surface vortical flow simulation. ACM Trans. Graph. 41(4): 116:1-116:13 (2022) - [j35]Yitong Deng, Mengdi Wang, Xiangxin Kong, Shiying Xiong, Zangyueyang Xian, Bo Zhu:
A moving eulerian-lagrangian particle method for thin film and foam simulation. ACM Trans. Graph. 41(4): 154:1-154:17 (2022) - [j34]Jinyuan Liu, Mengdi Wang, Fan Feng, Annie Tang, Qiqin Le, Bo Zhu:
Hydrophobic and Hydrophilic Solid-Fluid Interaction. ACM Trans. Graph. 41(6): 256:1-256:15 (2022) - [c79]Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel:
Multi-Agent Reinforcement Learning with General Utilities via Decentralized Shadow Reward Actor-Critic. AAAI 2022: 9031-9039 - [c78]Chenyu Wang, Joseph C. Kim, Le Cong, Mengdi Wang:
Neural Bandits for Protein Sequence Optimization. CISS 2022: 188-193 - [c77]