


default search action
Lu Wang 0029
Person information
- affiliation: Microsoft, Beijing, China
- affiliation: East China Normal University, School of Computer Science and Technology, Shanghai, China
Other persons with the same name
- Lu Wang — disambiguation page
- Lu Wang 0001
— Northeastern University, College of Information Science and Engineering, Shenyang, China (and 2 more)
- Lu Wang 0002
— Shenzhen University, College of Computer Science and Software Engineering, China (and 2 more)
- Lu Wang 0003
— Nanyang Technological University, School of Electrical and Electronic Engineering, Singapore (and 2 more)
- Lu Wang 0004
— University of Houston, TX, USA (and 3 more)
- Lu Wang 0005
— University of Southampton, Faculty of Engineering and the Environment, UK
- Lu Wang 0006 — KTH Royal Institute of Technology, Centre for Autonomous Systems and the Computer Vision and Active Perception Lab, Stockholm, Sweden
- Lu Wang 0007
— Shandong University, Department of Computer Science, China
- Lu Wang 0008
— University of Michigan, Computer Science and Engineering, Ann Arbor, MI, USA (and 2 more)
- Lu Wang 0009
— Anhui University of Technology, Department of Mechanical Engineering, Ma'anshan, China
- Lu Wang 0010
— Harbin Engineering University, Harbin, China (and 1 more)
- Lu Wang 0011 — Arizona State University, Tempe, USA
- Lu Wang 0012 — University of Rennes 1, LTSI, France (and 2 more)
- Lu Wang 0013 — University of Southern California, Department of Computer Science, Los Angeles, CA, USA
- Lu Wang 0014
— Xidian University, Software Engineering Institute, Xi'an, China
- Lu Wang 0015
— Central South University, Department of Statistics, Changsha, China (and 1 more)
- Lu Wang 0016
— Northwestern Polytechnical University, School of Aeronautics, Xi'an, China
- Lu Wang 0017
— Southwest Jiaotong University, College of Mathematics, Chengdu, China
- Lu Wang 0018
— University of International Business and Economics, Research Institute for Shenzhen, China (and 2 more)
- Lu Wang 0019
— Ghent University, Department of Electronics and Information Systems, Belgium (and 1 more)
- Lu Wang 0020
— Zhongkai University of Agriculture and Engineering, Guangzhou, China
- Lu Wang 0021
— Wuhan University, School of Resource and Environment Sciences, China
- Lu Wang 0022
— University of Sheffield, Department of Electronic and Electrical Engineering, UK
- Lu Wang 0023
— Zhejiang University of Finance and Economics, School of Information Management and Artificial Intelligence, Hangzhou, China
- Lu Wang 0024
— Qinghai University, State Key Laboratory of Plateau Ecology and Agriculture, Xining, China
- Lu Wang 0025
— Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China
- Lu Wang 0026
— China Three Gorges University, College of Electrical Engineering and New Energy, Yichang, China
- Lu Wang 0027
— Guangdong University of Technology, School of Information Engineering, Guangzhou, China
- Lu Wang 0028
— Delft University of Technology, TU Delft, Department of Electrical Sustainable Energy, DCE&S group, Netherlands
- Lu Wang 0030
— Florida State University, Department of Electrical and Computer Engineering, Center for Advanced Power Systems, Tallahassee, FL, USA
- Lu Wang 0031
— JD.com, Beijing, China (and 1 more)
- Lu Wang 0032 — University of Trier, Computational Linguistics and Digital Humanities, Trier, Germany
- Lu Wang 0033
— City University of Hong Kong, Hong Kong, SAR, China
- Lu Wang 0034
— Shanghai University of Finance and Economics, Shanghai, China
- Lu Wang 0035
— SD Vocational and Technical University of International Studies, Rizhao, Shandong, China
- Lu Wang 0036
— Stevens Institute of Technology, Hoboken, NJ, USA
- Lu Wang 0037
— Macao Polytechnic University, Macao, SAR, China
- Lu Wang 0038
— China University of Geosciences, Beijing, China
- Lu Wang 0039
— The First Affiliated Hospital of Jinan University, Guangzhou, China
- Lu Wang 0040
— Hunan University, Changsha, China
- Lu Wang 0041
— Southeast University, Nanjing, Jiangsu, China
- Lu Wang 0042
— University of Science and Technology of China, Hefei, China
- Lu Wang 0043
— Liaoning Normal University, Dalian, China
- Lu Wang 0044
— Chang'an University, Xi'an, China
- Lu Wang 0045
— Technical University of Darmstadt, Darmstadt, Germany
- Lu Wang 0046
— Wuhan University, Wuhan, Hubei, China
- Lu Wang 0047
— Southeast University, Nanjing, China
- Lu Wang 0048
— Shanghai Jiao Tong University, Shanghai, China
- Lu Wang 0049
— Xi'an Jiaotong University, Xi'an, China
- Lu Wang 0050
— Xidian University, Xi'an, Shaanxi, China
- Lu Wang 0051
— Fudan University, Shanghai, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j4]Shangding Gu
, Bilgehan Sel, Yuhao Ding
, Lu Wang, Qingwei Lin
, Alois Knoll
, Ming Jin
:
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 47(5): 3322-3331 (2025) - [j3]Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, He Huang, Si Qin, Qisheng Su, Jiayi Ye, Yudi Zhang, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
Large Action Models: From Inception to Implementation. Trans. Mach. Learn. Res. 2025 (2025) - [j2]Jiajun Cui
, Hong Qian
, Chanjin Zheng
, Lu Wang
, Mo Yu
, Wei Zhang
:
Rebalancing Discriminative Responses for Knowledge Tracing. ACM Trans. Inf. Syst. 43(3): 75:1-75:25 (2025) - [c37]Huawen Feng, Pu Zhao, Qingfeng Sun, Can Xu, Fangkai Yang, Lu Wang, Qianli Ma, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models. ACL (1) 2025: 4955-4969 - [c36]Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents. ACL (1) 2025: 7711-7743 - [c35]Yudi Zhang, Pei Xiao, Lu Wang, Chaoyun Zhang, Meng Fang, Yali Du, Yevgeniy Puzyrev, Randolph Yao, Si Qin, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
RuAG: Learned-rule-augmented Generation for Large Language Models. ICLR 2025 - [c34]Chenghua Huang, Zhizhen Fan, Lu Wang, Fangkai Yang, Pu Zhao, Zeqi Lin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
Self-Evolved Reward Learning for LLMS. ICLR 2025 - [c33]Pei Xiao
, Lu Wang
, Fangkai Yang
, Guoqing Geng
, Haoran Li
, Jeff Zhu
, Yu Kang
, Yifan Li
, Terry Chen
, Yue Chen
, Saravan Rajmohan
, Qi Zhang
:
Te-PID: An Adaptive Erasure Coding Temperature Management System for Optimized Cloud Storage. SIGSOFT FSE Companion 2025: 111-121 - [i42]Chenghua Huang, Lu Wang, Fangkai Yang, Pu Zhao, Zhixu Li, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance. CoRR abs/2502.16944 (2025) - [i41]Jiani Zheng, Lu Wang, Fangkai Yang, Chaoyun Zhang, Lingrui Mei, Wenjie Yin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model. CoRR abs/2502.18906 (2025) - [i40]Yudi Zhang, Lu Wang, Meng Fang, Yali Du, Chenghua Huang, Jun Wang, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones? CoRR abs/2502.19557 (2025) - [i39]Chaoyun Zhang, He Huang, Chiming Ni, Jian Mu, Si Qin, Shilin He, Lu Wang, Fangkai Yang, Pu Zhao, Chao Du, Liqun Li, Yu Kang, Zhao Jiang, Suzhen Zheng, Rujia Wang, Jiaxu Qian, Minghua Ma, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
UFO2: The Desktop AgentOS. CoRR abs/2504.14603 (2025) - [i38]Mingrui Wu, Lu Wang, Pu Zhao, Fangkai Yang, Jianjin Zhang, Jianfeng Liu, Yuefeng Zhan, Weihao Han, Hao Sun, Jiayi Ji, Xiaoshuai Sun, Qingwei Lin, Weiwei Deng, Dongmei Zhang, Feng Sun, Qi Zhang, Rongrong Ji:
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning. CoRR abs/2505.17540 (2025) - [i37]Hanyang Wang, Lu Wang, Chaoyun Zhang, Tianjun Mao, Si Qin, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
Text2Grad: Reinforcement Learning from Natural Language Feedback. CoRR abs/2505.22338 (2025) - [i36]Lu Wang, Di Zhang, Fangkai Yang, Pu Zhao, Jianfeng Liu, Yuefeng Zhan, Hao Sun, Qingwei Lin, Weiwei Deng, Dongmei Zhang, Feng Sun, Qi Zhang:
LettinGo: Explore User Profile Generation for Recommendation System. CoRR abs/2506.18309 (2025) - [i35]Yue Chen, Minghua He, Fangkai Yang, Pu Zhao, Lu Wang, Yu Kang, Yifei Dong, Yuefeng Zhan, Hao Sun, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework. CoRR abs/2508.01245 (2025) - [i34]Runchuan Zhu, Bowen Jiang, Lingrui Mei, Fangkai Yang, Lu Wang, Haoxiang Gao, Fengshuo Bai, Pu Zhao, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
AdaptFlow: Adaptive Workflow Optimization via Meta-Learning. CoRR abs/2508.08053 (2025) - 2024
- [c32]Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, Qingwei Lin, Ming Jin, Alois Knoll:
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation. AAAI 2024: 21099-21106 - [c31]Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Wei Zhang, Si Qin
, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation. ACL (Findings) 2024: 1638-1662 - [c30]Lu Wang
, Mayukh Das
, Fangkai Yang
, Chao Du
, Bo Qiao
, Hang Dong
, Chetan Bansal
, Si Qin
, Saravan Rajmohan
, Qingwei Lin
, Dongmei Zhang
, Qi Zhang
:
COIN: Chance-Constrained Imitation Learning for Safe and Adaptive Resource Oversubscription under Uncertainty. CIKM 2024: 4939-4947 - [c29]Kaikai An, Fangkai Yang, Junting Lu, Liqun Li, Zhixing Ren, Hao Huang, Lu Wang, Pu Zhao, Yu Kang, Hua Ding, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides. ECAI 2024: 4471-4474 - [c28]Jia Fu, Xiaoting Qin, Fangkai Yang, Lu Wang, Jue Zhang, Qingwei Lin, Yubo Chen, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation. EMNLP (Findings) 2024: 3875-3891 - [c27]Zezhong Wang, Fangkai Yang, Lu Wang, Pu Zhao, Hongru Wang, Liang Chen, Qingwei Lin, Kam-Fai Wong:
SELF-GUARD: Empower the LLM to Safeguard Itself. NAACL-HLT 2024: 1648-1668 - [c26]Tong Cheng, Hang Dong, Lu Wang, Bo Qiao, Qingwei Lin, Saravan Rajmohan, Thomas Moscibroda:
SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation. UAI 2024: 698-717 - [c25]Tianxiang Zhao
, Wenchao Yu
, Suhang Wang
, Lu Wang
, Xiang Zhang
, Yuncong Chen
, Yanchi Liu
, Wei Cheng
, Haifeng Chen
:
Interpretable Imitation Learning with Dynamic Causal Relations. WSDM 2024: 967-975 - [i33]Lu Wang, Mayukh Das, Fangkai Yang, Junjie Sheng, Bo Qiao, Hang Dong, Si Qin, Victor Rühle, Chetan Bansal, Eli Cortez, Íñigo Goiri, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning. CoRR abs/2401.07033 (2024) - [i32]Lu Wang, Mayukh Das, Fangkai Yang, Chao Du, Bo Qiao, Hang Dong, Si Qin, Chetan Bansal, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
COIN: Chance-Constrained Imitation Learning for Uncertainty-aware Adaptive Resource Oversubscription Policy. CoRR abs/2401.07051 (2024) - [i31]Lu Wang, Chao Du, Pu Zhao, Chuan Luo, Zhangchi Zhu, Bo Qiao, Wei Zhang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
Contrastive Learning with Negative Sampling Correction. CoRR abs/2401.08690 (2024) - [i30]Kaikai An, Fangkai Yang, Liqun Li, Zhixing Ren, Hao Huang, Lu Wang, Pu Zhao, Yu Kang, Hua Ding, Qingwei Lin, Saravan Rajmohan, Qi Zhang:
Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides. CoRR abs/2402.17531 (2024) - [i29]Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan:
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning. CoRR abs/2404.17780 (2024) - [i28]Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, Qingwei Lin, Ming Jin, Alois Knoll:
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation. CoRR abs/2405.01677 (2024) - [i27]Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, Qingwei Lin, Alois Knoll, Ming Jin:
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning. CoRR abs/2405.16390 (2024) - [i26]Kaikai An, Fangkai Yang, Liqun Li, Junting Lu, Sitao Cheng, Lu Wang, Pu Zhao, Lele Cao, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation. CoRR abs/2406.13372 (2024) - [i25]Jia Fu, Xiaoting Qin, Fangkai Yang, Lu Wang, Jue Zhang, Qingwei Lin, Yubo Chen, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation. CoRR abs/2406.19251 (2024) - [i24]Junting Lu, Zhiyang Zhang, Fangkai Yang, Jue Zhang, Lu Wang, Chao Du, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents. CoRR abs/2409.17140 (2024) - [i23]Chenghua Huang, Zhizhen Fan, Lu Wang, Fangkai Yang, Pu Zhao, Zeqi Lin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
Self-Evolved Reward Learning for LLMs. CoRR abs/2411.00418 (2024) - [i22]Yichen Ouyang, Lu Wang, Fangkai Yang, Pu Zhao, Chenghua Huang, Jianfeng Liu, Bochen Pang, Yaming Yang, Yuefeng Zhan, Hao Sun, Qingwei Lin, Saravan Rajmohan, Weiwei Deng, Dongmei Zhang, Feng Sun, Qi Zhang:
Token-level Proximal Policy Optimization for Query Generation. CoRR abs/2411.00722 (2024) - [i21]Yudi Zhang, Pei Xiao, Lu Wang, Chaoyun Zhang, Meng Fang, Yali Du, Yevgeniy Puzyrev, Randolph Yao, Si Qin, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
RuAG: Learned-rule-augmented Generation for Large Language Models. CoRR abs/2411.03349 (2024) - [i20]Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu, Jiaxu Qian, Shilin He, Pu Zhao, Bo Qiao, Ray Huang, Si Qin, Qisheng Su, Jiayi Ye, Yudi Zhang, Jian-Guang Lou, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
Large Action Models: From Inception to Implementation. CoRR abs/2412.10047 (2024) - [i19]Huawen Feng
, Pu Zhao, Qingfeng Sun, Can Xu, Fangkai Yang, Lu Wang, Qianli Ma, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang:
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models. CoRR abs/2412.17395 (2024) - 2023
- [c24]Fangkai Yang, Lu Wang, Zhenyu Xu
, Jue Zhang, Liqun Li, Bo Qiao, Camille Couturier, Chetan Bansal, Soumya Ram, Si Qin
, Zhen Ma, Íñigo Goiri, Eli Cortez, Terry Yang, Victor Rühle, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs. ASPLOS (3) 2023: 631-643 - [c23]Fangkai Yang, Pu Zhao, Zezhong Wang, Lu Wang, Bo Qiao, Jue Zhang, Mohit Garg, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering. EMNLP (Industry Track) 2023: 294-312 - [c22]Yushan Jiang
, Wenchao Yu
, Dongjin Song
, Lu Wang
, Wei Cheng
, Haifeng Chen
:
FedSkill: Privacy Preserved Interpretable Skill Learning via Imitation. KDD 2023: 1010-1019 - [c21]Tianxiang Zhao
, Wenchao Yu
, Suhang Wang
, Lu Wang
, Xiang Zhang
, Yuncong Chen
, Yanchi Liu
, Wei Cheng
, Haifeng Chen
:
Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations. KDD 2023: 3513-3524 - [c20]Zhangchi Zhu
, Lu Wang
, Pu Zhao
, Chao Du
, Wei Zhang
, Hang Dong
, Bo Qiao
, Qingwei Lin
, Saravan Rajmohan
, Dongmei Zhang
:
Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction. KDD 2023: 3663-3673 - [c19]Lu Wang
, Chaoyun Zhang
, Ruomeng Ding
, Yong Xu
, Qihang Chen
, Wentao Zou
, Qingjun Chen
, Meng Zhang
, Xuedong Gao
, Hao Fan
, Saravan Rajmohan
, Qingwei Lin
, Dongmei Zhang
:
Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback. KDD 2023: 5116-5125 - [c18]Fangkai Yang
, Jue Zhang
, Lu Wang
, Bo Qiao
, Di Weng
, Xiaoting Qin
, Gregory Weber
, Durgesh Nandini Das
, Srinivasan Rakhunathan
, Ranganathan Srikanth
, Qingwei Lin
, Dongmei Zhang
:
Contextual Self-attentive Temporal Point Process for Physical Decommissioning Prediction of Cloud Assets. KDD 2023: 5372-5381 - [c17]Liting Chen, Jie Yan, Zhengdao Shao, Lu Wang, Qingwei Lin, Saravanakumar Rajmohan, Thomas Moscibroda, Dongmei Zhang:
Conservative State Value Estimation for Offline Reinforcement Learning. NeurIPS 2023 - [c16]Ruomeng Ding
, Chaoyun Zhang
, Lu Wang
, Yong Xu
, Minghua Ma
, Xiaomin Wu
, Meng Zhang
, Qingjun Chen
, Xin Gao
, Xuedong Gao
, Hao Fan
, Saravan Rajmohan
, Qingwei Lin
, Dongmei Zhang
:
TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems. ESEC/SIGSOFT FSE 2023: 1762-1773 - [c15]Fangkai Yang
, Wenjie Yin
, Lu Wang
, Tianci Li
, Pu Zhao
, Bo Liu
, Paul Wang
, Bo Qiao
, Yudong Liu
, Mårten Björkman
, Saravan Rajmohan
, Qingwei Lin
, Dongmei Zhang
:
Diffusion-Based Time Series Data Imputation for Cloud Failure Prediction at Microsoft 365. ESEC/SIGSOFT FSE 2023: 2050-2055 - [c14]Tong Cheng
, Hang Dong
, Lu Wang
, Bo Qiao
, Si Qin
, Qingwei Lin
, Dongmei Zhang
, Saravan Rajmohan
, Thomas Moscibroda
:
Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem. WWW (Companion Volume) 2023: 391-395 - [c13]Junjie Sheng
, Lu Wang
, Fangkai Yang
, Bo Qiao
, Hang Dong
, Xiangfeng Wang
, Bo Jin
, Jun Wang
, Si Qin
, Saravan Rajmohan
, Qingwei Lin
, Dongmei Zhang
:
Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning. WWW 2023: 2927-2936 - [i18]Liting Chen, Jie Yan, Zhengdao Shao, Lu Wang, Qingwei Lin, Dongmei Zhang:
Conservative State Value Estimation for Offline Reinforcement Learning. CoRR abs/2302.06884 (2023) - [i17]Zezhong Wang, Fangkai Yang, Pu Zhao, Lu Wang, Jue Zhang, Mohit Garg, Qingwei Lin, Dongmei Zhang:
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering. CoRR abs/2305.11541 (2023) - [i16]Liting Chen, Lu Wang, Hang Dong, Yali Du, Jie Yan, Fangkai Yang, Shuang Li, Pu Zhao, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Introspective Tips: Large Language Model for In-Context Decision Making. CoRR abs/2305.11598 (2023) - [i15]Tianxiang Zhao, Wenchao Yu, Suhang Wang
, Lu Wang, Xiang Zhang, Yuncong Chen, Yanchi Liu, Wei Cheng, Haifeng Chen:
Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations. CoRR abs/2306.07919 (2023) - [i14]Zhangchi Zhu, Lu Wang, Pu Zhao, Chao Du, Wei Zhang, Hang Dong, Bo Qiao, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction. CoRR abs/2308.00279 (2023) - [i13]Chao Yang, Lu Wang, Kun Gao
, Shuang Li:
Reinforcement Logic Rule Learning for Temporal Point Processes. CoRR abs/2308.06094 (2023) - [i12]Bilgehan Sel, Ahmad Al-Tawaha, Vanshaj Khattar, Lu Wang, Ruoxi Jia, Ming Jin:
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models. CoRR abs/2308.10379 (2023) - [i11]Fangkai Yang, Wenjie Yin, Lu Wang, Tianci Li, Pu Zhao, Bo Liu, Paul Wang, Bo Qiao, Yudong Liu, Mårten Björkman, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Diffusion-based Time Series Data Imputation for Microsoft 365. CoRR abs/2309.02564 (2023) - [i10]Tianxiang Zhao, Wenchao Yu, Suhang Wang
, Lu Wang, Xiang Zhang, Yuncong Chen, Yanchi Liu, Wei Cheng, Haifeng Chen:
Dynamic DAG Discovery for Interpretable Imitation Learning. CoRR abs/2310.00489 (2023) - [i9]Zezhong Wang, Fangkai Yang, Lu Wang, Pu Zhao, Hongru Wang, Liang Chen, Qingwei Lin, Kam-Fai Wong:
Self-Guard: Empower the LLM to Safeguard Itself. CoRR abs/2310.15851 (2023) - [i8]Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Xiaomin Wu, Meng Zhang, Qingjun Chen, Xin Gao, Xuedong Gao, Hao Fan, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems. CoRR abs/2310.18740 (2023) - [i7]Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Wei Zhang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation. CoRR abs/2311.04254 (2023) - [i6]Bo Qiao, Liqun Li, Xu Zhang, Shilin He, Yu Kang, Chaoyun Zhang, Fangkai Yang, Hang Dong, Jue Zhang, Lu Wang, Minghua Ma, Pu Zhao, Si Qin, Xiaoting Qin, Chao Du, Yong Xu, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:
TaskWeaver: A Code-First Agent Framework. CoRR abs/2311.17541 (2023) - 2022
- [j1]Lu Wang
, Lei Han, Xinru Chen, Chengchang Li, Junzhou Huang, Weinan Zhang
, Wei Zhang
, Xiaofeng He
, Dijun Luo:
Hierarchical Multiagent Reinforcement Learning for Allocating Guaranteed Display Ads. IEEE Trans. Neural Networks Learn. Syst. 33(10): 5361-5373 (2022) - [c12]Shuang Li, Mingquan Feng, Lu Wang, Abdelmajid Essofi, Yufeng Cao, Junchi Yan, Le Song:
Explaining Point Processes by Learning Interpretable Temporal Logic Rules. ICLR 2022 - [c11]Pu Zhao, Chuan Luo, Bo Qiao, Lu Wang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
T-SMOTE: Temporal-oriented Synthetic Minority Oversampling Technique for Imbalanced Time Series Classification. IJCAI 2022: 2406-2412 - [c10]Lu Wang, Pu Zhao, Chao Du, Chuan Luo, Mengna Su, Fangkai Yang, Yudong Liu, Qingwei Lin, Min Wang, Yingnong Dang, Hongyu Zhang, Saravan Rajmohan, Dongmei Zhang:
NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365. KDD 2022: 4032-4040 - [c9]Minghua Ma, Yudong Liu, Yuang Tong
, Haozhe Li, Pu Zhao, Yong Xu, Hongyu Zhang, Shilin He, Lu Wang, Yingnong Dang, Saravanakumar Rajmohan, Qingwei Lin:
An empirical investigation of missing data handling in cloud node failure prediction. ESEC/SIGSOFT FSE 2022: 1453-1464 - [c8]Fangkai Yang, Bowen Pang, Jue Zhang, Bo Qiao, Lu Wang, Camille Couturier, Chetan Bansal, Soumya Ram, Si Qin
, Zhen Ma, Iñigo Goiri, Eli Cortez, Senthil Baladhandayutham, Victor Rühle, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Spot Virtual Machine Eviction Prediction in Microsoft Cloud. WWW (Companion Volume) 2022: 152-156 - [i5]Junjie Sheng
, Lu Wang, Fangkai Yang, Bo Qiao, Hang Dong, Xiangfeng Wang, Bo Jin, Jun Wang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning. CoRR abs/2211.11759 (2022) - 2021
- [i4]Lu Wang, Xiaofu Chang, Shuang Li, Yunfei Chu, Hui Li, Wei Zhang, Xiaofeng He, Le Song, Jingren Zhou, Hongxia Yang:
TCL: Transformer-based Dynamic Graph Modelling via Contrastive Learning. CoRR abs/2105.07944 (2021) - 2020
- [c7]Xinyun Chen, Lu Wang, Yizhe Hang, Heng Ge, Hongyuan Zha:
Infinite-horizon Off-Policy Policy Evaluation with Multiple Behavior Policies. ICLR 2020 - [c6]Lu Wang, Wenchao Yu, Xiaofeng He, Wei Cheng, Martin Renqiang Ren, Wei Wang
, Bo Zong, Haifeng Chen, Hongyuan Zha:
Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes✱. WWW 2020: 1785-1795
2010 – 2019
- 2019
- [c5]Lu Wang, Wei Zhang, Xiaofeng He:
Continuous Patient-Centric Sequence Generation via Sequentially Coupled Adversarial Learning. DASFAA (2) 2019: 36-52 - [c4]Lu Wang, Wenchao Yu, Wei Wang
, Wei Cheng, Wei Zhang, Hongyuan Zha, Xiaofeng He, Haifeng Chen:
Learning Robust Representations with Graph Denoising Policy Network. ICDM 2019: 1378-1383 - [i3]Lu Wang, Wenchao Yu, Wei Wang, Wei Cheng, Wei Zhang, Hongyuan Zha, Xiaofeng He, Haifeng Chen:
Learning Robust Representations with Graph Denoising Policy Network. CoRR abs/1910.01784 (2019) - [i2]Xinyun Chen, Lu Wang, Yizhe Hang, Heng Ge, Hongyuan Zha:
Infinite-horizon Off-Policy Policy Evaluation with Multiple Behavior Policies. CoRR abs/1910.04849 (2019) - 2018
- [c3]Lu Wang, Wei Zhang, Xiaofeng He, Hongyuan Zha:
Personalized Prescription for Comorbidity. DASFAA (2) 2018: 3-19 - [c2]Lu Wang, Wei Zhang, Xiaofeng He, Hongyuan Zha:
Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation. KDD 2018: 2447-2456 - [i1]Lu Wang, Wei Zhang, Xiaofeng He, Hongyuan Zha:
Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation. CoRR abs/1807.01473 (2018) - 2014
- [c1]Weishan Dong, Renjie Yao, Chunyang Ma, Changsheng Li, Lei Shi, Lu Wang, Yu Wang, Peng Gao, Junchi Yan:
Maximizing Multi-scale Spatial Statistical Discrepancy. CIKM 2014: 471-480
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-09-19 00:00 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint