default search action
Shaohan Huang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j9]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Dongdong Zhang, Furu Wei:
DeepNet: Scaling Transformers to 1,000 Layers. IEEE Trans. Pattern Anal. Mach. Intell. 46(10): 6761-6774 (2024) - [j8]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, Hailong Yang:
LogSay: An Efficient Comprehension System for Log Numerical Reasoning. IEEE Trans. Computers 73(7): 1809-1821 (2024) - [j7]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, Hailong Yang, Depei Qian:
SpikeLog: Log-Based Anomaly Detection via Potential-Assisted Spiking Neuron Network. IEEE Trans. Knowl. Data Eng. 36(12): 9322-9335 (2024) - [c62]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Text Diffusion with Reinforced Conditioning. AAAI 2024: 14069-14077 - [c61]Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang:
Se²: Sequential Example Selection for In-Context Learning. ACL (Findings) 2024: 5262-5284 - [c60]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition. ACL (1) 2024: 7641-7660 - [c59]Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
ResLoRA: Identity Residual Mapping in Low-Rank Adaption. ACL (Findings) 2024: 8870-8884 - [c58]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Calibrating LLM-Based Evaluator. LREC/COLING 2024: 2638-2656 - [c57]Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei:
Instruction Pre-Training: Language Models are Supervised Multitask Learners. EMNLP 2024: 2529-2550 - [c56]Ting Jiang, Shaohan Huang, Zhongzhi Luan, Deqing Wang, Fuzhen Zhuang:
Scaling Sentence Embeddings with Large Language Models. EMNLP (Findings) 2024: 3182-3196 - [c55]Shaohan Huang, Zhongzhi Luan:
Semantic-Aware Log Understanding and Analysis. HPDC 2024: 413-416 - [c54]Daixuan Cheng, Shaohan Huang, Furu Wei:
Adapting Large Language Models via Reading Comprehension. ICLR 2024 - [c53]Xichen Pan, Li Dong, Shaohan Huang, Zhiliang Peng, Wenhu Chen, Furu Wei:
Kosmos-G: Generating Images in Context with Multimodal Large Language Models. ICLR 2024 - [c52]Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Qixiang Ye, Furu Wei:
Grounding Multimodal Large Language Models to the World. ICLR 2024 - [c51]Xun Wu, Shaohan Huang, Furu Wei:
Mixture of LoRA Experts. ICLR 2024 - [c50]Shaohan Huang, Yi Liu, Jiaxing Qi, Jing Shang, Zhiwen Xiao, Carol J. Fung, Zhihui Wu, Hailong Yang, Zhongzhi Luan, Depei Qian:
Gloss: Guiding Large Language Models to Answer Questions from System Logs. SANER 2024: 91-101 - [i66]Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang:
Improving Domain Adaptation through Extended-Text Reading Comprehension. CoRR abs/2401.07284 (2024) - [i65]Haoran Li, Qingxiu Dong, Zhengyang Tang, Chaojun Wang, Xingxing Zhang, Haoyang Huang, Shaohan Huang, Xiaolong Huang, Zeqiang Huang, Dongdong Zhang, Yuxian Gu, Xin Cheng, Xun Wang, Si-Qing Chen, Li Dong, Wei Lu, Zhifang Sui, Benyou Wang, Wai Lam, Furu Wei:
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models. CoRR abs/2402.13064 (2024) - [i64]Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang:
Se2: Sequential Example Selection for In-Context Learning. CoRR abs/2402.13874 (2024) - [i63]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Text Diffusion with Reinforced Conditioning. CoRR abs/2402.14843 (2024) - [i62]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition. CoRR abs/2402.15754 (2024) - [i61]Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong Xue, Furu Wei:
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits. CoRR abs/2402.17764 (2024) - [i60]Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
ResLoRA: Identity Residual Mapping in Low-Rank Adaption. CoRR abs/2402.18039 (2024) - [i59]Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan:
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge. CoRR abs/2403.09164 (2024) - [i58]Xun Wu, Shaohan Huang, Furu Wei:
Mixture of LoRA Experts. CoRR abs/2404.13628 (2024) - [i57]Xun Wu, Shaohan Huang, Wenhui Wang, Furu Wei:
Multi-Head Mixture-of-Experts. CoRR abs/2404.15045 (2024) - [i56]Xun Wu, Shaohan Huang, Furu Wei:
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation. CoRR abs/2404.15100 (2024) - [i55]Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei:
You Only Cache Once: Decoder-Decoder Architectures for Language Models. CoRR abs/2405.05254 (2024) - [i54]Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang:
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning. CoRR abs/2405.12130 (2024) - [i53]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, Hailong Yang, Depei Qian:
FDLoRA: Personalized Federated Learning of Large Language Model via Dual LoRA Tuning. CoRR abs/2406.07925 (2024) - [i52]Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei:
Instruction Pre-Training: Language Models are Supervised Multitask Learners. CoRR abs/2406.14491 (2024) - 2023
- [j6]Shaohan Huang, Yi Liu, Carol J. Fung, He Wang, Hailong Yang, Zhongzhi Luan:
Improving Log-Based Anomaly Detection by Pre-Training Hierarchical Transformers. IEEE Trans. Computers 72(9): 2656-2667 (2023) - [j5]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, Hailong Yang, Hanlu Li, Danfeng Zhu, Depei Qian:
LogEncoder: Log-Based Contrastive Representation Learning for Anomaly Detection. IEEE Trans. Netw. Serv. Manag. 20(2): 1378-1391 (2023) - [c49]Yuan Xie, Shaohan Huang, Tianyu Chen, Furu Wei:
MoEC: Mixture of Expert Clusters. AAAI 2023: 13807-13815 - [c48]Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. ACL (Findings) 2023: 114-128 - [c47]Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang:
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding. ACL (1) 2023: 3466-3478 - [c46]Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li:
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator. ACL (1) 2023: 9394-9412 - [c45]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. ACL (1) 2023: 14590-14604 - [c44]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. ACL (1) 2023: 15354-15373 - [c43]Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Democratizing Reasoning Ability: Tailored Learning from Large Language Model. EMNLP 2023: 1948-1966 - [c42]Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Furu Wei, Weiwei Deng, Qi Zhang:
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation. EMNLP 2023: 12318-12337 - [c41]Jiaxing Qi, Shaohan Huang, Zhongzhi Luan, Shu Yang, Carol J. Fung, Hailong Yang, Depei Qian, Jing Shang, Zhiwen Xiao, Zhihui Wu:
LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection. HPCC/DSS/SmartCity/DependSys 2023: 273-280 - [c40]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Magneto: A Foundation Transformer. ICML 2023: 36077-36092 - [c39]Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Nils Johan Bertil Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei:
Language Is Not All You Need: Aligning Perception with Language Models. NeurIPS 2023 - [i51]Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei:
Language Is Not All You Need: Aligning Perception with Language Models. CoRR abs/2302.14045 (2023) - [i50]Daixuan Cheng, Shaohan Huang, Junyu Bi, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Furu Wei, Denvy Deng, Qi Zhang:
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation. CoRR abs/2303.08518 (2023) - [i49]Shaohan Huang, Yi Liu, Carol J. Fung, Jiaxing Qi, Hailong Yang, Zhongzhi Luan:
LogQA: Question Answering in Unstructured Logs. CoRR abs/2303.11715 (2023) - [i48]Beiduo Chen, Shaohan Huang, Zihan Zhang, Wu Guo, Zhenhua Ling, Haizhen Huang, Furu Wei, Weiwei Deng, Qi Zhang:
Pre-training Language Model as a Multi-perspective Course Learner. CoRR abs/2305.03981 (2023) - [i47]Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng, Qi Zhang:
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding. CoRR abs/2305.09148 (2023) - [i46]Tianyu Chen, Yuan Xie, Shuai Zhang, Shaohan Huang, Haoyi Zhou, Jianxin Li:
Learning Music Sequence Representation from Text Supervision. CoRR abs/2305.19602 (2023) - [i45]Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei:
Kosmos-2: Grounding Multimodal Large Language Models to the World. CoRR abs/2306.14824 (2023) - [i44]Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Nanning Zheng, Furu Wei:
LongNet: Scaling Transformers to 1, 000, 000, 000 Tokens. CoRR abs/2307.02486 (2023) - [i43]Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei:
Retentive Network: A Successor to Transformer for Large Language Models. CoRR abs/2307.08621 (2023) - [i42]Ting Jiang, Shaohan Huang, Zhongzhi Luan, Deqing Wang, Fuzhen Zhuang:
Scaling Sentence Embeddings with Large Language Models. CoRR abs/2307.16645 (2023) - [i41]Jiaxing Qi, Shaohan Huang, Zhongzhi Luan, Carol J. Fung, Hailong Yang, Depei Qian:
LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection. CoRR abs/2309.01189 (2023) - [i40]Daixuan Cheng, Shaohan Huang, Furu Wei:
Adapting Large Language Models via Reading Comprehension. CoRR abs/2309.09530 (2023) - [i39]Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei:
Kosmos-2.5: A Multimodal Literate Model. CoRR abs/2309.11419 (2023) - [i38]Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Calibrating LLM-Based Evaluator. CoRR abs/2309.13308 (2023) - [i37]Xichen Pan, Li Dong, Shaohan Huang, Zhiliang Peng, Wenhu Chen, Furu Wei:
Kosmos-G: Generating Images in Context with Multimodal Large Language Models. CoRR abs/2310.02992 (2023) - [i36]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Huaijie Wang, Lingxiao Ma, Fan Yang, Ruiping Wang, Yi Wu, Furu Wei:
BitNet: Scaling 1-bit Transformers for Large Language Models. CoRR abs/2310.11453 (2023) - [i35]Zhaoyang Wang, Shaohan Huang, Yuxuan Liu, Jiahai Wang, Minghui Song, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang:
Democratizing Reasoning Ability: Tailored Learning from Large Language Model. CoRR abs/2310.13332 (2023) - 2022
- [j4]Xin Sui, Guifen Shi, Guanchong Hou, Shaohan Huang, Yanshuang Li:
Impacts of COVID-19 on the Return and Volatility Nexus among Cryptocurrency Market. Complex. 2022: 5346080:1-5346080:15 (2022) - [j3]Shaozhi Dai, Zhongzhi Luan, Shaohan Huang, Carol J. Fung, He Wang, Hailong Yang, Depei Qian:
REVAL: Recommend Which Variables to Log With Pretrained Model and Graph Neural Network. IEEE Trans. Netw. Serv. Manag. 19(4): 4045-4057 (2022) - [c38]Tianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei:
THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption. ACL (Findings) 2022: 3510-3520 - [c37]Zewen Chi, Shaohan Huang, Li Dong, Shuming Ma, Bo Zheng, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei:
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA. ACL (1) 2022: 6170-6182 - [c36]Shaohan Huang, Yi Liu, Carol J. Fung, Hailong Yang, Zhongzhi Luan:
Black-box Attacks to Log-based Anomaly Detection. CNSM 2022: 310-316 - [c35]Jian Yang, Shaohan Huang, Shuming Ma, Yuwei Yin, Li Dong, Dongdong Zhang, Hongcheng Guo, Zhoujun Li, Furu Wei:
CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation. EMNLP (Findings) 2022: 486-496 - [c34]Daixuan Cheng, Shaohan Huang, Jianfeng Liu, Yuefeng Zhan, Hao Sun, Furu Wei, Denvy Deng, Qi Zhang:
Snapshot-Guided Domain Adaptation for ELECTRA. EMNLP (Findings) 2022: 2226-2232 - [c33]Ting Jiang, Jian Jiao, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Denvy Deng, Qi Zhang:
PromptBERT: Improving BERT Sentence Embeddings with Prompts. EMNLP 2022: 8826-8837 - [c32]Tianyu Chen, Yuan Xie, Shuai Zhang, Shaohan Huang, Haoyi Zhou, Jianxin Li:
Learning Music Sequence Representation From Text Supervision. ICASSP 2022: 4583-4587 - [c31]Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei:
On the Representation Collapse of Sparse Mixture of Experts. NeurIPS 2022 - [c30]Yunzhi Yao, Shaohan Huang, Li Dong, Furu Wei, Huajun Chen, Ningyu Zhang:
Kformer: Knowledge Injection in Transformer Feed-Forward Layers. NLPCC (1) 2022: 131-143 - [c29]Jiaxing Qi, Zhongzhi Luan, Shaohan Huang, Yukun Wang, Carol J. Fung, Hailong Yang, Depei Qian:
Adanomaly: Adaptive Anomaly Detection for System Logs with Adversarial Learning. NOMS 2022: 1-5 - [i34]Ting Jiang, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Liangjie Zhang, Qi Zhang:
PromptBERT: Improving BERT Sentence Embeddings with Prompts. CoRR abs/2201.04337 (2022) - [i33]Yunzhi Yao, Shaohan Huang, Ningyu Zhang, Li Dong, Furu Wei, Huajun Chen:
Kformer: Knowledge Injection in Transformer Feed-Forward Layers. CoRR abs/2201.05742 (2022) - [i32]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Dongdong Zhang, Furu Wei:
DeepNet: Scaling Transformers to 1, 000 Layers. CoRR abs/2203.00555 (2022) - [i31]Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Furu Wei:
On the Representation Collapse of Sparse Mixture of Experts. CoRR abs/2204.09179 (2022) - [i30]Tianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei:
THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption. CoRR abs/2206.00216 (2022) - [i29]Tianyu Chen, Shaohan Huang, Yuan Xie, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei:
Task-Specific Expert Pruning for Sparse Mixture-of-Experts. CoRR abs/2206.00277 (2022) - [i28]Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei:
Language Models are General-Purpose Interfaces. CoRR abs/2206.06336 (2022) - [i27]Yuan Xie, Shaohan Huang, Tianyu Chen, Furu Wei:
MoEC: Mixture of Expert Clusters. CoRR abs/2207.09094 (2022) - [i26]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Foundation Transformers. CoRR abs/2210.06423 (2022) - [i25]Jian Yang, Shaohan Huang, Shuming Ma, Yuwei Yin, Li Dong, Dongdong Zhang, Hongcheng Guo, Zhoujun Li, Furu Wei:
CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation. CoRR abs/2210.07022 (2022) - [i24]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. CoRR abs/2210.14867 (2022) - [i23]Shuming Ma, Hongyu Wang, Shaohan Huang, Wenhui Wang, Zewen Chi, Li Dong, Alon Benhaim, Barun Patra, Vishrav Chaudhary, Xia Song, Furu Wei:
TorchScale: Transformers at Scale. CoRR abs/2211.13184 (2022) - [i22]Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Zhoujun Li, Furu Wei:
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator. CoRR abs/2212.10218 (2022) - [i21]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. CoRR abs/2212.10554 (2022) - 2021
- [c28]Yunzhi Yao, Shaohan Huang, Wenhui Wang, Li Dong, Furu Wei:
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains. ACL/IJCNLP (Findings) 2021: 460-470 - [c27]Wenhui Wang, Hangbo Bao, Shaohan Huang, Li Dong, Furu Wei:
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers. ACL/IJCNLP (Findings) 2021: 2140-2151 - [c26]Bo Zheng, Li Dong, Shaohan Huang, Wenhui Wang, Zewen Chi, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song, Furu Wei:
Consistency Regularization for Cross-Lingual Fine-Tuning. ACL/IJCNLP (1) 2021: 3403-3417 - [c25]Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, Heyan Huang, Furu Wei:
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment. ACL/IJCNLP (1) 2021: 3418-3430 - [c24]Ruiyuan Gao, Hailong Yang, Shaohan Huang, Ming Dun, Mingzhen Li, Zerong Luan, Zhongzhi Luan, Depei Qian:
PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference. CCGRID 2021: 334-343 - [c23]Zewen Chi, Li Dong, Shuming Ma, Shaohan Huang, Saksham Singhal, Xian-Ling Mao, Heyan Huang, Xia Song, Furu Wei:
mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs. EMNLP (1) 2021: 1671-1683 - [c22]Bo Zheng, Li Dong, Shaohan Huang, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song, Furu Wei:
Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training. EMNLP (1) 2021: 3203-3215 - [c21]Jian Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Li Dong, Shaohan Huang, Alexandre Muzio, Saksham Singhal, Hany Hassan, Xia Song, Furu Wei:
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task. WMT@EMNLP 2021: 446-455 - [i20]Zewen Chi, Li Dong, Shuming Ma, Shaohan Huang, Xian-Ling Mao, Heyan Huang, Furu Wei:
mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs. CoRR abs/2104.08692 (2021) - [i19]Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, Heyan Huang, Furu Wei:
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment. CoRR abs/2106.06381 (2021) - [i18]Bo Zheng, Li Dong, Shaohan Huang, Wenhui Wang, Zewen Chi, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song, Furu Wei:
Consistency Regularization for Cross-Lingual Fine-Tuning. CoRR abs/2106.08226 (2021) - [i17]Yunzhi Yao, Shaohan Huang, Wenhui Wang, Li Dong, Furu Wei:
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains. CoRR abs/2106.13474 (2021) - [i16]Shuming Ma, Li Dong, Shaohan Huang, Dongdong Zhang, Alexandre Muzio, Saksham Singhal, Hany Hassan Awadalla, Xia Song, Furu Wei:
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders. CoRR abs/2106.13736 (2021) - [i15]Zewen Chi, Shaohan Huang, Li Dong, Shuming Ma, Saksham Singhal, Payal Bajaj, Xia Song, Furu Wei:
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA. CoRR abs/2106.16138 (2021) - [i14]Bo Zheng, Li Dong, Shaohan Huang, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song, Furu Wei:
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training. CoRR abs/2109.07306 (2021) - [i13]Ting Jiang, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Liangjie Zhang, Qi Zhang:
Improving Non-autoregressive Generation with Mixup Training. CoRR abs/2110.11115 (2021) - [i12]Jian Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Li Dong, Shaohan Huang, Alexandre Muzio, Saksham Singhal, Hany Hassan Awadalla, Xia Song, Furu Wei:
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task. CoRR abs/2111.02086 (2021) - 2020
- [j2]Qingyu Zhou, Nan Yang, Furu Wei, Shaohan Huang, Ming Zhou, Tiejun Zhao:
A Joint Sentence Scoring and Selection Framework for Neural Extractive Document Summarization. IEEE ACM Trans. Audio Speech Lang. Process. 28: 671-681 (2020) - [j1]Shaohan Huang, Yi Liu, Carol J. Fung, Rong He, Yining Zhao, Hailong Yang, Zhongzhi Luan:
HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log. IEEE Trans. Netw. Serv. Manag. 17(4): 2064-2076 (2020) - [c20]Shaohan Huang, Yi Liu, Carol J. Fung, Rong He, Yining Zhao, Hailong Yang, Zhongzhi Luan:
Transfer Log-based Anomaly Detection with Pseudo Labels. CNSM 2020: 1-5 - [c19]Minghao Li, Yiheng Xu, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li, Ming Zhou:
DocBank: A Benchmark Dataset for Document Layout Analysis. COLING 2020: 949-960 - [c18]Shaohan Huang, Furu Wei, Lei Cui, Xingxing Zhang, Ming Zhou:
Unsupervised Fine-tuning for Text Clustering. COLING 2020: 5530-5534 - [c17]Haozhe Ji, Pei Ke, Shaohan Huang, Furu Wei, Xiaoyan Zhu, Minlie Huang:
Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph. EMNLP (1) 2020: 725-736 - [c16]Shaohan Huang, Yi Liu, Carol J. Fung, Wanhe An, Rong He, Yining Zhao, Hailong Yang, Zhongzhi Luan:
A Gated Few-shot Learning Model For Anomaly Detection. ICOIN 2020: 505-509 - [c15]