


Остановите войну!
for scientists:


default search action
Bhiksha Raj
Bhiksha Ramakrishnan
Person information

- affiliation: Carnegie Mellon University, Pittsburgh, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j36]Viet-Khoa Vo-Ho
, Sang Truong, Kashu Yamazaki
, Bhiksha Raj, Minh-Triet Tran
, Ngan Le
:
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation. Int. J. Comput. Vis. 131(1): 302-323 (2023) - [j35]Weiyang Liu
, Yandong Wen
, Bhiksha Raj, Rita Singh, Adrian Weller
:
SphereFace Revived: Unifying Hyperspherical Face Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2458-2474 (2023) - [c224]Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj:
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance. AAAI 2023: 1424-1432 - [c223]Kashu Yamazaki, Khoa Vo, Quang Sang Truong, Bhiksha Raj, Ngan Le:
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning. AAAI 2023: 3081-3090 - [c222]Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson D. Cothren, Khoa Luu:
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding. CVPR 2023: 19988-19997 - [c221]Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele
, Xing Xie, Bhiksha Raj, Marios Savvides:
SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning. ICLR 2023 - [c220]Yidong Wang, Hao Chen, Qiang Heng, Wenxin Hou, Yue Fan, Zhen Wu, Jindong Wang, Marios Savvides, Takahiro Shinozaki, Bhiksha Raj, Bernt Schiele, Xing Xie:
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning. ICLR 2023 - [c219]Raphaël Olivier, Bhiksha Raj:
How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy. ICML 2023: 26583-26598 - [i114]Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo:
Understanding Political Polarisation using Language Models: A dataset and method. CoRR abs/2301.00891 (2023) - [i113]Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang
, Bernt Schiele
, Xing Xie, Bhiksha Raj, Marios Savvides:
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning. CoRR abs/2301.10921 (2023) - [i112]Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe
, Bhiksha Raj:
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement. CoRR abs/2302.08088 (2023) - [i111]Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe
, Bhiksha Raj:
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement. CoRR abs/2302.08095 (2023) - [i110]Laurie M. Heller
, Benjamin Elizalde, Bhiksha Raj, Soham Deshmukh:
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session. CoRR abs/2302.09719 (2023) - [i109]Ankit Shah, Shuyi Chen, Kejun Zhou, Yue Chen, Bhiksha Raj:
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms. CoRR abs/2303.03591 (2023) - [i108]Joseph Konan, Ojas Bhargave, Shikhar Agnihotri, Hojeong Lee, Ankit Shah, Shuo Han, Yunyang Zeng, Amanda Shu, Haohui Liu, Xuankai Chang, Hamza Khalid, Minseon Gwak, Kawon Lee, Minjeong Kim, Bhiksha Raj:
Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms. CoRR abs/2303.09048 (2023) - [i107]Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding. CoRR abs/2304.02135 (2023) - [i106]Yutian Chen
, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content. CoRR abs/2305.07969 (2023) - [i105]Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations. CoRR abs/2305.12715 (2023) - [i104]Thanh-Dat Truong, Hoang-Quan Nguyen, Bhiksha Raj, Khoa Luu:
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments. CoRR abs/2305.15700 (2023) - [i103]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Bhiksha Raj:
PaintSeg: Training-free Segmentation via Painting. CoRR abs/2305.19406 (2023) - [i102]Pha A. Nguyen, Kha Gia Quach, John Gauch, Samee U. Khan, Bhiksha Raj, Khoa Luu:
UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation. CoRR abs/2306.09613 (2023) - [i101]Roshan S. Sharma, Kenneth Zheng, Siddhant Arora, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. CoRR abs/2307.08217 (2023) - [i100]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. CoRR abs/2307.13948 (2023) - [i99]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. CoRR abs/2307.13953 (2023) - [i98]Muhammad A. Shah, Bhiksha Raj:
Training on Foveated Images Improves Robustness to Adversarial Attacks. CoRR abs/2308.00854 (2023) - [i97]Muhammad Ahmed Shah, Bhiksha Raj:
Fixed Inter-Neuron Covariability Induces Adversarial Robustness. CoRR abs/2308.03956 (2023) - [i96]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. CoRR abs/2309.07372 (2023) - [i95]Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan S. Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee:
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech. CoRR abs/2309.09510 (2023) - [i94]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of negative sampling in weak label learning. CoRR abs/2309.13227 (2023) - 2022
- [c218]Roshan Sharma, Bhiksha Raj:
Cross-utterance context for multimodal video transcription. IEEECONF 2022: 1321-1325 - [c217]Yandong Wen, Weiyang Liu, Adrian Weller, Bhiksha Raj, Rita Singh:
SphereFace2: Binary Classification is All You Need for Deep Face Recognition. ICLR 2022 - [c216]Hira Dhamyal, Bhiksha Raj, Rita Singh:
Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection. INTERSPEECH 2022: 166-170 - [c215]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Towards End-to-End Private Automatic Speaker Recognition. INTERSPEECH 2022: 2798-2802 - [c214]Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar, Shinji Watanabe
, Bhiksha Raj:
Improving Speech Enhancement through Fine-Grained Speech Characteristics. INTERSPEECH 2022: 2953-2957 - [c213]Raphaël Olivier, Bhiksha Raj:
Recent improvements of ASR models in the face of adversarial attacks. INTERSPEECH 2022: 4113-4117 - [c212]Yidong Wang, Hao Chen, Yue Fan, Wang Sun, Ran Tao, Wenxin Hou, Renjie Wang, Linyi Yang, Zhi Zhou, Lan-Zhe Guo, Heli Qi, Zhen Wu, Yu-Feng Li, Satoshi Nakamura, Wei Ye, Marios Savvides, Bhiksha Raj, Takahiro Shinozaki, Bernt Schiele, Jindong Wang, Xing Xie, Yue Zhang:
USB: A Unified Semi-supervised Learning Benchmark for Classification. NeurIPS 2022 - [i93]Larry Tang, Po Hao Chou, Yi Yu Zheng, Ziqian Ge, Ankit Shah, Bhiksha Raj:
Ontological Learning from Weak Labels. CoRR abs/2203.02483 (2022) - [i92]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally
, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe
, Zeyu Jin, Yonatan Bisk:
HEAR 2021: Holistic Evaluation of Audio Representations. CoRR abs/2203.03022 (2022) - [i91]Shentong Mo, Jingfei Xia, Xiaoqing Tan, Bhiksha Raj:
Point3D: tracking actions as moving points with 3D CNNs. CoRR abs/2203.10584 (2022) - [i90]Raphaël Olivier, Bhiksha Raj:
Recent improvements of ASR models in the face of adversarial attacks. CoRR abs/2203.16536 (2022) - [i89]Ankit Shah, Hira Dhamyal, Yang Gao, Rita Singh, Bhiksha Raj:
On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice. CoRR abs/2204.04802 (2022) - [i88]Yidong Wang, Hao Chen, Qiang Heng, Wenxin Hou, Yue Fan, Zhen Wu, Marios Savvides, Takahiro Shinozaki, Bhiksha Raj, Bernt Schiele
:
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning. CoRR abs/2205.07246 (2022) - [i87]Chonghan Chen, Qi Jiang, Chih-Hao Wang, Noel Chen, Haohan Wang, Xiang Li, Bhiksha Raj:
Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution. CoRR abs/2206.09114 (2022) - [i86]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Towards End-to-End Private Automatic Speaker Recognition. CoRR abs/2206.11750 (2022) - [i85]Roshan Sharma, Tyler Vuong, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj:
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction. CoRR abs/2206.12568 (2022) - [i84]Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar, Shinji Watanabe
, Bhiksha Raj:
Improving Speech Enhancement through Fine-Grained Speech Characteristics. CoRR abs/2207.00237 (2022) - [i83]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Yan Lu, Bhiksha Raj:
R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency. CoRR abs/2207.01203 (2022) - [i82]Raphaël Olivier, Bhiksha Raj:
Not all broken defenses are equal: The dead angles of adversarial accuracy. CoRR abs/2207.04129 (2022) - [i81]Xiang Li, Jinglu Wang, Xiaohao Xu, Bhiksha Raj, Yan Lu:
Online Video Instance Segmentation via Robust Context Fusion. CoRR abs/2207.05580 (2022) - [i80]Yidong Wang, Hao Chen, Yue Fan, Wang Sun, Ran Tao, Wenxin Hou, Renjie Wang, Linyi Yang, Zhi Zhou, Lan-Zhe Guo, Heli Qi, Zhen Wu, Yu-Feng Li, Satoshi Nakamura, Wei Ye, Marios Savvides, Bhiksha Raj, Takahiro Shinozaki, Bernt Schiele
, Jindong Wang
, Xing Xie, Yue Zhang:
USB: A Unified Semi-supervised Learning Benchmark. CoRR abs/2208.07204 (2022) - [i79]Raphaël Olivier, Hadi Abdullah, Bhiksha Raj:
Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models. CoRR abs/2209.13523 (2022) - [i78]Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le:
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation. CoRR abs/2210.02578 (2022) - [i77]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Privacy-preserving Automatic Speaker Diarization. CoRR abs/2210.14995 (2022) - [i76]Roshan Sharma, Hira Dhamyal, Bhiksha Raj, Rita Singh:
Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition. CoRR abs/2210.16642 (2022) - [i75]Roshan Sharma, Bhiksha Raj:
XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers. CoRR abs/2210.16643 (2022) - [i74]Raphaël Olivier, Bhiksha Raj:
There is more than one kind of robustness: Fooling Whisper with adversarial examples. CoRR abs/2210.17316 (2022) - [i73]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Describing emotions with acoustic property prompts for speech emotion recognition. CoRR abs/2211.07737 (2022) - [i72]Hao Chen, Yue Fan, Yidong Wang, Jindong Wang
, Bernt Schiele
, Xing Xie, Marios Savvides, Bhiksha Raj:
An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning. CoRR abs/2211.11086 (2022) - [i71]Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj:
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance. CoRR abs/2211.14419 (2022) - [i70]Kashu Yamazaki, Khoa Vo, Sang Truong, Bhiksha Raj, Ngan Le:
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning. CoRR abs/2211.15103 (2022) - 2021
- [j34]Wenbo Liu, Ming Li, Xiaobing Zou, Bhiksha Raj:
Discriminative Dictionary Learning for Autism Spectrum Disorder Identification. Frontiers Comput. Neurosci. 15: 662401 (2021) - [c211]Shentong Mo, Jingfei Xia, Xiaoqing Tan, Bhiksha Raj:
Point3D: tracking actions as moving points with 3D CNNs. BMVC 2021: 259 - [c210]Raphaël Olivier, Bhiksha Raj:
Sequential Randomized Smoothing for Adversarially Robust Speech Recognition. EMNLP (1) 2021: 6372-6386 - [c209]Raphaël Olivier, Bhiksha Raj, Muhammad Shah:
High-Frequency Adversarial Defense for Speech and Audio. ICASSP 2021: 2995-2999 - [c208]Muhammad A. Shah, Raphaël Olivier, Bhiksha Raj:
Towards Adversarial Robustness Via Compact Feature Representations. ICASSP 2021: 3845-3849 - [c207]Ali Shahin Shamsabadi, Francisco Sepúlveda Teixeira
, Alberto Abad
, Bhiksha Raj, Andrea Cavallaro, Isabel Trancoso:
FoolHD: Fooling Speaker Identification by Highly Imperceptible Adversarial Disturbances. ICASSP 2021: 6159-6163 - [c206]Maria Joana Correia, Francisco Teixeira
, Catarina Botelho, Isabel Trancoso, Bhiksha Raj:
The in-the-Wild Speech Medical Corpus. ICASSP 2021: 6973-6977 - [c205]Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le
, Khoa Luu:
The Right to Talk: An Audio-Visual Transformer Approach. ICCV 2021: 1085-1094 - [c204]Kai Hu, Jie Shao, Yuan Liu, Bhiksha Raj, Marios Savvides, Zhiqiang Shen:
Contrast and Order Representations for Video Self-supervised Learning. ICCV 2021: 7919-7929 - [c203]Yandong Wen, Weiyang Liu, Bhiksha Raj, Rita Singh:
Self-Supervised 3D Face Reconstruction via Conditional Estimation. ICCV 2021: 13269-13278 - [c202]Soham Deshmukh, Bhiksha Raj, Rita Singh:
Improving Weakly Supervised Sound Event Detection with Self-Supervised Auxiliary Tasks. Interspeech 2021: 596-600 - [c201]Jiachen Lian, Aiswarya Vinod Kumar, Hira Dhamyal, Bhiksha Raj, Rita Singh:
Masked Proxy Loss for Text-Independent Speaker Verification. Interspeech 2021: 4638-4642 - [c200]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk:
HEAR: Holistic Evaluation of Audio Representations. NeurIPS (Competition and Demos) 2021: 125-145 - [c199]Yang Gao, Jiachen Lian, Bhiksha Raj, Rita Singh:
Detection and Evaluation of Human and Machine Generated Speech in Spoofing Attacks on Automatic Speaker Verification Systems. SLT 2021: 544-551 - [c198]Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller
:
Identifying Actions for Sound Event Classification. WASPAA 2021: 26-30 - [i69]Bronya Roni Chernyak, Bhiksha Raj, Tamir Hazan, Joseph Keshet:
Constant Random Perturbations Provide Adversarial Robustness with Minimal Effect on Accuracy. CoRR abs/2103.08265 (2021) - [i68]Anxiang Zhang, Ankit Shah, Bhiksha Raj:
Training image classifiers using Semi-Weak Label Data. CoRR abs/2103.10608 (2021) - [i67]Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller:
Identifying Actions for Sound Event Classification. CoRR abs/2104.12693 (2021) - [i66]Soham Deshmukh, Bhiksha Raj, Rita Singh:
Improving weakly supervised sound event detection with self-supervised auxiliary tasks. CoRR abs/2106.06858 (2021) - [i65]Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh:
Controlled AutoEncoders to Generate Faces from Voices. CoRR abs/2107.07988 (2021) - [i64]Yandong Wen, Weiyang Liu, Adrian Weller, Bhiksha Raj, Rita Singh:
SphereFace2: Binary Classification is All You Need for Deep Face Recognition. CoRR abs/2108.01513 (2021) - [i63]Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu:
The Right to Talk: An Audio-Visual Transformer Approach. CoRR abs/2108.03256 (2021) - [i62]Weiyang Liu, Yandong Wen, Bhiksha Raj, Rita Singh, Adrian Weller:
SphereFace Revived: Unifying Hyperspherical Face Recognition. CoRR abs/2109.05565 (2021) - [i61]Yandong Wen, Weiyang Liu, Bhiksha Raj, Rita Singh:
Self-Supervised 3D Face Reconstruction via Conditional Estimation. CoRR abs/2110.04800 (2021) - [i60]Raphaël Olivier, Bhiksha Raj:
Sequential Randomized Smoothing for Adversarially Robust Speech Recognition. CoRR abs/2112.03000 (2021) - 2020
- [c197]Muhammad Ahmed Shah, Bhiksha Raj:
Deriving Compact Feature Representations Via Annealed Contraction. ICASSP 2020: 2068-2072 - [c196]Rowland Chen, Roger B. Dannenberg, Bhiksha Raj, Rita Singh:
Artificial Creative Intelligence: Breaking the Imitation Barrier. ICCC 2020: 319-325 - [c195]Wenbo Zhao, Yang Gao, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
Hierarchical Routing Mixture of Experts. ICPR 2020: 7900-7906 - [c194]Muhammad Ahmed Shah, Raphaël Olivier, Bhiksha Raj:
Exploiting Non-Linear Redundancy for Neural Model Compression. ICPR 2020: 9928-9935 - [c193]Muhammad A. Shah, Raphaël Olivier, Bhiksha Raj:
Optimal Strategies For Comparing Covariates To Solve Matching Problems. ICPR 2020: 10622-10628 - [c192]Hira Dhamyal, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
The Phonetic Bases of Vocal Expressed Emotion: Natural versus Acted. INTERSPEECH 2020: 3451-3455 - [c191]Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Towards Deep Neural Networks for Speech Steganography. INTERSPEECH 2020: 4656-4660 - [c190]Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh:
Controlled AutoEncoders to Generate Faces from Voices. ISVC (1) 2020: 476-487 - [c189]Maria Joana Correia, Isabel Trancoso, Bhiksha Raj:
Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance Learning. LREC 2020: 3542-3550 - [c188]Muhammad Ahmed Shah, Khaled A. Harras, Bhiksha Raj:
Sherlock: A Crowd-sourced System For Automatic Tagging Of Indoor Floor Plans. MASS 2020: 594-602 - [c187]Jie Shao, Kai Hu, Changhu Wang, Xiangyang Xue, Bhiksha Raj:
Is normalization indispensable for training deep neural network? NeurIPS 2020 - [i59]Yuichiro Koyama, Tyler Vuong, Stefan Uhlich, Bhiksha Raj:
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks. CoRR abs/2005.11611 (2020) - [i58]Yuichiro Koyama, Oluwafemi Azeez, Bhiksha Raj:
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation. CoRR abs/2005.11612 (2020) - [i57]Yuichiro Koyama, Bhiksha Raj:
Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References. CoRR abs/2005.12683 (2020) - [i56]Muhammad Ahmed Shah, Raphaël Olivier, Bhiksha Raj:
Exploiting Non-Linear Redundancy for Neural Model Compression. CoRR abs/2005.14070 (2020) - [i55]Soham Deshmukh, Bhiksha Raj, Rita Singh:
Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection. CoRR abs/2008.07085 (2020) - [i54]Yang Gao, Jiachen Lian, Bhiksha Raj, Rita Singh:
Detection and Evaluation of human and machine generated speech in spoofing attacks on automatic speaker verification systems. CoRR abs/2011.03689 (2020) - [i53]Jiachen Lian, Aiswarya Vinod Kumar, Hira Dhamyal, Bhiksha Raj, Rita Singh:
Mask Proxy Loss for Text-Independent Speaker Recognition. CoRR abs/2011.04491 (2020) - [i52]Ali Shahin Shamsabadi, Francisco Sepúlveda Teixeira, Alberto Abad, Bhiksha Raj, Andrea Cavallaro, Isabel Trancoso:
FoolHD: Fooling speaker identification by Highly imperceptible adversarial Disturbances. CoRR abs/2011.08483 (2020)
2010 – 2019
- 2019
- [j33]Annamaria Mesaros
, Aleksandr Diment, Benjamin Elizalde
, Toni Heittola
, Emmanuel Vincent
, Bhiksha Raj, Tuomas Virtanen:
Sound Event Detection in the DCASE 2017 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 992-1006 (2019) - [c186]M. Joana Correia, Isabel Trancoso, Bhiksha Raj:
In-the-Wild End-to-End Detection of Speech Affecting Diseases. ASRU 2019: 734-741 - [c185]Hira Dhamyal, Tianyan Zhou, Bhiksha Raj, Rita Singh:
Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification. ASRU 2019: 742-748 - [c184]Abelino Jiménez, Bhiksha Raj:
Time Signal Classification Using Random Convolutional Features. ICASSP 2019: 3592-3596 - [c183]Benjamin Elizalde, Shuayb Zarar, Bhiksha Raj:
Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio. ICASSP 2019: 4095-4099 - [c182]Daanish Ali Khan, Saquib Razak, Bhiksha Raj, Rita Singh:
Human Behaviour Recognition Using Wifi Channel State Information. ICASSP 2019: 7625-7629 - [c181]Yandong Wen, Mahmoud Al Ismail, Weiyang Liu, Bhiksha Raj, Rita Singh:
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces. ICLR (Poster) 2019 - [c180]Anurag Kumar, Ankit Shah, Alexander G. Hauptmann, Bhiksha Raj:
Learning Sound Events from Webly Labeled Data. IJCAI 2019: 2772-2778 - [c179]Shahan Ali Memon, Wenbo Zhao, Bhiksha Raj, Rita Singh:
Neural Regression Trees. IJCNN 2019: 1-8 - [c178]Yandong Wen, Bhiksha Raj, Rita Singh:
Face Reconstruction from Voice using Generative Adversarial Networks. NeurIPS 2019: 5266-5275 - [i51]Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Deep Neural Networks for Speech Steganography. CoRR abs/1902.03083 (2019) - [i50]Wenbo Zhao, Yang Gao, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
Hierarchical Routing Mixture of Experts. CoRR abs/1903.07756 (2019) - [i49]Chirag Nagpal, Rohan Sangave, Amit Chahar, Parth Shah, Artur Dubrawski, Bhiksha Raj:
Nonlinear Semi-Parametric Models for Survival Analysis. CoRR abs/1905.05865 (2019) - [i48]Yandong Wen, Rita Singh, Bhiksha Raj:
Reconstructing faces from voices. CoRR abs/1905.10604 (2019) - [i47]Daanish Ali Khan, Linhong Li, Ninghao Sha, Zhuoran Liu, Abelino Jimenez, Bhiksha Raj, Rita Singh:
Non-Determinism in Neural Networks for Adversarial Robustness. CoRR abs/1905.10906 (2019) - [i46]Shahan Ali Memon, Hira Dhamyal, Oren Wright, Daniel Justice, Vijaykumar Palat, William Boler, Yandong Wen, Bhiksha Raj, Rita Singh:
Detecting gender differences in perception of emotion in crowdsourced data. CoRR abs/1910.11386 (2019) - [i45]