default search action
ICASSP 2014: Florence, Italy
- IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014. IEEE 2014
SS1: Signal Processing for Big Data
- Nikos D. Sidiropoulos, Evangelos E. Papalexakis, Christos Faloutsos:
A parallel algorithm for big tensor decomposition using randomly compressed cubes (PARACOMP). 1-5 - Shahab Basiri, Esa Ollila, Visa Koivunen:
Fast and robust bootstrap method for testing hypotheses in the ICA model. 6-10 - Adam C. Wilkerson, Harish Chintakunta, Hamid Krim:
Computing persistent features in big data: A distributed dimension reduction approach. 11-15 - Konstantinos Slavakis, Georgios B. Giannakis:
Online dictionary learning from big data using accelerated stochastic approximation algorithms. 16-20 - Bubacarr Bah, Stephen Becker, Volkan Cevher, Baran Gozcu:
Metric learning with rank and sparsity constraints. 21-25 - André Lima Férrer de Almeida, Alain Y. Kibangou:
Distributed large-scale tensor decomposition. 26-30
SPTM-L1: Sampling Theory and Methods I
- John Murray-Bruce, Pier Luigi Dragotti:
Spatio-temporal sampling and reconstruction of diffusion fields induced by point sources. 31-35 - Céline Aubel, David Stotz, Helmut Bölcskei:
Super-resolution from short-time Fourier transform measurements. 36-40 - Dyonisius Dony Ariananda, Geert Leus:
Non-uniform sampling for compressive cyclic spectrum reconstruction. 41-45 - Volker Pohl, Çagkan Yapar, Holger Boche, Fanny Yang:
A phase retrieval method for signals in modulation-invariant spaces. 46-50 - Christopher Gilliam, Thierry Blu:
Fitting instead of annihilation: Improved recovery of noisy FRI signals. 51-55 - Holger Boche, Ullrich J. Mönich:
No-Go theorem for sampling-based signal processing. 56-60
SAM-L1: Radar Array Processing
- Augusto Aubry, Antonio De Maio, Goffredo Foglia, Danilo Orlando, Chengpeng Hao:
Enhanced radar detection and range estimation via oversampled data. 61-65 - Arnaud Breloy, Guillaume Ginolhac, Frédéric Pascal, Philippe Forster:
Robust estimation of the clutter subspace for a Low Rank heterogeneous noise under high Clutter to Noise Ratio assumption. 66-70 - Mohammad Mahdi Naghsh, Mojtaba Soltanalian, Petre Stoica, Mahmoud Modarres-Hashemi, Antonio De Maio, Augusto Aubry:
A max-min design of transmit sequence and receive filter. 71-75 - Itay Cnaan-On, Stewart J. Thomas, Matthew S. Reynolds, Jeffrey L. Krolik:
Multichannel radar backscatter communication and localization. 76-80 - Pietro Stinco, Maria Sabrina Greco, Fulvio Gini, Mario La Manna:
Compressed spectrum Sensing in Cognitive Radar systems. 81-85 - Wei Zhu, Jun Tang, Shuang Wan:
Angular resolution limit of two closely-spaced point sources based on information theoretic criteria. 86-90
SLTC-L1: Speaker diarization
- Rong Zheng, Ce Zhang, Shanshan Zhang, Bo Xu:
Variational Bayes based I-vector for speaker diarization of telephone conversations. 91-95 - Sree Harsha Yella, Hervé Bourlard:
Information bottleneck based speaker diarization of meetings using non-speech as side information. 96-100 - Ashtosh Sapru, Sree Harsha Yella, Hervé Bourlard:
Improving speaker diarization using social role information. 101-105 - Alexey Sholokhov, Timur Pekhovsky, Oleg Kudashev, Andrey Shulipa, Tomi Kinnunen:
Bayesian analysis of similarity matrices for speaker diarization. 106-110 - Srikanth R. Madikeri, Hervé Bourlard:
Filterbank slope based features for speaker diarization. 111-115 - Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez, Paul Deléglise:
A conditional random field approach for audio-visual people diarization. 116-120
SLTC-L2: Spoken Language Understanding I
- Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design. 121-125 - Mohamed Morchid, Richard Dufour, Pierre-Michel Bousquet, Mohamed Bouallegue, Georges Linarès, Renato De Mori:
Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule. 126-130 - Seokhwan Kim, Rafael E. Banchs, Haizhou Li:
Wikipedia-based Kernels for dialogue topic tracking. 131-135 - Puyang Xu, Ruhi Sarikaya:
Contextual domain classification in spoken language understanding systems using recurrent neural network. 136-140 - Joris Pelemans, Kris Demuynck, Hugo Van hamme, Patrick Wambacq:
Coping with language data sparsity: Semantic head mapping of compound words. 141-145 - Benoît Favre, Mickael Rouvier, Frédéric Béchet:
Reranked aligners for interactive transcript correction. 146-150
IVMSP-L1: Image Quality Assessment
- Tanaya Guha, Ehsan Nezhadarya, Rabab K. Ward:
Learning sparse models for image quality assessment. 151-155 - Yuanhao Zhai, David L. Neuhoff, Thrasyvoulos N. Pappas:
Subjective similarity evaluation for scenic bilevel images. 156-160 - Joumana Farah, Marie-Rita Hojeij, Jihad Chrabieh, Frédéric Dufaux:
Full-reference and reduced-reference quality metrics based on SIFT. 161-165 - Dohyoung Lee, Konstantinos N. Plataniotis:
Towards a novel perceptual color difference metric using circular processing of hue components. 166-170 - Won-Dong Jang, Jae-Young Sim, Chang-Su Kim:
GEQM: A quality metric for gray-level edge maps based on structural matching. 171-174 - Takahiro Ogawa, Miki Haseyama:
Missing intensity restoration via perceptually optimized subspace projection based on entropy component analysis. 175-179
SLTC-P1: Deep Neural Networks in Speech Recognition I
- Simon Wiesler, Alexander Richard, Ralf Schlüter, Hermann Ney:
Mean-normalized stochastic gradient for large-scale deep learning. 180-184 - Yu Zhang, Ekapol Chuangsuwanich, James R. Glass:
Extracting deep neural network bottleneck features using low-rank matrix factorization. 185-189 - László Tóth:
Combining time- and frequency-domain convolution in convolutional neural network-based phone recognition. 190-194 - Shilin Liu, Khe Chai Sim:
On combining DNN and GMM with unsupervised speaker adaptation for robust automatic speech recognition. 195-199 - Bo Li, Khe Chai Sim:
An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition. 200-204 - Po-Sen Huang, Haim Avron, Tara N. Sainath, Vikas Sindhwani, Bhuvana Ramabhadran:
Kernel methods match Deep Neural Networks on TIMIT. 205-209 - Vijayaditya Peddinti, Tara N. Sainath, Shay Maymon, Bhuvana Ramabhadran, David Nahamoo, Vaibhava Goel:
Deep Scattering Spectrum with deep neural networks. 210-214 - Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Improving deep neural network acoustic models using generalized maxout networks. 215-219 - Ching-feng Yeh, Lin-Shan Lee:
Transcribing code-switched bilingual lectures using deep neural networks with unit merging in acoustic modeling. 220-224 - Andrew W. Senior, Ignacio López-Moreno:
Improving DNN speaker independence with I-vector inputs. 225-229 - Michiel Bacchiani, David Rybach:
Context dependent state tying for speech recognition using deep neural network acoustic models. 230-234 - Frank Seide, Hao Fu, Jasha Droppo, Gang Li, Dong Yu:
On parallelizability of stochastic gradient descent for speech DNNS. 235-239 - Wei Deng, Yanmin Qian, Yuchen Fan, Tianfan Fu, Kai Yu:
Stochastic data sweeping for fast DNN training. 240-244 - Tianxing He, Yuchen Fan, Yanmin Qian, Tian Tan, Kai Yu:
Reshaping deep neural network for fast decoding by node-pruning. 245-249
SLTC-P2: Stochastic Speech Synthesis
- Chung-Hsien Wu, Yi-Chin Huang, Shih-Lun Lin, Chia-Ping Chen:
Natural speech synthesis based on hybrid approach with candidate expansion and verification. 250-254 - Bajibabu Bollepalli, Jérôme Urbain, Tuomo Raitio, Joakim Gustafson, Hüseyin Çakmak:
A comparative evaluation of vocoding techniques for HMM-based laughter synthesis. 255-259 - Thomas Drugman, Tuomo Raitio:
Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components. 260-264 - Kazuhiro Nakamura, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
HMM-Based singing voice synthesis and its application to Japanese and English. 265-269 - Nirmesh J. Shah, Bhavik B. Vachhani, Hardik B. Sailor, Hemant A. Patil:
Effectiveness of PLP-based phonetic segmentation for speech synthesis. 270-274 - Florian Eyben, Yannis Agiomyrgiannakis:
A frequency-weighted post-filtering transform for compensation of the over-smoothing effect in HMM-based speech synthesis. 275-279 - Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Mark J. F. Gales, Yannis Stylianou:
Cluster adaptive training of average voice models. 280-284 - Pierre Lanchantin, Mark J. F. Gales, Simon King, Junichi Yamagishi:
Multiple-average-voice-based speech synthesis. 285-289 - Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A postfilter to modify the modulation spectrum in HMM-based speech synthesis. 290-294 - Ran Zhang, Jianhua Tao, Ya Li, Zhengqi Wen:
A novel hybrid mandarin speech synthesis system using different base units for model training and concatenation. 295-299 - Vassilios Tsiaras, Ranniery Maia, Vassilios Diakoloukas, Yannis Stylianou, Vassilios Digalakis:
Linear dynamical models in speech synthesis. 300-304
SPTM-P1: Time Frequency Analysis, System Modelling and Estimation
- Roberto F. Leonarduzzi, Herwig Wendt, Stéphane Jaffard, Stéphane G. Roux, María E. Torres, Patrice Abry:
Extending multifractal analysis to negative regularity: P-exponents and P-leaders. 305-309 - Rodney A. Kennedy, Zubair Khalid, Parastoo Sadeghi:
Efficient kernel-based formulations of spatio-spectral and related transformations on the 2-sphere. 310-314 - Thomas Oberlin, Sylvain Meignen, Valérie Perrier:
The fourier-based synchrosqueezing transform. 315-319 - Douglas David Baptista de Souza, Jocelyn Chanussot, Anne-Catherine Favre, Pierre Borgnat:
A new nonparametric method for testing stationarity based on trend analysis in the time marginal distribution. 320-324 - Douglas David Baptista de Souza, Jocelyn Chanussot, Anne-Catherine Favre:
On selecting relevant intrinsic mode functions in empirical mode decomposition: An energy-based approach. 325-329 - Nouha Jaoua, François Septier, Emmanuel Duflos, Philippe Vanheeghe:
State and impulsive time-varying measurement noise density estimation in nonlinear dynamic systems using Dirichlet Process Mixtures. 330-334 - Daniele Angelosante:
Sparse regressions for joint segmentation and linear prediction. 335-339 - Scott Wisdom, Les Atlas, James Pittore:
Extending coherence time for analysis of modulated random processes. 340-344 - Moeness G. Amin, Yimin D. Zhang, Branka Jokanovic:
Time-frequency signature reconstruction from random observations using multiple measurement vectors. 345-349 - George-Othon Glentis, Andreas Jakobsson, Kostas Angelopoulos:
Block-recursive IAA-based spectral estimates with missing samples using data interpolation. 350-354
SPTM-P2: Signal and System Modelling, and Estimation I
- Zhenhua Yu, Robert John Baxley, G. Tong Zhou:
Distributions of upper PAPR and lower PAPR of OFDM signals in visible light communications. 355-359 - P. P. Vaidyanathan, Piya Pal:
The farey-dictionary for sparse representation of periodic signals. 360-364 - Ayush Bhandari, Achuta Kadambi, Ramesh Raskar:
Sparse Linear Operator identification without sparse regularization? Applications to mixed pixel problem in Time-of-Flight/Range imaging. 365-369 - Bogdan Dumitrescu, Bogdan C. Sicleru:
Optimization with sums of exponentials and applications. 370-374 - Syed Ahmed Pasha, Victor Solo:
Topology identification of dynamic point process networks. 375-378 - Boqiang Huang, Angela Kunoth:
A unique polar representation of the hyperanalytic signal. 379-383 - Scott C. Douglas, Danilo P. Mandic:
Autoconvolution and panorama: Augmenting second-order signal analysis. 384-388 - Tim Schwerdtfeger, Anton Kummert:
A multidimensional signal processing approach to Wave Digital Filters with topology-related delay-free loops. 389-393 - Mario H. Castañeda, Josef A. Nossek:
Estimation of rank deficient covariance matrices with Kronecker structure. 394-398 - Neha Thakre, Christian Debes, Roel Heremans, Abdelhak M. Zoubir:
Anomaly detection for dike monitoring using system identification. 399-403 - Nikola Rozic, Dinko Begusic, Josko Radic:
Noise squared norm in OFDM systems interfered by impulse noise. 404-408 - Magnus Mossberg:
Gaussian process parameter estimation using zero crossing data from wireless sensors. 409-413 - Tirza Routtenberg, Lang Tong:
The Cramér-Rao bound for estimation-after-selection. 414-418 - Paulo Jorge S. G. Ferreira, Armando J. Pinho:
Compression-based normal similarity measures for DNA sequences. 419-423
SPCOM-P1: Coordinated transmission in heterogeneous networks
- Anh H. Nguyen, Yichao Huang, Bhaskar D. Rao:
Order statistics based CDF scheduling methods in multiuser heterogeneous systems. 424-428 - Yair Noam, Amir Leshem, Hagit Messer:
Robust spectrum management with incomplete information. 429-433 - Jingran Lin, Yubai Li, Qicong Peng:
Joint power allocation, base station assignment and beamformer design for an uplink SIMO heterogeneous network. 434-438 - Nima Namvar, Walid Saad, Behrouz Maham, Stefan Valentin:
A context-aware matching game for user association in wireless small cell networks. 439-443 - Omid Semiari, Walid Saad, Stefan Valentin, Mehdi Bennis, Behrouz Maham:
Matching theory for priority-based cell association in the downlink of wireless small cell networks. 444-448 - Nassar Ksairi, Philippe Ciblat, Christophe J. Le Martret:
Optimal resource allocation for type-II HARQ based OFDMA ad hoc networks under individual rate and power constraints. 449-453 - Ruoyu Sun, Zhi-Quan Luo:
Globally optimal joint uplink base station association and power control for max-min fairness. 454-458 - Shixin Luo, Rui Zhang, Teng Joon Lim:
Coordinated downlink and uplink user association and beamforming for energy minimizationincloud radio access network. 459-463 - Wenhao Wu, Kun Wang, Zhi Ding, Chengshan Xiao:
Cooperative multi-cell MIMO downlink precoding for finite-alphabet inputs. 464-468 - Jarkko Kaleva, Randall Berry, Michael L. Honig, Antti Tölli, Markku J. Juntti:
Decentralized sum MSE minimization for coordinated multi-point transmission. 469-473 - Songze Li, Emrah Akyol, Urbashi Mitra:
Power allocation for Gaussian multiple access channel with noisy cooperative links. 474-478 - Rasmus Brandt, Emil Björnson, Mats Bengtsson:
Weighted sum rate optimization for multicell MIMO systems with hardware-impaired transceivers. 479-483 - Yao Cheng, Peng Li, Martin Haardt:
Coordinated beamforming in MIMO FBMC/OQAM systems. 484-488 - Qianrui Li, David Gesbert, Nicolas Gresset:
Joint precoding over a master-slave coordination link. 489-493
IVMSP-P1: Face Recognition
- Meriem Bendris, Benoît Favre, Delphine Charlet, Géraldine Damnati, Rémi Auguste:
Multiple-view constrained clustering for unsupervised face identification in TV-broadcast. 494-498 - Ramya Srinivasan, Abhishek Nagar, Anshuman Tewari, Donato Mitrani, Amit K. Roy-Chowdhury:
Face recognition based on SIGMA sets of image features. 499-503 - Yinyan Jiang, Yong Wu, Weifeng Li, Longbiao Wang, Qingmin Liao:
Log-domain polynomial filters for illumination-robust face recognition. 504-508 - Muhammad Khurram Shaikh, Muhammad Atif Tahir, Ahmed Bouridane:
Probabilistic Linear Discriminant Analysis for intermodality face recognition. 509-513 - Mohamed Anouar Borgi, Demetrio Labate, Maher El'arbi, Chokri Ben Amar:
Regularized Shearlet Network for face recognition using single sample per person. 514-518 - Jun Yi, Fei Su:
Histogram of Log-Gabor Magnitude Patterns for face recognition. 519-523 - Hehua Chi, Yu Hen Hu:
Facial image de-identification using identiy subspace decomposition. 524-528 - Cristina Bordei, Pascal Bourdon, Bertrand Augereau, Philippe Carré:
Polynomial based texture representation for facial expression recognition. 529-533
IVMSP-P2: Stereoscopic and 3D Processing
- Kuang-Tsu Shih, Chen-Yu Hsu, Cheng-Chieh Yang, Homer H. Chen:
Analysis of the effect of calibration error on light field super-resolution rendering. 534-538 - Yun Li, Mårten Sjöström, Roger Olsson, Ulf Jennehag:
Efficient intra prediction scheme for light field image compression. 539-543 - Xuyuan Xu, Lai-Man Po, Chun-Ho Cheung, Litong Feng, Kwok-Wai Cheung, Chi-Wang Ting, Ka-Ho Ng:
Adaptive block truncation filter for MVC depth image enhancement. 544-548 - Wenfei Jiang, Tao Luo, Fan Zhang, Jiang Tian, Pei Luo, Kangying Cai:
Generic 2D/3D smoothing via regional variation. 549-553 - Rodrigo Schramm, Cláudio Rosito Jung:
Temporally coherent stereo matching using kinematic constraints. 554-558 - Mitra Damghanian, Roger Olsson, Mårten Sjöström:
Performance analysis in Lytro camera: Empirical and model based approaches to assess refocusing quality. 559-563 - Zhen Zhang, Xiao Ai, C. K. Chan, Naim Dahnoun:
An efficient algorithm for pothole detection using stereo vision. 564-568 - Stuart Woolford, Ian S. Burnett:
Toward a one shot multi-projector profilometry system for full field of view object measurement. 569-573 - B. Budianto, Daniel Pak-Kong Lun:
Efficient 3-dimensional model reconstruction based on marker encoded fringe projection profilometry. 574-578 - Bruno Macchiavello, Camilo C. Dorea, Edson M. Hung, Gene Cheung, Ivan V. Bajic:
Low-saliency prior for disocclusion hole filling in DIBR-synthesized images. 579-583 - Zucheul Lee, Truong Q. Nguyen:
Hierarchical depth processing with adaptive search range and fusion. 584-588 - Iana Iatsun, Mohamed-Chaker Larabi, Christine Fernandez-Maloigne:
Using monocular depth cues for modeling stereoscopic 3D saliency. 589-593 - G. C. V. Perera, D. Varuna S. X. De Silva, Ahmet M. Kondoz, Safak Dogan:
An improved model of binocular energy calculation for full-reference stereoscopic image quality assessment. 594-598 - Richard Rzeszutek, Dimitrios Androutsos:
Label propagation through edge-preserving filters. 599-603
AASP-P1: Microphone Array Processing I, Music Analysis and Synthesis I
- Yoichi Haneda, Ken'ichi Furuya, Shoichi Koyama, Kenta Niwa:
Close-talking spherical microphone array using sound pressure interpolation based on spherical harmonic expansion. 604-608