default search action
ICME 2009: New York, NY, USA
- Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, ICME 2009, June 28 - July 2, 2009, New York City, NY, USA. IEEE 2009, ISBN 978-1-4244-4291-1
Compression and Coding I
- Xiulian Peng, Feng Wu, Jizheng Xu:
Directional filtering transform. 1-4 - Derek Pang, Xiaoyu Xiu, Jie Liang:
Multiview video coding using projective rectification-based view extrapolation and synthesis bias correction. 5-8 - Pei-Kuei Tsung, Wei-Yin Chen, Li-Fu Ding, Chuan-Yung Tsai, Tzu-Der Chuang, Liang-Gee Chen:
Single-iteration full-search fractional motion estimation for quad full HD H.264/AVC encoding. 9-12 - Xiaoyu Xiu, Jie Liang:
Rate-distortion analysis of rectification-based view interpolation for multiview video coding. 13-16 - Weipeng Ma, Shuyuan Yang, Li Gao, Chaoke Pei, Shefeng Yan:
Fast mode selection scheme for H.264/AVC inter prediction based on statistical learning method. 17-20 - Xiaobing Lu, Chuangbai Xiao:
A new strategy to predict the search range in H.264/AVC. 21-24
Media Conversion and Transcoding
- Qiu Shen, Houqiang Li, Feng Wu:
Content-based hierarchical motion description for multiple video adaptation. 25-28 - Hui Liu, Ye-Kui Wang, Ying Chen, Houqiang Li:
Spatial transcoding from Scalable Video Coding to H.264/AVC. 29-32 - Ka-Man Wong, Lai-Man Po, Kwok-Wai Cheung:
Block-Matching Translation and Zoom Motion-Compensated Prediction. 33-36 - Yiqing Huang, Qin Liu, Takeshi Ikenaga:
Content aware configurable architecture for H.264/AVC integer motion estimation engine. 37-40 - Ling Tian, Xin Yin, Yu Sun, Shixin Sun:
Accurate bit prediction for intra-only rate control. 41-44 - Ling Tian, Yu Sun, Shixin Sun:
Frame complexity prediction for H.264/AVC rate control. 45-48
Compression and Coding II
- Xiaoyan Sun, Feng Wu:
Fractional compensation for spatial scalable video coding. 49-52 - Shuixian Chen, Ruimin Hu, Shuhua Zhang:
Estimating spatial cues for audio coding in MDCT domain. 53-56 - Jong-Seok Lee, Francesca De Simone, Touradj Ebrahimi:
Video coding based on audio-visual attention. 57-60 - Chen-Kuo Chiang, Shang-Hong Lai:
Fast multi-reference motion estimation via statistical learning for H.264/AVC. 61-64 - Boxin Shi, Yangxi Li, Lin Liu, Chao Xu:
Block-based color correction algorithm for multi-view video coding. 65-68 - Sangkwon Na, Chong-Min Kyung:
A Multi-layer motion estimation scheme for spatial scalability in H.264/AVC scalable extension. 69-72
Image Processing
- Jianping Qiao, Ju Liu, Xiangzeng Meng, Wan-Chi Siu:
Kurtosis-based super-resolution algorithm. 73-76 - Shing Fat Tu, Oscar C. Au, Yannan Wu, Enming Luo, Chi Ho Yeung:
A robust spatial-temporal line-warping based deinterlacing method. 77-80 - Zhiwei Xiong, Yonghua Zhang, Xiaoyan Sun, Feng Wu:
Fast directional image interpolation with difference projection. 81-84 - Hsuan-Ying Chen, Jin-Jang Leou:
A new IKONOS imagery fusion approach using particle swarm optimization. 85-88 - Yi Yang, Oscar C. Au, Lu Fang, Xing Wen, Weiran Tang:
Perceptual compressive sensing for image signals. 89-92 - Ke Zhang, Jiangbo Lu, Gauthier Lafruit, Rudy Lauwereins, Luc Van Gool:
Accurate and efficient stereo matching with robust piecewise voting. 93-96
Compression and Coding III
- Demetrios Cantzos, Athanasios Mouchtaris, Chris Kyriakakis:
Bandwidth extension of low bitrate compressed audio based on statistical conversion. 97-100 - João Ascenso, Fernando Pereira:
Low complexity intra mode selection for efficient distributed video coding. 101-104 - Matthieu Urvoy, Nathalie Cammas, Stéphane Pateux, Olivier Déforges, Marie Babel, Muriel Pressigout:
Motion tubes for the representation of image sequences. 105-108 - En-Hui Yang, Longji Wang:
Entropy constrained color splitting for palette images. 109-112 - Zongju Peng, Gangyi Jiang, Mei Yu:
A fast multiview video coding algorithm based dynamic multi-threshold. 113-116
Multimedia Signal Processing
- Mehrez Souden, Zicheng Liu:
Optimal joint linear acoustic echo cancelation and blind source separation in the presence of loudspeaker nonlinearity. 117-120 - Liang Wang, Eliathamby Ambikairajah, Eric H. C. Choi:
Robust language identification based on fused phonotactic information with MLKSFM pre-classifier. 121-124 - Meng-Che Chuang, Yi-Nung Liu, Tsung-Huang Chen, Shao-Yi Chien:
Color filter array demosaicking using joint bilateral filter. 125-128 - Soo-Chang Pei, Yu-Zhe Hsiao, Chia-Ying Lee:
Empirical mode decomposition descriptor for plane closed curves. 129-132 - Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko:
Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations. 133-136 - Anan Liu, Jinghao Fei, Jianping Fan, Lin Pang, Yongdong Zhang, Jintao Li:
Confusion network based Video OCR post-processing approach. 137-140
Multimedia Coding and Processing
- Jinrong Zhang, Houqiang Li, Chang Wen Chen:
Distributed coding techniques for onboard lossless compression of multispectral images. 141-144 - Zhengyi Luo, Li Song, Shibao Zheng:
Offset based leaky prediction for error resilient ROI coding. 145-148 - Daniel Peintner, Harald Kosch, Jörg Heuer:
Efficient XML Interchange for rich internet applications. 149-152 - Anthony Griffin, Toni Hirvonen, Athanasios Mouchtaris, Panagiotis Tsakalides:
Encoding the sinusoidal model of an audio signal using compressed sensing. 153-156 - Ronggang Wang, Yongbing Zhang, Yuan Dong, Haila Wang:
Partition-level adaptive interpolation filter for video coding. 157-160 - Wei-Cheng Tai, Gwo-Long Li, Tian-Sheuan Chang:
Bandwidth-rate-distortion optimized motion estimation. 161-164 - Liping Wang, Lai-Man Po, Y. M. S. Uddin, Ka-Man Wong, Shenyuan Li:
A novel weighted cross prediction for H.264 intra coding. 165-168 - Joachim Schenk, Frank Wallhoff, Gerhard Rigoll:
Novel VQ with constraints on the quantization error distribution. 169-172 - Anmin Liu, Weisi Lin, Fan Zhang:
Lossless video compression with optimal compression plane determination. 173-176 - Murat B. Badem, Warnakulasuriya Anil Chandana Fernando, José Luis Martínez, Pedro Cuenca:
An iterative side information refinement technique for transform domain Distributed Video Coding. 177-180 - Qiwei Liu, Houqiang Li, Yan Song, Chang Wen Chen:
Distributed multiview video coding using the fusion of triple side information. 181-184 - Nükhet Özbek, A. Murat Tekalp:
Quality Layers in scalable multi-view video coding. 185-188 - Jiaying Liu, Yongjin Cho, Zongming Guo:
Frame-based bit allocation for spatial scalability in H.264/SVC. 189-192 - Jaemoon Kim, Jungsoo Kim, Chong-Min Kyung:
A lossless embedded compression algorithm for high definition video coding. 193-196 - Keng-Hsien Huang, Han-Ru Chen, Shao-Yi Chien:
Algorithm and architecture design of multi-layer video coding enginewith hybrid scheme for wireless video links. 197-200 - S. Mohamad R. Soroushmehr, Shadrokh Samavi, Shahram Shirani:
Block matching algorithm based on local codirectionality of blocks. 201-204 - Tien Huu Vu, Supavadee Aramvith:
An error resilience technique based on FMO and error propagation for H.264 video coding in error-prone channels. 205-208 - Yixin Gao, Guizhong Liu:
Biorthogonal frequency-varying modulated lapped transform. 209-212 - Tong Gan, Bart Masschelein, Carolina Blanch, Antoine Dejonghe, Kristof Denolf:
Rate-distortion-complexity performance analysis of the SVC decoder. 213-216 - Yannan Wu, Oscar C. Au, Enming Luo, Dennis Tu, Leo Yeung:
A novel deringing method based on MAP image restoration. 217-220 - Eduardo Martínez-Enríquez, Fernando Díaz-de-María:
A hierarchical classification-based approach to Inter Mode Decision in H.264/AVC. 221-224 - Byung-Gyu Kim, Krishna Reddy, Kee-Wook Lim:
Dynamic search range control algorithm for inter-frame coding in scalable video coding. 225-228 - Weiran Tang, Oscar C. Au, Xing Wen, Yi Yang, Lu Fang:
LMMSE frequency merging for demosaicking. 229-232 - Xin Jin, Satoshi Goto, King Ngi Ngan:
Composite modeling of optical flow for artifacts reduction. 233-236 - Cihat Goktug Gurler, Anil Aksay, Gozde Bozdagi Akar, A. Murat Tekalp:
Multi-threaded architectures and benchmark tests for real-time multi-view video decoding. 237-240 - Chia-Ming Cheng, Shu-Jyuan Lin, Shang-Hong Lai, Jenq Kuen Lee:
Efficient multiple virtual view generation based on reduced depth stereo image for advanced autostereoscopic displays. 241-244 - Asha Vijayakumar:
Noise suppression using one-regular Unimodular filterbanks. 245-249
Content Analysis and Synthesis I
- Kouji Miyazato, Akisato Kimura, Shigeru Takagi, Junji Yamato:
Real-time estimation of human visual attention with dynamic Bayesian network and MCMC-based particle filter. 250-257 - Xiaoyu Zhang, Jian Cheng, Changsheng Xu, Hanqing Lu, Songde Ma:
Multi-view multi-label active learning for image classification. 258-261 - Byung Tae Oh, C.-C. Jay Kuo:
New PAR/NL scheme for stochastic texture interpolation. 262-265 - Zohra Saidane, Christophe Garcia, Jean-Luc Dugelay:
The image Text Recognition Graph (iTRG). 266-269 - Liang Shi, Jinqiao Wang, Lei Xu, Hanqing Lu, Changsheng Xu:
Context saliency based image summarization. 270-273 - Jinqiao Wang, Ling-Yu Duan, Bo Wang, Shi Chen, Yi Ouyang, Jing Liu, Hanqing Lu, Wen Gao:
Linking video ADS with product or service information by web search. 274-277
Human Face / Emotion Analysis and Synthesis
- Yifan Zhang, Changsheng Xu, Jian Cheng, Hanqing Lu:
Naming faces in films using hypergraph matching. 278-281 - Chung-Chun Wang, Yi-Chueh Su, Chiou-Ting Hsu, Chia-Wen Lin, Hong-Yuan Mark Liao:
Bayesian age estimation on face images. 282-285 - Ping-Han Lee, Yi-Ping Hung:
Face synthesis using Facial Trait Code and its application to creating suspect's physical profiles. 286-289 - Xiang Ma, Junping Zhang, Chun Qi:
Position-based face hallucination method. 290-293 - Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang:
Emotion recognition from speech VIA boosted Gaussian mixture models. 294-297 - Stéphanie Lefevre, Jean-Marc Odobez:
Structure and appearance features for robust 3D facial actions tracking. 298-301
Content Analysis and Synthesis II
- Hua-Tsung Chen, Wen-Jiin Tsai, Suh-Yin Lee:
Stance-based strike zone shaping and visualization in broadcast baseball video: Providing reference for pitch location positioning. 302-305 - Mandis Beigi, Shih-Fu Chang, Shahram Ebadollahi, Dinesh C. Verma:
Muti-scale temporal segmentation and outlier detection in sensor networks. 306-309 - Yangyu Tao, Lin Liang, Yingqing Xu:
Learning probabilistic structure to group image edges for object extraction. 310-313 - Jie Xiao, Yun Fu, Yijuan Lu, Qi Tian:
Refining image retrieval using one-class classification. 314-317 - Gwang-Gook Lee, Hyeong-ki Kim, Whoi-Yul Kim:
Highlight generation for basketball video using probabilistic excitement. 318-321
Feature Extraction and Representation I
- Ke Gao, Shouxun Lin, Yongdong Zhang, Sheng Tang, Dongming Zhang:
Logo detection based on spatial-spectral saliency and partial spatial context. 322-329 - Sarah De Bruyne, Chris Poppe, Steven Verstockt, Peter Lambert, Rik Van de Walle:
Estimating motion reliability to improve moving object detection in the H.264/AVC domain. 330-333 - Alberto Del Bimbo, Walter Nunziati, Pietro Pala:
David: Discriminant analysis for verification of monuments in image data. 334-337 - EnShuo Tsau, Namgook Cho, C.-C. Jay Kuo:
Fundamental frequency estimation for music signals with modified Hilbert-Huang transform (HHT). 338-341 - Hon-Keat Pong, Ping Xue, Qi Tian:
Visual event detection using orientation histograms with feature point trajectory information. 342-345 - Nejla Essaddi, Mohamed Hamdi, Noureddine Boudriga:
An image-based tracking algorithm for hybrid Wireless Sensor Networks using epipolar geometry. 346-349
Content Understanding and Knowledge Molding I
- Dong Liu, Meng Wang, Linjun Yang, Xian-Sheng Hua, HongJiang Zhang:
Tag quality improvement for social images. 350-353 - Jinjun Wang, Yihong Gong:
Normalizing multi-subject variation for drivers' emotion recognition. 354-357 - Jie Yang, Jian Cheng, Hanqing Lu:
Human activity recognition based on the blob features. 358-361 - Nicholas Vretos, Nikos Nikolaidis, Ioannis Pitas:
A perceptual hashing algorithm using latent dirichlet allocation. 362-365 - Zhixin Li, Xi Liu, Zhiping Shi, Zhongzhi Shi:
Learning image semantics with latent aspect model. 366-369 - Dinesh Babu Jayagopi, Bogdan Raducanu, Daniel Gatica-Perez:
Characterizing conversational group dynamics using nonverbal behaviour. 370-373
Feature Extraction and Representation II
- Hairong Lv, Wen Jun Yin, Jin Dong:
Off-line signature verification based on deformable grid partition and Hidden Markov Models. 374-377 - Bailan Feng, Juan Cao, Shouxun Lin, Yongdong Zhang, Kun Tao:
Motion region-based trajectory analysis and re-ranking for video retrieval. 378-381 - Emiru Tsunoo, George Tzanetakis, Nobutaka Ono, Shigeki Sagayama:
Audio genre classification using percussive pattern clustering combined with timbral features. 382-385 - Qiong Liu, Hironori Yano, Don Kimber, Chunyuan Liao, Lynn Wilcox:
High accuracy and language independent document retrieval with a Fast Invariant Transform. 386-389 - Li Sun, Guizhong Liu, Xueming Qian, Danping Guo:
A novel text detection and localization method based on corner response. 390-393 - Nikolaos Gkalelis, Nikos Nikolaidis, Ioannis Pitas:
View indepedent human movement recognition from multi-view video exploiting a circular invariant posture representation. 394-397
Content Understanding and Knowledge Molding II
- Fuxiang Lu, Xiaokang Yang, Rui Zhang, Songyu Yu:
Image classification based on pyramid histogram of topics. 398-401 - Kun Tao, Shouxun Lin, Yongdong Zhang:
KNSC: A novel local classification method for multimedia semantic analysis. 402-405 - Tongwei Ren, Yan Liu, Gangshan Wu:
Image retargeting based on global energy optimization. 406-409 - Deepak S. Turaga, Rong Yan:
Resource-adaptive semantic concept detection using ensemble classifiers. 410-413 - Stefan Romberg, Eva Hörster, Rainer Lienhart:
Multimodal pLSA on visual features and tags. 414-417 - Lin Lin, Mei-Ling Shyu, Guy Ravitz, Shu-Ching Chen:
Video semantic concept detection via associative classification. 418-421
Feature Extraction and Representation III
- Promiti Dutta, Alexander Haubold:
Audio-based classification of speaker characteristics. 422-425 - Yijie Wang, Zhongding Jiang:
Distance measurement in panoramic video. 426-429 - Nicolas Hervé, Nozha Boujemaa:
Visual word pairs for automatic image annotation. 430-433 - Zhihua Xu, Hefei Ling, Fuhao Zou, Zhengding Lu, Ping Li, Tianjiang Wang:
Fast and robust video copy detection scheme using full DCT coefficients. 434-437 - Chia-Te Liao, Yu-Lin Wang, Shang-Hong Lai, Chiou-Ting Hsu:
A novel color-context descriptor and its applications. 438-441 - Jia Li, Yonghong Tian, Tiejun Huang, Wen Gao:
A dataset and evaluation methodology for visual saliency in video. 442-445
Content Understanding and Knowledge Molding III
- C. Gregor v. d. Boogaart, Rainer Lienhart:
Note onset detection for the transcription of polyphonic piano music. 446-449 - Keiji Yanai:
Web image gathering with region-based bag-of-features and multiple instance learning. 450-453 - Yi Ouyang, Ming Tang, Shi Chen, Jinqiao Wang, Hanqing Lu, Songde Ma:
Learning local features for object categorization. 454-457 - Lei Wang, Chng Eng Siong, Haizhou Li:
Efficient sparse self-similarity matrix construction for repeating sequence detection. 458-461 - Chunjie Zhang, Jing Liu, Hanqing Lu, Songde Ma:
Web image mining using concept sensitive Markov stationary features. 462-465 - Shiva Sundaram, Shrikanth S. Narayanan:
A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications. 466-469
Audio Analysis and Synthesis
- Soo-Chang Pei, Nien-Teh Hsu:
A novel music similarity measure system based on instrumentation analysis. 470-473 - Lamberto Ballan, Alessio Bazzica, Marco Bertini, Alberto Del Bimbo, Giuseppe Serra:
Deep networks for audio event classification in soccer videos. 474-477 - Hiromi Ishizaki, Keiichiro Hoashi, Yasuhiro Takishima:
Autocorrelation-based beat estimation adaptive to drastic tempo change in a song. 478-481 - Yongwei Zhu, Hui Li Tan, Susanto Rahardja:
Drum loop pattern extraction from polyphonic music audio. 482-485 - Zhi Zeng, Shuwu Zhang, Heping Li, Wei Liang, Haibo Zheng:
A novel approach to musical genre classification using probabilistic latent semantic analysis model. 486-489 - Jessie Xin Zhang, Stephen Brooks, Jacqueline L. Whalley:
Audio classification based on adaptive partitioning. 490-493
Audio / Video / Image Segmentation
- Stephen M. Chu, Hao Tang, Thomas S. Huang:
Locality preserving speaker clustering. 494-497 - Manuel Reyes-Gomez, Nebojsa Jojic:
Speech separation by efficient combinatorial decoding of speech mixtures. 498-505 - Milena Bueno Pereira Carneiro, Antônio Cláudio Paschoarelli Veiga, Fernando Cordeiro de Castro, Edna Lúcia Flôres, Gilberto Arantes Carrijo:
Application of evolutionary algorithms for iris localization. 506-509