default search action
17th CIKM 2008: Napa Valley, California, USA
- James G. Shanahan, Sihem Amer-Yahia, Ioana Manolescu, Yi Zhang, David A. Evans, Aleksander Kolcz, Key-Sun Choi, Abdur Chowdhury:
Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, Napa Valley, California, USA, October 26-30, 2008. ACM 2008, ISBN 978-1-59593-991-3 - Rakesh Agrawal:
Humane data mining. 1-2
DB: faceted search, web query results presentation
- Debabrata Dash, Jun Rao, Nimrod Megiddo, Anastasia Ailamaki, Guy M. Lohman:
Dynamic faceted search for discovery-driven analysis. 3-12 - Senjuti Basu Roy, Haidong Wang, Gautam Das, Ullas Nambiar, Mukesh K. Mohania:
Minimum-effort driven dynamic faceted search in structured databases. 13-22 - Gloria Bordogna, Alessandro Campi, Giuseppe Psaila, Stefania Ronchi:
A language for manipulating clustered web documents results. 23-32 - Shui-Lung Chuang, Kevin Chen-Chuan Chang:
Integrating web query results: holistic schema matching. 33-42
IR: web search 1
- Filip Radlinski, Madhu Kurup, Thorsten Joachims:
How does clickthrough data reflect retrieval quality? 43-52 - Marc Najork, Nick Craswell:
Efficient and effective link analysis with precomputed salsa maps. 53-62 - Lian'en Huang, Lei Wang, Xiaoming Li:
Achieving both high precision and high recall in near-duplicate detection. 63-72 - Zhicheng Dou, Ruihua Song, Xiaojie Yuan, Ji-Rong Wen:
Are click-through data adequate for learning web search rankings? 73-82
KM: classification
- Jian Huang, Omid Madani, C. Lee Giles:
Error-driven generalist+experts (edge): a multi-stage ensemble framework for text categorization. 83-92 - Yang Song, Lu Zhang, C. Lee Giles:
A sparse gaussian processes classification framework for fast tag suggestions. 93-102 - Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He:
Transfer learning from multiple source domains via consensus regularization. 103-112 - Dell Zhang, Robert Mao:
Classifying networked entities with modularity kernels. 113-122
Industry research track
- Casey Whitelaw, Alexander Kehlenbeck, Nemanja Petrovic, Lyle H. Ungar:
Web-scale named entity recognition. 123-132 - Roy J. Byrd, Mary S. Neff, Wilfried Teiken, Youngja Park, Keh-Shin F. Cheng, Stephen C. Gates, Karthik Visweswariah:
Semi-automated logging of contact center telephone calls. 133-142 - Gang Luo, Chunqiang Tang, Hao Yang, Xing Wei:
MedSearch: a specialized search engine for medical information retrieval. 143-152 - Roger B. Bradford:
An empirical study of required dimensionality for large-scale latent semantic indexing applications. 153-162
DB: efficient maintenance and query optimization
- Gang Luo, Philip S. Yu:
Content-based filtering for efficient online materialized view maintenance. 163-172 - Gang Qian, Yisheng Dong:
A step towards incremental maintenance of the composed schema mapping. 173-182 - Mumtaz Ahmad, Ashraf Aboulnaga, Shivnath Babu, Kamesh Munagala:
Modeling and exploiting query interactions in database systems. 183-192 - Humberto Luiz Razente, Maria Camila Nardini Barioni, Agma J. M. Traina, Christos Faloutsos, Caetano Traina Jr.:
A novel optimization approach to efficiently process aggregate similarity queries in metric access methods. 193-202
IR: social search
- Kerstin Bischoff, Claudiu S. Firan, Wolfgang Nejdl, Raluca Paiu:
Can all tags be used for search? 193-202 - Anna Ritchie, Stephen Robertson, Simone Teufel:
Comparing citation contexts for information retrieval. 213-222 - Fabian M. Suchanek, Milan Vojnovic, Dinan Gunawardena:
Social tags: meaning and suggestions. 223-232 - Hao Ma, Haixuan Yang, Michael R. Lyu, Irwin King:
Mining social networks using heat diffusion processes for marketing candidates selection. 233-242
IR/KM: machine learning
- Leonardo Rocha, Fernando Mourão, Adriano C. M. Pereira, Marcos André Gonçalves, Wagner Meira Jr.:
Exploiting temporal contexts in text classification. 243-252 - Alessandro Moschitti:
Kernel methods, syntax and semantics for relational text categorization. 253-262 - George Forman:
BNS feature scaling: an improved representation over tf-idf for svm text classification. 263-270 - Guilherme Hoefel, Charles Elkan:
Learning a two-stage SVM/CRF sequence classifier. 271-278
KM: link and graph mining
- Ziv Bar-Yossef, Li-Tal Mashiach:
Local approximation of pagerank and reverse pagerank. 279-288 - Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar, Ying Xu:
Link privacy in social networks. 289-298 - Chen Chen, Cindy Xide Lin, Xifeng Yan, Jiawei Han:
On effective presentation of graph patterns: a structural representative approach. 299-308 - Qiankun Zhao, Sourav S. Bhowmick, Xin Zheng, Kai Yi:
Characterizing and predicting community members from evolutionary and heterogeneous networks. 309-318
KM: information filtering
- Marko A. Rodriguez, Johan Bollen:
An algorithm to determine peer-reviewers. 319-328 - Dongmei Jia, Wai Gen Yee, Ophir Frieder:
Spam characterization and detection in peer-to-peer file-sharing systems. 329-338 - Steve Webb, James Caverlee, Calton Pu:
Predicting web spam with HTTP session information. 339-348 - Nish Parikh, Neel Sundaresan:
Inferring semantic query relations from collective user behavior. 349-358
DB: stream processing
- George A. Mihaila, Ioana Stanoi, Christian A. Lang:
Anomaly-free incremental output in stream processing. 359-368 - Abhishek Mukherji, Elke A. Rundensteiner, David C. Brown, Venkatesh Raghavan:
SNIF TOOL: sniffing for patterns in continuous streams. 369-378 - Gang Luo, Rong Yan, Philip S. Yu:
Real-time new event detection for video streams. 379-388 - Giorgio Ghelli, Dario Colazzo, Carlo Sartiani:
Linear time membership in a class of regular expressions with interleaving and counting. 389-398
IR: theory
- Donald Metzler:
Generalized inverse document frequency. 399-408 - Derrick Coetzee:
TinyLex: static n-gram index pruning with perfect recall. 409-418 - David E. Losada, Leif Azzopardi, Mark Baillie:
Revisiting the relationship between document length and relevance. 419-428 - Lixin Shi, Jian-Yun Nie, Guihong Cao:
Relating dependent indexes using dempster-shafer theory. 429-438
IR: query analysis
- Claudia Hauff, Vanessa Murdock, Ricardo Baeza-Yates:
Improved query difficulty prediction for the web. 439-448 - Doug Downey, Susan T. Dumais, Daniel J. Liebling, Eric Horvitz:
Understanding the relationship between searchers' queries and information goals. 449-458 - Zuobing Xu, Ram Akella:
Active relevance feedback for difficult queries. 459-468 - Qiaozhu Mei, Dengyong Zhou, Kenneth Ward Church:
Query suggestion using hitting time. 469-478
KM: web mining
- Xuanhui Wang, ChengXiang Zhai:
Mining term association patterns from search logs for effective query reformulation. 479-488 - Krisztian Balog, Maarten de Rijke:
Non-local evidence for expert finding. 489-498 - Amit Goyal, Francesco Bonchi, Laks V. S. Lakshmanan:
Discovering leaders from community actions. 499-508 - David N. Milne, Ian H. Witten:
Learning to link with wikipedia. 509-518 - Pedro M. Domingos:
Markov logic: a unifying language for knowledge and information management. 519
DB/industry: XML data integration and XML query optimization
- Alex Thomo, Srinivasan Venkatesh:
Rewriting of visibly pushdown languages for xml data integration. 521-530 - Guangjun Xie, Qi Cheng, Jarek Gryz, Calisto Zuzarte:
Some rewrite optimizations of DB2 XQuery navigation. 531-540 - Bilel Gueni, Talel Abdessalem, Bogdan Cautis, Emmanuel Waller:
Pruning nested XQuery queries. 541-550 - Pawel Placek, Dimitri Theodoratos, Stefanos Souldatos, Theodore Dalamagas, Timos K. Sellis:
A heuristic approach for checking containment of generalized tree-pattern queries. 551-560
IR: evaluation
- Leif Azzopardi, Vishwa Vinay:
Retrievability: an evaluation measure for higher order information access tasks. 561-570 - William Webber, Alistair Moffat, Justin Zobel:
Statistical power in retrieval experimentation. 571-580 - Tetsuya Sakai:
Comparing metrics across TREC and NTCIR: the robustness to system bias. 581-590 - Kenneth A. Kinney, Scott B. Huffman, Juting Zhai:
How evaluator domain expertise affects search result relevance judgments. 591-598
KM: statistical techniques
- Christos Boutsidis, Jimeng Sun, Nikos Anerousis:
Clustered subset selection and its applications on it service metrics. 599-608 - Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis, Sebastiano Vigna:
The query-flow graph: model and applications. 609-618 - Arnold P. Boedihardjo, Chang-Tien Lu, Feng Chen:
A framework for estimating complex probability density structures in data streams. 619-628 - Pinar Donmez, Jaime G. Carbonell:
Proactive learning: cost-sensitive active learning with multiple imperfect oracles. 619-628
Panel discussion
- David A. Evans, Jason R. Baron, Chris Buckley, Robert S. Bauer:
E-discovery. 1527
DB: indexing and physical query optimization
- Josep Aguilar-Saborit, Mohammad Jalali, Dave Sharpe, Victor Muntés-Mulero:
Exploiting pipeline interruptions for efficient memory allocation. 639-648 - Marina Barsky, Ulrike Stege, Alex Thomo, Chris Upton:
A new method for indexing genomes using on-disk suffix trees. 649-658 - Vuk Ercegovac, Vanja Josifovski, Ning Li, Maurício R. Mediano, Eugene J. Shekita:
Supporting sub-document updates and queries in an inverted index. 659-668 - Wei Dong, Zhe Wang, William Josephson, Moses Charikar, Kai Li:
Modeling LSH for performance tuning. 669-678
IR: web search 2
- Mingjie Zhu, Shuming Shi, Nenghai Yu, Ji-Rong Wen:
Can phrase indexing help to process non-phrase queries? 679-688 - Julia Luxenburger, Shady Elbassuoni, Gerhard Weikum:
Matching task profiles and user needs in personalized web search. 689-698 - Rosie Jones, Kristina Lisa Klinkner:
Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. 699-708 - Hao Ma, Haixuan Yang, Irwin King, Michael R. Lyu:
Learning latent semantic relations from clickthrough data for query suggestion. 709-718
IR: multilingual & multimedia
- Kristen Parton, Kathleen R. McKeown, James Allan, Enrique Henestroza:
Simultaneous multilingual search for translingual information retrieval. 719-728 - Daqing He, Dan Wu:
Translation enhancement: a new relevance feedback method for cross-language information retrieval. 729-738 - Eduardo Valle, Matthieu Cord, Sylvie Philipp-Foliguet:
High-dimensional descriptor indexing for large multimedia databases. 739-748 - Yu-En Lu, Pietro Liò, Steven Hand:
On low dimensional random projections and similarity search. 749-758
KM: data mining
- Hanghang Tong, Yasushi Sakurai, Tina Eliassi-Rad, Christos Faloutsos:
Fast mining of complex time-stamped events. 759-768 - Darcy A. Davis, Nitesh V. Chawla, Nicholas Blumm, Nicholas A. Christakis, Albert-László Barabási:
Predicting individual disease risk based on medical history. 769-778 - Malika Mahoui, William John Teahan, Arvind Kumar Thirumalaiswamy Sekhar, Satyasaibabu Chilukuri:
Identification of gene function using prediction by partial matching (PPM) language models. 779-786 - Philon Nguyen, Nematollaah Shiri:
Fast correlation analysis on time series datasets. 787-796
KM: semantic techniques
- Rodolfo Stecher, Claudia Niederée, Wolfgang Nejdl:
Wildcards for lightweight information integration in virtual desktops. 797-806 - Simona Colucci, Eugenio Di Sciascio, Francesco M. Donini, Eufemia Tinelli:
Finding informative commonalities in concept collections. 807-817 - Masahiro Ito, Kotaro Nakayama, Takahiro Hara, Shojiro Nishio:
Association thesaurus construction methods based on link co-occurrence analysis for wikipedia. 817-826 - Christian Hütter, Conny Kühne, Klemens Böhm:
Peer production of structured knowledge -: an empirical study of ratings and incentive mechanisms. 827-842
DB: security and privacy
- Venkatesan T. Chakaravarthy, Himanshu Gupta, Prasan Roy, Mukesh K. Mohania:
Efficient techniques for document sanitization. 843-852 - Rosie Jones, Ravi Kumar, Bo Pang, Andrew Tomkins:
Vanity fair: privacy in querylog bundles. 853-862 - Haixun Wang, Jian Yin, Chang-Shing Perng, Philip S. Yu:
Dual encryption for query integrity assurance. 863-872 - Ahmed A. Ataullah, Ashraf Aboulnaga, Frank Wm. Tompa:
Records retention in relational database systems. 873-882
IR: medley
- Lisa Friedland, James Allan:
Joke retrieval: recognizing the same joke told differently. 883-892 - Jeremy Pickens, Gene Golovchinsky:
Ranked feature fusion models for ad hoc retrieval. 893-900 - Jin Zhang, Xueqi Cheng, Gaowei Wu, Hongbo Xu:
AdaSum: an adaptive model for summarization. 901-910 - Deng Cai, Qiaozhu Mei, Jiawei Han, Chengxiang Zhai:
Modeling hidden topics on document manifold. 911-920
IR: recommender systems
- Jinwen Guo, Shengliang Xu, Shenghua Bao, Yong Yu:
Tapping on the potential of q&a community by recommending answer providers. 921-930 - Hao Ma, Haixuan Yang, Michael R. Lyu, Irwin King:
SoRec: social recommendation using probabilistic matrix factorization. 931-940 - Yun Chi, Shenghuo Zhu, Yihong Gong, Yi Zhang:
Probabilistic polyadic factorization and its application to personalized recommendation. 941-950 - Derry Tanti Wijaya, Stéphane Bressan:
A random walk on the red carpet: rating movies with user reviews and pagerank. 951-960
KM: feature selection
- Xiang Zhang, Feng Pan, Wei Wang:
REDUS: finding reducible subspaces in high dimensional data. 961-970 - Elsa Loekito, James Bailey:
Mining influential attributes that capture class and group contrast behaviour. 971-980 - Ying Liu, Lucian Vlad Lita, Radu Stefan Niculescu, Kun Bai, Prasenjit Mitra, C. Lee Giles:
Real-time data pre-processing technique for efficient feature extraction in large scale datasets. 981-990 - Hongliang Fei, Jun Huan:
Structure feature selection for graph classification. 991-1000
Panel discussion 2
- David A. Evans, Susan Feldman, Ed H. Chi, Natasa Milic-Frayling, Igor Perisic:
The social (open) workspace. 1529 - W. Bruce Croft:
Unsolved problems in search: (and how we approach them). 1001
IR: advertising & filtering
- Andrei Z. Broder, Massimiliano Ciaramita, Marcus Fontoura, Evgeniy Gabrilovich, Vanja Josifovski, Donald Metzler, Vanessa Murdock, Vassilis Plachouras:
To swing or not to swing: learning when (not) to advertise. 1003-1012 - Andrei Z. Broder, Peter Ciccolo, Marcus Fontoura, Evgeniy Gabrilovich, Vanja Josifovski, Lance Riedel:
Search advertising using web relevance feedback. 1013-1022 - Yuefeng Li, Xujuan Zhou, Peter Bruza, Yue Xu, Raymond Y. K. Lau:
A two-stage text mining model for information filtering. 1023-1032 - Canhui Wang, Min Zhang, Liyun Ru, Shaoping Ma:
Automatic online news topic ranking using media focus and user attention based on aging theory. 1033-1042
IR: blog
- Craig Macdonald, Iadh Ounis:
Key blog distillation: ranking aggregates. 1043-1052 - Jangwon Seo, W. Bruce Croft:
Blog site search using resource selection. 1053-1062 - Ben He, Craig Macdonald, Jiyin He, Iadh Ounis:
An effective statistical approach to blog post opinion retrieval. 1063-1072
KM: clustering
- Chuan Duan, Jane Cleland-Huang, Bamshad Mobasher:
A consensus based approach to constrained clustering of software requirements. 1073-1082 - Ron Bekkerman, Martin Scholz:
Data weaving: scaling up the state-of-the-art in data clustering. 1083-1092 - Ira Assent, Ralph Krieger, Emmanuel Müller, Thomas Seidl:
EDSC: efficient density-based subspace clustering. 1093-1102 - Faris Alqadah, Raj Bhatnagar:
An effective algorithm for mining 3-clusters in vertically partitioned data. 1103-1112
IR: enterprise search
- Maryam Karimzadehgan, ChengXiang Zhai, Geneva G. Belford:
Multi-aspect expertise matching for review assignment. 1113-1122 - Barbara Poblete, Carlos Castillo, Aristides Gionis:
Dr. Searcher and Mr. Browser: a unified hyperlink-click graph. 1123-1132 - Pavel Serdyukov, Henning Rode, Djoerd Hiemstra:
Modeling multi-step relevance propagation for expert finding. 1133-1142 - Keke Chen, Rongqing Lu, C. K. Wong, Gordon Sun, Larry P. Heck, Belle L. Tseng:
Trada: tree based ranking function adaptation. 1143-1152
IR: structured documents
- Mir Sadek Ali, Mariano P. Consens, Gabriella Kazai, Mounia Lalmas:
Structural relevance: a common basis for the evaluation of structured document retrieval. 1153-1162 - Le Zhao, Jamie Callan:
A generative retrieval model for structured documents. 1163-1172 - Christian Kohlschütter, Wolfgang Nejdl:
A densitometric approach to web page segmentation. 1173-1182 - Sujith Ravi, Marius Pasca:
Using structured text for large-scale attribute extraction. 1183-1192
KM: text mining
- Anup Chalamalla, Sumit Negi, L. Venkata Subramaniam, Ganesh Ramakrishnan:
Identification of class specific discourse patterns. 1193-1202 - Huajing Li, Zaiqing Nie, Wang-Chien Lee, C. Lee Giles, Ji-Rong Wen:
Scalable community discovery on textual data with relations. 1203-1212