default search action
18th KDD 2012: Beijing, China
- Qiang Yang, Deepak Agarwal, Jian Pei:
The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '12, Beijing, China, August 12-16, 2012. ACM 2012, ISBN 978-1-4503-1462-6
Keynote addresses
- Robin Li:
Nine real hard problems we'd like you to solve. 1 - Jiawei Han:
Mining heterogeneous information networks: the next frontier. 2-3 - Michael I. Jordan:
Divide-and-conquer and statistical inference for big data. 4 - Michael J. Kearns:
Experiments in social computation: (and the data they generate). 5
Research session a1: page rank and social networks
- Yasuko Matsubara, Yasushi Sakurai, B. Aditya Prakash, Lei Li, Christos Faloutsos:
Rise and fall patterns of information diffusion: model and implications. 6-14 - Yasuhiro Fujiwara, Makoto Nakatsuji, Takeshi Yamamuro, Hiroaki Shiokawa, Makoto Onizuka:
Efficient personalized pagerank with accuracy assurance. 15-23 - Bahman Bahmani, Ravi Kumar, Mohammad Mahdian, Eli Upfal:
PageRank on an evolving graph. 24-32 - Seth A. Myers, Chenguang Zhu, Jure Leskovec:
Information diffusion and external influence in networks. 33-41 - Rob Patro, Geet Duggal, Emre Sefer, Hao Wang, Darya Filippova, Carl Kingsford:
The missing models: a data-driven approach for learning how networks grow. 42-50
Research session a2: pattern mining
- Guimei Liu, Haojun Zhang, Limsoon Wong:
Finding minimum representative pattern sets. 51-59 - Kui Yu, Wei Ding, Dan A. Simovici, Xindong Wu:
Mining emerging patterns by streaming feature selection. 60-68 - Mario Boley, Sandy Moens, Thomas Gärtner:
Linear space direct pattern sampling using coupling from the past. 69-77 - Cheng-Wei Wu, Bai-En Shie, Vincent S. Tseng, Philip S. Yu:
Mining top-K high utility itemsets. 78-86 - Geng Li, Mohammed J. Zaki:
Sampling minimal frequent boolean (DNF) patterns. 87-95
Research session a3: probabilistic models
- Xu Chen, Mingyuan Zhou, Lawrence Carin:
The contextual focused topic model. 96-104 - Issei Sato, Kenichi Kurihara, Hiroshi Nakagawa:
Practical collapsed variational bayes inference for hierarchical dirichlet process. 105-113 - Lei Han, Guojie Song, Gao Cong, Kunqing Xie:
Overlapping decomposition for causal graphical modeling. 114-122 - Yu Wang, Eugene Agichtein, Michele Benzi:
TM-LDA: efficient online modeling of latent topic transitions in social media. 123-131 - Stephan Günnemann, Ines Färber, Thomas Seidl:
Multi-view clustering using mixture models in subspace projections. 132-140
Research session a4: supervised learning
- Te-Kang Jan, Da-Wei Wang, Chi-Hung Lin, Hsuan-Tien Lin:
A simple methodology for soft cost-sensitive classification. 141-149 - Yin Lou, Rich Caruana, Johannes Gehrke:
Intelligible models for classification and regression. 150-158 - Hua Ouyang, Alexander G. Gray:
NASA: achieving lower regrets and faster rates via adaptive stepsizes. 159-167 - Thomas Ryan Hoens, Nitesh V. Chawla:
Learning in non-stationary environments with class imbalance. 168-176 - Shin Matsushima, S. V. N. Vishwanathan, Alexander J. Smola:
Linear support vector machines via dual cached loops. 177-185
Industry/govt track a5: mobile computing
- Jing Yuan, Yu Zheng, Xing Xie:
Discovering regions of different functions in a city using human mobility and POIs. 186-194 - Ling-Yin Wei, Yu Zheng, Wen-Chih Peng:
Constructing popular routes from uncertain trajectories. 195-203 - Kent Shi, Kamal Ali:
GetJar mobile application recommendations with very sparse datasets. 204-212 - Rui Chen, Benjamin C. M. Fung, Bipin C. Desai, Nériah M. Sossou:
Differentially private transit data publication: a case study on the montreal transportation system. 213-221
Asia-Pacific track a6: session 1
- Deyi Li:
Interaction and collective intelligence in internet computing. 222 - Masaru Kitsuregawa:
Building an engine for big data. 223 - Bo Zhang:
A new challenge of information processing under the 21st century. 224 - Geoff Holmes:
Developing data mining applications. 225
Research session b1: social opinions
- Yuandong Tian, Jun Zhu:
Learning from crowds in the presence of schools of thought. 226-234 - Anirban Dasgupta, Ravi Kumar, D. Sivakumar:
Social sampling. 235-243 - Chunyan Wang, Mao Ye, Bernardo A. Huberman:
From user comments to on-line conversations. 244-252 - Jiliang Tang, Huiji Gao, Huan Liu, Atish Das Sarma:
eTrust: understanding trust evolution in an online world. 253-261
Research session b2: time series
- Thanawin Rakthanmanon, Bilson J. L. Campana, Abdullah Mueen, Gustavo E. A. P. A. Batista, M. Brandon Westover, Qiang Zhu, Jesin Zakaria, Eamonn J. Keogh:
Searching and mining trillions of time series subsequences under dynamic time warping. 262-270 - Yasuko Matsubara, Yasushi Sakurai, Christos Faloutsos, Tomoharu Iwata, Masatoshi Yoshikawa:
Fast mining and forecasting of complex time-stamped events. 271-279 - Iyad Batal, Dmitriy Fradkin, James H. Harrison Jr., Fabian Moerchen, Milos Hauskrecht:
Mining recent temporal patterns for event detection in multivariate time series data. 280-288 - Jason Lines, Luke M. Davis, Jon Hills, Anthony J. Bagnall:
A shapelet transform for time series classification. 289-297
Research session b3: matrices and tensors
- Yao Hu, Debing Zhang, Jun Liu, Jieping Ye, Xiaofei He:
Accelerated singular value thresholding for matrix completion. 298-306 - Liangda Li, Guy Lebanon, Haesun Park:
Fast bregman divergence NMF using taylor expansion and coordinate descent. 307-315 - U Kang, Evangelos E. Papalexakis, Abhay Harpale, Christos Faloutsos:
GigaTensor: scaling tensor analysis up by 100 times - algorithms and discoveries. 316-324 - Jorge G. Silva, Lawrence Carin:
Active learning for online bayesian matrix factorization. 325-333
Research session b4: unsupervised learning
- Shuiwang Ji, Wenlu Zhang, Jun Liu:
A sparsity-inducing formulation for evolutionary co-clustering. 334-342 - So Hirai, Kenji Yamanishi:
Detecting changes of clustering structures using normalized maximum likelihood coding. 343-351 - Stephan Günnemann, Ines Färber, Kittipat Virochsiri, Thomas Seidl:
Subspace correlation clustering: finding locally correlated dimensions in subspace projections of the data. 352-360 - Claudia Plant:
Dependency clustering across measurement scales. 361-369
Industry/govt track b5: social network analysis
- Xintian Yang, Amol Ghoting, Yiye Ruan, Srinivasan Parthasarathy:
A framework for summarizing and analyzing twitter feeds. 370-378 - Xinfan Meng, Furu Wei, Xiaohua Liu, Ming Zhou, Sujian Li, Houfeng Wang:
Entity-centric topic-oriented opinion summarization in twitter. 379-387 - Wenjun Zhou, Hongxia Jin, Yan Liu:
Community discovery and profiling with social messages. 388-396 - Ziad Al Bawab, George H. Mills, Jean-François Crespo:
Finding trending local topics in search queries for personalization of a recommendation system. 397-405
Industrial practice expo b6: session 1
- Yong Shi:
China's national personal credit scoring system: a real-life intelligent knowledge application. 406 - Rich Holada:
Maximizing return and minimizing cost with the right decision management systems. 407
Research session c1: social and web mining applications
- Theodoros Lappas, George Valkanas, Dimitrios Gunopulos:
Efficient and domain-invariant competitor mining. 408-416 - Peter Haider, Luca Chiarandini, Ulf Brefeld:
Discriminative clustering for market segmentation. 417-425 - Alex Beutel, B. Aditya Prakash, Roni Rosenfeld, Christos Faloutsos:
Interacting viruses in networks: can both survive? 426-434 - Rakesh Agrawal, Samuel Ieong:
Aggregating web offers to determine product prices. 435-443
Research session c2: event mining
- Zhenhui Li, Jingjing Wang, Jiawei Han:
Mining event periodicity from incomplete observations. 444-452 - Fei Wang, Noah Lee, Jianying Hu, Jimeng Sun, Shahram Ebadollahi:
Towards heterogeneous temporal clinical event pattern discovery: a convolutional approach. 453-461 - Nikolaj Tatti, Jilles Vreeken:
The long and the short of it: summarising event sequences with serial episodes. 462-470 - Bruno Cadonna, Johann Gamper, Michael H. Böhlen:
Efficient event pattern matching with match windows. 471-479
Research session c3: matrix approximation
- Shuo Xiang, Yunzhang Zhu, Xiaotong Shen, Jieping Ye:
Optimal exact least squares rank minimization. 480-488 - Vikas Sindhwani, Amol Ghoting:
Large-scale distributed non-negative sparse coding and sparse dictionary learning. 489-497 - Ke Zhou, Hongyuan Zha:
Learning binary codes for collaborative filtering. 498-506 - Cho-Jui Hsieh, Kai-Yang Chiang, Inderjit S. Dhillon:
Low rank modeling of signed networks. 507-515
Research session c4: supervised learning with multivariate data
- Madeleine Seeland, Andreas Karwath, Stefan Kramer:
A structural cluster kernel for learning on graphs. 516-524 - Sheng-Jun Huang, Yang Yu, Zhi-Hua Zhou:
Multi-label hypothesis reuse. 525-533 - Forrest Briggs, Xiaoli Z. Fern, Raviv Raich:
Rank-loss support instance machines for MIML instance annotation. 534-542 - Jintao Zhang, Jun Huan:
Inductive multi-task learning with multiple view data. 543-551
Industry/govt track c5: web applications
- Xinyu Xing, Yu-Li Liang, Sui Huang, Hanqiang Cheng, Richard Han, Qin Lv, Xue Liu, Shivakant Mishra, Yi Zhu:
Scalable misbehavior detection in online video chat services. 552-560 - Jun Zhang, Xiaoming Fan, Jianyong Wang, Lizhu Zhou:
Keyword-propagation-based information enriching and noise removal for web news videos. 561-569 - Lei Zhang, Linpeng Tang, Ping Luo, Enhong Chen, Limei Jiao, Min Wang, Guiquan Liu:
Harnessing the wisdom of the crowds for accurate web page clipping. 570-578 - Uwe F. Mayer:
Bootstrapped language identification for multi-site internet domains. 579-585
Industrial practice expo c6: session 2
- Wei-Ying Ma:
Semantic search and a new moore's law effect in knowledge engineering. 586 - Christian Posse:
Key lessons learned building recommender systems for large-scale social networks. 587
Research session a1: community mining
- Guan Wang, Yuchen Zhao, Xiaoxiao Shi, Philip S. Yu:
Magnet community identification on social networks. 588-596 - David F. Gleich, C. Seshadhri:
Vertex neighborhoods, low conductance cuts, and good seeds for local community methods. 597-605 - Yu Zhang, Dit-Yan Yeung:
Overlapping community detection via bounded nonnegative matrix tri-factorization. 606-614 - Michele Coscia, Giulio Rossetti, Fosca Giannotti, Dino Pedreschi:
DEMON: a local-first discovery method for overlapping communities. 615-623 - Bruno D. Abrahao, Sucheta Soundarajan, John E. Hopcroft, Robert Kleinberg:
On the separability of structural classes of communities. 624-632
Research session a2: sequential and spatio-temporal patterns
- Liang Tang, Tao Li, Larisa Shwartz:
Discovering lag intervals for temporal dependencies. 633-641 - Jaya Kawale, Snigdhansu Chatterjee, Dominick Ormsby, Karsten Steinhaeuser, Stefan Liess, Vipin Kumar:
Testing the significance of spatio-temporal teleconnection patterns. 642-650 - Jeffrey Chan, Wei Liu, Christopher Leckie, James Bailey, Kotagiri Ramamohanarao:
SeqiBloc: mining multi-time spanning blockmodels in dynamic graphs. 651-659 - Junfu Yin, Zhigang Zheng, Longbing Cao:
USpan: an efficient algorithm for mining high utility sequential patterns. 660-668 - Xuemei Liu, James Biagioni, Jakob Eriksson, Yin Wang, George Forman, Yanmin Zhu:
Mining large-scale, sparse GPS traces for map inference: comparison of approaches. 669-677
Research session a3: personalization and recommendation
- Khalid El-Arini, Ulrich Paquet, Ralf Herbrich, Jurgen Van Gael, Blaise Agüera y Arcas:
Transparent user models for personalization. 678-686 - Aristides Gionis, Theodoros Lappas, Evimaria Terzi:
Estimating entity importance via counting set covers. 687-695 - Erheng Zhong, Wei Fan, Junwei Wang, Lei Xiao, Yong Li:
ComSoc: adaptive transfer of user behaviors over composite social network. 696-704 - Karthik Raman, Pannaga Shivaswamy, Thorsten Joachims:
Online learning to diversify from implicit feedback. 705-713 - Shuo Chen, Joshua L. Moore, Douglas R. Turnbull, Thorsten Joachims:
Playlist prediction via metric embedding. 714-722
Research session a4: supervised learning with auxilliary information
- Ming Ji, Binbin Lin, Xiaofei He, Deng Cai, Jiawei Han:
Parallel field ranking. 723-731 - Fanhua Shang, Licheng Jiao, Fei Wang:
Semi-supervised learning with mixed knowledge information. 732-740 - Rita Chattopadhyay, Zheng Wang, Wei Fan, Ian Davidson, Sethuraman Panchanathan, Jieping Ye:
Batch mode active sampling based on marginal probability distribution matching. 741-749 - Ashesh Jain, S. V. N. Vishwanathan, Manik Varma:
SPF-GMKL: generalized multiple kernel learning with a million kernels. 750-758 - Pavel P. Kuksa, Vladimir Pavlovic:
Efficient evaluation of large sequence kernels. 759-767
Industry/govt track a5: computational advertising
- Kuang-chih Lee, Burkay Orten, Ali Dasdan, Wentong Li:
Estimating conversion rate in display advertising from past erformance data. 768-776 - Haibin Cheng, Roelof van Zwol, Javad Azimi, Eren Manavoglu, Ruofei Zhang, Yang Zhou, Vidhya Navalpakkam:
Multimedia features for click prediction of new ads in display advertising. 777-785 - Ron Kohavi, Alex Deng, Brian Frasca, Roger Longbotham, Toby Walker, Ya Xu:
Trustworthy online controlled experiments: five puzzling outcomes explained. 786-794 - Ye Chen, Tak W. Yan:
Position-normalized click prediction in search advertising. 795-803 - Claudia Perlich, Brian Dalessandro, Rod Hook, Ori Stitelman, Troy Raeder, Foster J. Provost:
Bid optimizing and inventory scoring in targeted online advertising. 804-812
Asia-Pacific track a6: session 2
- Jianzhong Li:
Algorithms for mining uncertain graph data. 813 - Paul Compton:
Experience with discovering knowledge by acquiring it. 814 - Naonori Ueda:
Bayesian relational data analysis. 815 - Zhongzhi Shi:
Cross-media knowledge discovery. 816
Research session b1: review, discussion, and q & a
- Sihong Xie, Guan Wang, Shuyang Lin, Philip S. Yu:
Review spam detection via temporal pattern discovery. 823-831 - Theodoros Lappas, Mark Crovella, Evimaria Terzi:
Selecting a characteristic set of reviews. 832-840 - Arjun Mukherjee, Bing Liu:
Mining contentions from discussions and debates. 841-849 - Ashton Anderson, Daniel P. Huttenlocher, Jon M. Kleinberg, Jure Leskovec:
Discovering value from community activity on focused question answering sites: a case study of stack overflow. 850-858
Research session b2: outlier and intrusion detection
- Manish Gupta, Jing Gao, Yizhou Sun, Jiawei Han:
Integrating community matching and outlier detection for mining evolutionary community outliers. 859-867 - Wouter Duivesteijn, Ad Feelders, Arno J. Knobbe:
Different slopes for different folks: mining for exceptional regression models with cook's distance. 868-876 - Ninh Pham, Rasmus Pagh:
A near-linear time approximation algorithm for angle-based outlier detection in high-dimensional data. 877-885 - Qi Ding, Natallia Katenka, Paul Barford, Eric D. Kolaczyk, Mark Crovella:
Intrusion as (anti)social communication: characterization and detection. 886-894
Research session b3: feature selection
- Pinghua Gong, Jieping Ye, Changshui Zhang:
Robust multi-task feature learning. 895-903 - Jiliang Tang, Huan Liu:
Unsupervised feature selection for linked social media data. 904-912 - Adam Woznica, Phong Nguyen, Alexandros Kalousis:
Model mining for robust feature selection. 913-921 - Sen Yang, Lei Yuan, Ying-Cheng Lai, Xiaotong Shen, Peter Wonka, Jieping Ye:
Feature grouping and selection over an undirected graph. 922-930
Research session b4: nearest neighbors
- Parikshit Ram, Alexander G. Gray:
Maximum inner-product search using cone trees. 931-939 - Yi Zhen, Dit-Yan Yeung:
A probabilistic model for multimodal hash function learning. 940-948 - De-Nian Yang, Chih-Ya Shen, Wang-Chien Lee, Ming-Syan Chen:
On socio-spatial group query for location-based social networks. 949-957 - Caiming Xiong, David M. Johnson, Ran Xu, Jason J. Corso:
Random forests for metric learning with implicit pairwise position dependence. 958-966
Industry/govt track b5: business intelligence
- Rakesh Agrawal, Sunandan Chakraborty, Sreenivas Gollapudi, Anitha Kannan, Krishnaram Kenthapadi:
Empowering authors to diagnose comprehension burden in textbooks. 967-975 - Yin Song, Longbing Cao, Xindong Wu, Gang Wei, Wu Ye, Wei Ding:
Coupled behavior analysis for capturing coupling relationships in group-based market manipulations. 976-984 - Zhiang Wu, Junjie Wu, Jie Cao, Dacheng Tao:
HySAD: a semi-supervised hybrid shilling attack detector for trustworthy product recommendation. 985-993 - Gowtham Bellala, Manish Marwah, Martin F. Arlitt, Geoff Lyon, Cullen E. Bash:
Following the electrons: methods for power management in commercial buildings. 994-1002
Industrial practice expo B6: session 3
- Graham Williams:
Ensembles and model delivery for tax compliance. 1003