default search action
15th KDD 2009: Paris, France
- John F. Elder IV, Françoise Fogelman-Soulié, Peter A. Flach, Mohammed Javeed Zaki:
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28 - July 1, 2009. ACM 2009, ISBN 978-1-60558-495-9
Keynote talks
- David J. Hand:
Mismatched models, wrong results, and dreadful decisions: on choosing appropriate data mining tools. 1-2 - Ravi Kumar:
Mining web logs: applications and challenges. 3-4 - Heikki Mannila:
Randomization methods in data mining. 5-6 - Ashok N. Srivastava:
Data mining at NASA: from theory to applications. 7-8 - Stanley Wasserman:
Network science: an introduction to recent statistical approaches. 9-10
Panel
- Michael Zeller, Robert Grossman, Christoph Lingenfelder, Michael R. Berthold, Erik Marcadé, Rick Pechter, Mike Hoskins, Wayne Thompson, Rich Holada:
Open standards and cloud computing: KDD-2009 panel report. 11-18
Research track papers
- Deepak Agarwal, Bee-Chung Chen:
Regression-based latent factor models. 19-28 - Charu C. Aggarwal, Yan Li, Jianyong Wang, Jing Wang:
Frequent pattern mining with uncertain data. 29-38 - Amr Ahmed, Eric P. Xing, William W. Cohen, Robert F. Murphy:
Structured correspondence topic models for mining captioned figures in biological literature. 39-48 - Anurag Ambekar, Charles B. Ward, Jahangir Mohammed, Swapna Male, Steven Skiena:
Name-ethnicity classification from open sources. 49-58 - Shin Ando, Einoshin Suzuki:
Detection of unique temporal segments by information theoretic meta-clustering. 59-68 - Mafruz Zaman Ashrafi, See-Kiong Ng:
Collusion-resistant anonymous data collection method. 69-78 - Sitaram Asur, Srinivasan Parthasarathy:
A viewpoint-based approach for interaction graph analysis. 79-88 - Lars Backstrom, Jon M. Kleinberg, Ravi Kumar:
Optimizing web traffic via the media scheduling problem. 89-98 - Ron Bekkerman, Martin Scholz, Krishnamurthy Viswanathan:
Improving clustering stability with combinatorial MRFs. 99-108 - Michele Berlingerio, Fabio Pinelli, Mirco Nanni, Fosca Giannotti:
Temporal mining for interactive workflow data analysis. 109-118 - Thomas Bernecker, Hans-Peter Kriegel, Matthias Renz, Florian Verhein, Andreas Züfle:
Probabilistic frequent itemset mining in uncertain databases. 119-128 - Alina Beygelzimer, John Langford:
The offset tree for learning with partial labels. 129-138 - Albert Bifet, Geoffrey Holmes, Bernhard Pfahringer, Richard Kirkby, Ricard Gavaldà:
New ensemble methods for evolving data streams. 139-148 - Christian Böhm, Katrin Haegler, Nikola S. Müller, Claudia Plant:
CoCo: coding cost for parameter-free outlier detection. 149-158 - Yingyi Bu, Lei Chen, Ada Wai-Chee Fu, Dawei Liu:
Efficient anomaly monitoring over moving object trajectory streams. 159-168 - Jonathan D. Chang, Jordan L. Boyd-Graber, David M. Blei:
Connections between the lines: augmenting social networks with text. 169-178 - Bo Chen, Wai Lam, Ivor W. Tsang, Tak-Lam Wong:
Extracting discriminative concepts for domain adaptation in text mining. 179-188 - Minmin Chen, Yixin Chen, Michael R. Brent, Aaron E. Tenney:
Constrained optimization for validation-guided conditional random field learning. 189-198 - Wei Chen, Yajun Wang, Siyu Yang:
Efficient influence maximization in social networks. 199-208 - Ye Chen, Dmitry Pavlov, John F. Canny:
Large-scale behavioral targeting. 209-218 - Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, Michael Mitzenmacher, Alessandro Panconesi, Prabhakar Raghavan:
On compressing social networks. 219-228 - Erick Delage:
Regret-based online ranking for a growing digital library. 229-238 - Hongbo Deng, Michael R. Lyu, Irwin King:
A generalized Co-HITS algorithm and its application to bipartite graphs. 239-248 - Meghana Deodhar, Joydeep Ghosh:
Mining for the most certain predictions from dyadic data. 249-258 - Pinar Donmez, Jaime G. Carbonell, Jeff G. Schneider:
Efficiently learning the accuracy of labeling sources for selective sampling. 259-268 - Nan Du, Christos Faloutsos, Bai Wang, Leman Akoglu:
Large human communication networks: patterns and a utility-driven generator. 269-278 - Murat Dundar, E. Daniel Hirleman, Arun K. Bhunia, J. Paul Robinson, Bartek Rajwa:
Learning with a non-exhaustive training dataset: a case study: detection of bacteria cultures using optical-scattering technology. 279-288 - Khalid El-Arini, Gaurav Veda, Dafna Shahaf, Carlos Guestrin:
Turning down the noise in the blogosphere. 289-298 - George Forman, Martin Scholz, Shyamsundar Rajaram:
Feature shaping for linear SVM classifiers. 299-308 - Richard Frank, Martin Ester, Arno J. Knobbe:
A multi-relational approach to spatial classification. 309-318 - Antonino Freno, Edmondo Trentin, Marco Gori:
Scalable pseudo-likelihood estimation in hybrid random fields. 319-328 - João Gama, Raquel Sebastião, Pedro Pereira Rodrigues:
Issues in evaluation of stream learning algorithms. 329-338 - Jing Gao, Wei Fan, Yizhou Sun, Jiawei Han:
Heterogeneous source consensus learning via decision propagation and negotiation. 339-348 - Yong Ge, Hui Xiong, Wenjun Zhou, Ramendra K. Sahoo, Xiaofeng Gao, Weili Wu:
Multi-focal learning and its application to customer service support. 349-358 - Quanquan Gu, Jie Zhou:
Co-clustering on manifolds. 359-368 - Lei Guo, Enhua Tan, Songqing Chen, Xiaodong Zhang, Yihong Eric Zhao:
Analyzing patterns of user content generation in online social networks. 369-378 - Sami Hanhijärvi, Markus Ojala, Niko Vuokko, Kai Puolamäki, Nikolaj Tatti, Heikki Mannila:
Tell me something I don't know: randomization strategies for iterative data mining. 379-388 - Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, Xiaohua Zhou:
Exploiting Wikipedia as external knowledge for document clustering. 389-396 - Mohsen Jamali, Martin Ester:
TrustWalker: a random walk model for combining trust-based and item-based recommendation. 397-406 - Shuiwang Ji, Lei Yuan, Ying-Xin Li, Zhi-Hua Zhou, Sudhir Kumar, Jieping Ye:
Drosophila gene expression pattern annotation using sparse features and term-term interactions. 407-416 - Ruoming Jin, Yang Xiang, Lin Liu:
Cartesian contour: a concise representation for a collection of frequent sets. 417-426 - Aleksander Kolcz, Gordon V. Cormack:
Genre-based decomposition of email class noise. 427-436 - Arne Koopman, Arno Siebes:
Characteristic relational patterns. 437-446 - Yehuda Koren:
Collaborative filtering with temporal dynamics. 447-456 - Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, Soumen Chakrabarti:
Collective annotation of Wikipedia entities in web text. 457-466 - Theodoros Lappas, Kun Liu, Evimaria Terzi:
Finding a team of experts in social networks. 467-476 - Theodoros Lappas, Benjamin Arai, Manolis Platakis, Dimitrios Kotsakos, Dimitrios Gunopulos:
On burstiness-aware search for document sequences. 477-486 - Mark Last:
Improving data mining utility with projective sampling. 487-496 - Jure Leskovec, Lars Backstrom, Jon M. Kleinberg:
Meme-tracking and the dynamics of the news cycle. 497-506 - Lei Li, James McCann, Nancy S. Pollard, Christos Faloutsos:
DynaMMo: mining and summarization of coevolving sequences with missing values. 507-516 - Tiancheng Li, Ninghui Li:
On the tradeoff between privacy and utility in data publishing. 517-526 - Yu-Ru Lin, Jimeng Sun, Paul C. Castro, Ravi B. Konuru, Hari Sundaram, Aisling Kelliher:
MetaFac: community discovery via relational hypergraph factorization. 527-536 - Chao Liu, Fan Guo, Christos Faloutsos:
BBM: bayesian browsing model from petabyte-scale data. 537-546 - Jun Liu, Jianhui Chen, Jieping Ye:
Large-scale sparse logistic regression. 547-556 - David Lo, Hong Cheng, Jiawei Han, Siau-Cheng Khoo, Chengnian Sun:
Classification of software behaviors for failure detection: a discriminative pattern mining approach. 557-566 - Steven Loscalzo, Lei Yu, Chris H. Q. Ding:
Consensus group stable feature selection. 567-576 - Aurélie C. Lozano, Naoki Abe, Yan Liu, Saharon Rosset:
Grouped graphical Granger modeling methods for temporal causal modeling. 577-586 - Aurélie C. Lozano, Hongfei Li, Alexandru Niculescu-Mizil, Yan Liu, Claudia Perlich, Jonathan R. M. Hosking, Naoki Abe:
Spatial-temporal causal modeling for climate change attribution. 587-596 - Sofus A. Macskassy:
Using graph-based metrics with empirical risk minimization to speed up active learning on networked data. 597-606 - R. Dean Malmgren, Jake M. Hofman, Luís A. Nunes Amaral, Duncan J. Watts:
Characterizing individual communication patterns. 607-616 - Andreas Maunz, Christoph Helma, Stefan Kramer:
Large-scale graph mining using backbone refinement classes. 617-626 - Frank McSherry, Ilya Mironov:
Differentially Private Recommender Systems: Building Privacy into the Netflix Prize Contenders. 627-636 - Anna Monreale, Fabio Pinelli, Roberto Trasarti, Fosca Giannotti:
WhereNext: a location predictor on trajectory pattern mining. 637-646 - Siegfried Nijssen, Tias Guns, Luc De Raedt:
Correlated itemset mining in ROC space: a constraint programming approach. 647-656 - Kensuke Onuma, Hanghang Tong, Christos Faloutsos:
TANGENT: a novel, 'Surprise me', recommendation algorithm. 657-666 - Rong Pan, Martin Scholz:
Mind the gaps: weighting the unknown in large-scale one-class collaborative filtering. 667-676 - Gaurav Pandey, Gowtham Atluri, Michael S. Steinbach, Chad L. Myers, Vipin Kumar:
An association analysis approach to biclustering. 677-686 - Ardian Kristanto Poernomo, Vivekanand Gopalkrishnan:
CP-summary: a concise representation for browsing frequent itemsets. 687-696 - Ardian Kristanto Poernomo, Vivekanand Gopalkrishnan:
Towards efficient mining of proportional fault-tolerant frequent itemsets. 697-706 - Foster J. Provost, Brian Dalessandro, Rod Hook, Xiaohan Zhang, Alan Murray:
Audience selection for on-line brand advertising: privacy-friendly social network targeting. 707-716 - Zijie Qi, Ian Davidson:
A principled and flexible framework for finding alternative clusterings. 717-726 - Steffen Rendle, Leandro Balby Marinho, Alexandros Nanopoulos, Lars Schmidt-Thieme:
Learning optimal ranking with tensor factorization for tag recommendation. 727-736 - Venu Satuluri, Srinivasan Parthasarathy:
Scalable graph clustering using stochastic flows: applications to community discovery. 737-746 - Jerry Scripps, Pang-Ning Tan, Abdol-Hossein Esfahanian:
Measuring the effects of preprocessing decisions and network forces in dynamic network analysis. 747-756 - Bao-Hong Shen, Shuiwang Ji, Jieping Ye:
Mining discrete patterns via binary matrix factorization. 757-766 - Lei Shi, Vandana Pursnani Janeja:
Anomalous window discovery through scan statistics for linear intersecting paths (SSLIP). 767-776 - Xiaolin Shi, Jun Zhu, Rui Cai, Lei Zhang:
User grouping behavior in online forums. 777-786 - Takashi Shibuya, Tatsuya Harada, Yasuo Kuniyoshi:
Causality quantification and its applications: structuring and modeling of multivariate time series. 787-796 - Yizhou Sun, Yintao Yu, Jiawei Han:
Ranking-based clustering of heterogeneous information networks with star network schema. 797-806 - Jie Tang, Jimeng Sun, Chi Wang, Zi Yang:
Social influence analysis in large-scale networks. 807-816 - Lei Tang, Huan Liu:
Relational learning via latent social dimensions. 817-826 - Chayant Tantipathananandh, Tanya Y. Berger-Wolf:
Constant-factor approximation algorithms for identifying dynamic communities. 827-836 - Charalampos E. Tsourakakis, U Kang, Gary L. Miller, Christos Faloutsos:
DOULION: counting triangles in massive graphs with a coin. 837-846 - Pavan Vatturi, Weng-Keen Wong:
Category detection using hierarchical mean shift. 847-856 - Ting Wang, Mudhakar Srivatsa, Dakshi Agrawal, Ling Liu:
Learning, indexing, and diagnosing network faults. 857-866 - Xuanhui Wang, Deepayan Chakrabarti, Kunal Punera:
Mining broad latent query aspects from search sessions. 867-876 - Junjie Wu, Hui Xiong, Jian Chen:
Adapting the right measures for K-means clustering. 877-886 - Mingxi Wu, Xiuyao Song, Chris Jermaine, Sanjay Ranka, John Gums:
A LRT framework for fast spatial anomaly detection. 887-896 - Jack Chongjie Xue, Gary M. Weiss:
Quantification and semi-supervised classification methods for handling changes in class distribution. 897-906 - Donghui Yan, Ling Huang, Michael I. Jordan:
Fast approximate spectral clustering. 907-916 - Bishan Yang, Jian-Tao Sun, Tengjiao Wang, Zheng Chen:
Effective multi-label active learning for text classification. 917-926 - Tianbao Yang, Rong Jin, Yun Chi, Shenghuo Zhu:
Combining link and content for community detection: a discriminative approach. 927-936 - Limin Yao, David M. Mimno, Andrew McCallum:
Efficient methods for topic model inference on streaming document collections. 937-946 - Lexiang Ye, Eamonn J. Keogh:
Time series shapelets: a new primitive for data mining. 947-956 - Zhijun Yin, Rui Li, Qiaozhu Mei, Jiawei Han:
Exploring social tagging graph for web object classification. 957-966 - Shinjae Yoo, Yiming Yang, Frank Lin, Il-Chul Moon:
Mining social networks for personalized email prioritization. 967-976 - Chang Hun You, Lawrence B. Holder, Diane J. Cook:
Learning patterns in the dynamics of biological networks. 977-986 - Xiangliang Zhang, Cyril Furtlehner, Julien Perez, Cécile Germain-Renaud, Michèle Sebag:
Toward autonomic grids: analyzing the job flow with affinity streaming. 987-996 - Yuzhou Zhang, Jianyong Wang, Yi Wang, Lizhu Zhou:
Parallel community detection on large networks with propinquity dynamics. 997-1006 - Elena Zheleva, Hossam Sharara, Lise Getoor:
Co-evolution of social and affiliation networks. 1007-1016 - Lei Zheng, Shaojun Wang, Yan Liu, Chi-Hoon Lee:
Information theoretic regularization for semi-supervised boosting. 1017-1026 - Erheng Zhong, Wei Fan, Jing Peng, Kun Zhang, Jiangtao Ren, Deepak S. Turaga, Olivier Verscheure:
Cross domain distribution adaptation via kernel mapping. 1027-1036 - Guangyu Zhu, Gilad Mishne:
Mining rich session context to improve web search. 1037-1046 - Jun Zhu, Eric P. Xing, Bo Zhang:
Primal sparse Max-margin Markov networks. 1047-1056 - Qiang Zhu, Xiaoyue Wang, Eamonn J. Keogh, Sang-Hee Lee:
Augmenting the generalized hough transform to enable the mining of petroglyphs. 1057-1066
Industrial track papers
- Josh Attenberg, Sandeep Pandey, Torsten Suel:
Modeling and predicting user behavior in sponsored search. 1067-1076 - Indrajit Bhattacharya, Shantanu Godbole, Ajay Gupta, Ashish Verma, Jeff Achtermann, Kevin English:
Enabling analysts in managed services for CRM analytics. 1077-1086 - Ludmila Cherkasova, Kave Eshghi, Charles B. Morrey III, Joseph A. Tucek, Alistair C. Veitch:
Applying syntactic similarity algorithms for enterprise information management. 1087-1096 - Wei Chu, Seung-Taek Park, Todd Beaupre, Nitin Motgi, Amit Phadke, Seinjuti Chakraborty, Joe Zachariah:
A case study of behavior-driven conjoint analysis on Yahoo!: front page today module. 1097-1104 - Thomas Crook, Brian Frasca, Ron Kohavi, Roger Longbotham:
Seven pitfalls to avoid when running controlled experiments on the web. 1105-1114 - Srivatsava Daruru, Nena M. Marin, Matt Walker, Joydeep Ghosh:
Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data. 1115-1124 - Xiaowen Ding, Bing Liu, Lei Zhang:
Entity discovery and assignment for opinion mining applications. 1125-1134 - Xiaoxi Du, Ruoming Jin, Liang Ding, Victor E. Lee, John H. Thornton Jr.:
Migration motif: a spatial - temporal pattern mining approach for financial markets. 1135-1144 - Ariel Fuxman, Anitha Kannan, Andrew B. Goldberg, Rakesh Agrawal, Panayiotis Tsaparas, John C. Shafer:
Improving classification accuracy using automatically extracted training data. 1145-1154 - Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang, Zhong Su:
Address standardization with latent semantic association. 1155-1164 - Sonal Gupta, Mikhail Bilenko, Matthew Richardson:
Catching the drift: learning broad matches from clickthrough data. 1165-1174 - Mohammad Al Hasan, W. Scott Spangler, Thomas D. Griffin, Alfredo Alba:
COA: finding novel patents through text analysis. 1175-1184 - Shunsuke Hirose, Kenji Yamanishi, Takayuki Nakata, Ryohei Fujimaki:
Network anomaly detection based on Eigen equation compression. 1185-1194 - Wei Jin, Hung Hay Ho, Rohini K. Srihari:
OpinionMiner: a novel machine learning system for web opinion mining and extraction. 1195-1204 - Jongwuk Lee, Seung-won Hwang, Zaiqing Nie, Ji-Rong Wen:
Query result clustering for object-level search. 1205-1214 - Ming Li, M. Benjamin Dias, Ian H. Jarman, Wael El-Deredy, Paulo J. G. Lisboa:
Grocery shopping recommendations based on basket-sensitive random walk. 1215-1224 - Yan Liu, Jayant R. Kalagnanam, Oivind Johnsen:
Learning dynamic temporal graphs for oil-production equipment monitoring system. 1225-1234 - Ping Luo, Fen Lin, Yuhong Xiong, Yong Zhao, Zhongzhi Shi:
Towards combining web classification and web information extraction: a case study. 1235-1244 - Justin Ma, Lawrence K. Saul, Stefan Savage, Geoffrey M. Voelker:
Beyond blacklists: learning to detect malicious web sites from suspicious URLs. 1245-1254 - Adetokunbo Makanju, Nur Zincir-Heywood, Evangelos E. Milios:
Clustering event logs using iterative partitioning. 1255-1264 - Mary McGlohon, Stephen Bay, Markus G. Anderle, David M. Steier, Christos Faloutsos:
SNARE: a link analytic system for graph labeling and risk detection. 1265-1274 - Prem Melville, Wojciech Gryc, Richard D. Lawrence:
Sentiment analysis of blogs by combining lexical knowledge with text classification. 1275-1284 - Noman Mohammed, Benjamin C. M. Fung, Patrick C. K. Hung, Cheuk-kwong Lee:
Anonymizing healthcare data: a case study on the blood transfusion service. 1285-1294 - Kivanc M. Ozonat, Donald Young:
Towards a universal marketplace over the web: statistical multi-label classification of service provider forms with simulated annealing. 1295-1304 - Debprakash Patnaik, Manish Marwah, Ratnesh K. Sharma, Naren Ramakrishnan:
Sustainable operation and management of data center chillers using temporal data mining. 1305-1314 - B. Aditya Prakash, Nicholas Valler, David G. Andersen, Michalis Faloutsos, Christos Faloutsos:
BGP-lens: patterns and anomalies in internet routing updates. 1315-1324 - D. Sculley, Robert G. Malkin, Sugato Basu, Roberto J. Bayardo:
Predicting bounce rates in sponsored search advertisements. 1325-1334 - Liang Sun, Rinkal Patel, Jun Liu, Kewei Chen, Teresa Wu, Jing Li, Eric Reiman, Jieping Ye:
Mining brain region connectivity for alzheimer's disease study via sparse inverse covariance estimation. 1335-1344 - Junfeng Wang, Chun Chen, Can Wang, Jian Pei, Jiajun Bu, Ziyu Guan, Wei Vivian Zhang:
Can we learn a template-independent wrapper for news article extraction from a single training site? 1345-1354 - Kuansan Wang, Toby Walker, Zijian Zheng:
PSkip: estimating relevance ranking quality from web search clickthrough data. 1355-1364 - Gu Xu, Shuang-Hong Yang, Hang Li:
Named entity mining from click-through data using weakly supervised latent dirichlet allocation. 1365-1374 - Jiang-Ming Yang, Rui Cai, Chunsong Wang, Hua Huang, Lei Zhang, Wei-Ying Ma:
Incorporating site-level knowledge for incremental crawling of web forums: a list-wise strategy. 1375-1384 - Yanfang Ye, Tao Li, Qingshan Jiang, Zhixue Han, Li Wan:
Intelligent file scoring system for malware detection from the gray list. 1385-1394 - Bin Zhou, Daxin Jiang, Jian Pei, Hang Li:
OLAP on search logs: an infrastructure supporting data-driven applications in search engines. 1395-1404
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.