default search action
ACM SIGMOD Conference 2015: Melbourne, Victoria, Australia
- Timos K. Sellis, Susan B. Davidson, Zachary G. Ives:
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31 - June 4, 2015. ACM 2015, ISBN 978-1-4503-2758-9
Keynote 1
- Jignesh M. Patel:
From Data to Insights @ Bare Metal Speed. 1
Research Session 1 - Cloud: Parallel Execution
- Ying Yan, Jiaxing Zhang, Bojun Huang, Xuzhan Sun, Jiaqi Mu, Zheng Zhang, Thomas Moscibroda:
Distributed Outlier Detection using Compressive Sensing. 3-16 - Erfan Zamanian, Carsten Binnig, Abdallah Salama:
Locality-aware Partitioning in Parallel Database Systems. 17-30 - Ziqiang Feng, Eric Lo, Ben Kao, Wenjian Xu:
ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout. 31-46 - Alexander Alexandrov, Andreas Kunft, Asterios Katsifodimos, Felix Schüler, Lauritz Thamsen, Odej Kao, Tobias Herb, Volker Markl:
Implicit Parallelism through Deep Language Embedding. 47-61 - Shumo Chu, Magdalena Balazinska, Dan Suciu:
From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System. 63-78
Research Session 2 - Matrix and Array Computations
- Tarek Elgamal, Maysam Yabandeh, Ashraf Aboulnaga, Waleed Mustafa, Mohamed Hefeeda:
sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms. 79-91 - Lele Yu, Yingxia Shao, Bin Cui:
Exploiting Matrix Dependency for Efficient Distributed Matrix Computation. 93-105 - Christina Teflioudi, Rainer Gemulla, Olga Mykytiuk:
LEMP: Fast Retrieval of Large Entries in a Matrix Product. 107-122 - Jennie Duggan, Olga Papaemmanouil, Leilani Battle, Michael Stonebraker:
Skew-Aware Join Optimization for Array Databases. 123-135 - Botong Huang, Matthias Boehm, Yuanyuan Tian, Berthold Reinwald, Shirish Tatikonda, Frederick R. Reiss:
Resource Elasticity for Large-Scale Machine Learning. 137-152
Research Session 3 - Security and Access Control
- Kerim Yasin Oktay, Sharad Mehrotra, Vaibhav Khadilkar, Murat Kantarcioglu:
SEMROD: Secure and Efficient MapReduce Over HybriD Clouds. 153-166 - Qian Chen, Haibo Hu, Jianliang Xu:
Authenticated Online Data Integration Services. 167-181 - Isabelle Hang, Florian Kerschbaum, Ernesto Damiani:
ENKI: Access Control for Encrypted Query Processing. 183-196 - Vera Zaychik Moffitt, Julia Stoyanovich, Serge Abiteboul, Gerome Miklau:
Collaborative Access Control in WebdamLog. 197-211 - Prasang Upadhyaya, Magdalena Balazinska, Dan Suciu:
Automatic Enforcement of Data Use Policies with DataLawyer. 213-225
Industry Session 1 - Streaming/Real-Time/Active
- Yanxiang Huang, Bin Cui, Wenyu Zhang, Jie Jiang, Ying Xu:
TencentRec: Real-time Stream Recommendation in Practice. 227-238 - Sanjeev Kulkarni, Nikunj Bhagat, Maosong Fu, Vikas Kedigehalli, Christopher Kellogg, Sailesh Mittal, Jignesh M. Patel, Karthik Ramasamy, Siddarth Taneja:
Twitter Heron: Stream Processing at Scale. 239-250 - Lucas Braun, Thomas Etter, Georgios Gasparis, Martin Kaufmann, Donald Kossmann, Daniel Widmer, Aharon Avitzur, Anthony Iliopoulos, Eliezer Levy, Ning Liang:
Analytics in Motion: High Performance Event-Processing AND Real-Time Analytics in the Same Database. 251-264 - Paul Suganthan G. C., Chong Sun, Krishna Gayatri K., Haojun Zhang, Frank Yang, Narasimhan Rampalli, Shishir Prasad, Esteban Arcaute, Ganesh Krishnan, Rohit Deep, Vijay Raghavendra, AnHai Doan:
Why Big Data Industrial Systems Need Rules and What We Can Do About It. 265-276
Tutorial 1
- Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri:
Overview of Data Exploration Techniques. 277-281
Panel
- Christopher Ré, Divy Agrawal, Magdalena Balazinska, Michael J. Cafarella, Michael I. Jordan, Tim Kraska, Raghu Ramakrishnan:
Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype? 283-284
Research Session 4 - Cloud: Fault Tolerance, Reconfiguration
- Abdallah Salama, Carsten Binnig, Tim Kraska, Erfan Zamanian:
Cost-based Fault-tolerance for Parallel Data Processing. 285-297 - Aaron J. Elmore, Vaibhav Arora, Rebecca Taft, Andrew Pavlo, Divyakant Agrawal, Amr El Abbadi:
Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases. 299-313 - Takeshi Mishima, Yasuhiro Fujiwara:
Madeus: Database Live Migration Middleware under Heavy Workloads for Cloud Environment. 315-329 - Peter Alvaro, Joshua Rosen, Joseph M. Hellerstein:
Lineage-driven Fault Injection. 331-346
Research Session 5 - Keyword Search and Text
- Lisi Chen, Gao Cong:
Diversity-Aware Top-k Publish/Subscribe for Text Stream. 347-362 - Georgios John Fakas, Zhi Cai, Nikos Mamoulis:
Diverse and Proportional Size-l Object Summaries for Keyword Search. 363-375 - Xiaochun Yang, Yaoshu Wang, Bin Wang, Wei Wang:
Local Filtering: Improving the Performance of Approximate Queries on String Collections. 377-392 - Minhao Jiang, Ada Wai-Chee Fu, Raymond Chi-Wing Wong:
Exact Top-k Nearest Keyword Search in Large Networks. 393-404 - Tao Guo, Xin Cao, Gao Cong:
Efficient Algorithms for Answering the m-Closest Keywords Query. 405-418
Research Session 6 - Graph Primitives
- Silu Huang, Ada Wai-Chee Fu, Ruifeng Liu:
Minimum Spanning Trees in Temporal Graphs. 419-430 - Devora Berlowitz, Sara Cohen, Benny Kimelfeld:
Efficient Enumeration of Maximal k-Plexes. 431-444 - Zhiwei Zhang, Jeffrey Xu Yu, Lu Qin, Zechao Shang:
Divide & Conquer: I/O Efficient Depth-First Search. 445-458 - Lijun Chang, Xuemin Lin, Lu Qin, Jeffrey Xu Yu, Wenjie Zhang:
Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity. 459-474
Research Session 7 - Data Mining
- Saket Gurukar, Sayan Ranu, Balaraman Ravindran:
COMMIT: A Scalable Approach to Mining Communication Motifs from Dynamic Networks. 475-489 - Kaustubh Beedkar, Rainer Gemulla:
LASH: Large-Scale Sequence Mining with Hierarchies. 491-503 - Michael Cochez, Hao Mou:
Twister Tries: Approximate Hierarchical Agglomerative Clustering for Average Distance in Linear Time. 505-517 - Junhao Gan, Yufei Tao:
DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation. 519-530 - Azade Nazi, Mahashweta Das, Gautam Das:
The TagAdvisor: Luring the Lurkers to Review Web Items. 531-543
Research Session 8 - Uncertainty and Linking
- Liping Peng, Yanlei Diao:
Supporting Data Uncertainty in Array Databases. 545-560 - Simon Razniewski, Flip Korn, Werner Nutt, Divesh Srivastava:
Identifying the Extent of Completeness of Query Answers over Partially Complete Databases. 561-576 - Peng Peng, Raymond Chi-Wing Wong:
k-Hit Query: Top-k Query with Probabilistic Utility Function. 577-592 - Furong Li, Mong-Li Lee, Wynne Hsu, Wang-Chiew Tan:
Linking Temporal Records for Profiling Entities. 593-605
Industry Session 2 - Applications
- Yiqing Huang, Fangzhou Zhu, Mingxuan Yuan, Ke Deng, Yanhua Li, Bing Ni, Wenyuan Dai, Qiang Yang, Jia Zeng:
Telco Churn Prediction with Big Data. 607-618 - Orri Erling, Alex Averbuch, Josep Lluís Larriba-Pey, Hassan Chafi, Andrey Gubichev, Arnau Prat-Pérez, Minh-Duc Pham, Peter A. Boncz:
The LDBC Social Network Benchmark: Interactive Workload. 619-630 - Frank Austin Nothaft, Matt Massie, Timothy Danford, Zhao Zhang, Uri Laserson, Carl Yeksigian, Jey Kottalam, Arun Ahuja, Jeff Hammerbacher, Michael D. Linderman, Michael J. Franklin, Anthony D. Joseph, David A. Patterson:
Rethinking Data-Intensive Science Using Scalable Analytics Systems. 631-646 - Yue Wang, Yingzhong Xu, Yue Liu, Jian Chen, Songlin Hu:
QMapper for Smart Grid: Migrating SQL-based Application to Hive. 647-658
ACM-W Athena Lecturer Award
- Jennifer Widom:
Three Favorite Results. 659
Keynote 2
- Laura M. Haas:
The Power Behind the Throne: Information Integration in the Age of Data-Driven Discovery. 661
Research Session 9 - Transactional Architectures
- Simon Loesing, Markus Pilman, Thomas Etter, Donald Kossmann:
On the Design and Scalability of Distributed Shared-Data Databases. 663-676 - Thomas Neumann, Tobias Mühlbauer, Alfons Kemper:
Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems. 677-689 - Hideaki Kimura:
FOEDUS: OLTP Engine for a Thousand Cores and NVRAM. 691-706 - Joy Arulraj, Andrew Pavlo, Subramanya Dulloor:
Let's Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. 707-722
Research Session 10 - Privacy
- Jun Zhang, Graham Cormode, Cecilia M. Procopiuc, Divesh Srivastava, Xiaokui Xiao:
Private Release of Graph Statistics using Ladder Functions. 731-745 - Bin Yang, Issei Sato, Hiroshi Nakagawa:
Bayesian Differential Privacy on Correlated Data. 747-762 - Charalampos Mavroforakis, Nathan Chenette, Adam O'Neill, George Kollios, Ran Canetti:
Modular Order-Preserving Encryption, Revisited. 763-777 - Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti:
Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering. 779-794
Research Session 11 - Streams
- Zhewei Wei, Ge Luo, Ke Yi, Xiaoyong Du, Ji-Rong Wen:
Persistent Data Sketching. 795-810 - Qian Lin, Beng Chin Ooi, Zhengkui Wang, Cui Yu:
Scalable Distributed Stream Join Processing. 811-825 - Shaoxu Song, Aoqian Zhang, Jianmin Wang, Philip S. Yu:
SCREEN: Stream Data Cleaning under Speed Constraints. 827-841 - Long Guo, Dongxiang Zhang, Guoliang Li, Kian-Lee Tan, Zhifeng Bao:
Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams. 843-857
Demo A
- Nick R. Katsipoulakis, Cory Thoma, Eric A. Gratta, Alexandros Labrinidis, Adam J. Lee, Panos K. Chrysanthis:
CE-Storm: Confidential Elastic Processing of Data Streams. 859-864 - Benjamin Dietrich, Torsten Grust:
A SQL Debugger Built from Spare Parts: Turning a SQL: 1999 Database System into Its Own Debugger. 865-870 - Zhifeng Bao, Yong Zeng, H. V. Jagadish, Tok Wang Ling:
Exploratory Keyword Search with Interactive Input. 871-876 - Daniel Scheibli, Christian Dinse, Alexander Boehm:
QE3D: Interactive Visualization and Exploration of Complex, Distributed Query Plans. 877-881 - John Morcos, Ziawasch Abedjan, Ihab Francis Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker:
DataXFormer: An Interactive Data Transformation Tool. 883-888 - Yuanzhen Ji, Hongjin Zhou, Zbigniew Jerzak, Anisoara Nica, Gregor Hackenbroich, Christof Fetzer:
Quality-Driven Continuous Query Execution over Out-of-Order Data Streams. 889-894 - Ioannis Mytilinis, Ioannis Giannakopoulos, Ioannis Konstantinou, Katerina Doka, Dimitrios Tsitsigkos, Manolis Terrovitis, Lampros Giampouras, Nectarios Koziris:
MoDisSENSE: A Distributed Spatio-Temporal and Textual Processing Platform for Social Networking Services. 895-900 - Qiang Hu, Qi Liu, Xiaoli Wang, Anthony K. H. Tung, Shubham Goyal, Jisong Yang:
DocRicher: An Automatic Annotation System for Text Documents Using Social Media. 901-906 - Li-Yan Yuan, Lengdong Wu, Jia-Huai You, Yan Chi:
A Demonstration of Rubato DB: A Highly Scalable NewSQL Database System for OLTP and Big Data Applications. 907-912 - Kai Zeng, Sameer Agarwal, Ankur Dave, Michael Armbrust, Ion Stoica:
G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data. 913-918
Tutorial 2
- Yasushi Sakurai, Yasuko Matsubara, Christos Faloutsos:
Mining and Forecasting of Big Time-series Data. 919-922
Research Session 12 - Spatial data
- Xiaoyang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, Muhammad Aamir Cheema:
Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates. 923-938 - Farhan Tauheed, Thomas Heinis, Anastasia Ailamaki:
THERMAL-JOIN: A Scalable Spatial Join for Dynamic Workloads. 939-950 - Lu Chen, Yunjun Gao, Xinhan Li, Christian S. Jensen, Gang Chen, Baihua Zheng:
Indexing Metric Uncertain Data for Range Queries. 951-965 - Sibo Wang, Wenqing Lin, Yi Yang, Xiaokui Xiao, Shuigeng Zhou:
Efficient Route Planning on Public Transportation Networks: A Labelling Approach. 967-982
Research Session 13- Crowdsourcing
- Aris Anagnostopoulos, Luca Becchetti, Adriano Fazzone, Ida Mele, Matteo Riondato:
The Importance of Being Expert: Efficient Max-Finding in Crowdsourcing. 983-998 - Nguyen Quoc Viet Hung, Duong Chi Thang, Matthias Weidlich, Karl Aberer:
Minimizing Efforts in Validating Crowd Answers. 999-1014 - Ju Fan, Guoliang Li, Beng Chin Ooi, Kian-Lee Tan, Jianhua Feng:
iCrowd: An Adaptive Crowdsourcing Framework. 1015-1030 - Yudian Zheng, Jiannan Wang, Guoliang Li, Reynold Cheng, Jianhua Feng:
QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications. 1031-1046 - Vasilis Verroios, Peter Lofgren, Hector Garcia-Molina:
tDP: An Optimal-Latency Budget Allocation Strategy for Crowdsourced MAXIMUM Operations. 1047-1062
Demo B
- Petrie Wong, Zhian He, Ziqiang Feng, Wenjian Xu, Eric Lo:
Thrifty: Offering Parallel Database as a Service using the Shared-Process Approach. 1063-1068 - Dana Van Aken, Djellel Eddine Difallah, Andrew Pavlo, Carlo Curino, Philippe Cudré-Mauroux:
BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed. 1069-1073 - V. M. Megler, David Maier:
Demonstrating "Data Near Here": Scientific Data Search. 1075-1080 - Jules Chevalier, Julien Subercaze, Christophe Gravier, Frédérique Laforest:
Slider: An Efficient Incremental Reasoner. 1081-1086 - Ashish Vulimiri, Carlo Curino, Philip Brighten Godfrey, Thomas Jungblut, Konstantinos Karanasos, Jitendra Padhye, George Varghese:
WANalytics: Geo-Distributed Analytics for a Data Intensive World. 1087-1092 - Huayu Wu, Jo-Anne Tan, Wee Siong Ng, Mingqiang Xue, Wei Chen:
FTT: A System for Finding and Tracking Tourists in Public Transport Services. 1093-1098 - Haozhou Wang, Kai Zheng, Xiaofang Zhou, Shazia Wasim Sadiq:
SharkDB: An In-Memory Storage System for Massive Trajectory Data. 1099-1104 - Yonathan Perez, Rok Sosic, Arijit Banerjee, Rohan Puttagunta, Martin Raison, Pararth Shah, Jure Leskovec:
Ringo: Interactive Graph Analytics on Big-Memory Machines. 1105-1110 - Robert Christensen, Lu Wang, Feifei Li, Ke Yi, Jun Tang, Natalee Villa:
STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data. 1111-1116 - Jesús Camacho-Rodríguez, Dario Colazzo, Ioana Manolescu, Juan Álvaro Muñoz Naranjo:
PAXQuery: Parallel Analytical XML Processing. 1117-1122
Research Session 14 - Indexing & Performance
- Ingo Müller, Peter Sanders, Arnaud Lacurie, Wolfgang Lehner, Franz Färber:
Cache-Efficient Aggregation: Hashing Is Sorting. 1123-1136 - Guoliang Li, Jian He, Dong Deng, Jian Li:
Efficient Similarity Join and Search on Multi-Attribute Data. 1137-1151 - Eleni Petraki, Stratos Idreos, Stefan Manegold:
Holistic Indexing in Main-memory Column-stores. 1153-1166 - Barzan Mozafari, Eugene Zhen Ye Goh, Dong Young Yoon:
CliffGuard: A Principled Framework for Finding Robust Database Designs. 1167-1182 - Manas Joglekar, Hector Garcia-Molina, Aditya G. Parameswaran, Christopher Ré:
Exploiting Correlations for Expensive Predicate Evaluation. 1183-1198
Research Session 15 - Data Cleaning
- Moria Bergman, Tova Milo, Slava Novgorodov, Wang Chiew Tan:
Query-Oriented Data Cleaning with Oracles. 1199-1214 - Zuhair Khayyat, Ihab F. Ilyas, Alekh Jindal, Samuel Madden, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
BigDansing: A System for Big Data Cleansing. 1215-1230 - Xiaolan Wang, Xin Luna Dong, Alexandra Meliou:
Data X-Ray: A Diagnostic Tool for Data Errors. 1231-1245 - Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye:
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing. 1247-1261 - Sibo Wang, Xiaokui Xiao, Chun-Hee Lee:
Crowd-Based Deduplication: An Adaptive Approach. 1263-1277
Research Session 16- Transactions
- Faisal Nawab, Vaibhav Arora, Divyakant Agrawal, Amr El Abbadi:
Minimizing Commit Latency of Transactions in Geo-Replicated Data Stores. 1279-1294 - Philip A. Bernstein, Sudipto Das, Bailu Ding, Markus Pilman:
Optimizing Optimistic Concurrency Control for Tree-Structured, Log-Structured Databases. 1295-1309 - Sudip Roy, Lucja Kot, Gabriel Bender, Bailu Ding, Hossein Hojjat, Christoph Koch, Nate Foster, Johannes Gehrke:
The Homeostasis Protocol: Avoiding Transaction Coordination Through Program Analysis. 1311-1326 - Peter Bailis, Alan D. Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica:
Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity. 1327-1342
Industry Session 3 - Novel Systems
- Markus Weimer, Yingda Chen, Byung-Gon Chun, Tyson Condie, Carlo Curino, Chris Douglas, Yunseong Lee, Tony Majestro, Dahlia Malkhi, Sergiy Matusevych, Brandon Myers, Shravan M. Narayanamurthy, Raghu Ramakrishnan, Sriram Rao, Russell Sears, Beysim Sezgin, Julia Wang:
REEF: Retainable Evaluator Execution Framework. 1343-1355 - Bikas Saha, Hitesh Shah, Siddharth Seth, Gopal Vijayaraghavan, Arun C. Murthy, Carlo Curino:
Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications. 1357-1369 - Molham Aref, Balder ten Cate, Todd J. Green, Benny Kimelfeld, Dan Olteanu, Emir Pasalic, Todd L. Veldhuizen, Geoffrey Washburn:
Design and Implementation of the LogicBlox System. 1371-1382 - Michael Armbrust, Reynold S. Xin, Cheng Lian, Yin Huai, Davies Liu, Joseph K. Bradley, Xiangrui Meng, Tomer Kaftan, Michael J. Franklin, Ali Ghodsi, Matei Zaharia:
Spark SQL: Relational Data Processing in Spark. 1383-1394
Demo C
- Semih Salihoglu, Jaeho Shin, Vikesh Khanna, Ba Quan Truong, Jennifer Widom:
Graft: A Debugging Tool For Apache Giraph. 1403-1408 - Dongqing Xiao, Armir Bashllari, Tyler Menard, Mohamed Y. Eltabakh:
Even Metadata is Getting Big: Annotation Summarization using InsightNotes. 1409-1414 - Anja Gruenheid, Donald Kossmann, Theodoros Rekatsinas, Divesh Srivastava:
StoryPivot: Comparing and Contrasting Story Evolution. 1415-1420 - Alexander Ulrich, Torsten Grust:
The Flatter, the Better: Query Compilation Based on the Flattening Transformation. 1421-1426 - Martin Jergler, Mohammad Sadoghi, Hans-Arno Jacobsen:
D2WORM: A Management Infrastructure for Distributed Data-centric Workflows. 1427-1432 - Yael Amsterdamer, Anna Kukliansky, Tova Milo:
NL2CM: A Natural Language Interface to Crowd Mining. 1433-1438 - Sergey Dudoladov, Chen Xu, Sebastian Schelter, Asterios Katsifodimos, Stephan Ewen, Kostas Tzoumas, Volker Markl:
Optimistic Recovery for Iterative Dataflows in Action. 1439-1443 - Saliha Lallali, Nicolas Anciaux, Iulian Sandu Popa, Philippe Pucheral:
A Secure Search Engine for the Personal Cloud. 1445-1450 - Katerina Doka, Nikolaos Papailiou, Dimitrios Tsoumakos, Christos Mantas, Nectarios Koziris:
IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows. 1451-1456 - Tilmann Rabl, Manuel Danisch, Michael Frank, Sebastian Schindler, Hans-Arno Jacobsen:
Just can't get enough: Synthesizing Big Data. 1457-1462
Research Session 17 - Hardware-Aware Query Processing
- Claude Barthels, Simon Loesing, Gustavo Alonso, Donald Kossmann:
Rack-Scale In-Memory Join Processing using RDMA. 1463-1475 - Max Heimel, Martin Kiefer, Volker Markl:
Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation. 1477-1492 - Orestis Polychroniou, Arun Raghavan, Kenneth A. Ross:
Rethinking SIMD Vectorization for In-Memory Databases. 1493-1508 - Yinan Li, Craig Chasseur, Jignesh M. Patel:
A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew. 1509-1524
Research Session 18 - Graph Propagation, Influence, Mining
- Hui Li, Sourav S. Bhowmick, Jiangtao Cui, Yunjun Gao, Jianfeng Ma:
GetReal: Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks. 1525-1537 - Youze Tang, Yanchen Shi, Xiaokui Xiao:
Influence Maximization in Near-Linear Time: A Martingale Approach. 1539-1554 - Zhiting Hu, Junjie Yao, Bin Cui, Eric P. Xing:
Community Level Diffusion Extraction. 1555-1569 - Kijung Shin, Jinhong Jung, Lee Sael, U Kang:
BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs. 1571-1585 - Natali Ruchansky, Francesco Bonchi, David García-Soriano, Francesco Gullo, Nicolas Kourtellis:
The Minimum Wiener Connector Problem. 1587-1602
Research Session 19 - Social Networks
- Senjuti Basu Roy, Laks V. S. Lakshmanan, Rui Liu:
From Group Recommendations to Group Formation. 1603-1616 - Nikos Armenatzoglou, Huy Pham, Vasilis Ntranos, Dimitris Papadias, Cyrus Shahabi:
Real-Time Multi-Criteria Social Graph Partitioning: A Game Theoretic Approach. 1617-1628 - Jieying She, Yongxin Tong, Lei Chen:
Utility-Aware Social Event-Participant Planning. 1629-1643 - Xiangmin Zhou, Lei Chen, Yanchun Zhang, Longbing Cao, Guangyan Huang, Chen Wang:
Online Video Recommendation in Sharing Community. 1645-1656
Industry Session 4 - Performance
- Shreya Prasad, Arash Fard, Vishrut Gupta, Jorge Martinez, Jeff LeFevre, Vincent Xu, Meichun Hsu, Indrajit Roy:
Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction. 1657-1668 - Quoc Trung Tran, Konstantinos Morfonios, Neoklis Polyzotis:
Oracle Workload Intelligence. 1669-1681 - John Colgrove, John D. Davis, John Hayes, Ethan L. Miller, Cary Sandvig, Russell Sears, Ari Tamches, Neil Vachharajani, Feng Wang:
Purity: Building Fast, Highly-Available Enterprise Flash Storage from Commodity Components. 1683-1694 - Pawel Terlecki, Fei Xu, Marianne Shaw, Valeri Kim, Richard Michael Grantham Wesley:
On Improving User Response Times in Tableau. 1695-1706
Tutorial 3
- Stratis D. Viglas:
Data Management in Non-Volatile Memory. 1707-1711
Research Session 20 - Information Extraction and Record Linking
- Xu Chu, Yeye He, Kaushik Chakrabarti, Kris Ganjam:
TEGRA: Table Extraction by Global Record Alignment. 1713-1728 - Jialu Liu, Jingbo Shang, Chi Wang, Xiang Ren, Jiawei Han:
Mining Quality Phrases from Massive Text Corpora. 1729-1744 - Immanuel Trummer, Alon Y. Halevy, Hongrae Lee, Sunita Sarawagi, Rahul Gupta:
Mining Subjective Properties on the Web. 1745-1760 - Wen Hua, Kai Zheng, Xiaofang Zhou:
Microblog Entity Linking with Social Temporal Context. 1761-1775
Research Session 21 - RDF and SPARQL
- Nikolaos Papailiou, Dimitrios Tsoumakos, Panagiotis Karras, Nectarios Koziris:
Graph-Aware, Workload-Adaptive SPARQL Query Caching. 1777-1792 - Medha Atre:
Left Bit Right: For SPARQL Join Queries with OPTIONAL Patterns (Left-outer-joins). 1793-1808 - Weiguo Zheng, Lei Zou, Xiang Lian, Jeffrey Xu Yu, Shaoxu Song, Dongyan Zhao:
How to Build Templates for RDF Question/Answering: An Uncertain Graph Similarity Join Approach. 1809-1824 - Shi Qiao, Z. Meral Özsoyoglu:
RBench: Application-Specific RDF Benchmarking. 1825-1838 - Ahmed El-Roby, Ashraf Aboulnaga:
ALEX: Automatic Link Exploration in Linked Data. 1839-1853
Research Session 22 - Time Series & Graph Processing
- John Paparrizos, Luis Gravano:
k-Shape: Efficient and Accurate Clustering of Time Series. 1855-1870 - Jingbo Zhou, Anthony K. H. Tung:
SMiLer: A Semi-Lazy Time Series Prediction System for Sensors. 1871-1886 - Wen Sun, Achille Fokoue, Kavitha Srinivas, Anastasios Kementsietsidis, Gang Hu, Guo Tong Xie:
SQLGraph: An Efficient Relational-Based Property Graph Store. 1887-1901 - Dayu Yuan, Prasenjit Mitra, Huiwen Yu, C. Lee Giles:
Updating Graph Indices with a One-Pass Algorithm. 1903-1916
Industry Session 5 - Usability
- Anurag Gupta, Deepak Agarwal, Derek Tan, Jakub Kulesza, Rahul Pathak, Stefano Stefani, Vidhya Srinivasan:
Amazon Redshift and the Case for Simpler Data Warehouses. 1917-1923 - Mukund Deshpande, Dhruva Ray, Sameer Dixit, Avadhoot Agasti:
ShareInsights: An Unified Approach to Full-stack Data Processing. 1925-1940
Research Session 23 - Advanced Query Processing
- Immanuel Trummer, Christoph Koch:
An Incremental Anytime Algorithm for Multi-Objective Query Optimization. 1941-1953 - Niccolò Meneghetti, Denis Mindolin, Paolo Ciaccia, Jan Chomicki:
Output-sensitive Evaluation of Prioritized Skyline Queries. 1955-1967 - Arun Kumar, Jeffrey F. Naughton, Jignesh M. Patel:
Learning Generalized Linear Models Over Normalized Data. 1969-1984 - Yannis Katsis, Kian Win Ong, Yannis Papakonstantinou, Kevin Keliang Zhao:
Utilizing IDs to Accelerate Incremental View Maintenance. 1985-2000
Research Session 24 - New Models
- Fotis Psallidas, Bolin Ding, Kaushik Chakrabarti, Surajit Chaudhuri:
S4: Top-k Spreadsheet-Style Search for Query Discovery. 2001-2016 - Karim Ibrahim, Xiao Du, Mohamed Y. Eltabakh:
Proactive Annotation Management in Relational Databases. 2017-2030 - Ngai Meng Kou, Leong Hou U, Nikos Mamoulis, Zhiguo Gong:
Weighted Coverage based Reviewer Assignment. 2031-2046 - Mingwang Tang, Feifei Li, Yufei Tao:
Distributed Online Tracking. 2047-2061
Tutorial 4
- Xin Luna Dong, Divesh Srivastava:
Knowledge Curation and Knowledge Fusion: Challenges, Models and Applications. 2063-2066
Undergraduate Abstracts
- Mansheng Yang, Richard T. B. Ma:
Smooth Task Migration in Apache Storm. 2067-2068 - Oreoluwatomiwa O. Babarinsa, Stratos Idreos:
JAFAR: Near-Data Processing for Databases. 2069-2070 - Trevor Clinkenbeard, Anisoara Nica:
Job Scheduling with Minimizing Data Communication Costs. 2071-2072 - Styliani Pantela, Stratos Idreos:
One Loop Does Not Fit All. 2073-2074 - Adam Perelman, Christopher Ré:
DunceCap: Compiling Worst-Case Optimal Query Plans. 2075-2076 - Susan Tu, Christopher Ré:
DunceCap: Query Plans Using Generalized Hypertree Decompositions. 2077-2078
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.