default search action
ACM SIGMOD Conference 2022: Philadelphia, PA, USA
- Zachary G. Ives, Angela Bonifati, Amr El Abbadi:
SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12 - 17, 2022. ACM 2022, ISBN 978-1-4503-9249-5
Keynote Talks
- Barbara Liskov:
Reflections on a Career in Computer Science. 1 - Laks V. S. Lakshmanan:
On a Quest for Combating Filter Bubbles and Misinformation. 2 - Christopher Ré:
Is Data Management the Beating Heart of AI Systems? 3
Session 1: Transaction Processing
- Chuzhe Tang, Zhaoguo Wang, Xiaodong Zhang, Qianmian Yu, Binyu Zang, Haibing Guan, Haibo Chen:
Ad Hoc Transactions in Web Applications: The Good, the Bad, and the Ugly. 4-18 - Youmin Chen, Xiangyao Yu, Paraschos Koutris, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Jiwu Shu:
Plor: General Transactions with Predictable, Low Tail Latency. 19-33 - Jianqiu Zhang, Kaisong Huang, Tianzheng Wang, King Lv:
Skeena: Efficient and Consistent Cross-Engine Transactions. 34-48 - Jong-Bin Kim, Jaeseon Yu, Jaechan Ahn, Sooyong Kang, Hyungsoo Jung:
Diva: Making MVCC Systems HTAP-Friendly. 49-64 - Yijian Liu, Li Su, Vivek Shah, Yongluan Zhou, Marcos Antonio Vaz Salles:
Hybrid Deterministic and Nondeterministic Execution of Transactions in Actor Systems. 65-78
Session 2: Query Processing and Optimization 1
- Yisu Remy Wang, Mahmoud Abo Khamis, Hung Q. Ngo, Reinhard Pichler, Dan Suciu:
Optimizing Recursive Queries with Progam Synthesis. 79-93 - Zhaoguo Wang, Zhou Zhou, Yicun Yang, Haoran Ding, Gansen Hu, Ding Ding, Chuzhe Tang, Haibo Chen, Jinyang Li:
WeTune: Automatic Discovery and Verification of Query Rewrite Rules. 94-107 - Qichen Wang, Ke Yi:
Conjunctive Queries with Comparisons. 108-121 - Riccardo Mancini, Srinivas Karthik, Bikash Chandra, Vasilis Mageirakos, Anastasia Ailamaki:
Efficient Massively Parallel Join Optimization for Large Queries. 122-135 - Supun Abeysinghe, Qiyang He, Tiark Rompf:
Efficient Incrementialization of Correlated Nested Aggregate Queries using Relative Partial Aggregate Indexes (RPAI). 136-149
Session 3: ML for Data Management 1
- Barrie Kersbergen, Olivier Sprangers, Sebastian Schelter:
Serenade - Low-Latency Session-Based Recommendation in e-Commerce at Scale. 150-159 - Hanchen Wang, Rong Hu, Ying Zhang, Lu Qin, Wei Wang, Wenjie Zhang:
Neural Subgraph Counting with Wasserstein Estimator. 160-175 - Justin Talbot, Daniel Ting:
Statistical Schema Learning with Occam's Razor. 176-189 - Immanuel Trummer:
DB-BERT: A Database Tuning Tool that "Reads the Manual". 190-203 - Xiu Tang, Sai Wu, Mingli Song, Shanshan Ying, Feifei Li, Gang Chen:
PreQR: Pre-training Representation for SQL Understanding. 204-216
Session 4: Responsible Data Management and Fairness
- Sainyam Galhotra, Anna Fariha, Raoni Lourenço, Juliana Freire, Alexandra Meliou, Divesh Srivastava:
DataPrism: Exposing Disconnect between Data and Systems. 217-231 - Maliha Tashfia Islam, Anna Fariha, Alexandra Meliou, Babak Salimi:
Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification. 232-246 - Romila Pradhan, Jiongli Zhu, Boris Glavic, Babak Salimi:
Interpretable Data-Based Explanations for Fairness Debugging. 247-261 - Dong Wei, Md Mouinul Islam, Baruch Schieber, Senjuti Basu Roy:
Rank Aggregation with Proportionate Fairness. 262-275 - Sainyam Galhotra, Karthikeyan Shanmugam, Prasanna Sattigeri, Kush R. Varshney:
Causal Feature Selection for Algorithmic Fairness. 276-285
Session 5: Streaming and Sensor Networks 1
- Yunlong Xu, Jinshu Liu, Fatemeh Nargesian:
TSUBASA: Climate Network Construction on Historical and Real-Time Data. 286-295 - Bogyeong Kim, Kyoseung Koo, Undraa Enkhbat, Bongki Moon:
DenForest: Enabling Fast Deletion in Incremental Density-Based Clustering over Sliding Windows. 296-309 - Hadar Sivan, Moshe Gabel, Assaf Schuster:
AutoMon: Automatic Distributed Monitoring for Arbitrary Multivariate Functions. 310-324 - David Tench, Evan West, Victor Zhang, Michael A. Bender, Abiyaz Chowdhury, J. Ahmed Dellas, Martin Farach-Colton, Tyler Seip, Kenny Zhang:
GraphZeppelin: Storage-Friendly Sketching for Connected Components on Dynamic Graph Streams. 325-339 - Adar Amir, Ilya Kolchinsky, Assaf Schuster:
DLACEP: A Deep-Learning Based Framework for Approximate Complex Event Processing. 340-354
Session 6: Data Cleaning and Integration
- Amir Gilad, Zhengjie Miao, Sudeepa Roy, Jun Yang:
Understanding Queries by Conditional Instances. 355-368 - Lampros Flokas, Weiyuan Wu, Yejia Liu, Jiannan Wang, Nakul Verma, Eugene Wu:
Complaint-Driven Training Data Debugging at Interactive Speeds. 369-383 - Wenfei Fan, Ziyan Han, Yaoshu Wang, Min Xie:
Parallel Rule Discovery from Large Datasets by Sampling. 384-398 - Zezhou Huang, Eugene Wu:
Reptile: Aggregation-level Explanations for Hierarchical Data. 399-413 - Sainyam Galhotra, Donatella Firmani, Barna Saha, Divesh Srivastava:
Hierarchical Entity Resolution using an Oracle. 414-428 - Dezhong Yao, Yuhong Gu, Gao Cong, Hai Jin, Xinqiao Lv:
Entity Resolution with Hierarchical Graph Attention Networks. 429-442 - Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Chengliang Chai, Guoliang Li, Ruixue Fan, Xiaoyong Du:
Domain Adaptation for Deep Entity Resolution. 443-457
Session 7: Data Management for ML 1
- Pei-Yu Hou, Daniel Robert Korn, Cleber C. Melo-Filho, David R. Wright, Alexander Tropsha, Rada Chirkova:
Compact Walks: Taming Knowledge-Graph Embeddings with Domain- and Task-Specific Pathways. 458-469 - Xupeng Miao, Yining Shi, Hailin Zhang, Xin Zhang, Xiaonan Nie, Zhi Yang, Bin Cui:
HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training. 470-480 - Alexander Renz-Wieland, Rainer Gemulla, Zoi Kaoudi, Volker Markl:
NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access. 481-495 - Daniel Kang, Nikos Aréchiga, Sudeep Pillai, Peter D. Bailis, Matei Zaharia:
Finding Label and Model Errors in Perception Data With Learned Observation Assertions. 496-505 - Supun Nakandala, Arun Kumar:
Nautilus: An Optimized System for Deep Transfer Learning over Evolving Training Datasets. 506-520 - Chaoji Zuo, Sepehr Assadi, Dong Deng:
Spine: Scaling up Programming-by-Negative-Example for String Filtering and Transformation. 521-530 - Jinglin Peng, Bolin Ding, Jiannan Wang, Kai Zeng, Jingren Zhou:
One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees. 531-544
Session 8: Query Processing and Data Management for ML
- Pramod Chunduri, Jaeho Bang, Yao Lu, Joy Arulraj:
Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning. 545-558 - Jiashen Cao, Karan Sarkar, Ramyad Hadidi, Joy Arulraj, Hyesoon Kim:
FiGO: Fine-Grained Query Optimization in Video Analytics. 559-572 - Zihao Chen, Baokun Han, Chen Xu, Weining Qian, Aoying Zhou:
Redundancy Elimination in Distributed Matrix Computation. 573-586 - Kwanghyun Park, Karla Saur, Dalitso Banda, Rathijit Sen, Matteo Interlandi, Konstantinos Karanasos:
End-to-end Optimization of Machine Learning Prediction Queries. 587-601 - Zhuangdi Xu, Gaurav Tarlok Kakkar, Joy Arulraj, Umakishore Ramachandran:
EVA: A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized Views. 602-616
Session 9: Database Monitoring and Tuning
- Matthew Butrovich, Wan Shen Lim, Lin Ma, John Rollinson, William Zhang, Yu Xia, Andrew Pavlo:
Tastes Great! Less Filling! High Performance and Accurate Training Data Collection for Self-Driving Database Management Systems. 617-630 - Xinyi Zhang, Hong Wu, Yang Li, Jian Tan, Feifei Li, Bin Cui:
Towards Dynamic and Safe Configuration Tuning for Cloud Databases. 631-645 - Baoqing Cai, Yu Liu, Ce Zhang, Guangyu Zhang, Ke Zhou, Li Liu, Chunhua Li, Bin Cheng, Jie Yang, Jiashu Xing:
HUNTER: An Online Cloud Database Hybrid Tuning System for Personalized Requirements. 646-659 - Tarique Siddiqui, Saehan Jo, Wentao Wu, Chi Wang, Vivek R. Narasayya, Surajit Chaudhuri:
ISUM: Efficiently Compressing Large and Complex Workloads for Scalable Index Tuning. 660-673 - Jinhan Xin, Kai Hwang, Zhibin Yu:
LOCAT: Low-Overhead Online Configuration Auto-Tuning of Spark SQL Applications. 674-684
Session 10: Distributed and Parallel Databases
- Tobias Ziegler, Carsten Binnig, Viktor Leis:
ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA. 685-699 - Michael Abebe, Horatiu Lazu, Khuzaima Daudjee:
Proteus: Autonomous Adaptive Storage for Mixed Workloads. 700-714 - Linguan Yang, Xinan Yan, Bernard Wong:
Natto: Providing Distributed Transaction Prioritization for High-Contention Workloads. 715-729 - Yu Sun, Zheng Zheng, Shaoxu Song, Fei Chiang:
Confidence Bounded Replica Currency Estimation. 730-743 - Yikai Zhao, Yinda Zhang, Yuanpeng Li, Yi Zhou, Chunhui Chen, Tong Yang, Bin Cui:
MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area. 744-758
Session 11: Database Security, Privacy and Control
- Wei Dong, Juanru Fang, Ke Yi, Yuchao Tao, Ashwin Machanavajjhala:
R2T: Instance-optimal Truncation for Differentially Private Query Evaluation with Foreign Keys. 759-772 - Seng Pei Liew, Tsubasa Takahashi, Shun Takagi, Fumiyuki Kato, Yang Cao, Masatoshi Yoshikawa:
Network Shuffling: Privacy Amplification via Random Walks. 773-787 - Sainan Li, Qilei Yin, Guoliang Li, Qi Li, Zhuotao Liu, Jinwei Zhu:
Unsupervised Contextual Anomaly Detection for Database Systems. 788-802 - Zhao Chang, Dong Xie, Sheng Wang, Feifei Li:
Towards Practical Oblivious Join. 803-817 - Chenghong Wang, Johes Bater, Kartik Nayak, Ashwin Machanavajjhala:
IncShrink: Architecting Efficient Outsourced Databases using Incremental MPC and Differential Privacy. 818-832
Session 12: Graph Data Management and Mining
- Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V. S. Lakshmanan, Xiaolin Han:
A Convex-Programming Approach for Efficient Directed Densest Subgraph Discovery. 845-859 - Kaiqiang Yu, Cheng Long, Shengxin Liu, Da Yan:
Efficient Algorithms for Maximal k-Biplex Enumeration. 860-873 - Yahui Sun, Shuai Ma, Bin Cui:
Hunting Temporal Bumps in Graphs with Dynamic Vertex Properties. 874-888 - Junghoon Kim, Siqiang Luo, Gao Cong, Wenyuan Yu:
DMCS : Density Modularity based Community Search. 889-903 - Wentao Li, Miao Qiao, Lu Qin, Lijun Chang, Ying Zhang, Xuemin Lin:
On Scalable Computation of Graph Eccentricities. 904-916
Session 13: ML for Data Management and Query Processing
- Qiyu Liu, Yanyan Shen, Lei Chen:
HAP: An Efficient Hamming Space Index Based on Augmented Pigeonhole Principle. 917-930 - Zongheng Yang, Wei-Lin Chiang, Sifei Luan, Gautam Mittal, Michael Luo, Ion Stoica:
Balsa: Learning a Query Optimizer Without Expert Demonstrations. 931-944 - Lixi Zhang, Chengliang Chai, Xuanhe Zhou, Guoliang Li:
LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning. 945-958 - Xiao Hu, Yuxi Liu, Haibo Xiu, Pankaj K. Agarwal, Debmalya Panigrahi, Sudeepa Roy, Jun Yang:
Selectivity Functions of Range Queries are Learnable. 959-972 - Kangfei Zhao, Jeffrey Xu Yu, Zongyan He, Rui Li, Hao Zhang:
Lightweight and Accurate Cardinality Estimation by Neural Network Gaussian Process. 973-987
Session 14: Modern Hardware and In-memory DBMS
- Sangjin Lee, Alberto Lerner, André Ryser, Kibin Park, Chanyoung Jeon, Jinsub Park, Yong Ho Song, Philippe Cudré-Mauroux:
X-SSD: A Storage System with Native Support for Database Logging and Replication. 988-1002 - Nils Boeschen, Carsten Binnig:
GaccO - A GPU-accelerated OLTP DBMS. 1003-1016 - Clemens Lutz, Sebastian Breß, Steffen Zeuch, Tilmann Rabl, Volker Markl:
Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects. 1017-1032 - Qing Wang, Youyou Lu, Jiwu Shu:
Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory. 1033-1048 - Daokun Hu, Zhiwen Chen, Wenkui Che, Jianhua Sun, Hao Chen:
Halo: A Hybrid PMem-DRAM Persistent Hash Index with Fast Recovery. 1049-1063
Session 15: Streaming and Sensor Networks 2
- Xuebin Ren, Liang Shi, Weiren Yu, Shusen Yang, Cong Zhao, Zongben Xu:
LDP-IDS: Local Differential Privacy for Infinite Data Streams. 1064-1077 - Bonaventura Del Monte, Steffen Zeuch, Tilmann Rabl, Volker Markl:
Rethinking Stateful Stream Processing with RDMA. 1078-1092 - Maor Yankovitch, Ilya Kolchinsky, Assaf Schuster:
HYPERSONIC: A Hybrid Parallelization Approach for Scalable Complex Event Processing. 1093-1107 - Zhuo Zhang, Junhao Gan, Zhifeng Bao, Seyed Mohammad Hussein Kazemi, Guangyong Chen, Fengyuan Zhu:
Approximate Range Thresholding. 1108-1121 - Lei Ma, Chuan Lei, Olga Poppe, Elke A. Rundensteiner:
Gloria: Graph-based Sharing Optimizer for Event Trend Aggregation. 1122-1135
Session 16: Knowledge Discovery and Data Mining
- Martino Ciaperoni, Aristides Gionis, Athanasios Katsamanis, Panagiotis Karras:
SIEVE: A Space-Efficient Algorithm for Viterbi Decoding. 1136-1145 - Zhizhi Wang, Chaoji Zuo, Dong Deng:
TxtAlign: Efficient Near-Duplicate Text Alignment Search via Bottom-k Sketches for Plagiarism Detection. 1146-1159 - Shay Gershtein, Tova Milo, Slava Novgorodov, Kathy Razmadze:
Classifier Construction Under Budget Constraints. 1160-1174 - Paul Boniol, Mohammed Meftah, Emmanuel Remy, Themis Palpanas:
dCAM: Dimension-wise Class Activation Map for Explaining Multivariate Data Series Classification. 1175-1189 - Dmitrii Babaev, Nikita Ovsov, Ivan Kireev, Mariya Ivanova, Gleb Gusev, Ivan Nazarov, Alexander Tuzhilin:
CoLES: Contrastive Learning for Event Sequences with Self-Supervision. 1190-1199
Session 17: Query Processing and Optimization 2
- Yizhou Dai, Miao Qiao, Lijun Chang:
Anchored Densest Subgraph. 1200-1213 - Kyoungmin Kim, Jisung Jung, In Seo, Wook-Shin Han, Kangwoo Choi, Jaehyok Chong:
Learned Cardinality Estimation: An In-depth Study. 1214-1227 - Ibrahim Sabek, Tenzin Samten Ukyab, Tim Kraska:
LSched: A Workload-Aware Learned Query Scheduler for Analytical Database Systems. 1228-1242 - Adrian Vogelsgesang, Thomas Neumann, Viktor Leis, Alfons Kemper:
Efficient Evaluation of Arbitrarily-Framed Holistic SQL Aggregates and Window Functions. 1243-1256 - George Christodoulou, Panagiotis Bouros, Nikos Mamoulis:
HINT: A Hierarchical Index for Intervals in Main Memory. 1257-1270
Session 18: Data Management for ML 2
- Yiming Li, Yanyan Shen, Lei Chen:
Camel: Managing Data for Efficient Stream Learning. 1271-1285 - Lijie Xu, Shuang Qiu, Binhang Yuan, Jiawei Jiang, Cédric Renggli, Shaoduo Gan, Kaan Kara, Guoliang Li, Ji Liu, Wentao Wu, Jieping Ye, Ce Zhang:
In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle. 1286-1300 - Qiange Wang, Yanfeng Zhang, Hao Wang, Chaoyi Chen, Xiaodong Zhang, Ge Yu:
NeutronStar: Distributed GNN Training with Hybrid Dependency Management. 1301-1315 - Fangcheng Fu, Huanran Xue, Yong Cheng, Yangyu Tao, Bin Cui:
BlindFL: Vertical Federated Machine Learning without Peeking into Your Data. 1316-1330 - Evgenios M. Kornaropoulos, Silei Ren, Roberto Tamassia:
The Price of Tailoring the Index to Your Data: Poisoning Attacks on Learned Index Structures. 1331-1344
Session 19: Databases for Emerging Hardware
- Qizhen Zhang, Xinyi Chen, Sidharth Sankhe, Zhilei Zheng, Ke Zhong, Sebastian Angel, Ang Chen, Vincent Liu, Boon Thau Loo:
Optimizing Data-intensive Systems in Disaggregated Data Centers with TELEPORT. 1345-1359 - Yu-Ching Hu, Yuliang Li, Hung-Wei Tseng:
TCUDB: Accelerating Database with Tensor Processors. 1360-1374 - Matthias Jasny, Lasse Thostrup, Tobias Ziegler, Carsten Binnig:
P4DB - The Case for In-Network OLTP. 1375-1389 - Anil Shanbhag, Bobbi W. Yogatama, Xiangyao Yu, Samuel Madden:
Tile-based Lightweight Integer Compression in GPU. 1390-1403 - Mijin An, In-Yeong Song, Yong Ho Song, Sang-Won Lee:
Avoiding Read Stalls on Flash Storage. 1404-1417
Session 20: Database Security and Distributed Data Management
- Zhiqi Wang, Zili Shao:
TimeUnion: An Efficient Architecture with Unified Data Model for Timeseries Management Systems on Hybrid Cloud Storage. 1418-1432 - Jiacheng Wu, Jin Wang, Carlo Zaniolo:
Optimizing Parallel Recursive Datalog Evaluation on Multicore Machines. 1433-1446 - Hao Zhang, Jeffrey Xu Yu, Yikai Zhang, Kangfei Zhao:
Parallel Query Processing: To Separate Communication from Computation. 1447-1461 - Harshavardhan Unnibhavi, David Cerdeira, Antonio Barbalace, Nuno Santos, Pramod Bhatotia:
Secure and Policy-Compliant Query Processing on Heterogeneous Computational Storage Architectures. 1462-1477 - Yu Xia, Xiangyao Yu, Matthew Butrovich, Andrew Pavlo, Srinivas Devadas:
Litmus: Towards a Practical Database Management System with Verifiable ACID Properties and Transaction Correctness. 1478-1492
Session 21: ML for Data Management 2
- Yoshihiko Suhara, Jinfeng Li, Yuliang Li, Dan Zhang, Çagatay Demiralp, Chen Chen, Wang-Chiew Tan:
Annotating Columns with Pre-trained Language Models. 1493-1503 - Zixuan Zhao, Raul Castro Fernandez:
Leva: Boosting Machine Learning Performance with Relational Embedding Data Augmentation. 1504-1517 - Sepideh Nikookar, Paras Sakharkar, Sathyanarayanan Somasunder, Senjuti Basu Roy, Adam Bienkowski, Matthew Macesker, Krishna R. Pattipati, David Sidoti:
Cooperative Route Planning Framework for Multiple Distributed Assets in Maritime Applications. 1518-1527 - Wentao Wu, Chi Wang, Tarique Siddiqui, Junxiong Wang, Vivek R. Narasayya, Surajit Chaudhuri, Philip A. Bernstein:
Budget-aware Index Tuning with Reinforcement Learning. 1528-1541 - Jingyi Yang, Peizhi Wu, Gao Cong, Tieying Zhang, Xiao He:
SAM: Database Generation from Query Workloads with Supervised Autoregressive Models. 1542-1555
Session 22: Provenance and Uncertainty
- Felix S. Campbell, Bahareh Sadat Arab, Boris Glavic:
Efficient Answering of Historical What-if Queries. 1556-1569 - Daniel Deutch, Nave Frost, Benny Kimelfeld, Mikaël Monet:
Computing the Shapley Value of Facts in Query Answering. 1570-1583 - Thomas Hütter, Nikolaus Augsten, Christoph M. Kirsch, Michael J. Carey, Chen Li:
JEDI: These aren't the JSON documents you're looking for? 1584-1597 - Sainyam Galhotra, Amir Gilad, Sudeepa Roy, Babak Salimi:
HypeR: Hypothetical Reasoning With What-If and How-To Queries Using a Probabilistic Causal Approach. 1598-1611 - Daniel Ting:
Adaptive Threshold Sampling. 1612-1625
Session 23: Storage and Indexing
- Christoph Anneser, Andreas Kipf, Huanchen Zhang, Thomas Neumann, Alfons Kemper:
Adaptive Hybrid Indexes. 1626-1639 - Brian Hentschel, Utku Sirin, Stratos Idreos:
Entropy-Learned Hashing: Constant Time Hashing with Controllable Uniformity. 1640-1654 - Feng Zhang, Weitao Wan, Chenyang Zhang, Jidong Zhai, Yunpeng Chai, Haixiang Li, Xiaoyong Du:
CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases. 1655-1669 - Eric R. Knorr, Baptiste Lemaire, Andrew Lim, Siqiang Luo, Huanchen Zhang, Stratos Idreos, Michael Mitzenmacher:
Proteus: A Self-Designing Range Filter. 1670-1684 - Noura S. Alghamdi, Liang Zhang, Elke A. Rundensteiner, Mohamed Y. Eltabakh:
Scalable Time Series Compound Infrastructure. 1685-1698
Session 24: Potpourri
- Yiru Chen, Eugene Wu:
PI2: End-to-end Interactive Visualization Interface Generation from Queries. 1711-1725 - Wenfei Fan, Yuanhao Li, Muyang Liu, Can Lu:
A Hierarchical Contraction Scheme for Querying Big Graphs. 1726-1740 - Rachel Behar, Sara Cohen:
Representative Query Results by Voting. 1741-1754 - Raul Castro Fernandez:
Protecting Data Markets from Strategic Buyers. 1755-1769 - Uri Avron, Shay Gershtein, Ido Guy, Tova Milo, Slava Novgorodov:
Automated Category Tree Construction in E-Commerce. 1770-1783
Session 25: Benchmarking and Performance Evaluation
- Naiqing Guan, Nick Koudas:
FILA: Online Auditing of Machine Learning Model Accuracy under Finite Labelling Budget. 1784-1794 - Tobias Maltenberger, Ivan Ilic, Ilin Tolovski, Tilmann Rabl:
Evaluating Multi-GPU Sorting with Modern Interconnects. 1795-1809 - Elena Milkai, Yannis Chronis, Kevin P. Gaffney, Zhihan Guo, Jignesh M. Patel, Xiangyao Yu:
How Good is My HTAP System? 1810-1824 - Alexander Isenko, Ruben Mayer, Jeffrey Jedele, Hans-Arno Jacobsen:
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines. 1825-1839 - Hani Al-Sayeh, Bunjamin Memishi, Muhammad Attahir Jibril, Marcus Paradies, Kai-Uwe Sattler:
Juggler: Autonomous Cost Optimization and Performance Prediction of Big Data Applications. 1840-1854 - Sarah Alnegheimish, Dongyu Liu, Carles Sala, Laure Berti-Équille, Kalyan Veeramachaneni:
Sintel: A Machine Learning Framework to Extract Insights from Signals. 1855-1865 - Yuncheng Wu, Tien Tuan Anh Dinh, Guoyu Hu, Meihui Zhang, Yeow Meng Chee, Beng Chin Ooi:
Serverless Data Science - Are We There Yet? A Case Study of Model Serving. 1866-1875
Session 26: Data Management for ML 3
- Peizhen Guo, Bo Hu, Wenjun Hu:
Sommelier: Curating DNN Models for the Masses. 1876-1890 - Donghyoung Han, Jongwuk Lee, Min-Soo Kim:
FuseME: Distributed Matrix Computation Engine based on Cuboid-based Fused Operator and Plan Generation. 1891-1904 - Bo Hu, Peizhen Guo, Wenjun Hu:
Video-zilla: An Indexing Layer for Large-Scale Video Analytics. 1905-1919 - Beibin Li, Yao Lu, Srikanth Kandula:
Warper: Efficiently Adapting Learned Cardinality Estimators to Data and Workload Drifts. 1920-1933 - Daniel Kang, John Guibas, Peter D. Bailis, Tatsunori Hashimoto, Matei Zaharia:
TASTI: Semantic Indexes for Machine Learning-based Queries over Unstructured Data. 1934-1947 - Dan Olteanu, Nils Vortmeier, Dorde Zivanovic:
Givens QR Decomposition over Relational Databases. 1948-1961 - Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Zoi Kaoudi, Tilmann Rabl, Volker Markl:
Materialization and Reuse Optimizations for Production Data Science Pipelines. 1962-1976
Session 27: Graph Data Management and Social Networks
- Renchi Yang, Jieming Shi, Keke Huang, Xiaokui Xiao:
Scalable and Effective Bipartite Network Embedding. 1977-1991 - Yikai Zhang, Jeffrey Xu Yu:
Relative Subboundedness of Contraction Hierarchy and Hierarchical 2-Hop Index in Dynamic Road Networks. 1992-2005 - Xiaofan Li, Rui Zhou, Lu Chen, Chengfei Liu, Qiang He, Yun Yang:
One Set to Cover All Maximal Cliques Approximately. 2006-2019 - Muhammad Farhan, Qing Wang, Henning Koehler:
BatchHL: Answering Distance Queries on Batch-Dynamic Networks at Scale. 2020-2033 - Qiangqiang Dai, Rong-Hua Li, Meihao Liao, Hongzhi Chen, Guoren Wang:
Fast Maximal Clique Enumeration on Uncertain Graphs: A Pivot-based Approach. 2034-2047 - Meihao Liao, Rong-Hua Li, Qiangqiang Dai, Guoren Wang:
Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based Approach. 2048-2061 - Andrea Rossi, Donatella Firmani, Paolo Merialdo, Tommaso Teofili:
Explaining Link Prediction Systems based on Knowledge Graph Embeddings. 2062-2075
Session 28: Spatial, Temporal, and Multimedia Databases
- Xiao Hu, Stavros Sintos, Junyang Gao, Pankaj K. Agarwal, Jun Yang:
Computing Complex Temporal Join Queries Efficiently. 2076-2090 - Favyen Bastani, Samuel Madden:
OTIF: Efficient Tracker Pre-processing over Large Video Datasets. 2091-2104 - Wenjia He, Michael J. Cafarella:
Controlled Intentional Degradation in Analytical Video Systems. 2105-2119 - Tsz Nam Chan, Leong Hou U, Byron Choi, Jianliang Xu:
SLAM: Efficient Sweep Line Algorithms for Kernel Density Visualization. 2120-2134 - Yuxiang Zeng, Yongxin Tong, Lei Chen:
Faster and Better Solution to Embed Lp Metrics by Tree Metrics. 2135-2148 - Jiahao Zhang, Bo Tang, Man Lung Yiu, Xiao Yan, Keming Li:
T-LevelIndex: Towards Efficient Query Processing in Continuous Preference Space. 2149-2162
Industrial Track Papers
- Ahmed Metwally:
Scaling Equi-Joins. 2163-2176 - Yunus Ma, Siphrey Xie, Henry Zhong, Leon Lee, King Lv:
HiEngine: How to Architect a Cloud-Native Memory-Optimized Database Engine. 2177-2190 - Konstantin Taranov, Steve Byan, Virendra J. Marathe, Torsten Hoefler:
KafkaDirect: Zero-copy Data Access for Apache Kafka over RDMA Networks. 2191-2204 - Nikos Armenatzoglou, Sanuj Basu, Naga Bhanoori, Mengchu Cai, Naresh Chainani, Kiran Chinta, Venkatraman Govindaraju, Todd J. Green, Monish Gupta, Sebastian Hillig, Eric Hotinger, Yan Leshinksy, Jintian Liang, Michael McCreedy, Fabian Nagel, Ippokratis Pandis, Panos Parchas, Rahul Pathak, Orestis Polychroniou, Foyzur Rahman, Gaurav Saxena, Gokul Soundararajan, Sriram Subramanian, Doug Terry:
Amazon Redshift Re-invented. 2205-2217 - Pingcheng Ruan, Yaron Kanza, Beng Chin Ooi, Divesh Srivastava:
LedgerView: Access-Control Views on Hyperledger Fabric. 2218-2231 - Junbin Kang, Le Cai, Feifei Li, Xingxuan Zhou, Wei Cao, Songlu Cai, Daming Shao:
Remus: Efficient Live Migration for Distributed Databases with Snapshot Isolation. 2232-2245 - Alin Deutsch, Nadime Francis, Alastair Green, Keith Hare, Bei Li, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Wim Martens, Jan Michels, Filip Murlak, Stefan Plantikow, Petra Selmer, Oskar van Rest, Hannes Voigt, Domagoj Vrgoc, Mingxi Wu, Fred Zemke:
Graph Pattern Matching in GQL and SQL/PGQ. 2246-2258 - Ihab F. Ilyas, Theodoros Rekatsinas, Vishnu Konda, Jeffrey Pound, Xiaoguang Qi, Mohamed A. Soliman:
Saga: A Platform for Continuous Construction and Serving of Knowledge at Scale. 2259-2272 - Amirhossein Aleyasen, Mark Morcos, Lyublena Antova, Marc Sugiyama, Dmitri Korablev, Jozsef Patvarczki, Rima Mutreja, Michael Duller, Florian M. Waas, Marianne Winslett:
Intelligent Automated Workload Analysis for Database Replatforming. 2273-2285 - Jiachi Zhang, Shi Cheng, Zhihui Xue, Jianjun Deng, Cuiyun Fu, Wenchao Zhou, Sheng Wang, Changcheng Chen, Feifei Li:
ESDB: Processing Extremely Skewed Workloads in Real-time. 2286-2298 - Wangda Zhang, Matteo Interlandi, Paul Mineiro, Shi Qiao, Nasim Ghazanfari, Karlen Lie, Marc T. Friedman, Rafah Hosn, Hiren Patel, Alekh Jindal:
Deploying a Steered Query Optimizer in Production at Microsoft. 2299-2311 - Nathan VanBenschoten, Arul Ajmani, Marcus Gartner, Andrei Matei, Aayush Shah, Irfan Sharif, Alexander Shraer, Adam Storm, Rebecca Taft, Oliver Tan, Andy Woods, Peyton Walters:
Enabling the Next Generation of Multi-Region Applications with CockroachDB. 2312-2325 - Alexander Behm, Shoumik Palkar, Utkarsh Agarwal, Timothy Armstrong, David Cashman, Ankur Dave, Todd Greenstein, Shant Hovsepian, Ryan Johnson, Arvind Sai Krishnan, Paul Leventis, Ala Luszczak, Prashanth Menon, Mostafa Mokhtar, Gene Pang, Sameer Paranjpye, Greg Rahn, Bart Samwel, Tom van Bussel, Herman Van Hovell, Maryann Xue, Reynold Xin, Matei Zaharia:
Photon: A Fast Query Engine for Lakehouse Systems. 2326-2339 - Adam Prout, Szu-Po Wang, Joseph Victor, Zhou Sun, Yongzhu Li, Jack Chen, Evan Bergeron, Eric N. Hanson, Robert Walzer, Rodrigo Gomes, Nikita Shamgunov:
Cloud-Native Transactions and Analytics in SingleStore. 2340-2352
Demonstrations
- Jiawei Tang, Yuyu Luo, Mourad Ouzzani, Guoliang Li, Hongyang Chen:
Sevi: Speech-to-Visualization through Neural Machine Translation. 2353-2356 - Ziliang Lai, Chris Liu, Chenxia Han, Pengfei Zhang, Eric Lo, Ben Kao:
Everest: A Top-K Deep Video Analytics System. 2357-2360 - Gerardo Vitagliano, Lucas Reisener, Lan Jiang, Mazhar Hameed, Felix Naumann:
Mondrian: Spreadsheet Layout Detection. 2361-2364 - Jeffrey Tao, Yiru Chen, Eugene Wu:
Demonstration of PI2: Interactive Visualization Interface Generation for SQL Analysis in Notebook. 2365-2368 - Kathy Razmadze, Yael Amsterdamer, Amit Somech, Susan B. Davidson, Tova Milo:
SubTab: Data Exploration with Informative Sub-Tables. 2369-2372 - Susan B. Davidson, Daniel Deutch, Nave Frost, Benny Kimelfeld, Omer Koren, Mikaël Monet:
ShapGraph: An Holistic View of Explanations through Provenance Graphs and Shapley Values. 2373-2376 - Sophie Pavia, Rituparna Khan, Anna Pyayt, Michael N. Gubanov:
Simplifying Access to Large-scale Structured Datasets by Meta-Profiling with Scalable Training Set Enrichment. 2377-2380 - Zifeng Yuan, Huey-Eng Chua, Sourav S. Bhowmick, Zekun Ye, Byron Choi, Wook-Shin Han:
PLAYPEN: Plug-and-Play Visual Graph Query Interfaces for Top-down and Bottom-Up Search on Large Networks. 2381-2384 - Yuanfeng Song, Raymond Chi-Wing Wong, Xuefang Zhao, Di Jiang:
VoiceQuerySystem: A Voice-driven Database Querying System Using Natural Language Questions. 2385-2388 - Tim Fischer, Denis Hirn, Torsten Grust:
Snakes on a Plan: Compiling Python Functions into Plain SQL Queries. 2389-2392 - Benjamin Hättasch, Jan-Micha Bodensohn, Carsten Binnig:
Demonstrating ASET: Ad-hoc Structured Exploration of Text Collections. 2393-2396 - Vibhor Porwal, Subrata Mitra, Fan Du, John Anderson, Nikhil Sheoran, Anup B. Rao, Tung Mai, Gautam Kowshik, Sapthotharan Nair, Sameeksha Arora, Saurabh Mahapatra:
Efficient Insights Discovery through Conditional Generative Model based Query Approximation. 2397-2400 - Idan Meyuhas, Aviv Ben-Arie, Yair Horesh, Daniel Deutch:
CFDB: Machine Learning Model Analysis via Databases of CounterFactuals. 2401-2404 - Zihui Gu, Ruixue Fan, Xiaoman Zhao, Meihui Zhang, Ju Fan, Xiaoyong Du:
OpenTFV: An Open Domain Table-Based Fact Verification System. 2405-2408 - Enzo Veltri, Donatello Santoro, Gilbert Badaro, Mohammed Saeed, Paolo Papotti:
Pythia: Unsupervised Generation of Ambiguous Textual Claims from Relational Data. 2409-2412 - Peng Chen, Hui Li, Sourav S. Bhowmick, Shafiq R. Joty, Weiguo Wang:
LANTERN: Boredom-conscious Natural Language Description Generation of Query Execution Plans for Database Education. 2413-2416 - Haotian Liu, Bo Tang, Jiashu Zhang, Yangshen Deng, Xinying Zheng, Qiaomu Shen, Xiao Yan, Dan Zeng, Zunyao Mao, Chaozu Zhang, Zhengxin You, Zhihao Wang, Runzhe Jiang, Fang Wang, Man Lung Yiu, Huan Li, Mingji Han, Qian Li, Zhenghai Luo:
GHive: A Demonstration of GPU-Accelerated Query Processing in Apache Hive. 2417-2420 - Luming Sun, Tao Ji, Cuiping Li, Hong Chen:
DeepO: A Learned Query Optimizer. 2421-2424 - Junran Yang, Hyekang Kevin Joo, Sai S. Yerramreddy, Siyao Li, Dominik Moritz, Leilani Battle:
Demonstration of VegaPlus: Optimizing Declarative Visualization Languages. 2425-2428 - Subhadeep Sarkar, Kaijie Chen, Zichen Zhu, Manos Athanassoulis:
Compactionary: A Dictionary for LSM Compactions. 2429-2432 - Jiongli Zhu, Romila Pradhan, Boris Glavic, Babak Salimi:
Generating Interpretable Data-Based Explanations for Fairness Debugging using Gopher. 2433-2436 - Immanuel Trummer:
Demonstrating DB-BERT: A Database Tuning Tool that "Reads" the Manual. 2437-2440
Tutorials
- Sourav S. Bhowmick, Byron Choi:
Data-driven Visual Query Interfaces for Graphs: Past, Present, and (Near) Future. 2441-2447 - Akash Bharadwaj, Graham Cormode:
An Introduction to Federated Computation. 2448-2451 - Romila Pradhan, Aditya Lahiri, Sainyam Galhotra, Babak Salimi:
Explainable AI: Foundations, Applications, Opportunities for Data Management Research. 2452-2457 - Fatemeh Nargesian, Abolfazl Asudeh, H. V. Jagadish:
Responsible Data Integration: Next-generation Challenges. 2458-2464 - Vivek R. Narasayya, Surajit Chaudhuri:
Multi-Tenant Cloud Data Services: State-of-the-Art, Challenges and Opportunities. 2465-2473 - Huan Li, Bo Tang, Hua Lu, Muhammad Aamir Cheema, Christian S. Jensen:
Spatial Data Quality in the IoT Era: Management and Exploitation. 2474-2482 - Guoliang Li, Chao Zhang:
HTAP Databases: What is New and What is Next. 2483-2488 - Subhadeep Sarkar, Manos Athanassoulis:
Dissecting, Designing, and Optimizing LSM-based Data Stores. 2489-2497
Panels
- Anastasia Ailamaki, Leilani Battle, Johannes Gehrke, Masaru Kitsuregawa, David Maier, Christopher Ré, Meihui Zhang, Magdalena Balazinska:
The DB Community vis-à-vis Environmental, Health, and Societal Grand Challenges: Innovation Engine, Plumber, or Bystander? 2498-2500 - Sihem Amer-Yahia, Sourav S. Bhowmick, Xin Luna Dong, Stratos Idreos, Wolfgang Lehner, Divesh Srivastava:
Publication Culture and Review Processes in the Data Management Community: An Open Discussion. 2501-2502
Student Abstracts
- Mihail Stoian:
Concurrent Link-Cut Trees. 2503-2505 - David Justen:
Cost-efficiency and Performance Robustness in Serverless Data Exchange. 2506-2508 - Plaksin Yaroslav:
An Approach for Unlabeled Tasks Prioritization. 2509-2511 - Manuel Schönberger:
Applicability of Quantum Computing on Database Query Optimization. 2512-2514 - Supawit Chockchowwat:
Tuning Hierarchical Learned Indexes on Disk and Beyond. 2515-2517 - Jiadong Xie:
Hindering Influence Diffusion of Community. 2518-2520 - Ryan Wickman:
SparRL: Graph Sparsification via Deep Reinforcement Learning. 2521-2523 - Michael Fruth:
Live Patching Database Management Systems. 2524-2526 - Nikhil Sheoran:
DeepOLA: Online Aggregation for Deeply Nested Queries. 2527-2529 - Sughosh V. Kaushik:
Lineage Resource Manager. 2530-2532 - Joshua Pan:
Workload-Adaptive Filtering in Storage Engines. 2533-2535 - Alexander Yao:
Interactive Query Explanations Using Fine Grained Provenance. 2536-2538 - Anna Gorb:
A Recommender Algorithm to Automatically Generate Metrics for GQM Models in Software Development. 2539-2541
Workshop Summaries
- Sven Groppe, Le Gruenwald, Ching-Hsien Hsu:
BiDEDE'22: Second International Workshop on Big Data in Emergent Distributed Environments. 2542-2543 - Daniel Deutch, Tanu Malik, Adriane Chapman:
Theory and Practice of Provenance. 2544-2545 - Vasiliki Kalavri, Semih Salihoglu:
GRADES-NDA'22: 5th International Workshop on Graph Data management Experiences and Systems (GRADES) and Network Data Analytics (NDA). 2546-2547 - Matthias Boehm, Paroma Varma, Doris Xin:
DEEM'22: Data Management for End-to-End Machine Learning. 2548-2549 - Rajesh Bordawekar, Yael Amsterdamer, Donatella Firmani, Ryan Marcus, Oded Shmueli:
aiDM'22: Fifth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management. 2550-2551 - Azza Abouzied, Dominik Moritz, Michael J. Cafarella:
HILDA'22: The SIGMOD 2022 Workshop on Human-in-the-Loop Data Analytics. 2552-2553 - Manuel Rigger, Pinar Tözün:
DBTest '22: 9th International Workshop on Testing Database Systems. 2554-2555 - Efthimia Aivaloglou, George Fletcher, Daphne Miedema:
DataEd'22 - 1st International Workshop on Data Systems Education: Bridging Education Practice with Education Research. 2556-2557 - Spyros Blanas, Norman May:
International Workshop on Data Management on New Hardware (DaMoN). 2558-2559
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.