default search action
BigData Conference 2016: Washington DC, USA
- James Joshi, George Karypis, Ling Liu, Xiaohua Hu, Ronay Ak, Yinglong Xia, Weijia Xu, Aki-Hiro Sato, Sudarsan Rachuri, Lyle H. Ungar, Philip S. Yu, Rama Govindaraju, Toyotaro Suzumura:
2016 IEEE International Conference on Big Data (IEEE BigData 2016), Washington DC, USA, December 5-8, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-9005-7 - Chaitanya K. Baru:
Harnessing the data revolution: A perspective from the national science foundation. 2 - Elisa Bertino:
Big data security and privacy. 3 - Jiawei Han:
On the power of big data: Mining structures from massive, unstructured text data. 4 - Mark Johnson:
Leveraging high performance computing to drive advanced manufacturing R&D at the US department of energy. 5-6 - Michael Stonebraker, Dong Deng, Michael L. Brodie:
Database decay and how to avoid it. 7-16 - Christian Böhm, Martin Perdacher, Claudia Plant:
Cache-oblivious loops based on a novel space-filling curve. 17-26 - Jagat Sesh Challa, Poonam Goyal, S. Nikhil, Aditya Mangla, Sundar Balasubramaniam, Navneet Goyal:
DD-Rtree: A dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithms. 27-36 - Ravikant Dindokar, Neel Choudhury, Yogesh Simmhan:
A meta-graph approach to analyze subgraph-centric distributed programming models. 37-47 - Subhadeep Karan, Jaroslaw Zola:
Exact structure learning of Bayesian networks by optimal path extension. 48-55 - Walaa Eldin Moustafa, Vicky Papavasileiou, Ken Yocum, Alin Deutsch:
Datalography: Scaling datalog graph analytics on graph processing systems. 56-65 - Yosuke Oyama, Akihiro Nomura, Ikuro Sato, Hiroki Nishimura, Yukimasa Tamatsu, Satoshi Matsuoka:
Predicting statistics of asynchronous SGD parameters for a large-scale distributed deep learning system on GPU supercomputers. 66-75 - Benjamin Sirb, Xiaojing Ye:
Consensus optimization with delayed and stochastic gradients on decentralized networks. 76-85 - Xiaoli Song, Yan Rui, Xiaohua Hu:
Pairwise topic model and its application to topic transition and evolution. 86-95 - Yuan Yuan, Sihong Xie, Chun-Ta Lu, Jie Tang, Philip S. Yu:
Interpretable and effective opinion spam detection via temporal patterns mining across websites. 96-105 - Fang Zhou, Mohamed F. Ghalwash, Zoran Obradovic:
A fast structured regression for large networks. 106-115 - Adiska Fardani Haryadi, Joris Hulstijn, Agung Wahyudi, Haiko Van Der Voort, Marijn Janssen:
Antecedents of big data quality: An empirical examination in financial service organizations. 116-121 - Joseph Jupin, Justin Y. Shi, Eduard C. Dragut:
PSH: A probabilistic signature hash method with hash neighborhood candidate generation for fast edit-distance string comparison on big data. 122-127 - Rocco Langone, Johan A. K. Suykens:
Efficient multiple scale kernel classifiers. 128-133 - Joaquim F. Silva, Carlos Gonçalves, José C. Cunha:
A theoretical model for n-gram distribution in big data corpora. 134-141 - Jonathan Stokes, Steven Weber:
The self-avoiding walk-jump (SAWJ) algorithm for finding maximum degree nodes in large graphs. 142-149 - Xiaoli Song, Xiaotong Wang, Xiaohua Hu:
Semantic pattern mining for text mining. 150-155 - Kenji Yamanishi, Kohei Miyaguchi:
Detecting gradual changes from data stream using MDL-change statistics. 156-163 - Rongda Zhu, Aston Zhang, Jian Peng, Chengxiang Zhai:
Exploiting temporal divergence of topic distributions for event detection. 164-171 - Timo Bingmann, Michael Axtmann, Emanuel Jöbstl, Sebastian Lamm, Huyen Chau Nguyen, Alexander Noe, Sebastian Schlag, Matthias Stumpp, Tobias Sturm, Peter Sanders:
Thrill: High-performance algorithmic distributed batch data processing with C++. 172-183 - Liuhua Chen, Haiying Shen:
Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioning. 184-193 - Katerina Doka, Nikolaos Papailiou, Victor Giannakouris, Dimitrios Tsoumakos, Nectarios Koziris:
Mix 'n' match multi-engine analytics. 194-203 - Alex Gittens, Aditya Devarakonda, Evan Racah, Michael F. Ringenburg, Lisa Gerhardt, Jey Kottalam, Jialin Liu, Kristyn J. Maschhoff, Shane Canon, Jatin Chhugani, Pramod Sharma, Jiyan Yang, James Demmel, Jim Harrell, Venkat Krishnamurthy, Michael W. Mahoney, Prabhat:
Matrix factorizations at scale: A comparison of scientific data analytics in spark and C+MPI using three case studies. 204-213 - Yin Huang, Yelena Yesha, Milton Halem, Yaacov Yesha, Shujia Zhou:
YinMem: A distributed parallel indexed in-memory computation system for large scale data analytics. 214-222 - Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dhabaleswar K. Panda:
Efficient data access strategies for Hadoop and Spark on HPC cluster with heterogeneous storage. 223-232 - Zhuozhao Li, Haiying Shen, Jeffrey Denton, Walter B. Ligon III:
Comparing application performance on HPC-based Hadoop platforms with local storage and dedicated storage. 233-242 - Jinwei Liu, Haiying Shen, Husnu S. Narman:
CCRP: Customized cooperative resource provisioning for high resource utilization in clouds. 243-252 - Xiaoyi Lu, Dipti Shankar, Shashank Gugnani, Dhabaleswar K. Panda:
High-performance design of apache spark with RDMA and its benefits on various workloads. 253-262 - Tomoki Yoshihisa, Takahiro Hara:
A low-load stream processing scheme for IoT environments. 263-272 - Yuan Yuan, Meisam Fathi Salmi, Yin Huai, Kaibo Wang, Rubao Lee, Xiaodong Zhang:
Spark-GPU: An accelerated in-memory data processing engine on clusters. 273-283 - Angen Zheng, Alexandros Labrinidis, Panos K. Chrysanthis, Jack Lange:
Argo: Architecture-aware graph partitioning. 284-293 - Kareem S. Aggour, Bülent Yener:
Adapting to data sparsity for efficient parallel PARAFAC tensor decomposition in Hadoop. 294-301 - Yadu N. Babuji, Kyle Chard, Aaron Gerow, Eamon Duede:
Cloud Kotta: Enabling secure and scalable data analytics in the cloud. 302-310 - Chunkun Bo, Ke Wang, Jeffrey J. Fox, Kevin Skadron:
Entity resolution acceleration using the automata processor. 311-318 - Kyle Chard, Mike D'Arcy, Benjamin D. Heavner, Ian T. Foster, Carl Kesselman, Ravi K. Madduri, Alexis A. Rodriguez, Stian Soiland-Reyes, Carole A. Goble, Kristi Clark, Eric W. Deutsch, Ivo D. Dinov, Nathan D. Price, Arthur W. Toga:
I'll take that to go: Big data bags and minimal identifiers for exchange of large, complex datasets. 319-328 - Chun-Chieh Chen, Chih-Ya Shen, Ming-Syan Chen:
Massive parallelism for non-linear and non-stationary data analysis with GPGPU. 329-334 - Stratos Dimopoulos, Chandra Krintz, Rich Wolski:
Big data framework interference in restricted private cloud settings. 335-340 - Khoa D. Doan, Amidu O. Oloso, Kwo-Sen Kuo, Thomas L. Clune, Hongfeng Yu, Brian Nelson, Jian Zhang:
Evaluating the impact of data placement to spark and SciDB with an Earth Science use case. 341-346 - Saliya Ekanayake, Supun Kamburugamuve, Pulasthi Wickramasinghe, Geoffrey C. Fox:
Java thread and process performance for parallel machine learning on multicore HPC clusters. 347-354 - Gheorghi Guzun, Josiah C. McClurg, Guadalupe Canahuate, Raghuraman Mudumbai:
Power efficient big data analytics algorithms through low-level operations. 355-361 - Satoshi Imamura, Keitaro Oka, Yuichiro Yasui, Yuichi Inadomi, Katsuki Fujisawa, Toshio Endo, Koji Ueno, Keiichiro Fukazawa, Nozomi Hata, Yuta Kakibuka, Koji Inoue, Takatsugu Ono:
Evaluating the impacts of code-level performance tunings on power efficiency. 362-369 - Fan Jiang, Claris Castillo, Charles Schmitt:
RADU: Bridging the divide between data and infrastructure management to support data-driven collaborations. 370-377 - Jinfeng Li, James Cheng, Yunjian Zhao, Fan Yang, Yuzhen Huang, Haipeng Chen, Ruihao Zhao:
A comparison of general-purpose distributed systems for data processing. 378-383 - Jinwei Liu, Haiying Shen:
A popularity-aware cost-effective replication scheme for high data durability in cloud storage. 384-389 - Luis Pineda-Morales, Ji Liu, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso:
Managing hot metadata for scientific workflows on multisite clouds. 390-397 - Hitoshi Sato, Ryo Mizote, Satoshi Matsuoka, Hirotaka Ogawa:
I/O chunking and latency hiding approach for out-of-core sorting acceleration using GPU and flash NVM. 398-403 - Dipti Shankar, Xiaoyi Lu, Dhabaleswar K. Panda:
Boldio: A hybrid and resilient burst-buffer over lustre for accelerating big data I/O. 404-409 - Christoforos Svingos, Theofilos Mailis, Herald Kllapi, Lefteris Stamatogiannakis, Yannis Kotidis, Yannis E. Ioannidis:
Real time processing of streaming and static information. 410-415 - Hans Vandierendonck, Karen L. Murphy, Mahwish Arif, Dimitrios S. Nikolopoulos:
HPTA: High-performance text analytics. 416-423 - Jorge Veiga, Roberto R. Expósito, Xoán C. Pardo, Guillermo L. Taboada, Juan Touriño:
Performance evaluation of big data frameworks for large-scale data analytics. 424-431 - Yali Zhao, Rodrigo N. Calheiros, James Bailey, Richard O. Sinnott:
SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. 432-441 - Kaiji Chen, Yongluan Zhou:
Materialized view selection in feed following systems. 442-451 - Victor Giannakouris, Nikolaos Papailiou, Dimitrios Tsoumakos, Nectarios Koziris:
MuSQLE: Distributed SQL query execution over multiple engine environments. 452-461 - Ahsanul Haque, Zhuoyi Wang, Swarup Chandra, Yupeng Gao, Latifur Khan, Charu C. Aggarwal:
Sampling-based distributed Kernel mean matching using spark. 462-471 - Yudian Ji, Yuda Zang, Wuman Luo, Xibo Zhou, Ye Ding, Lionel M. Ni:
Clockwise compression for trajectory data under road network constraints. 472-481 - Karuna P. Joshi, Aditi Gupta, Sudip Mittal, Claudia Pearce, Anupam Joshi, Tim Finin:
Semantic approach to automating management of big data privacy policies. 482-491 - Eleazar Leal, Le Gruenwald, Jianting Zhang:
Handling uncertainty in trajectories of moving objects in unconstrained outdoor spaces. 492-501 - Cuong M. Nguyen, Philip J. Rhodes:
Accelerating range queries for large-scale unstructured meshes. 502-511 - Md. Shiblee Sadik, Le Gruenwald, Eleazar Leal:
In pursuit of outliers in multi-dimensional data streams. 512-521 - Jianpeng Xu, Jiayu Zhou, Pang-Ning Tan, Xi Liu, Lifeng Luo:
WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decomposition. 522-531 - Farrukh Ahmed, Michele Samorani, Colin Bellinger, Osmar R. Zaïane:
Advantage of integration in big data: Feature generation in multi-relational databases for imbalanced learning. 532-539 - Matthew Edwards, Stephen Wattam, Paul Rayson, Awais Rashid:
Sampling labelled profile data for identity resolution. 540-547 - Frank Pallas, Johannes Günther, David Bermbach:
Pick your choice in HBase: Security or performance. 548-554 - Rui Ren, Zhen Jia, Lei Wang, Jianfeng Zhan, Tianxu Yi:
BDTUne: Hierarchical correlation-based performance analysis and rule-based diagnosis for big data systems. 555-562 - Ramyar Saeedi, Hassan Ghasemzadeh, Assefaw Hadish Gebremedhin:
Transfer learning algorithms for autonomous reconfiguration of wearable systems. 563-569 - Mei Saouk, Christos Doulkeridis, Akrivi Vlachou, Kjetil Nørvåg:
Efficient processing of top-k joins in MapReduce. 570-577 - Ting Wu, Chen Jason Zhang, Lei Chen, Pan Hui, Siyuan Liu:
Object identification with Pay-As-You-Go crowdsourcing. 578-585 - Nesreen K. Ahmed, Theodore L. Willke, Ryan A. Rossi:
Estimation of local subgraph counts. 586-595 - Christian Beecks, Alexander Graß:
Multi-step threshold algorithm for efficient feature-based query processing in large-scale multimedia databases. 596-605 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
PRIIME: A generic framework for interactive personalized interesting pattern discovery. 606-615 - Ngot Bui, Thanh Le, Vasant G. Honavar:
Labeling actors in multi-view social networks by integrating information from within and across multiple views. 616-625 - Hariton Efstathiades, Demetris Antoniades, George Pallis, Marios D. Dikaiakos, Zoltán Szlávik, Robert-Jan Sips:
Online social network evolution: Revisiting the Twitter graph. 626-635 - Jianliang Gao, Bo Song, Ping Liu, Weimao Ke, Jianxin Wang, Xiaohua Hu:
Parallel top-k subgraph query in massive graphs: Computing from the perspective of single vertex. 636-645 - Xiaoyu Ge, Yanbing Xue, Zhipeng Luo, Mohamed A. Sharaf, Panos K. Chrysanthis:
REQUEST: A scalable framework for interactive construction of exploratory queries. 646-655 - Chun Guo, Xiaozhong Liu:
Dynamic feature generation and selection on heterogeneous graph for music recommendation. 656-665 - Nguyen Ho, Huy T. Vo, Mai Vu:
An adaptive information-theoretic approach for identifying temporal correlations in big data sets. 666-675 - Chao Huang, Dong Wang, Shenglong Zhu, Daniel Yue Zhang:
Towards unsupervised home location inference from online social media. 676-685 - Wei Jiang, Juan Rodriguez, Torsten Suel:
Improved methods for static index pruning. 686-695 - Wooyeol Kim, Younghoon Kim, Kyuseok Shim:
Parallel computation of k-nearest neighbor joins using MapReduce. 696-705 - Sarasi Lalithsena, Pavan Kapanipathi, Amit P. Sheth:
Harnessing relationships for domain-specific subgraph extraction: A recommendation use case. 706-715 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
Scalable link community detection: A local dispersion-aware approach. 716-725 - Hongfu Liu, Yuchao Zhang, Bo Deng, Yun Fu:
Outlier detection via sampling ensemble. 726-735 - Athanasios N. Nikolakopoulos, Antonia Korba, John D. Garofalakis:
Random surfing on multipartite graphs. 736-745 - Cheong Hee Park, Youngsoon Kang:
An active learning method for data streams with concept drift. 746-752 - Charles Siegel, Jeff Daily, Abhinav Vishnu:
Adaptive neuron apoptosis for accelerating deep learning on large scale systems. 753-762 - Ata Turk, Hao Chen, Anthony Byrne, John Knollmeyer, Sastry S. Duri, Canturk Isci, Ayse K. Coskun:
DeltaSherlock: Identifying changes in the cloud. 763-772 - Xiaokai Wei, Bokai Cao, Weixiang Shao, Chun-Ta Lu, Philip S. Yu:
Community detection with partially observable links and node attributes. 773-782 - Yongyi Xian, Yan Liu, Chuanfei Xu:
Parallel gathering discovery over big trajectory data. 783-792 - Hu Xu, Sihong Xie, Lei Shu, Philip S. Yu:
CER: Complementary entity recognition via knowledge expansion on large unlabeled product reviews. 793-802 - Jingyuan Zhang, Chun-Ta Lu, Mianwei Zhou, Sihong Xie, Yi Chang, Philip S. Yu:
HEER: Heterogeneous graph embedding for emerging relation detection from news. 803-812 - Hao Zhang, Yuanyuan Zhu, Lu Qin, Hong Cheng, Jeffrey Xu Yu:
Efficient triangle listing for billion-scale graphs. 813-822 - Yating Zhang, Adam Jatowt, Katsumi Tanaka:
Towards understanding word embeddings: Automatically explaining similarity of terms. 823-832 - Kai Zhao, Denis Khryashchev, Juliana Freire, Cláudio T. Silva, Huy T. Vo:
Predicting taxi demand at high spatial resolution: Approaching the limit of predictability. 833-842 - Yixian Zheng, Wenchao Wu, Haipeng Zeng, Nan Cao, Huamin Qu, Mingxuan Yuan, Jia Zeng, Lionel M. Ni:
TelcoFlow: Visual exploration of collective behaviors based on telco data. 843-852 - Morteza Zihayat, Zane Zhenhua Hu, Aijun An, Yonggang Hu:
Distributed and parallel high utility sequential pattern mining. 853-862 - Philip K. Chan, Ebad Ahmadzadeh:
Improving efficiency of maximizing spread in the flow authority model for large sparse networks. 863-868 - Wanying Ding, Yue Zhang, Chaomei Chen, Xiaohua Hu:
Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in Twitter. 869-874 - Ioanna Filippidou, Yannis Kotidis:
Effective and efficient graph augmentation in large graphs. 875-880 - Ville Hyvönen, Teemu Pitkänen, Sotiris K. Tasoulis, Elias Jaasaari, Risto Tuomainen, Liang Wang, Jukka Corander, Teemu Roos:
Fast nearest neighbor search through sparse random projections and voting. 881-888 - Saïd Jabbour, Nizar Mhadhbi, Abdesattar Mhadhbi, Badran Raddaoui, Lakhdar Sais:
Summarizing big graphs by means of pseudo-boolean constraints. 889-894 - Uwe Jugel, Zbigniew Jerzak, Volker Markl:
Big data on a few pixels. 895-900 - Mohammad Mahdi Kamani, Farshid Farhat, Stephen Wistar, James Z. Wang:
Shape matching using skeleton context for automated bow echo detection. 901-908 - Weimao Ke, Javed Mostafa:
Scalability analysis of distributed search in large peer-to-peer networks. 909-914 - Nicolas Kourtellis, Gianmarco De Francisci Morales, Albert Bifet, Arinto Murdopo:
VHT: Vertical hoeffding tree. 915-922 - Yuh-Jye Lee, Hsing-Kuo Pao, Shueh-Han Shih, Jing-Yao Lin, Xin-Rong Chen:
Compressed learning for time series classification. 923-930 - Xiaopeng Li, Ming Cheung, James She:
Connection discovery using shared images by Gaussian relational topic model. 931-936 - Haofu Liao, Yucheng Li, Tianran Hu, Jiebo Luo:
Inferring restaurant styles by mining crowd sourced photos from user-review websites. 937-944 - Chang Liu, Bin Wu, Yi Yang, Zhihong Guo:
Multiple submodels parallel support vector machine on spark. 945-950 - Xiang Liu, Torsten Suel:
What makes a group fail: Modeling social group behavior in event-based social networks. 951-956 - Jinna Lv, Bin Wu, Shuai Yang, Bingjing Jia, Peigang Qiu:
Efficient large scale near-duplicate video detection base on spark. 957-962 - Stathis Maroulis, Ioannis Boutsis, Vana Kalogeraki:
Context-aware point of interest recommendation using tensor factorization. 963-968 - Steven Morse, Marta C. González, Natasha Markuzon:
Persistent cascades: Measuring fundamental communication structure in social networks. 969-975 - Tathagata Mukherjee, Biswas Parajuli, Piyush Kumar, Eduardo L. Pasiliao Jr.:
TruthCore: Non-parametric estimation of truth from a collection of authoritative sources. 976-983 - Sergey Nepomnyachiy, Torsten Suel:
Efficient index updates for mixed update and query loads. 984-991 - Gopi Chand Nutakki, Olfa Nasraoui:
Compartmentalized adaptive topic mining on social media streams. 992-997 - Aduri Pavan, Paul Quint, Stephen D. Scott, N. V. Vinodchandran, J. Smith:
Computing triangle and open-wedge heavy-hitters in large networks. 998-1005 - Michael L. Rilee, Kwo-Sen Kuo, Thomas L. Clune, Amidu Oloso, Paul G. Brown, Hongfeng Yu:
Addressing the big-earth-data variety challenge with the hierarchical triangular mesh. 1006-1011 - Weixiang Shao, Lifang He, Chun-Ta Lu, Philip S. Yu:
Online multi-view clustering with incomplete views. 1012-1017 - Chuan Shi, Bowei He, Menghao Zhang, Fuzhen Zhuang, Philip S. Yu, Naiwang Guo:
Expenditure aware rating prediction for recommendation. 1018-1025 - Sreenivas R. Sukumar, Ramakrishnan Kannan, Seung-Hwan Lim, Michael A. Matheson:
Kernels for scalable data analysis in science: Towards an architecture-portable future. 1026-1031 - Ioanna Tsalouchidou, Gianmarco De Francisci Morales, Francesco Bonchi, Ricardo Baeza-Yates:
Scalable dynamic graph summarization. 1032-1039 - Koji Ueno, Toyotaro Suzumura, Naoya Maruyama, Katsuki Fujisawa, Satoshi Matsuoka:
Extreme scale breadth-first search on supercomputers. 1040-1047 - Pascal Welke, Alexander Markowetz, Torsten Suel, Maria Christoforaki:
Three-hop distance estimation in social graphs. 1048-1055 - Tong Yu, Ole J. Mengshoel, Alvin Jude, Eugen Feller, Julien Forgeat, Nimish Radia:
Incremental learning for matrix factorization in recommender systems. 1056-1063 - Abir Zayani, Chiheb-Eddine Ben N'cir, Nadia Essoussi:
Parallel clustering method for non-disjoint partitioning of large-scale data based on spark framework. 1064-1069 - Da-Chuan Zhang, Mei Li, Chang-Dong Wang:
Point of interest recommendation with social and geographical influence. 1070-1075 - Daniel Yue Zhang, Rungang Han, Dong Wang, Chao Huang:
On robust truth discovery in sparse social media sensing. 1076-1081 - Rajesh Sankaran, Ricardo A. Calix:
On the feasibility of an embedded machine learning processor for intrusion detection. 1082-1089 - Heqing Huang, Cong Zheng, Junyuan Zeng, Wu Zhou, Sencun Zhu, Peng Liu, Suresh Chari, Ce Zhang:
Android malware development on public malware scanning platforms: A large-scale data-driven study. 1090-1099 - Hui Li, Jiangtao Cui, Xiaobin Lin, Jianfeng Ma:
Improving the utility in differential private histogram publishing: Theoretical study and practice. 1100-1109 - Xiao Pan, Jiawei Zhang, Fengjiao Wang, Philip S. Yu:
DistSD: Distance-based social discovery with personalized posterior screening. 1110-1119 - Quan Zhang, Mu Qiao, Ramani R. Routray, Weisong Shi:
H2O: A hybrid and hierarchical outlier detection method for large scale data protection. 1120-1129 - Ariel Bar, Bracha Shapira, Lior Rokach, Moshe Unger:
Scalable attack propagation model and algorithms for honeypot systems. 1130-1135 - Bas van Stein, Matthijs van Leeuwen, Thomas Bäck:
Local subspace-based outlier detection using global neighbourhoods. 1136-1142 - Shuo Wang, Richard O. Sinnott, Surya Nepal:
Protecting the location privacy of mobile social media users. 1143-1150 - Michael J. Anderson, Mihai Capota, Javier S. Turek, Xia Zhu, Theodore L. Willke, Yida Wang, Po-Hsuan Chen, Jeremy R. Manning, Peter J. Ramadge, Kenneth A. Norman:
Enabling factor analysis on thousand-subject neuroimaging datasets. 1151-1160 - Yanan Bao, Huasen Wu, Tianxiao Zhang, Albara Ah Ramli, Xin Liu:
Shooting a moving target: Motion-prediction-based transmission for 360-degree videos. 1161-1170 - Sayan Goswami, Arghya Kusum Das, Richard Platania, Kisung Lee, Seung-Jong Park:
Lazer: Distributed memory-efficient assembly of large-scale genomes. 1171-1181 - Zhichuan Huang, Ting Zhu:
Leveraging multi-granularity energy data for accurate energy demand forecast in smart grids. 1182-1191 - Xiaowei Jia, Ankush Khandelwal, James Gerber, Kimberly Carlson, Paul C. West, Vipin Kumar:
Learning large-scale plantation mapping from imperfect annotators. 1192-1201 - Darja Krushevskaja, William Simpson, S. Muthukrishnan:
Ad allocation with secondary metrics. 1202-1211 - Azad Naik, Huzefa Rangwala:
Embedding feature selection for large-scale hierarchical classification. 1212-1221 - Naman Shah, Harshil Shah, Matthew Malensek, Sangmi Lee Pallickara, Shrideep Pallickara:
Network analysis for identifying and characterizing disease outbreak influence from voluminous epidemiology data. 1222-1231 - Francesco Versaci, Luca Pireddu, Gianluigi Zanetti:
Scalable genomics: From raw data to aligned reads on Apache YARN. 1232-1241 - Yida Wang, Bryn Keller, Mihai Capota, Michael J. Anderson, Narayanan Sundaram, Jonathan D. Cohen, Kai Li, Nicholas B. Turk-Browne, Theodore L. Willke:
Real-time full correlation matrix analysis of fMRI data. 1242-1251 - Yanan Xu, Yanmin Zhu:
When remote sensing data meet ubiquitous urban data: Fine-grained air quality inference. 1252-1261 - Jingyuan Yang, Chuanren Liu, Mingfei Teng, March Liao, Hui Xiong:
Buyer targeting optimization: A unified customer segmentation perspective. 1262-1271 - Mehrdad Yazdani, Bryn C. Taylor, Justine W. Debelius, Weizhong Li, Rob Knight, Larry Smarr:
Using machine learning to identify major shifts in human gut microbiome protein family abundance in disease. 1272-1280 - Chunqiu Zeng, Qing Wang, Wentao Wang, Tao Li, Larisa Shwartz:
Online inference for time-varying temporal dependency discovery from time series. 1281-1290 - Ke Zhang, Jianwu Xu, Martin Renqiang Min, Guofei Jiang, Konstantinos Pelechrinis, Hui Zhang:
Automated IT system failure prediction: A deep learning approach. 1291-1300 - Hông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes:
Estimating human interactions with electrical appliances for activity-based energy savings recommendations. 1301-1308 - Zexi Chen, Ranga Raju Vatsavai, Bharathkumar Ramachandra, Qiang Zhang, Nagendra Singh, Sreenivas R. Sukumar:
Scalable nearest neighbor based hierarchical change detection framework for crop monitoring. 1309-1314 - Aman Gupta, S. Muthukrishnan, Smita Wadhwa:
Optimizing callout in unified ad markets. 1315-1321 - Zhichuan Huang, Tiantian Xie, Ting Zhu, Jianwu Wang, Qingquan Zhang:
Application-driven sensing data reconstruction and selection based on correlation mining and dynamic feedback. 1322-1327 - Xiaowei Jia, Xi C. Chen, Anuj Karpatne, Vipin Kumar:
Identifying dynamic changes with noisy labels in spatial-temporal data: A study on large-scale water monitoring application. 1328-1333 - Mike Lakoju, Alan Serrano:
A strategic approach for visualizing the value of big data (SAVV-BIGD) framework. 1334-1339 - Mai H. Nguyen, Dylan Uys, Daniel Crawl, Charles Cowart, Ilkay Altintas:
A scalable approach for location-specific detection of Santa Ana conditions. 1340-1345 - Susanna Pirttikangas, Ekaterina Gilman, Xiang Su, Teemu Leppänen, Anja Keskinarkaus, Mika Rautiainen, Mikko Pyykkönen, Jukka Riekki:
Experiences with smart city traffic pilot. 1346-1352 - Elyas Sabeti, Anders Høst-Madsen:
How interesting images are: An atypicality approach for social networks. 1353-1358 - Wenzhao Zhang, Houjun Tang, Stephen Ranshous, Surendra Byna, Daniel F. Martin, Kesheng Wu, Bin Dong, Scott Klasky, Nagiza F. Samatova:
Exploring memory hierarchy and network topology for runtime AMR data sharing across scientific applications. 1359-1366 - Pavel A. Dmitriev, Brian Frasca, Somit Gupta, Ron Kohavi, Garnet Jason Vaz:
Pitfalls of long-term online controlled experiments. 1367-1376 - Juergen Heit, Jiayi Liu, Mohak Shah:
An architecture for the deployment of statistical models for the big data era. 1377-1384 - Raya Horesh, Kush R. Varshney, Jinfeng Yi:
Information retrieval, fusion, completion, and clustering for employee expertise estimation. 1385-1393 - Rishi Chhatwal, Nathaniel Huber-Fliflet, Robert Keeling, Jianping Zhang, Haozhen Zhao:
Empirical evaluations of preprocessing parameters' impact on predictive coding's effectiveness. 1394-1401 - Ruoyu Wang, Daniel Sun, Guoqiang Li, Muhammad Atif, Surya Nepal:
LogProv: Logging events as provenance of big data analytics pipelines with trustworthiness. 1402-1411 - Bradford Littooy, Sophie Loire, Michael Georgescu, Igor Mezic:
Pattern recognition and classification of HVAC rule-based faults in commercial buildings. 1412-1421 - Adetokunbo Makanju, Zahra Farzanyar, Aijun An, Nick Cercone, Zane Zhenhua Hu, Yonggang Hu:
Deep parallelization of parallel FP-growth using parent-child MapReduce. 1422-1431 - Nicolás Poggi, Josep Lluis Berral, Thomas Fenech, David Carrera, José A. Blakeley, Umar Farooq Minhas, Nikola Vujic:
The state of SQL-on-Hadoop in the cloud. 1432-1443 - Emily Grace, Ankit Rai, Elissa M. Redmiles, Rayid Ghani:
Detecting fraud, corruption, and collusion in international development contracts: The design of a proof-of-concept automated system. 1444-1453 - Michele Samorani, Farrukh Ahmed, Osmar R. Zaïane:
Automatic generation of relational attributes: An application to product returns. 1454-1463 - Syed Yousaf Shah, Brent Paulovicks, Petros Zerfos:
Data-at-rest security for spark. 1464-1473 - Mylene Simon, Joe Chalfoun, Mary Brady, Peter Bajcsy:
Do we trust image measurements? Variability, accuracy and traceability of image features. 1474-1482 - Sreenivas R. Sukumar, Michael A. Matheson, Ramakrishnan Kannan, Seung-Hwan Lim:
Mini-apps for high performance data analysis. 1483-1492 - Tomasz Tajmajer, Malwina Splawinska, Piotr Wasilewski, Stan Matwin:
Predicting annual average daily highway traffic from large data and very few measurements. 1493-1501 - Ganesh Venkataraman, Abhimanyu Lad, Lin Guo, Shakti Sinha:
Fast, lenient and accurate: Building personalized instant search experience at LinkedIn. 1502-1511 - Hui Wu, Yi Fang, Huming Wu, Shenhong Zhu:
Diversifying trending topic discovery via Semidefinite Programming. 1512-1521 - Xuchao Zhang, Zhiqian Chen, Weisheng Zhong, Arnold P. Boedihardjo, Chang-Tien Lu:
Storytelling in heterogeneous Twitter entity network based on hierarchical cluster routing. 1522-1531 - Wenjun Zhou, Yun Zhu, Faizan Javed, Mahmudur Rahman, Janani Balaji, Matt McNair:
Quantifying skill relevance to job titles. 1532-1541 - Zhenyun Zhuang, Haricharan Ramachandra, Badri Sridharan, Brandon Duncan, Kishore Gopalakrishna, Jean-Francois Im:
SmartCache: Application layer caching to improve performance of large-scale memory mapping. 1542-1550 - Zahra Zohrevand, Uwe Glässer, Hamed Yaghoubi Shahir, Mohammad A. Tayebi, Robert Costanzo:
Hidden Markov based anomaly detection for water supply systems. 1551-1560 - Ilaria Bordino, Andrea Ferretti, Marco Firrincieli, Francesco Gullo, Marcello Paris, Stefano Pascolutti, Gianluca Sabena:
Advancing NLP via a distributed-messaging approach. 1561-1568 - Luca Cazzanti, Antonio Davoli, Leonardo Maria Millefiori:
Automated port traffic statistics: From raw data to visualisation. 1569-1573 - Hongfeng Chai, Hao Liu, Xibo Zhou, Yanjun Xu, Shuo He, Jinzhi Hua, Dongjie He, Weihuai Liu:
UStore: An optimized storage system for enterprise data warehouses at UnionPay. 1574-1578 - Vinay Deolalikar, Hernan Laffitte:
Extensive large-scale study of error surfaces in sampling-based distinct value estimators for databases. 1579-1586 - Amita Gajewar, Lizhong Wu, Jignesh Parmar, Ramana Yerneni:
Forecasting squatting of demand in display advertising. 1587-1594 - Archana Ganapathi, Yanpei Chen:
Data quality: Experiences and lessons from operationalizing big data. 1595-1602 - Nancy W. Grady:
KDD meets Big Data. 1603-1608 - Rajaraman Kanagasabai, Anitha Veeramani, Shangfeng Hu, Sangaralingam Kajanan, Giuseppe Manai:
Classification of massive mobile web log URLs for customer profiling & analytics. 1609-1614 - Masahiro Kazama, Issei Sato, Haruaki Yatabe, Tairiku Ogihara, Tetsuro Onishi, Hiroshi Nakagawa:
Company recommendation for new graduates via implicit feedback multiple matrix factorization with Bayesian optimization. 1615-1620 - Yiming Kong, Hui Zang, Xiaoli Ma:
Human network usage patterns revealed by telecom data. 1621-1626 - Leonardo Maria Millefiori, Dimitrios Zissis, Luca Cazzanti, Gianfranco Arcieri:
A distributed approach to estimating sea port operational regions from lots of AIS data. 1627-1632 - Thibaud Nesztler, Don Kasper, Michael Georgescu, Sophie Loire, Igor Mezic:
Uniformization, organization, association and use of metadata from multiple content providers and manufacturers: A close look at the Building Automation System (BAS) sector. 1633-1638 - Derrick C. Spell, Ling-Yong Wang, Richard T. Shomer, Bahador Nooraei, Jarrell Waggoner, Xiao-Han T. Zeng, Jae Young Chung, Kai-Chen Cheng, Daniel Kirsche:
QED: Groupon's ETL management and curated feature catalog system for machine learning. 1639-1646 - Ljiljana Stojanovic, Marko Dinic, Nenad Stojanovic, Aleksandar Stojadinovic:
Big-data-driven anomaly detection in industry (4.0): An approach and a case study. 1647-1652 - Jiejun Xu, Samuel D. Johnson, Kang-Yu Ni:
Cross-modal event summarization: A network of networks approach. 1653-1657 - Teruyoshi Zenmyo, Satoshi Iijima, Ichiro Fukuda:
Managing a complicated workflow based on dataflow-based workflow scheduler. 1658-1663 - Li Zhou, Yinglong Xia, Hui Zang, Jian Xu, Mingzhen Xia:
An edge-set based large scale graph processing system. 1664-1669 - Nora Alkhamees, Maria Fasli:
Event detection from social network streams using frequent pattern mining with dynamic support values. 1670-1679 - Victor Perazzolo Barros, Pollyana Notargiacomo:
Big data analytics in cloud gaming: Players' patterns recognition using artificial neural networks. 1680-1689 - Nada Basit, Yutong Zhang, Hao Wu, Haoran Liu, Jieming Bin, Yijun He, Abdeltawab M. Hendawi:
MapReduce-based deep learning with handwritten digit recognition case study. 1690-1699 - Giuseppe Bruno:
Text mining and sentiment extraction in central bank documents. 1700-1708 - Philip Thruesen, Jaroslav Cechák, Blandine Seznec, Roel Castalio, Nattiya Kanhabua:
To link or not to link: Ranking hyperlinks in Wikipedia using collective attention. 1709-1718 - Ismail Duru, Gulustan Dogan, Banu Diri:
An overview of studies about students' performance analysis and learning analytics in MOOCs. 1719-1723 - Brahim Hnich, Faisal R. Al-Osaimi, Ata Sasmaz, Ozkan Sayin, Amine Lamine, Majed AlOtaibi:
Smart online vehicle tracking system for security applications. 1724-1733 - Hsiao-Wei Hu, Hao-Chen Chang, Wen-Shiu Lin:
An optimized frequent pattern mining algorithm with multiple minimum supports. 1734-1741 - Ammar Jabakji, Hasan Dag:
Improving item-based recommendation accuracy with user's preferences on Apache Mahout. 1742-1749 - Sampath Jayarathna, Faryaneh Poursardar:
Change detection and classification of digital collections. 1750-1759 - Yerzhan Kerimbekov, Hasan Sakir Bilge:
A feature selection method based on Lorentzian metric. 1760-1767 - Sercan Külcü, Erdogan Dogdu, A. Murat Ozbayoglu:
A survey on semantic Web and big data technologies for social network analysis. 1768-1777 - Quanzhi Li, Sameena Shah, Rui Fang:
Table classification using both structure and content information: A case study of financial documents. 1778-1783 - Xiao Li, Reza Sharifi Sedeh, Liao Wang, Yang Yang:
Patient-record level integration of de-identified healthcare big databases. 1784-1786 - Bingchuan Liu, Yudong Tan, Huimin Zhou:
A Bayesian predictor of airline class seats based on multinomial event model. 1787-1791 - Busra Mutlu, Merve Mutlu, Kasim Oztoprak, Erdogan Dogdu:
Identifying trolls and determining terror awareness level in social networks using a scalable framework. 1792-1798 - Aparna Oruganti, Fangzhou Sun, Hiba Baroud, Abhishek Dubey:
DelayRadar: A multivariate predictive model for transit systems. 1799-1806 - A. Murat Ozbayoglu, Yusuf Gökhan Küçükayan, Erdogan Dogdu:
A real-time autonomous highway accident detection model based on big data processing and computational intelligence. 1807-1813 - Francisco Padillo, José María Luna, Sebastián Ventura:
Subgroup discovery on big data: Pruning the search space on exhaustive search algorithms. 1814-1823 - Paul Raff, Ze Jin:
The difference-of-datasets framework: A statistical method to discover insight. 1824-1831 - Yehezkel S. Resheff:
Online trajectory segmentation and summary with applications to visualization and retrieval. 1832-1840 - Ali Sekmen, Akram Aldroubi, Ahmet Bugra Koku:
Skeleton decomposition analysis for subspace clustering. 1841-1848 - Omer Berat Sezer, Erdogan Dogdu, A. Murat Ozbayoglu, Aras Onal:
An extended IoT framework with semantics, big data, and analytics. 1849-1856 - M. Omair Shafiq:
Event segmentation using MapReduce based big data clustering. 1857-1866 - Madhu Shashanka, Min-Yi Shen, Jisheng Wang:
User and entity behavior analytics for enterprise security. 1867-1874 - Thamarai Selvi Somasundaram, Kannan Govindarajan, Vivekanandan Suresh Kumar:
Swarm Intelligence (SI) based profiling and scheduling of big data applications. 1875-1880 - Jenq-Haur Wang, Jia-Zhi Lin:
Improving clustering efficiency by SimHash-based K-Means algorithm for big data analytics. 1881-1888 - Yuchen Wu, Jianbo Yuan, Quanzeng You, Jiebo Luo:
The effect of pets on happiness: A data-driven approach via large-scale social media. 1889-1894 - Ozlem Yavanoglu:
Intelligent authorship identification with using Turkish newspapers metadata. 1895-1900 - Jianbo Yuan, Walid Shalaby, Mohammed Korayem, David Lin, Khalifeh AlJadda, Jiebo Luo:
Solving cold-start problem in large-scale recommendation engines: A deep learning approach. 1901-1910 - Kai Zhao, Sasu Tarkoma, Siyuan Liu, Huy T. Vo:
Urban human mobility data mining: An overview. 1911-1920 - Yiheng Zhou, Numair Sani, Jiebo Luo:
Fine-grained mining of illicit drug use patterns using social multimedia data from instagram. 1921-1930 - Zhenwei Du, Haopeng Chen, Jianwei Jiang:
Research on the big data system of massive open online course. 1931-1936 - Srinivasa Rao Kundeti, J. Vijayananda, Srikanth Mujjiga, M. Kalyan:
Clinical named entity recognition: Challenges and opportunities. 1937-1945 - Tsau-Young Lin:
Very fast frequent itemset mining: Simplicial complex methods (Extended abstract). 1946-1949 - G. S. Smrithy, Sathyan Munirathinam, Ramadoss Balakrishnan:
Online anomaly detection using non-parametric technique for big data streams in cloud collaborative environment. 1950-1955 - Shusaku Tsumoto, Michinori Nakata, Hiroshi Sakai, Chenxi Liu:
A proposal of a privacy-preserving questionnaire by non-deterministic information and its analysis. 1956-1965 - Parul Sharma, Teng-Sheng Moh:
Prediction of Indian election using sentiment analysis on Hindi Twitter. 1966-1971 - Shusaku Tsumoto, Shoji Hirano, Haruko Iwata:
Construction of clinical pathway from histories of clinical actions in hospital information system. 1972-1981 - Shusaku Tsumoto, Shoji Hirano, Haruko Iwata, Norio Yoshimoto, Tomohiro Kimura:
Mining process for improvement of clinical process quality. 1982-1990 - Yan Zhu, Melody Moh, Teng-Sheng Moh:
Multi-layer text classification with voting for consumer reviews. 1991-1999 - Shakti Awaghad:
SCEM: Smart & effective crowd management with a novel scheme of big data analytics. 2000-2003 - Alexander Brodsky, Mohan Krishnamoorthy, William Z. Bernstein, M. Omar Nachawati:
A system and architecture for reusable abstractions of manufacturing processes. 2004-2013 - Max Ferguson, Kincho H. Law, Raunak Bhinge, David Dornfeld, Jinkyoo Park, Yung-Tsun Tina Lee:
Evaluation of a PMML-based GPR scoring engine on a cloud platform and microcomputer board for smart manufacturing. 2014-2023 - Jeff Hebert:
Predicting rare failure events using classification trees on large scale manufacturing data with complex interactions. 2024-2028 - Ankita Mangal, Nishant Kumar:
Using big data to enhance the bosch production line performance: A Kaggle challenge. 2029-2035 - Abhinav Maurya:
Bayesian optimization for predicting rare internal failures in manufacturing processes. 2036-2045 - Bohdan M. Pavlyshenko:
Machine learning, linear and Bayesian models for logistic regression in failure detection problems. 2046-2050 - Srinivasan Radhakrishnan, Sagar V. Kamarthi:
Convergence and divergence in academic and industrial interests on IOT based manufacturing. 2051-2056 - Srinivasan Radhakrishnan, Sagar V. Kamarthi:
Complexity-entropy feature plane for gear fault detection. 2057-2061 - Dazhong Wu, Connor Jennings, Janis P. Terpenny, Soundar R. T. Kumara:
Cloud-based machine learning for predictive analytics: Tool wear prediction in milling. 2062-2069 - Darui Zhang, Bin Xu, Jasmine Wood:
Predict failures in production lines: A two-stage approach with clustering and supervised learning. 2070-2074 - Aharon Abadi, Ashraf Haib, Roie Melamed, Alaa Nassar, Aidan Shribman, Hisham Yasin:
Holistic disaster recovery approach for big data NoSQL workloads. 2075-2080 - Genady Ya. Grabarnik, Mauro Tortonesi, Larisa Shwartz:
Data-driven cloud-based IT services performance forecasting. 2081-2086 - John Harney, Seung-Hwan Lim, Sreenivas R. Sukumar, Dale Stansberry, Peter Xenopoulos:
On-demand data analytics in HPC environments at leadership computing facilities: Challenges and experiences. 2087-2096 - Katsunori Miura, Tazro Ohta, Courtney Powell, Masaharu Munetomo:
Intercloud brokerages based on PLS method for deploying infrastructures for big data analytics. 2097-2102 - Kayhan Moharreri, Jayashree Ramanathan, Rajiv Ramnath:
Motivating dynamic features for resolution time estimation within IT operations management. 2103-2108 - Alexander C. Shulyak, Lizy K. John:
Identifying performance bottlenecks in Hive: Use of processor counters. 2109-2114 - Alok Singh, Eric G. Stephan, Todd Elsethagen, Matt MacDuff, Bibi Raju, Malachi Schram, Kerstin Kleese van Dam, Darren J. Kerbyson, Ilkay Altintas:
Leveraging large sensor streams for robust cloud control. 2115-2120 - Shuang Song, Xinnian Zheng, Andreas Gerstlauer, Lizy K. John:
Fine-grained power analysis of emerging graph processing workloads for cloud operations management. 2121-2126 - Konstantinos Tsakalozos, Cory Johns, Kevin Monroe, Pete VanderGiessen, Andrew Mcleod, Antonio Rosales:
Open big data infrastructures to everyone. 2127-2129 - Shahbaz Atta, Bilal Sadiq, Akhlaq Ahmad, Sheikh Nasir Saeed, Emad A. Felemban:
Spatial-crowd: A big data framework for efficient data visualization. 2130-2138 - Anne M. Denton, Mostofa Ahsan, David W. Franzen, John Nowatzki:
Multi-scalar analysis of geospatial agricultural data for sustainability. 2139-2146 - Luciano Gervasoni, Martí Bosch, Serge Fenet, Peter F. Sturm:
A framework for evaluating urban land use mix from crowd-sourcing data. 2147-2156 - Thong Hoang, Pei Hua Cher, Philips Kokoh Prasetyo, Ee-Peng Lim:
Crowdsensing and analyzing micro-event tweets for public transportation insights. 2157-2166 - Yu Ichifuji, Yoshihide Matsuo, Noriaki Koide, Nobuhiro Akashi, Yoshitaka Terai, Toru Kobayashi:
A study for understanding of tourist person trip pattern based on log data of Wi-Fi access points. 2167-2174 - Noriaki Koide, Yu Ichifuji, Hideki Yoshii, Noboru Sonehara:
Estimation of national tourism statistics based on Wi-Fi association log data. 2175-2179 - Gaurav Paruthi, Enrique Frías-Martínez, Vanessa Frías-Martínez:
Peer-to-peer microlending platforms: Characterization of online traits. 2180-2189 - Caleb Robinson, Arezoo Shirazi, Mengmeng Liu, Bistra Dilkina:
Network optimization of food flows in the U.S. 2190-2198 - Aki-Hiro Sato, Tsutomu Watanabe:
Measuring activities and values of industrial clusters based on job opportunity data collected from an internet Japanese job matching site. 2199-2208 - Xiaoyan Shao, Siyuan Lu, Theodore G. van Kessel, Hendrik F. Hamann, Leda Daehler, Jeffrey Cwagenberg, Alan Li:
Solar irradiance forecasting by machine learning for solar car races. 2209-2216 - Hiroshi Tsuda, Masakazu Ando, Yu Ichifuji:
Hotel plan popularity factor analysis of hotels in the Keihanshin region. 2217-2224 - Laura L. Tupper, David S. Matteson, John C. Handley:
Mixed data and classification of transit stops. 2225-2232 - Mahwish Arif, Hans Vandierendonck, Dimitrios S. Nikolopoulos, Bronis R. de Supinski:
A scalable and composable map-reduce system. 2233-2242 - Amit Gupta, Weijia Xu, Natalia Ruiz-Juri, Kenneth Perrine:
A workload aware model of computational resource selection for big data applications. 2243-2250 - Sunwoo Lee, Wei-keng Liao, Ankit Agrawal, Nikos Hardavellas, Alok N. Choudhary:
Evaluation of K-means data clustering algorithm on Intel Xeon Phi. 2251-2260 - Ruoqian Liu, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary, Marc De Graef:
Materials discovery: Understanding polycrystals from large-scale electron patterns. 2261-2269 - Fang (Cherry) Liu, Fu Shen, Duen Horng Chau, Neil Bright, Mehmet Belgin:
Building a research data science platform from industrial machines. 2270-2275 - Lauritz Thamsen, Thomas Renner, Marvin Byfeld, Markus Paeschke, Daniel Schroder, Felix Bohm:
Visually programming dataflows for distributed data analytics. 2276-2285 - Peter Xenopoulos, Jamison Daniel, Michael A. Matheson, Sreenivas R. Sukumar:
Big data analytics on HPC architectures: Performance and cost. 2286-2295 - Weijia Xu, Natalia Ruiz-Juri, Amit Gupta, Amanda Deering, Chandra R. Bhat, James Kuhr, Jackson Archer:
Supporting large scale connected vehicle data analysis using HIVE. 2296-2304 - Lina Yu, Hongfeng Yu:
Legion-based scientific data analytics on heterogeneous processors. 2305-2314 - Juan Lin, Di Zhong, Yiwen Zhong, Hui Zhang:
Accelerating mathematical knot simulations with R on the web. 2315-2321 - Yanfu Zhou, Jieting Wu, Lina Yu, Hongfeng Yu, Zhenghong Tang:
A geohydrologie data visualization framework with an extendable user interface design. 2322-2331 - Jian Zou, Chuqin Huang:
Efficient portfolio allocation with sparse volatility estimation for high-frequency financial data. 2332-2341 - James Crist:
Dask & Numba: Simple libraries for optimizing scientific python code. 2342-2343 - Vishnu Gowda Harish, Vinay Kumar Bingi, John A. Miller:
A big data platform integrating compressed linear algebra with columnar databases. 2344-2352 - Ruoqian Liu, Diana Palsetia, Arindam Paul, Reda Al-Bahrani, Dipendra Jha, Wei-keng Liao, Ankit Agrawal, Alok N. Choudhary:
PinterNet: A thematic label curation tool for large image datasets. 2353-2362 - Geoffrey Mon, Milad Makkie, Xiang Li, Tianming Liu, Shannon Quinn:
Implementing dictionary learning in Apache Flink, Or: How I learned to relax and love iterations. 2363-2367 - Hatef Monajemi, David L. Donoho, Victoria Stodden:
Making massive computational experiments painless. 2368-2373 - Ella Peltonen, Eemil Lagerspetz, Petteri Nurmi, Sasu Tarkoma:
Too big to mail: On the way to publish large-scale mobile analytics data. 2374-2377 - Zhou Xing, Marzieh Parandehgheibi, Fei Xiao, Nilesh Kulkarni, Chris Pouliot:
Content-based recommendation for podcast audio-items using natural language processing techniques. 2378-2383 - Sylvain Hallé, Sébastien Gaboury, Raphaël Khoury:
A glue language for event stream processing. 2384-2391 - Christopher Hillman, Karen E. Petrie, Andrew Cobley, Mark Whitehorn:
Real-time processing of proteomics data: The internet of things and the connected laboratory. 2392-2399 - Yaser Keneshloo, Shuguang Wang, Eui-Hong Sam Han, Naren Ramakrishnan:
Predicting the shape and peak time of news article views. 2400-2409 - Kohei Nakamura, Ami Hayashi, Hiroki Matsutani:
An FPGA-based low-latency network processing for spark streaming. 2410-2415 - Joshua Plasse, Niall M. Adams:
Handling delayed labels in temporally evolving data streams. 2416-2424 - Athena Vakali, Paschalis Korosoglou, Pavlos Daoglou:
A multi-layer software architecture framework for adaptive real-time analytics. 2425-2430 - Yongyi Xian, Chuanfei Xu, Yan Liu:
Implementing trajectory data stream analysis in parallel. 2431-2436 - Jaime Alonso-Lorenzo, Enrique Costa-Montenegro, Milagros Fernández Gavilanes:
Language independent big-data system for the prediction of user location on Twitter. 2437-2446 - Linda Camilla Boldt, Vinothan Vinayagamoorthy, Florian Winder, Melanie Schnittger, Mats Ekran, Raghava Rao Mukkamala, Niels Buus Lassen, Benjamin Flesch, Abid Hussain, Ravi Vatrapu:
Forecasting Nike's sales using Facebook data. 2447-2456 - Seung-Woo Choi, Aviv Segev:
Finding informative comments for video viewing. 2457-2465 - Anahita Davoudi, Mainak Chatterjee:
Prediction of information diffusion in social networks using dynamic carrying capacity. 2466-2469 - Yang Feng, Jiebo Luo:
When do luxury cars hit the road? Findings by a big data approach. 2470-2474 - David Watts, K. M. George, Ashwin Kumar T. K, Zenia Arora:
Tweet sentiment as proxy for political campaign momentum. 2475-2484 - Ryohei Hisano:
A new approach to building the interindustry input-output table using block estimation techniques. 2485-2494 - Atushi Ishikawa, Shouji Fujimoto, Takayuki Mizuno:
Nowcast of firm sales using POS data toward stock market stability. 2495-2499 - Yuka Kamiko, Mitsuo Yoshida, Hirotada Ohashi, Fujio Toriumi:
Uncovering information flow among users by time-series retweet data: Who is a friend of whom on Twitter? 2500-2504 - Rishemjit Kaur, Kazutoshi Sasahara:
Quantifying moral foundations from various topics on Twitter conversations. 2505-2512 - Yasuko Kawahata, Tamio Koyama:
Application of an integer-valued autoregressive model to hit phenomena. 2513-2517 - Hirotaka Kawazu, Fujio Toriumi, Masanori Takano, Kazuya Wada, Ichiro Fukuda:
Analytical method of web user behavior using Hidden Markov Model. 2518-2524 - Eyad Makki, Lin-Ching Chang:
Leveraging social big data for performance evaluation of E-commerce websites. 2525-2534 - Rubén Tous, Otto Wüst, Mauro Gomez, Jonatan Poveda, Marc Elena, Jordi Torres, Mouna Makni, Eduard Ayguadé:
User-generated content curation with deep convolutional neural networks. 2535-2540 - Yu Wang, Yang Feng, Jiebo Luo, Xiyang Zhang:
Pricing the woman card: Gender politics between hillary clinton and donald trump. 2541-2544 - Daniel Xie, Jiejun Xu, Tsai-Ching Lu:
Automated classification of extremist Twitter accounts using content-based and network-based features. 2545-2549 - Edmon Begoli, Derek Kistler, Jack Bates:
Towards a heterogeneous, polystore-like data architecture for the US Department of Veteran Affairs (VA) enterprise analytics. 2550-2554 - Subhasis Dasgupta, Kevin L. Coakley, Amarnath Gupta:
Analytics-driven data ingestion and derivation in the AWESOME polystore. 2555-2564 - Evgeny Kharlamov, Theofilos P. Mailis, Konstantina Bereta, Dimitris Bilidas, Sebastian Brandt, Ernesto Jiménez-Ruiz, Steffen Lamparter, Christian Neuenstadt, Özgür L. Özçep, Ahmet Soylu, Christoforos Svingos, Guohui Xiao, Dmitriy Zheleznyakov, Diego Calvanese, Ian Horrocks, Martin Giese, Yannis E. Ioannidis, Yannis Kotidis, Ralf Möller, Arild Waaler:
A semantic approach to polystores. 2565-2573 - Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Pereira:
Benchmarking polystores: The CloudMdsQL experience. 2574-2579 - Vasilis Spyropoulos, Christina Vasilakopoulou, Yannis Kotidis:
Digree: A middleware for a graph databases polystore. 2580-2589 - Abdeltawab M. Hendawi, Fatemah Alali, Xiaoyu Wang, Yunfei Guan, Tianshu Zhou, Xiao Liu, Nada Basit, John A. Stankovic:
Hobbits: Hadoop and Hive based Internet traffic analysis. 2590-2599 - Sangkeun Lee, Liangzhe Chen, Sisi Duan, Supriya Chinthavali, Mallikarjun Shankar, B. Aditya Prakash:
URBAN-NET: A network-based infrastructure monitoring and analysis system for emergency management and public safety. 2600-2609 - Gandhi Sivakumar, Drew Johnson, Rashida Hodge:
Unravelling the Myth of big data and artificial intelligence in sustainable natural resource development. 2610-2615 - Joya A. Deri, Franz Franchetti, José M. F. Moura:
Big data computation of taxi movement in New York City. 2616-2625 - Holly Ferguson, Charles Vardeman, Jarek Nabrzyski:
Linked data view methodology and application to BIM alignment and interoperability. 2626-2635 - Rafal A. Angryk, Douglas E. Galarus:
The SMART approach to comprehensive quality assessment of site-based spatial-temporal data. 2636-2645 - Upa Gupta, Kulsawasd Jitkajornwanich, Ramez Elmasri, Leonidas Fegaras:
Adapting K-means clustering to identify spatial patterns in storms. 2646-2654 - Behnam Hedayatnia, Mehrdad Yazdani, Mai H. Nguyen, Jessica Block, Ilkay Altintas:
Determining feature extractors for unsupervised learning on satellite images. 2655-2663 - Andrew Hulbert, Thomas Kunicki, James N. Hughes, Anthony D. Fox, Christopher N. Eichelberger:
An experimental study of big spatial data systems. 2664-2671 - Siyuan Lu, Xiaoyan Shao, Marcus Freitag, Levente J. Klein, Jason D. Renwick, Fernando J. Marianno, Conrad M. Albrecht, Hendrik F. Hamann:
IBM PAIRS curated big data service for accelerated geospatial data analytics and discovery. 2672-2675 - Chengcheng Mou, Shaoping Chen, Yi-Cheng Tu:
A comparative study of dual-tree algorithm implementations for computing 2-body statistics in spatial data. 2676-2685 - Ivens Portugal, Paulo S. C. Alencar, Donald D. Cowan:
Towards a provenance-aware spatial-temporal architectural framework for massive data integration and analysis. 2686-2691 - Alan Woodley, Ling-Xiang Tang, Shlomo Geva, Richi Nayak, Timothy Chappell:
Using parallel hierarchical clustering to address spatial big data challenges. 2692-2698 - Chien-Heng Wu, Franco Lin, Wen-Yi Chang, Whey-Fone Tsai, Hsi-Ching Lin, Chao-Tung Yang:
Big data development platform for engineering applications. 2699-2702 - Jiangye Yuan, Hsiu-Han Lexie Yang, Olufemi A. Omitaomu, Budhendra L. Bhaduri:
Large-scale solar panel mapping from aerial images using deep convolutional networks. 2703-2708 - Yu Zhuang:
Symmetric repositioning of bisecting K-means centers for increased reduction of distance calculations for big data clustering. 2709-2715 - Anton Gulenko, Marcel Wallschläger, Florian Schmidt, Odej Kao, Feng Liu:
Evaluating machine learning algorithms for anomaly detection in clouds. 2716-2721 - Teemu Kanstrén, Jussi Liikka, Jukka Mäkelä, Markus Luoto, Jarmo Prokkola:
Preliminary big data in a 5G test network. 2722-2727 - Yiming Kong, Hui Zang, Xiaoli Ma:
Quick model fitting using a classifying engine. 2728-2733 - Ruilin Liu, Kai Yang, Yanjia Sun, Tao Quan, Jin Yang:
Spark-based rare association rule mining for big datasets. 2734-2739 - Martino Trevisan, Idilio Drago, Marco Mellia, Han Hee Song, Mario Baldi:
WHAT: A big data approach for accounting of modern web services. 2740-2745 - Azadeh Eftekhari, Farhana H. Zulkernine, Patrick Martin:
BINARY: A framework for big data integration for ad-hoc querying. 2746-2753 - Ellis R. Giles:
Container-based virtualization for byte-addressable NVM data storage. 2754-2763 - Meike Klettke, Uta Störl, Manuel Shenavai, Stefanie Scherzinger:
NoSQL schema evolution and big data migration at scale. 2764-2774 - Aravind Mohan, Mahdi Ebrahimi, Shiyong Lu, Alexander Kotov:
Scheduling big data workflows in the cloud under budget constraints. 2775-2784 - Daniel Playfair, Amitabh Trehan, Barry McLarnon, Dimitrios S. Nikolopoulos:
Big data availability: Selective partial checkpointing for in-memory database queries. 2785-2794 - Nico Rödder, David Dauer, Kevin Laubis, Paul Karaenke, Christof Weinhardt:
The digital transformation and smart data analytics: An overview of enabling developments and application areas. 2795-2802 - Matthieu-P. Schapranow, Matthias Uflacker, Murat Sariyar, Sebastian C. Semler, Johannes Klaus Fichte, Dietmar Schielke, Kismet Ekinci, Thomas Zahn:
Towards an integrated health research process: A cloud-based approach. 2813-2818 - Merlijn Sebrechts, Sander Borny, Thomas Vanhove, Gregory van Seghbroeck, Tim Wauters, Bruno Volckaert, Filip De Turck:
Model-driven deployment and management of workflows on analytics frameworks. 2819-2826 - Daniel Seybold, Nicolas Wagner, Benjamin Erb, Jörg Domaschka:
Is elasticity of scalable databases a Myth? 2827-2836 - Alexander Stiemer, Ilir Fetai, Heiko Schuldt:
Analyzing the performance of data replication and data partitioning in the cloud: The BEOWULF approach. 2837-2846 - Miguel G. Xavier, Kassiano J. Matteussi, Fabian Lorenzo, César A. F. De Rose:
Understanding performance interference in multi-tenant cloud databases and web applications. 2847-2852 - Bonnie J. Dorr, Peter C. Fontana, Craig S. Greenberg, Marion Le Bras, Mark A. Przybocki:
Evaluation-driven research in data science: Leveraging cross-field methodologies. 2853-2862 - Frank S. Haug:
Bad big data science. 2863-2871 - Jeffrey S. Saltz, Ivan Shamshurin:
Big data team process methodologies: A literature review and the identification of key factors for a project's success. 2872-2879 - Pankush Kalgotra, Ramesh Sharda:
Progression analysis of signals: Extending CRISP-DM to stream analytics. 2880-2885 - Vijay Dipti Kumar, Paulo S. C. Alencar:
Software engineering for big data projects: Domains, methodologies and gaps. 2886-2895 - Sohini Roychowdhury, Johnny Ren:
Non-deep CNN for multi-modal image classification and feature learning: An Azure-based model. 2893-2812 - Jeffrey S. Saltz, Sibel Yilmazel, Özgür Yilmazel:
Not all software engineers can become good data engineers. 2896-2901 - Toshiyuki Shimono:
A hacking toolset for big tabular files (Codenames: Bin4tsv, Kabutomushi). 2902-2910 - Sandro Fiore, Marcin Plóciennik, Charles M. Doutriaux, Cosimo Palazzo, J. Boutte, Tomasz Zok, Donatello Elia, Michal Owsiak, Alessandro D'Anca, Z. Shaheen, Riccardo Bruno, Marco Fargetta, Miguel Caballer, Germán Moltó, Ignacio Blanquer, Roberto Barbera, Mário David, Giacinto Donvito, Dean N. Williams, V. Anantharaj, Davide Salomoni, Giovanni Aloisio:
Distributed and cloud-based multi-model analytics experiments on large volumes of climate change data in the earth system grid federation eco-system. 2911-2918 - Jason Laura, Robin L. Fergason:
Modeling martian thermal inertia in a distributed memory high performance computing environment. 2919-2928 - Adam M. Leadbetter, Damian Smyth, Robert Fuller, Eoin O'Grady, Adam Shepherd:
Where big data meets linked data: Applying standard data models to environmental data streams. 2929-2937 - Ryuya Mitsuhashi, Hideyuki Kawashima, Takahiro Nishimichi, Osamu Tatebe:
Three-dimensional spatial join count exploiting CPU optimized STR R-tree. 2938-2947 - Amidu Oloso, Kwo-Sen Kuo, Thomas L. Clune, Paul Brown, Alex Poliakov, Hongfeng Yu:
Implementing connected component labeling as a user defined operator for SciDB. 2948-2952 - Kevin Paul, Sheri A. Mickelson, John M. Dennis:
A new parallel python tool for the standardization of earth system model data. 2953-2959 - Michael Requa, Garrison Vaughan, John David, Ben Cotton:
Using cloud bursting to count trees and shrubs in Sub-Saharan Africa. 2960-2963 - Brian Wilson, Rahul Palamuttam, Kim Whitehall, Chris Mattmann, Alex Goodman, Maziyar Boustani, Sujen Shah, Paul Zimdars, Paul M. Ramirez:
SciSpark: Highly interactive in-memory science data analytics. 2964-2973 - Shujia Zhou, Xiaowen Li, Toshihisa Matsui, Wei-Kuo Tao:
Visualization and diagnosis of earth science data through Hadoop and Spark. 2974-2980 - Ellis Giles, Kshitij A. Doshi, Peter J. Varman:
Persisting in-memory databases using SCM. 2981-2990 - Zhihao Huang, Hui Li, Xin Li, Wei He:
SS-dedup: A high throughput stateful data routing algorithm for cluster deduplication system. 2991-2995 - Xin Li, Hui Li, Zhihao Huang, Bing Zhu, Jiawei Cai:
EStore: An effective optimized data placement structure for Hive. 2996-3001 - Si Liu, Eun-Sung Jung, Rajkumar Kettimuthu, Xian-He Sun, Michael E. Papka:
Towards optimizing large-scale data transfers with end-to-end integrity verification. 3002-3007 - Thomas Renner, Lauritz Thamsen, Odej Kao:
CoLoc: Distributed data and container colocation for data-intensive applications. 3008-3015 - Holly Ferguson, Charles Vardeman, Jarek Nabrzyski:
Linked data platform for building cloud-based smart applications and connecting API access points with data discovery techniques. 3016-3025 - Ajinkya Prabhune, Hasebullah Ansari, Anil Keshav, Rainer Stotzka, Michael Gertz, Jürgen Hesser:
MetaStore: A metadata framework for scientific data repositories. 3026-3035 - Ulrich Schwardmann:
Automated schema extraction for PID information types. 3036-3044 - Priyaa Thavasimani, Paolo Missier:
Facilitating reproducible research by investigating computational metadata. 3045-3051 - Sudharshan S. Vazhkudai, John Harney, Raghul Gunasekaran, Dale Stansberry, Seung-Hwan Lim, Tom Barron, Andrew Nash, Arvind Ramanathan:
Constellation: A science graph network for scalable data and knowledge discovery in extreme-scale scientific collaborations. 3052-3061 - Guangxia Xu, Jin Qi, Deling Huang, Mahmoud Daneshmand:
Detecting spammers on social networks based on a hybrid model. 3062-3068 - Liudong Zuo, Mengxia Michelle Zhu:
Bandwidth provision strategies for reliable data movements in dedicated networks. 3069-3078 - Radhakrishnan Angamuthu Chinnathambi, Prakash Ranganathan:
Investigation of forecasting methods for the hourly spot price of the day-ahead electric power markets. 3079-3086 - Hông-Ân Cao, Felix Rauchenstein, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes:
Leveraging user expertise in collaborative systems for annotating energy datasets. 3087-3096 - Hông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes:
Temporal association rules for electrical activity detection in residential homes. 3097-3106 - Saman Mostafavi, Benjamin Futrell, John Troxler, Robert W. Cox:
Leveraging cloud computing to convert the non-intrusive load monitor into a powerful framework for grid-responsive buildings. 3107-3114 - Shady S. Refaat, Haitham Abu-Rub, Amira Mohamed:
Big data, better energy management and control decisions for distribution systems in smart grid. 3115-3120 - Viktor Botev, Magnus Almgren, Vincenzo Gulisano, Olaf Landsiedel, Marina Papatriantafilou, Joris van Rooij:
Detecting non-technical energy losses through structural periodic patterns in AMI data. 3121-3130 - Andreas Unterweger, Dominik Engel:
Lossless compression of high-frequency voltage and current data in smart grids. 3131-3139 - Berkay Aydin, Ahmet Küçük, Rafal A. Angryk:
Indexing spatiotemporal relations in solar event datasets. 3140-3148 - Soukaina Filali Boubrahimi, Berkay Aydin, Dustin Kempton, Rafal A. Angryk:
Spatio-temporal interpolation methods for solar events metadata. 3149-3157 - Jon M. Jenkins:
Processing and managing the Kepler mission's treasure trove of stellar and exoplanet data. 3158-3167 - Dustin J. Kempton, Michael A. Schuh, Rafal A. Angryk:
Describing solar images with sparse coding for similarity search. 3168-3176 - Ruizhe Ma, Rafal A. Angryk, Pete Riley:
A data-driven analysis of interplanetary coronal mass ejecta and magnetic flux ropes. 3177-3186 - Simon Marcin, André Csillaghy:
Running scientific algorithms as array database operators: Bringing the processing power to the data. 3187-3193 - Andrés Muñoz-Jaramillo, Z. A. Werginz, J. P. Vargas-Acosta, M. D. DeLuca, J. C. Windmueller, J. Zhang, D. W. Longcope, Derek A. Lamb, C. E. DeForest, S. Vargas-Dominguez, J. W. Harvey, P. C. H. Martens:
The best of both worlds: Using automatic detection and limited human supervision to create a homogenous magnetic catalog spanning four solar cycles. 3194-3203 - Ryan J. Oelkers, Keivan G. Stassun, Joshua A. Pepper, Nathan M. De Lee, Martin A. Paegert:
An input catalog and target selection for the transiting exoplanet survey satellite. 3204-3213 - N. Olspert, M. J. Kapyla, J. Pelti:
Method for estimating cycle lengths from multidimensional time series: Test cases and application to a massive "in silico" dataset. 3214-3223 - Bennett B. Borden, Jason R. Baron:
Opening up dark digital archives through the use of analytics to identify sensitive content. 3224-3229 - Marco Büchler, Greta Franzini, Emily Franzini, Thomas Eckart:
Mining and analysing one billion requests to linguistic services. 3230-3239 - Jenny Bunn:
Mind the explanatory gap: Quality from quantity. 3240-3244 - Simon Hengchen, Mathias Coeckelbergs, Seth van Hooland, Ruben Verborgh, Thomas Steiner:
Exploring archives with probabilistic models: Topic modelling for the valorisation of digitised archives of the European Commission. 3245-3249 - Emily Maemura, Christoph Becker, Ian Milligan:
Understanding computational web archives research methods using research objects. 3250-3259 - Sonia Ranade:
Traces through time: A probabilistic approach to connected archival data. 3260-3265 - Robert J. Sandusky:
Computational provenance: DataONE and implications for cultural heritage institutions. 3266-3271 - Michael Shallcross:
Appraising digital archives with Archivematica. 3272-3276 - Kenneth Thibodeau:
Breaking down the invisible wall to enrich archival science and practice. 3277-3282 - Weijia Xu, Ruizhu Huang, Maria Esteva, Jawon Song, Ramona L. Walls:
Content-based comparison for collections identification. 3283-3289 - Stephen Bonner, John Brennan, Georgios Theodoropoulos, Ibad Kureshi, Andrew Stephen McGough:
Deep topology classification: A new approach for massive graph classification. 3290-3297 - Stephen Bonner, John Brennan, Georgios Theodoropoulos, Ibad Kureshi, Andrew Stephen McGough:
GFP-X: A parallel approach to massive graph comparison using spark. 3298-3307 - Thibault Debatty, Fabio Pulvirenti, Pietro Michiardi, Wim Mees:
Fast distributed k-nn graph update. 3308-3317 - Hiroki Kanezashi, Toyotaro Suzumura:
An incremental local-first community detection method for dynamic graphs. 3318-3325 - Bryan Rainey, David F. Gleich:
Massive graph processing on nanocomputers. 3326-3335 - Sara Riazi, Boyana Norris:
GraphFlow: Workflow-based big graph processing. 3336-3343 - W. Sean Kennedy, Iraj Saniee, Onuttom Narayan:
On the hyperbolicity of large-scale networks and its estimation. 3344-3351 - Nilothpal Talukder, Mohammed J. Zaki:
Parallel graph mining with dynamic load balancing. 3352-3359 - Charith Wickramaarachchi, Rajgopal Kannan, Charalampos Chelmis, Viktor K. Prasanna:
Distributed exact subgraph matching in small diameter dynamic graphs. 3360-3369 - Duncan Yung, Shi-Kuo Chang:
Fast reachability query computation on big attributed graphs. 3370-3380 - Fang Du, Ting Li, Yingjie Shi, Lijuan Song, Xiaojun Gu:
Drug target path discovery on semantic biomedical big data. 3381-3386 - Muhammad Kamran Lodhi, Rashid Ansari, Yingwei Yao, Gail M. Keenan, Diana J. Wilkie, Ashfaq Khokhar:
A framework to predict outcome for cancer patients using data from a nursing EHR. 3387-3395 - Milad Makkie, Xiang Li, Tianming Liu, Shannon Quinn, Binbin Lin, Jieping Ye:
Distributed rank-1 dictionary learning: Towards fast and scalable solutions for fMRI big data analytics. 3396-3403 - Mohammad Mehedy Masud, Abdel Rahman Al Harahsheh:
Mortality prediction of ICU patients using lab test data by feature vector compaction & classification. 3404-3411 - Vasundhara Misal, Vandana P. Janeja, Sai C. Pallaprolu, Yelena Yesha, Raghu Chintalapati:
Iterative unified clustering in big data. 3412-3421 - Maitham D. Naeemi, Johnny Ren, Nathan Hollcroft, Adam M. Alessio, Sohini Roychowdhury:
Application of big data analytics for automated estimation of CT image quality. 3422-3431 - Jianwu Wang, Zhichuan Huang, Wenbin Zhang, Ankita Patil, Ketan Patil, Ting Zhu, Eric J. Shiroma, Mitchell A. Schepps, Tamara B. Harris:
Wearable sensor based human posture recognition. 3432-3438 - Takuya Yoshida, M. Emre Celebi, Gerald Schaefer, Hitoshi Iyatomi:
Simple and effective pre-processing for automated melanoma discrimination based on cytological findings. 3439-3442 - Weider D. Yu, Jaspal Singh Gill, Maulin Dalal, Piyush Jha, Sajan Shah:
Big data approach in healthcare used for intelligent design - Software as a service. 3443-3449 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
Interactive personalized interesting pattern discovery. 3450-3456 - Jordan DeLoach, Doina Caragea, Xinming Ou:
Android malware detection with weak ground truth data. 3457-3464 - Chenxiao Dou, Daniel Sun, Yi-Cheng Chen, Guoqiang Li, Jianquan Liu:
Probabilistic parallelisation of blocking non-matched records for big data. 3465-3473 - Anders Høst-Madsen, Elyas Sabeti, Chad Walton, Su Jun Lim:
Universal data discovery using atypicality. 3474-3483 - Elham Sahebkar Khorasani, Zhao Zhenge, John Champaign:
A Markov chain collaborative filtering model for course enrollment recommendations. 3484-3490 - Hsu-Chao Lai, Wen-Yueh Shih, Jiun-Long Huang, Yi-Cheng Chen:
Predicting traffic of online advertising in real-time bidding systems from perspective of demand-side platforms. 3491-3498 - Nicholas A. James, Arun Kejariwal, David S. Matteson:
Leveraging cloud data to mitigate user experience from 'breaking bad'. 3499-3508 - Max Menenberg, Surya Pathak, Hari P. Udyapuram, Srinagesh Gavirneni, Sohini Roychowdhury:
Topic modeling for management sciences: A network-based approach. 3509-3518 - Izabela Moise:
The technical hashtag in Twitter data: A hadoop experience. 3519-3528 - Kingsley Okoye, Abdel-Rahman H. Tawil, Usman Naeem, Syed Islam, Elyes Lamine:
Using semantic-based approach to manage perspectives of process mining: Application on improving learning process domain data. 3529-3538 - Sai C. Pallaprolu, Josephine M. Namayanja, Vandana P. Janeja, C. T. Sai Adithya:
Label propagation in big data to detect remote access Trojans. 3539-3547 - Fuad Rahman, Marvin J. Slepian, Ari Mitra:
A novel big-data processing framwork for healthcare applications: Big-data-healthcare-in-a-box. 3548-3555 - Yao-Ming Yang, Chang-Dong Wang, Jian-Huang Lai:
An efficient parallel topic-sensitive expert finding algorithm using spark. 3556-3562 - Linlin You, Bige Tunçer:
Exploring the utilization of places through a scalable "Activities in Places" analysis mechanism. 3563-3572 - Jun He, Yue Zhang, Jiye Wang, Nan Zeng, Hanyong Hao:
Robust K-subspaces recovery with combinatorial initialization. 3573-3582 - Supun Kamburugamuve, Pulasthi Wickramasinghe, Saliya Ekanayake, Chathuri Wimalasena, Milinda Pathirage, Geoffrey C. Fox:
TSmap3D: Browser visualization of high dimensional time series data. 3583-3592 - Michael A. Schuh, Rafal A. Angryk:
On the theory and practice of high-dimensional data indexing with iDistance. 3593-3600 - Michael Wojnowicz, Ben Cruz, Xuan Zhao, Brian Wallace, Matt Wolff, Jay Luan, Caleb Crable:
"Influence sketching": Finding influential samples in large-scale regressions. 3601-3612 - Katie R. Yates, Nicos G. Pavlidis:
Minimum density hyperplanes in the feature space. 3613-3618 - Bo Zhang, Liwei Wang:
Structure preserving dimension reduction with 2D images as predictors. 3619-3624 - Santosh Aditham, Nagarajan Ranganathan, Srinivas Katkoori:
Memory access pattern based insider threat detection in big data systems. 3625-3628 - Khudran Alzhrani, Ethan M. Rudd, C. Edward Chow, Terrance E. Boult:
Automated big security text pruning and classification. 3629-3637 - Claudio A. Ardagna, Paolo Ceravolo, Ernesto Damiani:
Big data analytics as-a-service: Issues and challenges. 3638-3644 - Elisa Bertino:
Data privacy for IoT systems: Concepts, approaches, and research directions. 3645-3647 - Chia-Tien Dan Lo, Pablo Ordóñez, Carlos Cepeda Mora:
Towards an effective and efficient malware detection system. 3648-3655 - Alfredo Cuzzocrea, Carlo Mastroianni, Giorgio Mario Grasso:
Private databases on the cloud: Models, issues and research perspectives. 3656-3661 - Philip Derbeko, Shlomi Dolev, Ehud Gudes, Jeffrey D. Ullman:
Concise essence-preserving big data representation. 3662-3665 - Sushil Jajodia, Witold Litwin, Thomas J. E. Schwarz:
Trusted cloud SQL DBS with on-the-fly AES decryption/encryption. 3666-3675 - Soo-Hyung Kim, Changwook Jung, Yoon-Joon Lee:
An entropy-based analytic model for the privacy-preserving in open data. 3676-3684 - Xueni Li, Guanggang Geng, Zhiwei Yan, Yong Chen, Xiaodong Lee:
Phishing detection based on newly registered domains. 3685-3692 - Boel Nelson, Tomas Olovsson:
Security and privacy for big data: A systematic literature review. 3693-3702 - Mohammad Shafahi, Leon Kempers, Hamideh Afsarmanesh:
Phishing through social bots on Twitter. 3703-3712 - Hippolyte Djonon Tsague, Bheki Twala:
Reverse engineering smart card malware using side channel analysis with machine learning techniques. 3713-3721 - Jason W. Woodworth, Mohsen Amini Salehi, Vijay Raghavan:
S3C: An architecture for space-efficient semantic search over encrypted data in the cloud. 3722-3731 - Tomohiro Fukui:
A systems approach to big data technology applied to supply chain. 3732-3736 - Gary S. W. Goh, Andy J. L. Ang, Allan N. Zhang:
Optimizing performance of sentiment analysis through design of experiments. 3737-3742 - Vahid Kayvanfar, S. M. Moattar Husseini, Behrooz Karimi, Mohsen S. Sajadieh, Tan Wen Jun:
Analysis for supply hub in industrial cluster: Classic vs. new perspective. 3743-3748 - Jasmine J. Lim, Allan N. Zhang:
A DEA approach for Supplier Selection with AHP and risk consideration. 3749-3758 - André Luckow, Matthew Cook, Nathan Ashcraft, Edwin Weill, Emil Djerekarov, Bennie Vorster:
Deep learning in the automotive industry: Applications and tools. 3759-3768 - Kazumasa Mori, Takuya Ohmori:
The Bayesian estimators of polytomous item response theory models with approximated conditional likelihood and their mathematical optimalities. 3769-3772 - B. Y. Ong, Rong Wen, Allan N. Zhang:
Data blending in manufacturing and supply chains. 3773-3778 - Wen Jun Tan, Wentong Cai, Zhengping Li:
Adaptive resilient strategies for supply chain networks. 3779-3784 - Takuya Watanabe, Hiroaki Muroi, Motoki Naruke, Kyoto Yono, Gen Kobayashi, Masanori Yamasaki:
Prediction of regional goods demand incorporating the effect of weather. 3785-3791 - Rong Wen, Wenjing Yan, Allan N. Zhang:
Weighted clustering of spatial pattern for optimal logistics hub deployment. 3792-3797 - Wenjing Yan, Rong Wen, Allan N. Zhang, Dazhi Yang:
Vessel movement analysis and pattern discovery using density-based clustering approach. 3798-3806 - Dazhi Yang, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang:
Spatial data dimension reduction using quadtree: A case study on satellite-derived solar radiation. 3807-3812 - Dazhi Yang, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang:
Forecast UPC-level FMCG demand, Part III: Grouped reconciliation. 3813-3819 - A. Aziz Altowayan, Lixin Tao:
Word embeddings for Arabic sentiment analysis. 3820-3825 - Michael Bentley, Soumya Batra:
Giving voice to office customers: Best practices in how office handles verbatim text feedback. 3826-3832 - Xiangfeng Dai, Robert Prout:
Unlock big data emotions: Weighted word embeddings for sentiment classification. 3833-3838 - Anna Hennig, Anne-Sofie Amodt, Henrik Hernes, Helene Mejer Nygardsmoen, Peter Arenfeldt Larsen, Raghava Rao Mukkamala, Benjamin Flesch, Abid Hussain, Ravi Vatrapu:
Big social data analytics of changes in consumer behaviour and opinion of a TV broadcaster. 3839-3848 - Henrikke Hovda Larsen, Johanna Margareta Forsberg, Sigrid Viken Hemstad, Raghava Rao Mukkamala, Abid Hussain, Ravi Vatrapu:
TV ratings vs. social media engagement: Big social data analytics of the Scandinavian TV talk show Skavlan. 3849-3858 - Tayfun Pay:
Totally automated keyword extraction. 3859-3863 - Belainine Billal, Alexsandro Fonseca, Fatiha Sadat:
Efficient natural language pre-processing for analyzing large data sets. 3864-3871 - Jihun Choi, Jonghem Youn, Sang-goo Lee:
A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge. 3872-3879 - Matthew Coole, Paul Rayson, John A. Mariani:
lexiDB: A scalable corpus database management system. 3880-3884 - Pradipto Das, Yandi Xia, Aaron Levine, Giuseppe Di Fabbrizio, Ankur Datta:
Large-scale taxonomy categorization for noisy product listings. 3885-3894 - Georg Heigold, Josef van Genabith, Günter Neumann:
Scaling character-based morphological tagging to fourteen languages. 3895-3902 - Avinash Kumar, Dhaval Patel, Nikita Jain:
Lightweight system for NE-tagged news headlines corpus creation. 3903-3912 - Yunfei Long, Qin Lu, Yue Xiao, Minglei Li, Chu-Ren Huang:
Domain-specific user preference prediction based on multiple user activities. 3913-3921 - Daiki Shimada, Ryunosuke Kotani, Hitoshi Iyatomi:
Document classification through image-based character embedding and wildcard training. 3922-3927 - Alexey Svyatkovskiy, Kosuke Imai, Mary Kroeger, Yuki Shiraito:
Large-scale text processing pipeline with Apache Spark. 3928-3935 - Hoseong Yang, Hye Jin Lee, Sungzoon Cho, Eugene Cho:
Automatic classification of securities using hierarchical clustering of the 10-Ks. 3936-3943 - Katchaguy Areekijseree, Ricky Laishram, Sucheta Soundarajan:
Max-node sampling: An expansion-densification algorithm for data collection. 3944-3946 - Adel Saad Assiri, Ahmed Z. Emam, Hmood Al-Dossari:
Real-time sentiment analysis of Saudi dialect tweets using SPARK. 3947-3950 - Peter Bajcsy, Soweon Yoon, Mylene Simon, Mary Brady, Ram D. Sriram, Nathan Hotaling, Nicholas Schaub, Carl G. Simon, Piotr M. Szczypinski, Stephen J. Florczyk:
Modeling, validation and verification of cell-scaffold contact measurements over terabyte-sized 3D image collection. 3951-3953 - Raja Sarath Kumar Boddu:
An integrated assessment approach to different collaborative filtering algorithms. 3954-3956 - Shaunak D. Bopardikar, George S. Eskander Ekladious:
Sequential randomized matrix factorization for Gaussian processes. 3957-3959 - Vy Bui, Lin-Ching Chang, Dunling Li, Li-yueh Hsu, Marcus Y. Chen:
Comparison of lossless video and image compression codecs for medical computed tomography datasets. 3960-3962 - Sunghwan Cho, Sunghal Hong, Changsoo Lee:
ORANGE: Spatial big data analysis platform. 3963-3965 - Ranjeet Devarakonda, Yaxing Wei, Michele Thornton:
Accessing and distributing large volumes of NetCDF data. 3966-3967 - Ranjeet Devarakonda, Kyle Dumas, Sheman Beus, Everett Neil Rush, Bhargavi Krishna, Rob Records, Giri Prakash:
Next-gen tools for big scientific data: ARM data center example. 3968-3970 - Srabasti Dutta, Sumantro Ray, S. Roy:
Correlation between weather and weather-related tweets - A preliminary study. 3971-3973 - Austin Harris, Hanna True, Zhen Hu, Jin Cho, Nancy Fell, Mina Sartipi:
Fall recognition using wearable technologies and machine learning algorithms. 3974-3976 - Ling He, Jiebo Luo:
"What makes a pro eating disorder hashtag": Using hashtags to identify pro eating disorder tumblr posts and Twitter users. 3977-3979 - Ayae Ichinose, Masato Oguchi, Atsuko Takefusa, Hidemoto Nakada:
Evaluation of distributed processing of caffe framework using poor performance device. 3980-3982 - Hiroki Imabayashi, Yu Ishimaki, Akira Umayabara, Hayato Yamana:
Fast and space-efficient secure frequent pattern mining by FHE. 3983-3985 - Akira Ishii, Masanori Ajito, Yasuko Kawahata:
Analysis of Pokémon GO using sociophysics approach. 3986-3988 - Yu Ishimaki, Hiroki Imabayashi, Kana Shimizu, Hayato Yamana:
Privacy-preserving string search for genome sequences with FHE bootstrapping optimization. 3989-3991 - Jeffrey Jenkins, Lin-Ching Chang, Elizabeth B. Hutchinson, M. Okan Irfanoglu, Carlo Pierpaoli:
Harmonization of methods to facilitate reproducibility in medical data processing: Applications to diffusion tensor magnetic resonance imaging. 3992-3994 - Seungwoo Jeon, Jaegi Hong, Bonghee Hong, Chumsu Kim:
TPR∗-tree Performance improvement for big tactical moving objects. 3995-3997 - Xiaoxia Jia, Peng Cheng, Jiming Chen:
A data analysis and visualization system for large-scale e-bike data. 3998-4000 - Priyanka Kale, Shilpa Balan:
Big data application in job trend analysis. 4001-4003 - David Kimmey, Jin Soung Yoo:
Nowcasting with social media data. 4004 - Vivian Lai, Kyong Jin Shim, Richard Jayadi Oentaryo, Philips Kokoh Prasetyo, Casey Vu, Ee-Peng Lim, David Lo:
CareerMapper: An automated resume evaluation tool. 4005-4007 - Ricky Laishram, Katchaguy Areekijseree, Sucheta Soundarajan:
Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawling. 4008-4010 - Jiwan Lee, Jaegi Hong, Bonghee Hong, Jinsu Ahn:
A generator of test data set for tactical moving objects based on velocity. 4011-4013 - Quanzhi Li, Sameena Shah, Mohammad Mahdi Ghassemi, Rui Fang, Armineh Nourbakhsh, Xiaomo Liu:
Using paraphrases to improve tweet classification: Comparing WordNet and word embedding approaches. 4014-4016 - Xiaomeng Liang, Lin-Ching Chang, Arash Massoudieh:
A framework for large-scale bacterial motility behavior analysis. 4017-4019 - Ankur Padia, Konstantinos Kalpakis, Tim Finin:
Inferring relations in knowledge graphs with tensor decompositions. 4020-4022 - Benito O. Perez, Yiwei Ma, Mengran Wang, Xiaomeng Liang, Negin Askarzadeh:
Towards a more meterless parking system: Understanding meter payment behavior and trends in Washington, DC. 4023-4025 - Giri Prakash, Jitendra Kumar, Everett Neil Rush, Robert Records, Anthony Clodfelter, Jimmy W. Voyles:
HPC infrastructure to support the next-generation ARM facility data operations. 4026-4028 - Jonathan M. Rogers, Soumya S. Dey, Richard Retting, Rahul Jain, Xiaomeng Liang, Negin Askarzadeh:
Using automated enforcement data to achieve vision zero goals: A case study. 4029-4031 - Antonette Shibani, Elizabeth Koh, Vivian Lai, Kyong Jin Shim:
Analysis of teamwork dialogue: A data mining approach. 4032-4034 - Kenneth David Strang, Zhaohao Sun:
Meta-analysis of big data security and privacy: Scholarly literature gaps. 4035-4037 - Xingang Wang, Zhigang Gai, Suiping Qi:
An approach for extracting big micro-scale severe weather region trajectories automatically from meteorological radar data. 4038-4039 - Guangxia Xu, Jingteng Zhao, Deling Huang:
An improved social spammer detection based on tri-training. 4040-4042
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.