User profiles for Xubin He

Xubin He

Temple University
Verified email at temple.edu
Cited by 4069

Implementing WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS

…, J Han, Y Zhong, C Han, X He - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
Hadoop framework has been widely used in various clusters to build large scale, high
performance systems. However, Hadoop distributed file system (HDFS) is designed to manage …

{MatrixKV}: Reducing write stalls and write amplification in {LSM-tree} based {KV} stores with matrix container in {NVM}

…, J Wan, Q Cui, L Tang, H Jiang, C Xie, X He - 2020 USENIX Annual …, 2020 - usenix.org
Popular LSM-tree based key-value stores suffer from suboptimal and unpredictable
performance due to write amplification and write stalls that cause application performance to …

Design tradeoffs for data deduplication performance in backup workloads

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - … USENIX Conference on …, 2015 - usenix.org
Data deduplication has become a standard component in modern backup systems. In order
to understand the fundamental tradeoffs in each of its design choices (such as prefetching …

Accelerating restore and garbage collection in deduplication-based backup systems via exploiting historical information

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - 2014 USENIX Annual …, 2014 - usenix.org
In deduplication-based backup systems, the chunks of each backup are physically scattered
after deduplication, which causes a challenging fragmentation problem. The fragmentation …

[PDF][PDF] Reducing SSD read latency via NAND flash program and erase suspension.

G Wu, X He - FAST, 2012 - usenix.org
In NAND flash memory, once a page program or block erase (P/E) command is issued to a
NAND flash chip, the subsequent read requests have to wait until the timeconsuming P/E …

Delta-FTL: Improving SSD lifetime via exploiting content locality

G Wu, X He - Proceedings of the 7th ACM european conference on …, 2012 - dl.acm.org
NAND flash-based SSDs suffer from limited lifetime due to the fact that NAND flash can only
be programmed or erased for limited times. Among various approaches to address this …

High-ratio lossy compression: Exploring the autoencoder to compress scientific data

…, Q Liu, S Alibhai, T Lu, X He - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Scientific simulations on high-performance computing (HPC) systems can generate large
amounts of floating-point data per run. To mitigate the data storage bottleneck and lower the …

Understanding and modeling lossy compression schemes on HPC scientific data

T Lu, Q Liu, X He, H Luo, E Suchyta… - 2018 IEEE …, 2018 - ieeexplore.ieee.org
Scientific simulations generate large amounts of floating-point data, which are often not very
compressible using the traditional reduction schemes, such as deduplication or lossless …

GearDB: A GC-free key-value store on HM-SMR drives with gear compaction

…, J Wan, P Huang, Y Zhang, C Xie, X He - Proceedings of the ACM …, 2023 - dl.acm.org
Host-managed shingled magnetic recording drives (HM-SMR)[2] give a capacity advantage
to harness the explosive growth of data. Applications where data is sequentially written and …

[PDF][PDF] A light-weight compaction tree to reduce I/O amplification toward efficient key-value stores

T Yao, J Wan, P Huang, X He, Q Gui… - Proc. 33rd Int. Conf …, 2017 - msstconference.org
Log-Structure merge tree (LSM-tree) has been one of the mainstream indexes in key-value
systems supporting a variety of write-intensive Internet applications in today’s data centers. …