User profiles for Xubin He
Xubin HeTemple University Verified email at temple.edu Cited by 4069 |
Implementing WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS
Hadoop framework has been widely used in various clusters to build large scale, high
performance systems. However, Hadoop distributed file system (HDFS) is designed to manage …
performance systems. However, Hadoop distributed file system (HDFS) is designed to manage …
{MatrixKV}: Reducing write stalls and write amplification in {LSM-tree} based {KV} stores with matrix container in {NVM}
Popular LSM-tree based key-value stores suffer from suboptimal and unpredictable
performance due to write amplification and write stalls that cause application performance to …
performance due to write amplification and write stalls that cause application performance to …
Design tradeoffs for data deduplication performance in backup workloads
Data deduplication has become a standard component in modern backup systems. In order
to understand the fundamental tradeoffs in each of its design choices (such as prefetching …
to understand the fundamental tradeoffs in each of its design choices (such as prefetching …
Accelerating restore and garbage collection in deduplication-based backup systems via exploiting historical information
In deduplication-based backup systems, the chunks of each backup are physically scattered
after deduplication, which causes a challenging fragmentation problem. The fragmentation …
after deduplication, which causes a challenging fragmentation problem. The fragmentation …
[PDF][PDF] Reducing SSD read latency via NAND flash program and erase suspension.
In NAND flash memory, once a page program or block erase (P/E) command is issued to a
NAND flash chip, the subsequent read requests have to wait until the timeconsuming P/E …
NAND flash chip, the subsequent read requests have to wait until the timeconsuming P/E …
Delta-FTL: Improving SSD lifetime via exploiting content locality
NAND flash-based SSDs suffer from limited lifetime due to the fact that NAND flash can only
be programmed or erased for limited times. Among various approaches to address this …
be programmed or erased for limited times. Among various approaches to address this …
High-ratio lossy compression: Exploring the autoencoder to compress scientific data
Scientific simulations on high-performance computing (HPC) systems can generate large
amounts of floating-point data per run. To mitigate the data storage bottleneck and lower the …
amounts of floating-point data per run. To mitigate the data storage bottleneck and lower the …
Understanding and modeling lossy compression schemes on HPC scientific data
Scientific simulations generate large amounts of floating-point data, which are often not very
compressible using the traditional reduction schemes, such as deduplication or lossless …
compressible using the traditional reduction schemes, such as deduplication or lossless …
GearDB: A GC-free key-value store on HM-SMR drives with gear compaction
Host-managed shingled magnetic recording drives (HM-SMR)[2] give a capacity advantage
to harness the explosive growth of data. Applications where data is sequentially written and …
to harness the explosive growth of data. Applications where data is sequentially written and …
[PDF][PDF] A light-weight compaction tree to reduce I/O amplification toward efficient key-value stores
Log-Structure merge tree (LSM-tree) has been one of the mainstream indexes in key-value
systems supporting a variety of write-intensive Internet applications in today’s data centers. …
systems supporting a variety of write-intensive Internet applications in today’s data centers. …