Improving Accessing Efficiency of Cloud Storage Using De-Duplication and Feedback Schemes

1) The document proposes using an Index Name Server (INS) and SHA-1 functions to improve the efficiency of cloud storage by reducing duplicate files, optimizing node selection, and balancing server loads. 2) The INS manages matches between storage node IP addresses and hash codes to switch fingerprints to nodes, confirm and balance loads, and fulfill transmission requirements. 3) The proposed scheme uses SHA-1 parameters like IP information and node busy levels to determine highly loaded locations and optimize file distribution across nodes.

Uploaded by

sharifahnur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views3 pages

Improving Accessing Efficiency of Cloud Storage Using De-Duplication and Feedback Schemes

Uploaded by

sharifahnur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

www.ijecs.

in
International Journal Of Engineering And Computer Science ISSN:2319-7242
Volume 4 Issue 4 April 2015, Page No. 11212-11214

Improving Accessing Efficiency of Cloud Storage Using De-

Duplication and Feedback Schemes
R.K.Saranya1, R.Sanjana2, Steffi Miriam Philip3, Shahana M.S.A4

1
Assistant Professor, Department of Computer Science and Engineering, 2, 3 B.E Final Year Students

Jeppiaar Engineering College, Chennai

saranya.rks@gmail.com1, sanjanaramesh15@gmail.com2, miriam.steffi@gmail.com3,

shahanamsa@gmail.com4
Abstract File storage in cloud storage is handled by third parties. Files can be integrated, so that the users are
able to access t h e f i l e s using the centralized management. Due to the great number of users and devices in the
cloud network, the managers cannot effectively manage the efficiency of storage node. Therefore, hardware is wasted
and the complexity for managing the files also increases. In order to reduce workloads due to duplicate files, we
propose the index name servers (INS). It helps to reduce file storage, data de-duplication, optimized node selection,
and server load balancing, file compression, chunk matching, real-time feedback control, IP information, and
busy level index monitoring. Performance is also increased. By usi ng I NS t he fil es can also be reasonably
dist ri buted and workl oad can be decreased

Key Words de- dupli cation,Load Balanci ng, you store less, you also send less data over the network
Hash- Code Function. in case of a disaster, which means you save money in
hardware and network costs over time. The business
benefits of data deduplication.
I.Introduction
Load balancing distributes workloads across multiple
Files stored in the cloud can be accessed at any time computing resources, such as computers, a computer
from any place so long as we will have Internet access. cluster, network links, central processing units or disk
Another benefit is that cloud storage provides
drives. Load balancing aims to optimize resource use,
organizations with off-site backups of data which
reduces costs associated with disaster recovery. Cloud maximize throughput, minimize response time, and
storage can provide the benefits of greater accessibility avoid overload of any single resource. Using multiple
and reliability; rapid deployment; strong protection for components with load balancing instead of a single
backup, archival and disaster recovery purposes; and component may increase reliability through
lower overall storage costs as a result of not having to redundancy. Load balancing usually involves
purchase, manage and maintain expensive hardware. dedicated software or hardware, such as a multilayer
However, cloud storage will have the potential for
switch or a Domain Name System server process.
security and compliance concerns.
Data deduplication is one of the hottest technologies in
II.Related Work
storage right now because it enables companies to save
a lot of money on storage costs to store the data and on
To decrease the workload caused by duplicated files,
the bandwidth costs to move the data when replicating
it offsite for DR. This is great news for cloud this paper proposes a new data management structure:
providers, because if you store less, you need less Index Name Server (INS), which integrates data de-
hardware. If you can deduplicate what you store, you duplication with nodes optimization mechanisms for
can better utilize your existing storage space, which cloud storage performance enhancement. INS can
can save money by using what you have more manage and optimize the nodes according to the client-
efficiently. If you store less, you also back up less, side transmission conditions. By INS, each node can be
which again means less hardware and backup media. If controlled to work in the best status and matched to
R.K.Saranya1 IJECS Volume 4 Issue 4 April, 2015 Page No.11212-11214 Page 11212
suitable clients as possible. IT is improved the that the 3) Fulfilling user requirements for
performance of the cloud storage system efficiently transmission as possible.
distribute the files to reduce the load of each storage In the present work, we are implementing the
node. Techniques, such as run length encoding (RLE), SHA-1 function to improve the efficiency.
dictionary coding, calculation for the digital SHA-1 function is an advance technology that
fingerprinting of data chunks, distributed hash table provides an enhanced functionality for cloud
(DHT), and bloom filter, there have been several storage security and it protects the stored file.
investigations into load balancing in cloud computing This novel technique will improve the service
systems. A digital fingerprint is the essential feature of a of cloud storage.This technique will notify the
data chunk. Each data chunk has its unique fingerprint, user if a duplicate file is present in cloud.
and different chunks have different fingerprints.If it has Therfore it will automatically remove the
same hash values, we can say that data with the same duplicate files using the hash code functions.
hash values must have the same original data, and that By doing this, the space in the cloud is
data with different hash values must have different increased, duplicate files are reduced, load
original input data. balancing takes place and selection of
The bloom filter is composed of a long binary vector optimised node.
and a series of random mapping functions. The bloom
filter is presented to test whether an element is included
in the set. However, with the increase of the elements in IV. Algorithm
the set, more storage space will be needed and the STEP 1: .1) R(k): The initial expected value
retrieval speed will be slowed down.
A DHT node does not maintain and possess all the
information in the network, but stores only its own data
STEP 2: F(k): The output feedback;
and those of its neighboring nodes. This greatly reduces
hardware and bandwidth consumption. Essentially,
DHTs features include Decentralization, Scalability,
Fault Tolerance. STEP 3: M(k): The modified feedback;
keeps changing.
In existing system, the opportunistic load balancing
(OLB) algorithm is used which keep the node busy. STEP 4: Fs(k): The modified internal function of the
Thus, OLB does not consider the current workload of storage node;
each node, but distributes the unprocessed tasks
randomly to available nodes. Although OLB is easy
and direct, this scheduling algorithm does not consider STEP 5: D(k): The external random variable;
the expected task execution time and therefore cannot
achieve good execution time in make span.

STEP 6: X(k): The result within the storage node;

III. Present System & Framework

The INS(Index Name Server) uses a complex

P2P-like structure to manage the cloud data. STEP 7: Y(k): The actual result;
The INS principally handles the one-to-many
matches between the storage nodes’ IP
addresses and hash codes.Three main functions STEP 8: KINS: The optimal node determined by the
of INS include: SHA-1 based on the feedback.
1) Switching the fingerprints to their
corresponding storage nodes; V. Conclusion

2) Confirming and balancing the load of the

storage nodes;
R.K.Saranya1 IJECS Volume 4 Issue 4 April, 2015 Page No.11212-11214 Page 11213
We proposed the SHA-1 to process not only file [7]R. Tong and X. Zhu,. “A load balancing
compression, chunk matching, data de-duplication, strategy based on the combination of static and
real-time feedback control, IP information, and busy dynamic,” in Proc. 2nd Int. Workshop
level index monitoring, but also file storage, optimized Database Technol. Appl., Nov. 2010, pp. 1–4
node selection, and server load balancing. [8]T.-Y. Wu, W.-T. Lee, Y.-S. Lin, Y.-S.
Lin, H.-L. Chan, and J.-S. Huang, “Dynamic
Based on several SHA parameters that monitor IP
load balancing mechanism based on cloud
information and the busy level index of each node, our
storage,” in Proc Comput. Com. Appl. Conf.,
proposed scheme can determine the location of
Jan. 2012, pp. 102–106.
maximum loading and trace back to the source of
[9]Y. Zhang, C. Zhang, Y. Ji, and W. Mi, “ A
demands to determine the optimal backup node.
novel load balancing scheme for DHT-based
According to the transmission states of storage nodes server farm,” in Proc. 3rd IEEE Int. Conf Comput.
and clients, the SHA-1 received the feedback of the Broadband Netw. Multimedia Technol., Oct.
previous transmissions and adjusted the transmission 2010, pp. 980–984.
parameters to attain the optimal performance for the
storage nodes. By compressing and partitioning the
files according to the chunk size of the cloud file
system.

REFERENCES

[1] Y.-M. Huo, H.-Y. Wang, L.-A. Hu, and

H.-G. Yang, “A cloud storage architecture
model for data-intensive applications,” in Proc
Int. Conf Comput. Manage., May 2011, pp. 1–
4.
[2] L. B. Costa and M. Ripeanu, “Towards
automating the configuration of a distributed
storage system,” in Proc. 11th IEEE/ACM Int.
Conf. Grid Comput., Oct. 2010, pp. 201–208.
[3]C.-Y. Chen, K.-D. Chang, and H.-C.
Chao, “Transaction pattern based anomaly
detection algorithm for IP multimedia
subsystem, IEEE Trans Inform. Forensics
Security, vol. 6, no. 1, pp. 152–161, Mar. 2011.
[4]G. Urdaneta, G. Pierre, and M. Van Steen,
“A survey of DHT security techniques,” ACM
Comput. Surveys (CSUR), vol. 43, no. 2, pp.
8:1–8:49, Jan. 2011.
[5]T.-Y. Wu, W.-T. Lee, and C. F. Lin,
“Cloud storage performance enhancement by
real-time feedback control and de-duplication,”
in Proc Wireless Telecommun. Symp., Apr.
2012, pp. 1–5.
[6]H. He and L. Wang, “ P&P: A combined
push-pull model for resource monitoring in
cloud computing environment,” in Proc. IEEE
3rd Int Conf. Cloud Comput., Jul. 2010, pp.
260–267

R.K.Saranya1 IJECS Volume 4 Issue 4 April, 2015 Page No.11212-11214 Page 11214

Cloud Data Deduplication Techniques
No ratings yet
Cloud Data Deduplication Techniques
7 pages
Block
No ratings yet
Block
16 pages
Deduplication Review
No ratings yet
Deduplication Review
18 pages
23enhanced Storage Optimization System (SoS) For IaaS Cloud Storage
No ratings yet
23enhanced Storage Optimization System (SoS) For IaaS Cloud Storage
5 pages
Storage Management System Using Block Level Deduplication Technique in Cloud Computing
No ratings yet
Storage Management System Using Block Level Deduplication Technique in Cloud Computing
5 pages
Li 2020
No ratings yet
Li 2020
13 pages
Amol PCX - Report
No ratings yet
Amol PCX - Report
15 pages
Edge Data Deduplication Under Uncertainties A Robust Optimization Approach
No ratings yet
Edge Data Deduplication Under Uncertainties A Robust Optimization Approach
12 pages
Storage Management System Using Block Level Deduplication Technique in Cloud Computing Final
No ratings yet
Storage Management System Using Block Level Deduplication Technique in Cloud Computing Final
5 pages
Protected Steadfast Deduplication in Crossbreed Cloud Technique
No ratings yet
Protected Steadfast Deduplication in Crossbreed Cloud Technique
5 pages
Analysis of Empower Auditing and Secure Deduplication in Hybrid Cloud
No ratings yet
Analysis of Empower Auditing and Secure Deduplication in Hybrid Cloud
4 pages
Paper Publish
No ratings yet
Paper Publish
6 pages
Iaetsd Controlling Data Deuplication in Cloud Storage
No ratings yet
Iaetsd Controlling Data Deuplication in Cloud Storage
6 pages
Cloud Backup Deduplication Guide
No ratings yet
Cloud Backup Deduplication Guide
3 pages
Deduplication On Encrypted Data in Cloud Computing
No ratings yet
Deduplication On Encrypted Data in Cloud Computing
4 pages
Secure Server Deduplication in Cloud Computing
No ratings yet
Secure Server Deduplication in Cloud Computing
6 pages
Meta Cloud-Improve The Performance of Winds of Change From Vendor Lock
No ratings yet
Meta Cloud-Improve The Performance of Winds of Change From Vendor Lock
6 pages
Sagar de Duplication Paper
No ratings yet
Sagar de Duplication Paper
2 pages
Data Routing Strategy in Cloud Environment (Autorecovered)
No ratings yet
Data Routing Strategy in Cloud Environment (Autorecovered)
5 pages
2022 V13i1173
No ratings yet
2022 V13i1173
8 pages
Conference CloudFileOptimizer
No ratings yet
Conference CloudFileOptimizer
4 pages
Detecting Replicated Files in The Cloud
No ratings yet
Detecting Replicated Files in The Cloud
9 pages
Sathe 2018
No ratings yet
Sathe 2018
4 pages
Data Deduplication and Secure Sharing in Cloud Storage
No ratings yet
Data Deduplication and Secure Sharing in Cloud Storage
5 pages
Encrypted Data Deduplication in Cloud Storage: 2015 10th Asia Joint Conference On Information Security
No ratings yet
Encrypted Data Deduplication in Cloud Storage: 2015 10th Asia Joint Conference On Information Security
8 pages
A Hybrid Cloud Approach For Secure Authorized De-Duplication
No ratings yet
A Hybrid Cloud Approach For Secure Authorized De-Duplication
9 pages
Secure Cloud Deduplication & Auditing
No ratings yet
Secure Cloud Deduplication & Auditing
10 pages
CLOUD COMPUTING - Using Fog Computing To Deduplicate Data For Efficient Cloud Storage.
No ratings yet
CLOUD COMPUTING - Using Fog Computing To Deduplicate Data For Efficient Cloud Storage.
4 pages
Authorized Data Deduplication in Cloud
No ratings yet
Authorized Data Deduplication in Cloud
11 pages
Dependable and Secure Storage Services in Cloud Computing
No ratings yet
Dependable and Secure Storage Services in Cloud Computing
5 pages
Secure Data Deduplication and Auditing For Cloud Data Storage 1 1
No ratings yet
Secure Data Deduplication and Auditing For Cloud Data Storage 1 1
4 pages
10 1109@icoei48184 2020 9143004
No ratings yet
10 1109@icoei48184 2020 9143004
6 pages
Deduplication On Encrypted Big Data in Cloud
No ratings yet
Deduplication On Encrypted Big Data in Cloud
13 pages
Riya Cs
No ratings yet
Riya Cs
3 pages
A Study On Data Deduplication Techniques For Optimized Storage
No ratings yet
A Study On Data Deduplication Techniques For Optimized Storage
7 pages
A New Cloud Arechitecutre For Secure Verified Deduplication
No ratings yet
A New Cloud Arechitecutre For Secure Verified Deduplication
33 pages
Secure Destributed De-Duplication System in Reliable Cloud Storage1
No ratings yet
Secure Destributed De-Duplication System in Reliable Cloud Storage1
45 pages
191 1496476036 - 03-06-2017 PDF
No ratings yet
191 1496476036 - 03-06-2017 PDF
5 pages
Encrypted Data Management With Deduplication in Cloud Computing
No ratings yet
Encrypted Data Management With Deduplication in Cloud Computing
13 pages
Solving Data De-Duplication Issues On Cloud Using Hashing and Md5 Techniques
No ratings yet
Solving Data De-Duplication Issues On Cloud Using Hashing and Md5 Techniques
11 pages
Keywords:: De-Duplication, Authorized Duplicate Check, Confidentiality
No ratings yet
Keywords:: De-Duplication, Authorized Duplicate Check, Confidentiality
1 page
(IJCST-V10I5P53) :MR D Purushothaman, M Naveen
No ratings yet
(IJCST-V10I5P53) :MR D Purushothaman, M Naveen
8 pages
Seminar Report: Secure Auditing and Deduplicating Data in Cloud
No ratings yet
Seminar Report: Secure Auditing and Deduplicating Data in Cloud
8 pages
Toward Secure and Dependable Storage Services in Cloud Computing
No ratings yet
Toward Secure and Dependable Storage Services in Cloud Computing
13 pages
V3i501 PDF
No ratings yet
V3i501 PDF
6 pages
Aggressive Hash Table Predicated Social Analyze For Defended Cloud Cache
No ratings yet
Aggressive Hash Table Predicated Social Analyze For Defended Cloud Cache
5 pages
Research Paper
No ratings yet
Research Paper
6 pages
1 s2.0 S221421262300011X Main
No ratings yet
1 s2.0 S221421262300011X Main
13 pages
2023TPDS FCDedup A Two-Level Deduplication System For Encrypted Data in Fog Computing
No ratings yet
2023TPDS FCDedup A Two-Level Deduplication System For Encrypted Data in Fog Computing
15 pages
2023TSC Security-Aware and Efficient Data Deduplication For Edge-Assisted Cloud Storage Systems
No ratings yet
2023TSC Security-Aware and Efficient Data Deduplication For Edge-Assisted Cloud Storage Systems
12 pages
Data Storage Technology and Its Development Based On Cloud Computing
No ratings yet
Data Storage Technology and Its Development Based On Cloud Computing
4 pages
CHAPTER 1 (New)
No ratings yet
CHAPTER 1 (New)
33 pages
Ijctt V3i3p108
No ratings yet
Ijctt V3i3p108
6 pages
MJNW06 - Final Document
No ratings yet
MJNW06 - Final Document
77 pages
Cloud Storage
No ratings yet
Cloud Storage
15 pages
SRS NNN
No ratings yet
SRS NNN
16 pages
1822 B.E Cse Batchno 141
No ratings yet
1822 B.E Cse Batchno 141
56 pages
Science International Journal
No ratings yet
Science International Journal
10 pages
Dotnet: Towards Secure and Dependable Storage Services in Cloud Computing
No ratings yet
Dotnet: Towards Secure and Dependable Storage Services in Cloud Computing
3 pages
Ics - Chapter 1 - Exercise - Objective
No ratings yet
Ics - Chapter 1 - Exercise - Objective
4 pages
Hardware Troubleshooting Guide
No ratings yet
Hardware Troubleshooting Guide
17 pages
Mobile Device Research Activity
No ratings yet
Mobile Device Research Activity
2 pages
10.2.3.2 Worksheet - Third-Party Antivirus Software PDF
No ratings yet
10.2.3.2 Worksheet - Third-Party Antivirus Software PDF
1 page
Wireless Network Quote
No ratings yet
Wireless Network Quote
37 pages
Dhiman - 2016 - Performance Testing A Comparative Study and Analysis of Web Service Testing Tools
No ratings yet
Dhiman - 2016 - Performance Testing A Comparative Study and Analysis of Web Service Testing Tools
6 pages
SAP Simsession Handling Guide
No ratings yet
SAP Simsession Handling Guide
16 pages
BCS-041 2016
No ratings yet
BCS-041 2016
12 pages
Abit - M621 Schematics: Title Sheet Modify
100% (1)
Abit - M621 Schematics: Title Sheet Modify
33 pages
Sap MM Plants Roll Out Project Documents
60% (5)
Sap MM Plants Roll Out Project Documents
1 page
IoT Solutions Guide March14 Digital
0% (1)
IoT Solutions Guide March14 Digital
39 pages
AN3115
No ratings yet
AN3115
24 pages
LPWAN Technology
No ratings yet
LPWAN Technology
1 page
Formation Chorfi Partie 1
No ratings yet
Formation Chorfi Partie 1
58 pages
Aficio SP 3400sf PDF
No ratings yet
Aficio SP 3400sf PDF
362 pages
IS42S16100C1: 512K Words X 16 Bits X 2 Banks (16-MBIT) Synchronous Dynamic Ram
No ratings yet
IS42S16100C1: 512K Words X 16 Bits X 2 Banks (16-MBIT) Synchronous Dynamic Ram
79 pages
IPTV Green Book V005a PDF
No ratings yet
IPTV Green Book V005a PDF
64 pages
I.T. Essentials Chapter 6 Jeopardy Game
No ratings yet
I.T. Essentials Chapter 6 Jeopardy Game
29 pages
Oracle Business Intelligence Enterprise Edition: Overview
No ratings yet
Oracle Business Intelligence Enterprise Edition: Overview
5 pages
Web Dynpro Architecture Guide
No ratings yet
Web Dynpro Architecture Guide
3 pages
Docu56210 - XtremIO Host Configuration Guide PDF
No ratings yet
Docu56210 - XtremIO Host Configuration Guide PDF
188 pages
BMC Patrol Agent 9.5
No ratings yet
BMC Patrol Agent 9.5
415 pages
NTC Summary Worksheet
No ratings yet
NTC Summary Worksheet
48 pages
IMEI No of Mobiles-How To Find
No ratings yet
IMEI No of Mobiles-How To Find
8 pages
Remote Direct Memory Access
No ratings yet
Remote Direct Memory Access
2 pages
Longse HD-IP Camera Price List For Distributor V201603B
No ratings yet
Longse HD-IP Camera Price List For Distributor V201603B
23 pages
BCA/CS Video & Animation Guide
No ratings yet
BCA/CS Video & Animation Guide
98 pages
Google Assistant Controlled Home Automation: Advancement in Engineering, Science & Technology
No ratings yet
Google Assistant Controlled Home Automation: Advancement in Engineering, Science & Technology
8 pages
Handa Books For Electrical PDF
100% (2)
Handa Books For Electrical PDF
2 pages
STM32 Lec1
100% (2)
STM32 Lec1
15 pages
Unit 7 A
No ratings yet
Unit 7 A
27 pages
Hitachi White Paper Compute Blade 2500
No ratings yet
Hitachi White Paper Compute Blade 2500
19 pages
Best Practices For Recording With The Em32
No ratings yet
Best Practices For Recording With The Em32
2 pages
RS232 Sidertop
No ratings yet
RS232 Sidertop
3 pages
Answer The Following Questions
No ratings yet
Answer The Following Questions
3 pages
Ruckus Indoor-AP User-Guide 100.2.0
No ratings yet
Ruckus Indoor-AP User-Guide 100.2.0
193 pages

Improving Accessing Efficiency of Cloud Storage Using De-Duplication and Feedback Schemes

Uploaded by

Improving Accessing Efficiency of Cloud Storage Using De-Duplication and Feedback Schemes

Uploaded by

www.ijecs.

Improving Accessing Efficiency of Cloud Storage Using De-

Jeppiaar Engineering College, Chennai

saranya.rks@gmail.com1, sanjanaramesh15@gmail.com2, miriam.steffi@gmail.com3,

STEP 6: X(k): The result within the storage node;

The INS(Index Name Server) uses a complex

2) Confirming and balancing the load of the

[1] Y.-M. Huo, H.-Y. Wang, L.-A. Hu, and

You might also like