A Similarity based Compression Approach for Efficient Data Processing on Cost Optimized Multi-Cloud.

Deepa, S.T.; Gejalakshmi.L.

UoK Repository Home
→
Computing and Technology
→
Symposia and Conferences
→
International Conference on Advances in Computing and Technology (ICACT)
→
KICACT 2017
→
View Item

dc.contributor.author	Deepa, S.T.
dc.contributor.author	Gejalakshmi.L.
dc.date.accessioned	2017-09-11T06:47:37Z
dc.date.available	2017-09-11T06:47:37Z
dc.date.issued	2017
dc.identifier.citation	Deepa, S.T. and Gejalakshmi.L.2017. A Similarity based Compression Approach for Efficient Data Processing on Cost Optimized Multi-Cloud. Kelaniya International Conference on Advances in Computing and Technology (KICACT - 2017), Faculty of Computing and Technology, University of Kelaniya, Sri Lanka. p 04.	en_US
dc.identifier.uri	http://repository.kln.ac.lk/handle/123456789/17373
dc.description.abstract	Cloud computing provides a promising platform for flexible massive storage of data, computing and software services in a scalable manner. Massive storage sensing is prevalent in both industry and research applications where the data storage consumes volume and high velocity. There are five phases in the compression based cost effective multi cloud architecture. Pre-process phase: In this phase user is intended to upload their local files. The cloud server decides whether these files to be uploaded considering the authenticity of the user and the content priority. Deduplication process: Once the file is approved for uploading, similarity model is used for compression and clustering, Data chunks are created. Similarity model works with text data and multidimensional numerical data is contributes to the majority of the data available. Markov model is used to calculate the similarity in text data and in tree topology, similarity is determined by the number of leaf nodes. The data is checked for duplication in this step. It is done by generating signature for the data chuck, the signature stored in DDB (Deduplication DataBase) is compared with the signature of the chunk to be stored and then data chunk is stored. Upload phase: After approval for uploading and clustering, the data chunks are encrypted and uploaded into multi-cloud where the application and data are fragmented and stored in multicloud to enhance security and protection. In multi cloud architecture, no cloud provider learns the complete application logic and overall application results which leads to data and application confidentiality. Update phase: If user intend to modify, insert or delete some blocks of the existing files, then the corresponding data chunks alone is updated in the cloud. Proof of storage Phase: The user has a meta data file stored locally to identify which data chunks is available in which cloud in our multi cloud architecture. To make cloud storage cost effective, on demand resource provisioning is established and the cost is calculated depending on the number of user and number of resources used. On demand resource planning avoids under and over provisioning of the resource enhancing the resource utilization.	en_US
dc.language.iso	en	en_US
dc.publisher	Faculty of Computing and Technology, University of Kelaniya, Sri Lanka.	en_US
dc.subject	Multicloud	en_US
dc.subject	Data compression	en_US
dc.subject	Resource provisioning	en_US
dc.title	A Similarity based Compression Approach for Efficient Data Processing on Cost Optimized Multi-Cloud.	en_US
dc.type	Article	en_US