Abstract.With the rapid development of information technology, cloud computing, large data, mobile interconnection and other technologies, the number of users is increasing, and the amount of data generated by users is increasing exponentially. How to store large scale and massive data, the traditional distributed storage technology will face challenges. Distributed storage technology can use multiple servers to store data and share storage load. However, in the cloud computing environment, it looks pale. In order to improve the storage efficiency of massive data, data blocking algorithm is studied to shorten the response time of data backup. Effective application of data duplication technology improves the storage performance of data.
IntroductionWith the rapid development of cloud computing, big data and mobile Internet technology, cloud computing technology as the core of the new generation of IT technology [1], it is exerting a subtle influence on people's way of life and learning. The concept of cloud computing was first put forward at the search engine conference held in 2006. So far, it has penetrated into all walks of life. Cloud computing is to automatically split a huge computing process into numerous smaller subroutines through a network, and then make a large system composed of multiple servers to assemble a large number of heterogeneous storage devices in the network and return the results to users after search, calculation and analysis. In this environment, the volume of data generated by enterprises and individual users is increasing exponentially. It is estimated that the total data of 40ZB will be generated worldwide by 2020 [2]. How to store large scale, massive data, data processing and storage technology is facing new challenges. Traditional data storage technologies such as network storage, centralized storage, and distributed file systems can not efficiently complete the task of storing and processing mass data in the cloud computing environment.Therefore, this paper makes detailed research on data storage technology under the cloud computing environment, improves the data partitioning strategy, shortens the response time of data backup, and effectively applicates duplicate data deletion technique.to improve the efficiency of data storage. As the core component of the new generation of IT technology, cloud computing will become a new hot spot of the new revolution of information technology in the world after the personal computer and the Internet [3].