With the development of concepts and technologies such as computer network, grid computing, WEB services, and new WEB applications, the World Wide Web has changed from a traditional HTML file sharing, exchange, and access platform to a sharing platform of arbitrary information resources. The architecture and cloud computing model on the Internet have become the general trend. The most important type of cloud computing is data intensive computing, with its core connotation being data management. Traditional data management techniques are difficult to meet the data management requirements proposed by these applications. This article draws on the concept of traditional data management technology and studies and designs a highly scalable and reliable massive image storage and management system based on CouchDB.
Definition and Characteristics of Cloud Computing
APP ETH PCU PCU2000 ETH cloud computing is a supercomputing model that integrates large-scale and scalable computing, storage, data, applications, IT resources, and other distributed computing resources in a collaborative manner, using networks as carriers, virtualization technology as the foundation, and providing infrastructure, platforms, software, and other services. In the cloud computing mode, users no longer need to purchase complex hardware and software, but only need to pay corresponding fees to the “cloud computing” service provider, and can easily obtain the required computing and storage resources through the network. A more flexible approach has been adopted in pricing. From a management perspective, cloud computing provides a way to manage a large amount of virtualized resources, which can be automatically aggregated and provide services, and can be flexibly provided and expanded. Cloud computing is a new user experience and business model that has the characteristics of service standardization, rapid deployment, flexible billing, and easy access. Cloud computing, as a method of infrastructure management and service provision, has the characteristics of virtualized resources, elastic service provision, and management in a large resource pool.
Cloud computing is a computing model that uses Internet technology to provide a large number of scalable (and resilient) IT related capabilities to multiple users as a service. According to the operation model, cloud computing can be divided into three types: public cloud, private cloud, and hybrid cloud.
Public Cloud: Directly providing services to users through its own infrastructure. Users access services through the Internet and do not own cloud computing resources.
Private cloud: Enterprises build their own cloud computing infrastructure and provide cloud computing services to internal users or external customers. Enterprises have autonomy in their infrastructure and can improve services and innovate independently based on their own needs.
Hybrid cloud: It refers to the use of both its own cloud computing infrastructure and services provided by external public clouds.
2 Key Technologies of Cloud Computing
Cloud computing is a data intensive supercomputing model centered around data. It has its own unique technology in five aspects: data storage, data management, programming patterns, concurrency control, and system management.
1) Massive distributed storage technology
To ensure the reliability, high availability, and economy of stored data, the storage data of the APPETH PCU PCU2000ETH cloud computing adopts a distributed redundant storage method, using highly reliable software to compensate for hardware deficiencies and provide a cheap and reliable system to the outside world. In order to meet the needs of a large number of users, data storage technology must have the characteristics of high throughput and high transmission rate.
The data storage systems for cloud computing mainly include Google’s GFS (Google File System) and the open-source system developed by the Hadoop development team – HDFS (Hadoop Distr4buted File System). Most IT vendors, including Yahoo and Intel, adopt HDFS data storage technology in their cloud plans.
2) Parallel programming mode
In order to efficiently utilize the resources of cloud computing and make it easier for users to enjoy the services it brings, the programming mode of cloud computing must ensure that the complex parallel execution and task scheduling in the background are transparent to users and programmers. Cloud computing adopts the Map Reduce programming mode, which automatically divides tasks into multiple subtasks and achieves task scheduling and allocation in large-scale computing nodes through two steps: Map and Reduce.
3) Data management technology
Cloud computing analyzes and processes big datasets to provide efficient services to users. Therefore, data management technology must be able to efficiently manage big datasets. In addition, how to find specific data in large-scale data is also a problem that cloud computing data management technology must solve. The data management of cloud computing systems often adopts a column storage data management mode to ensure the storage and analysis performance of massive data.