Two-way data irreversible storage system based on combination of block chain and blue-ray storage

文档序号:34595 发布日期:2021-09-24 浏览:33次 中文

阅读说明:本技术 一种基于区块链和蓝光存储相结合的双路数据不可逆存储系统 (Two-way data irreversible storage system based on combination of block chain and blue-ray storage ) 是由 王大鹏 张海龙 田国刚 姜南雪 张诗琪 金岚 于 2021-06-15 设计创作,主要内容包括:本发明公开了一种基于区块链和蓝光存储相结合的双路数据不可逆存储系统,包括存储模块和检索模块,所述存储模块能够通过数据存储策略将数据存储并进行管理,在区块链节点与用户达成数据存储的合约后,接收用户数据,通过智能算法预测用户数据访问概率,构建用户数据的数据标识,所述数据标识包括用户ID、用户数据索引、数据访问概率,判断数据访问概率是否超过阈值选择数据存储策略;所述检索模块能够通过数据检索策略在存储系统中进行检索,在用户检索数据时,根据数据标识进行检索,判断数据访问概率是否超过阈值选择数据检索策略。本发明提供了根据用户数据标识进行双路存储且操作不可逆的系统,能够满足大数据存储,且提供快速检索的服务。(The invention discloses a two-way data irreversible storage system based on combination of a block chain and blue-ray storage, which comprises a storage module and a retrieval module, wherein the storage module can store and manage data through a data storage strategy, receives user data after a contract of data storage is achieved between a block chain link point and a user, predicts user data access probability through an intelligent algorithm, constructs data identification of the user data, and judges whether the data access probability exceeds a threshold value to select the data storage strategy or not; the retrieval module can retrieve in the storage system through the data retrieval strategy, and when a user retrieves data, the retrieval module retrieves according to the data identification and judges whether the data access probability exceeds a threshold value to select the data retrieval strategy. The invention provides a system which performs double-path storage and is operated irreversibly according to the user data identification, can meet the requirement of large data storage, and provides a service of quick retrieval.)

1. A two-way data irreversible storage system based on the combination of a block chain and blue-ray storage is characterized by comprising a storage module and a retrieval module,

the storage module can store and manage data through a data storage strategy, receives user data after a contract of data storage is achieved between a block link node and a user, predicts the user data access probability through an intelligent algorithm, constructs the data identification of the user data, the data identification comprises a user ID, a user data index and the data access probability, judges whether the data access probability exceeds a threshold value, if so, adopts a data storage strategy a1, otherwise, adopts a data storage strategy a2,

the data storage policy comprises:

a1, verifying whether the block chain storage is saturated, if not, storing the user data in the block chain, and if saturated, transferring the data stored in the block chain to a blue light storage module, and then storing the user data in the block chain;

a2, verifying whether the block chain storage is saturated, if not, storing the user data in the block chain and the blue light storage module at the same time, and if saturated, deleting the data stored in the block chain and then storing the user data in the block chain and the blue light storage module at the same time;

the retrieval module can retrieve in the storage system through the data retrieval strategy, when a user retrieves data, the retrieval is carried out according to the data identification, whether the data access probability exceeds the threshold value is judged, if yes, the data retrieval strategy a3 is adopted, otherwise, the data retrieval strategy a4 is adopted,

the data retrieval policy includes:

a3, searching in the block chain based on the data identification of the user data, if the user data is not searched in the block chain, searching in the blue-ray storage module based on the data identification of the user data, feeding the search result back to the user, and updating the data access probability in the stored user data identification;

a4, directly searching in the blue-ray storage module, feeding back the search result to the user, and updating the data access probability in the data identification of the storage user.

2. The two-way data irreversible storage system based on the combination of the block chain and the blue-ray storage is characterized in that the data stored in the block chain is transferred to the blue-ray storage module by determining a data unit to be transferred, a temporary storage unit and a transfer unit,

determining a data unit to be transferred to be used for screening user data to be transferred from a block chain, when the storage space of the block chain reaches a saturation state, screening the user data in the block chain through data storage duration or data access frequency, and determining the user data meeting a threshold value as the user data to be transferred;

the temporary storage unit is used for creating a temporary storage space and storing the user data to be transferred in the temporary storage space;

the transfer unit is used for transmitting the data in the temporary storage space to the blue-ray storage module, and deleting the data to be transferred in the block chain after the data is successfully transferred.

3. The two-way data irreversible storage system based on the combination of the block chain and the Blu-ray storage as claimed in claim 1, wherein the storing the data in the Blu-ray storage module further comprises performing redundant backup on the data, wherein the redundant backup is to divide the data into a plurality of sub-data, then Reed-solomon encode the sub-data into redundant sub-data, and store the redundant sub-data separately.

4. The two-way data irreversible storage system based on the combination of the block chain and the blue-ray storage as claimed in claim 1, wherein the storing and managing the data means transferring the user data stored in the block chain at regular intervals, the regular intervals mean that the average time required for the block chain storage space to reach saturation is taken as a period, the transferring means deleting the data in the block chain when the data is stored in both the block chain and the blue-ray storage module, and when the data is stored in the block chain only, transferring the data to the blue-ray storage module and then deleting the data.

5. The two-way data irreversible storage system based on blockchain and blu-ray storage combination as claimed in claim 1, wherein the prediction of the user data access probability through an intelligent algorithm means that the probability of all files stored in the system being accessed is predicted through an intelligent algorithm, wherein the intelligent algorithm includes but is not limited to RBF neural network, GRNN neural network, SVR, and prediction is performed through one or more intelligent algorithm combinations.

Technical Field

The invention relates to the field of data storage, in particular to a two-way data irreversible storage system based on combination of a block chain and blue-ray storage.

Background

Nowadays, along with the rapid development of information technologies such as cloud computing, big data, internet of things, artificial intelligence and the like and the transformation of traditional industry digitalization, the data volume shows geometric growth. And there are great differences in access frequency, access times, etc. for different data. The current storage system adopts magnetism and electricity as physical storage media, and magnetism and electricity can keep data online all the time, improves data corresponding speed, but also brings huge energy consumption, short storage life, single storage mode, easy trouble and so on a great deal of problem simultaneously. The block chain storage is a data storage technology, which uses the disk space on each machine through the network, and forms a virtual storage device with these distributed storage resources, the data is distributed and stored in each corner of the network, and the data of a user is split into fragments and stored in different block chain nodes, thereby providing a safe, reliable, fast, flexible, and low-cost storage service for the user. Blockchain storage is needed to provide stable and fast service for users while reducing the cost of storing and retrieving data for nodes. It is challenging to maintain fast service with the need for large amounts of data storage and retrieval.

Disclosure of Invention

The invention provides a two-way data irreversible storage system based on the combination of a block chain and blue-ray storage, which aims to overcome the technical problem.

A two-way data irreversible storage system based on the combination of a block chain and blue-ray storage is characterized by comprising a storage module and a retrieval module,

the storage module can store and manage data through a data storage strategy, receives user data after a contract of data storage is achieved between a block link node and a user, predicts the user data access probability through an intelligent algorithm, constructs the data identification of the user data, the data identification comprises a user ID, a user data index and the data access probability, judges whether the data access probability exceeds a threshold value, if so, adopts a data storage strategy a1, otherwise, adopts a data storage strategy a2,

the data storage policy comprises:

a1, verifying whether the block chain storage is saturated, if not, storing the user data in the block chain, and if saturated, transferring the data stored in the block chain to a blue light storage module, and then storing the user data in the block chain;

a2, verifying whether the block chain storage is saturated, if not, storing the user data in the block chain and the blue light storage module at the same time, and if saturated, deleting the data stored in the block chain and then storing the user data in the block chain and the blue light storage module at the same time;

the retrieval module can retrieve in the storage system through the data retrieval strategy, when a user retrieves data, the retrieval is carried out according to the data identification, whether the data access probability exceeds the threshold value is judged, if yes, the data retrieval strategy a3 is adopted, otherwise, the data retrieval strategy a4 is adopted,

the data retrieval policy includes:

a3, searching in the block chain based on the data identification of the user data, if the user data is not searched in the block chain, searching in the blue-ray storage module based on the data identification of the user data, feeding the search result back to the user, and updating the data access probability in the stored user data identification;

a4, directly searching in the blue-ray storage module, feeding back the search result to the user, and updating the data access probability in the data identification of the storage user.

Preferably, the transferring the data stored in the block chain to the Blu-ray storage module comprises determining a data unit to be transferred, a temporary storage unit, a transfer unit,

determining a data unit to be transferred to be used for screening user data to be transferred from a block chain, when the storage space of the block chain reaches a saturation state, screening the user data in the block chain through data storage duration or data access frequency, and determining the user data meeting a threshold value as the user data to be transferred;

the temporary storage unit is used for creating a temporary storage space and storing the user data to be transferred in the temporary storage space;

the transfer unit is used for transmitting the data in the temporary storage space to the blue-ray storage module, and deleting the data to be transferred in the block chain after the data is successfully transferred.

Preferably, the storing the data in the blu-ray storage module further includes performing redundant backup on the data, where the redundant backup is to divide the data into a plurality of sub-data, then Reed-solomon encode the sub-data into redundant sub-data, and store the redundant sub-data separately.

Preferably, the step of storing and managing the data refers to transferring the user data stored in the block chain at regular intervals, the regular intervals refer to taking average time required for the block chain storage space to reach saturation as a period, the step of transferring refers to deleting the data in the block chain when the data is stored in the block chain and the blue light storage module at the same time, and deleting the data after transferring the data to the blue light storage module when the data is stored in the block chain only.

6. Preferably, predicting the user data access probability through an intelligent algorithm means predicting the probability that all files stored in the system are accessed through an intelligent algorithm, wherein the intelligent algorithm comprises but is not limited to an RBF neural network, a GRNN neural network and an SVR, and the prediction is carried out through one or more intelligent algorithm combinations.

The invention provides a two-way data irreversible storage system based on combination of a block chain and blue-ray storage, which can be used for storing data in the block chain and the blue-ray storage module respectively by adopting different storage strategies, realizes safe storage of operation irreversible data based on the storage characteristics of the block chain and the blue-ray storage, transfers data with user data access probability exceeding a threshold value according to an intelligent algorithm, and improves data access speed.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is a block diagram of the system of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Fig. 1 is a system block diagram of the present invention, and as shown in fig. 1, the method of this embodiment may include:

a two-way data irreversible storage system based on the combination of a block chain and blue-ray storage comprises a storage module and a retrieval module,

the storage module can store and manage data through a data storage strategy, the data storage and management means that user data stored in a block chain is transferred regularly, the regular period means that average time required for a block chain storage space to reach saturation is taken as a period, the transfer means that the data in the block chain is deleted when the data is stored in the block chain and the blue light storage module at the same time, and when the data is stored in the block chain only, the data is transferred to the blue light storage module and then deleted.

The block chain has a timestamp characteristic and a non-tampering characteristic, the two characteristics are used for storing data certificates and comprise a current block chain, an upper block chain, a lower block chain, a block chain service system and a storage unit, the block chain service system comprises a C/S (client/server) architecture mode, a B/S (client/server) architecture mode, a system logic module, a data block layer and a network switching layer which are sequentially connected, the storage unit and the system logic module respectively receive block chain data written by the current block chain, the upper block chain and the lower block chain link point and store the block chain data, and when a node reads the block chain data, the node reads the block chain data from the system logic module and the storage unit;

the data block layer comprises a sequence set based on physical data blocks and is used for providing distributed data storage and data reading of the whole block chain network, when an access terminal needs to access data, the storage unit selects one of the disclosure or confidentiality of the data needing to be accessed, the access terminal is controlled by the storage unit, the input terminal stores the data through the storage unit, and the storage unit discloses a data receiving unit and a confidential data receiving unit.

The blue light storage module comprises a storage array and a control unit; the storage array comprises a plurality of optical disc boxes and an optical disc drive array, the control unit comprises an I/O processing chip and an RAID control chip, the I/O processing chip and the RAID control chip provide performance support for the optical disc drive array in the blue-ray storage module to carry out data recording and reading, RAID redundancy, off-line management and data reading are completed, the blue-ray storage module takes a blue-ray disc as a storage medium, the blue-ray disc has the characteristics of irreversibility, virus prevention, hacker prevention, electromagnetic attack prevention and the like once data is written in, and the integrity of the data is guaranteed.

7. After a contract of data storage is achieved between the block chain link points and the user, user data is received, the user data access probability is predicted through an intelligent algorithm and is recorded as f1The probability of access to all files stored in the system is predicted by intelligent algorithms, including but not limited to RBF neural networks, GRNN neural networks, SVR, in one or more intelligent algorithm combinations.

When a user first saves data in a system, the data is a point by point, so that the data is still in an island form at the beginning. The users can access the data with different probabilities, and the data also has access influence, and the data access probability of the users needs to be updated, and the step of updating the data access probability comprises the following steps:

step 1a, setting initial values of access probabilities of all incidence relations;

step 1b, acquiring access data of a storage system in a set time period;

step 1c, updating the initial value of the access probability according to the access data, and recording the data access probability as f2

The data access probability of a user is calculated by equation (1),

f=f1*k1+f2*k2 (1)

wherein k is1、k2Is a weight, k1+k2F is the user data access probability.

The specific implementation steps include that when a user accesses certain data (marked as A), the data is marked as a root node, another data B accessed by the same user is marked as reference 1, the revisited data C is marked as reference 2, the revisited data D is marked as reference 3, the revisited data E is marked as reference 4, and the reference relations among different nodes are relative reference relations, namely, the reference 1 of B is C, D is the reference 3 of A, and D is the reference 2 of B;

the initial values of the access probabilities set by the references marked for the first time are all 50%, namely the access probability between the root node and the reference 1, the access probability between the reference 1 and the reference 2, and so on, the initial values of the access probabilities between the references are all set to be 50%, and the threshold value of the data access probability is set to be 70%;

according to the constant access of users, the data can be connected from the islands to the land. In the process, the probability updating is mainly judged according to the access condition of the user.

Wherein, the access probability updating rule is as follows:

after the user accesses the B after the A, the data access probability is updated, and the original access probability is improved by 50% to 100%. After the user accesses the A, the B is automatically cached in the high-speed device, and the next time the user accesses the B, the quick response is obtained.

After the user accesses the data A, if the user does not access the data B, updating the data access probability, reducing the original access probability by 50% to 25%, marking the newly accessed data as reference 2, and setting the access probability with the reference 2 to be 25%;

after the client accesses the client A, the client accesses the client B and then accesses the client C, the access probabilities of the client B and the client C are updated simultaneously, and the access probabilities of the client B and the client C are improved by 2 times;

if the D is accessed, then the A is accessed, the A is the reference of the D, and the updating of the access probability of the A is consistent with the updating mode of other nodes;

if the access probabilities of B and C are the same and reach the data access probability threshold, caching B and C at the same time;

and setting a value reduction protection strategy for all references reaching a data access probability threshold, adjusting the reference probability of the references with the number of times of non-access reaching more than 5 times, avoiding repeated fluctuation of the references, and reducing the access probability by 2 times for the references with the number of times of non-access reaching more than 5 times.

Constructing data identification of user data, wherein the data identification comprises a user ID, a user data index and a data access probability, judging whether the data access probability exceeds a threshold value, if so, adopting a data storage strategy a1, otherwise, adopting a data storage strategy a2,

the data storage policy comprises:

a1, verifying whether the block chain storage is saturated, if not, storing the user data in the block chain, and if saturated, transferring the data stored in the block chain to a blue light storage module, and then storing the user data in the block chain; the transferring of the data stored in the block chain to the blue-ray storage module comprises determining a data unit to be transferred, a temporary storage unit and a transfer unit,

determining a data unit to be transferred to be used for screening user data to be transferred from a block chain, when the storage space of the block chain reaches a saturation state, screening the user data in the block chain through data storage duration or data access frequency, and determining the user data meeting a threshold value as the user data to be transferred;

the temporary storage unit is used for creating a temporary storage space and storing the user data to be transferred in the temporary storage space;

the transfer unit is used for transmitting the data in the temporary storage space to the blue-ray storage module, and deleting the data to be transferred in the block chain after the data is successfully transferred.

a2, verifying whether the block chain storage is saturated, if not, storing the user data in the block chain and the blue light storage module at the same time, and if saturated, deleting the data stored in the block chain and then storing the user data in the block chain and the blue light storage module at the same time;

the verification that the block chain is saturated may be performed by testing the block chain to obtain a storage state when the block chain is saturated, and when a block chain link point increases to a certain extent or a block chain storage data reaches a certain amount, a system resource usage rate (e.g., a CPU usage rate or a memory usage rate) of the block chain tends to be stable (e.g., approximately stable at 90%), at this time, the block chain resource usage rate may be considered to be substantially saturated, and the state may be considered as a saturation state of the block chain system. When the block chain storage reaches the bottleneck state, the data is transferred to the blue light storage module.

The retrieval module can retrieve in the storage system through the data retrieval strategy, when a user retrieves data, the retrieval is carried out according to the data identification, whether the data access probability exceeds the threshold value is judged, if yes, the data retrieval strategy a3 is adopted, otherwise, the data retrieval strategy a4 is adopted,

the data retrieval policy includes:

a3, searching in the block chain based on the data identification of the user data, if the user data is not searched in the block chain, searching in the blue-ray storage module based on the data identification of the user data, feeding the search result back to the user, and updating the data access probability in the stored user data identification;

a4, directly searching in the blue-ray storage module, feeding back the search result to the user, and updating the data access probability in the data identification of the storage user.

The data storage in the blue-ray storage module also comprises the step of carrying out redundancy backup on the data, wherein the redundancy backup is to divide the data into a plurality of subdata, then carry out Reed-Solmon coding on the subdata to form redundancy subdata, and store the redundancy subdata separately.

The beneficial effects of the whole are as follows: the data access method and the data access system can store data in the block chain and the blue light storage module respectively by adopting different storage strategies, realize the safe storage of the data which is operated irreversibly based on the storage characteristics of the block chain and the blue light storage, transfer the data of which the user data access probability exceeds a threshold value according to an intelligent algorithm, and improve the data access speed.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

9页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种分布式图片存储转为视频并转码展示方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!