Equipment operation control method, device and equipment

文档序号：1686841 发布日期：2020-01-03 浏览：24次中文

阅读说明：本技术 设备运行控制方法、装置及设备 (Equipment operation control method, device and equipment ) 是由孙健杨龙梁国栋张振国赵兴海于 2019-09-30 设计创作，主要内容包括：本申请公开了设备运行控制方法、装置及设备。该方法包括：获取设备运行指令；利用神经网络根据所述设备运行指令确定设备需要达到的目标运行状态；采集所述设备进入t时刻后,能够反映所述设备在所述t时刻运行状态的多模态数据,基于所述多模态数据和所述目标运行状态,对所述设备的包括所述t时刻在内的后续运行时段的运行轨迹进行预测,得到预测运行轨迹,控制所述设备按照所述预测运行轨迹运行。设备结合多模态数据进行轨迹预测,提高了预测结果的准确性,确保了设备安全运行以及顺利达到目标运行状态。(The application discloses a device operation control method, a device and equipment. The method comprises the following steps: acquiring an equipment operation instruction; determining a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network; and acquiring multi-modal data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t, predicting the running track of the equipment in a subsequent running period including the time t based on the multi-modal data and the target running state to obtain a predicted running track, and controlling the equipment to run according to the predicted running track. The device carries out the track prediction by combining the multi-mode data, improves the accuracy of the prediction result, and ensures the safe operation of the device and the smooth achievement of the target operation state.)

1. A method for controlling operation of a plant, the method comprising:

acquiring an equipment operation instruction;

determining a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

acquiring multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

predicting the running track of the equipment in the subsequent running time period including the t moment based on the multi-modal data and the target running state to obtain a predicted running track;

and controlling the equipment to operate according to the predicted operation track.

2. The method of claim 1, wherein the number of predicted operational trajectories is plural, and wherein the controlling the device to operate according to the predicted operational trajectories comprises:

screening out an optimal predicted operation track from the plurality of predicted operation tracks according to a preset screening rule;

and controlling the equipment to operate according to the optimal predicted operation track.

3. The method of claim 2, wherein the screening the optimal predicted operation trajectory from the plurality of predicted operation trajectories according to a preset screening rule comprises:

for each predicted operation track, acquiring predicted behavior data of the equipment at the t moment from the predicted operation track, wherein the predicted behavior data comprises predicted component behavior data of all components of the equipment, and weighting and processing all the predicted component behavior data to obtain weighted sum;

acquiring a historical behavior identifier of the device at the t-1 moment, wherein the historical behavior identifier comprises: historical component behavior identification of the total components;

carrying out LSTM coding processing on the first history context, the history behavior identifier and the weighted sum at the time t-1 to obtain a second history context at the time t;

and processing the equipment operation instruction, the second historical context corresponding to each predicted operation track and all predicted component behavior data according to an attention mechanism to determine the optimal predicted operation track, wherein the equipment operation instruction comprises a plurality of component operation instructions.

4. The method of claim 3, wherein said processing said device operating instructions, said second historical context corresponding to each of said predicted operational trajectories, and all predicted component behavior data in accordance with an attention mechanism to determine said optimal predicted operational trajectory comprises:

processing the corresponding second historical context and all component operation instructions according to an attention mechanism aiming at each predicted operation track to obtain a text context at the time t;

processing the text context and the corresponding behavior data of all the prediction components according to an attention mechanism to obtain a state context at the time t;

carrying out bilinear dot product calculation on the corresponding second historical context, the corresponding text context and the corresponding state context to obtain evaluation data of the predicted running track;

and determining the predicted operation track corresponding to the optimal evaluation data as the optimal predicted operation track.

5. The method of claim 2, wherein the plant predicts predicted operational data for the current time at each operational time and operates in accordance with the predicted operational data; the controlling the device to operate according to the optimal predicted operation track comprises the following steps:

acquiring the optimal predicted behavior data of the equipment at the t moment from the optimal predicted running track;

and controlling the equipment to operate according to the optimal predicted behavior data at the time t.

6. The method of claim 5, further comprising:

acquiring the actual running state of the equipment after the equipment runs according to the optimal predicted behavior data at all running moments;

determining a first network loss of the neural network according to a difference between the actual operating state and the target operating state when the actual operating state and the target operating state satisfy a difference range;

adjusting network parameters of the neural network based on the first network loss.

7. The method of claim 2, further comprising:

simulating the operation of the equipment according to a plurality of predicted operation tracks;

determining performance evaluation data of the equipment after the execution of each predicted running track is finished;

determining a predicted operation track corresponding to the optimal performance evaluation data as an optimal predicted operation track, and determining the residual predicted operation track as a non-optimal predicted operation track;

determining a second network loss value of the neural network based on a difference between the optimal predicted operational trajectory and the non-optimal predicted operational trajectory;

adjusting a network parameter of the neural network based on the second network loss value.

8. The method of claim 1, wherein controlling the device to operate in accordance with the predicted operational trajectory comprises:

acquiring the predicted behavior data of the predicted running track at the t moment,

judging whether the predicted behavior data meet preset operation data conditions or not, wherein the preset behavior data conditions limit the operation data range of the equipment in normal operation;

and when the predicted behavior data meet the preset operation data condition, controlling the equipment to operate according to the predicted behavior data.

9. The method of claim 8, further comprising:

when the predicted behavior data does not meet the preset operation data condition, outputting failure prediction prompt information; and/or the presence of a gas in the gas,

and when the predicted behavior data does not meet the preset operation data condition, stopping the operation of the equipment.

10. The method of claim 1, wherein the multi-modal data comprises at least one of: the device comprises a component operation data, a device performance index, information generated based on the currently collected data and combined information of correction information, a correction log, an operation log and an environmental parameter.

11. An apparatus operation control method device, characterized in that the device comprises:

the acquisition module is configured to acquire a device operation instruction;

the determining module is configured to determine a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

the acquisition module is configured to acquire multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

the prediction module is configured to predict the running locus of the equipment in a subsequent running period including the t moment based on the multi-modal data and the target running state to obtain a predicted running locus;

a control module configured to control the device to operate according to the predicted operation trajectory.

12. An apparatus, comprising: the system comprises an internal bus, a memory, a processor and an external interface which are connected through the internal bus; wherein the content of the first and second substances,

the external interface is used for acquiring data;

the memory is used for storing machine readable instructions corresponding to the running control of the equipment;

the processor is configured to read the machine-readable instructions on the memory and perform the following operations:

acquiring an equipment operation instruction;

determining a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

acquiring multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

and controlling the equipment to operate according to the predicted operation track.

Technical Field

The present disclosure relates to the field of device technologies, and in particular, to a method, an apparatus, and a device for controlling device operation.

Background

The equipment stores running programs written aiming at different equipment running instructions, and the running programs are used for controlling the equipment to run according to the specified running track. After receiving a certain device operation instruction, the device acquires an operation program written according to the device operation instruction and operates according to the operation program. For example, the medical device receives the initialization instruction, acquires an operation program for device initialization, and controls the operation of the device according to the operation program, thereby completing the device initialization.

When the performance of the device changes, for example, the performance of the device changes after the device is used for a long time, which may cause the performance of the device to change, or the performance of the device changes due to the component update of the device, the device may run using a pre-programmed fixed running program, which may cause undesirable phenomena such as failure to reach a target running state and damage to the component.

Disclosure of Invention

In order to overcome the problems in the related art, the specification provides a method, a device and equipment for controlling the operation of the equipment.

Specifically, the method is realized through the following technical scheme:

in a first aspect, a method for controlling operation of a device is provided, the method including:

acquiring an equipment operation instruction;

determining a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

acquiring multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

and controlling the equipment to operate according to the predicted operation track.

In a second aspect, there is provided an apparatus for controlling operation of a device, the apparatus including:

the acquisition module is configured to acquire a device operation instruction;

the determining module is configured to determine a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

the acquisition module is configured to acquire multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

a control module configured to control the device to operate according to the predicted operation trajectory.

In a third aspect, an apparatus is provided, comprising: the system comprises an internal bus, a memory, a processor and an external interface which are connected through the internal bus; wherein the content of the first and second substances,

the external interface is used for acquiring data;

the memory is used for storing machine readable instructions corresponding to the running control of the equipment;

the processor is configured to read the machine-readable instructions on the memory and perform the following operations:

acquiring an equipment operation instruction;

determining a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

acquiring multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

and controlling the equipment to operate according to the predicted operation track.

The technical scheme provided by the embodiment of the specification can have the following beneficial effects:

in the embodiment of the specification, after the device obtains a device operation instruction, the device determines a target operation state which the device needs to reach according to the device operation instruction by using a neural network, acquires multi-mode data which can reflect the current operation state of the device after the device enters the current time t, predicts the operation track of the device in a subsequent operation period including the time t based on the multi-mode data and the target operation state, and predicts the track by combining the multi-mode data, so that the accuracy of a prediction result is improved, and the safe operation of the device and the smooth reaching of the target operation state are ensured.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the specification.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present specification and together with the description, serve to explain the principles of the specification.

FIG. 1 is a flow chart illustrating a method for controlling operation of a device according to an exemplary embodiment of the present application;

FIG. 2 is a flow chart illustrating another method of controlling operation of a device according to an exemplary embodiment of the present application;

FIG. 3 is a flow chart illustrating another method of controlling operation of a device according to an exemplary embodiment of the present application;

FIG. 4 is a schematic diagram of an embodiment of a method for controlling the operation of the apparatus of the present application;

FIG. 5 is a schematic diagram of another embodiment of the apparatus operation control method of the present application;

fig. 6 is a block diagram illustrating an apparatus operation control device according to an exemplary embodiment of the present application;

fig. 7 is a schematic diagram of an apparatus according to an exemplary embodiment of the present application.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present specification. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the specification, as detailed in the appended claims.

The application provides an equipment operation control method, which is applied to equipment, wherein the equipment can be medical equipment, processing equipment in a factory and the like, and the medical equipment can be PET equipment, CT equipment, PET/CT equipment and the like. The application does not limit the type of the device.

Referring to fig. 1, a flowchart of an embodiment of an apparatus operation control method according to the present application may include the following steps:

in step 101, a device operation instruction is obtained.

And in the using process of the equipment, receiving an equipment operation instruction, and controlling the equipment to operate according to the equipment operation instruction.

The device operation instructions are different for different devices, for example, for a PET/CT device, the device operation instructions may be initialization instructions, image scanning instructions, and the like.

There are various ways to obtain the device operation instruction, for example, receiving a pressing operation of a user on a physical key on the device, receiving a selection operation of a user on a virtual key displayed on a device interface, receiving a device operation instruction sent by the control device, and the like.

In step 102, a target operation state which needs to be reached by the equipment is determined according to the equipment operation instruction by using the neural network.

In this embodiment, the device is provided with a neural network trained in advance, and the running trajectory prediction is performed by using the neural network.

The device operating instructions indicate a target operating state that the device needs to achieve and may include a target operating state that a plurality of components in the device need to achieve. For example, the initialization instruction of the PET/CT apparatus indicates a target operation state that the PET/CT apparatus needs to reach after the initialization is finished, including a target operation state that each component in the PET/CT apparatus needs to reach, such as a temperature after the temperature of the bulb tube is raised, a pressure value after the pressure of the high-pressure generator is raised, and the like.

In step 103, multi-modal data capable of reflecting the running state of the device at the time t after the device enters the time t is collected.

In the operation process of the control device, the time t can be preset time, such as time designated by a user, default time of the device and the like, and the device only collects multimodal data and executes subsequent operations at the preset time t; or, the time t may be the current time in the operation process of the device, and may be understood as each operation time, at which the device acquires multimodal data and executes subsequent operations.

The multi-modal data can reflect the running state of the equipment at the time t, and the multi-modal data comprises data of a plurality of modes, wherein the modes can be understood as view angles and angles. The multimodal data includes a variety of data types, for example, the multimodal data can include at least one of: the system comprises a combination of component operating data of the equipment, equipment performance indexes, information generated based on currently acquired data and correction information, a correction log, an operating log and environmental parameters.

For example, for a PET/CT device, the multi-modality data may include at least one of: various image data and correction image data in the PET/CT apparatus, and the like; voltage, current, temperature, etc. of each electronic component in the PET/CT apparatus; the rotating speed, the position, the movement times and the like of each mechanical part in the PET/CT equipment; performance indexes such as time resolution, spatial resolution, sensitivity and counting rate in the PET/CT equipment; data transmission indexes of detector data acquisition channels in the PET/CT equipment; a correction log of the PET/CT device; a maintenance log of the PET/CT device; a log of the operation of the PET/CT system; operating climate data and seasonal data of the PET/CT equipment.

In step 104, based on the multi-modal data and the target operation state, the operation trajectory of the device in the subsequent operation period including the time t is predicted to obtain a predicted operation trajectory.

After the device obtains the multi-modal data capable of reflecting the running state of the device at the time t, the device predicts the running track of the device in the subsequent running period including the time t based on the multi-modal data and the target running state to obtain the predicted running track.

The obtained predicted operation track comprises: a predicted component behavior trace for each of all components of the device. The predicted component behavior trajectory for each component includes: predicted component operational data for the component from time t to each operational time at the end of the operation.

For example, the PET/CT device obtains a predicted trajectory based on the initialization instructions and the multi-modal data, including: the predicted operation track of the bulb, the predicted operation track of the motor, the predicted operation track of the boosting component and the like, wherein the predicted operation track of the bulb comprises the following steps: the applied voltage of the bulb tube from the time t to each operation time in the operation end and the like, and the predicted operation track of the motor comprises the following steps: and the rotating speed and the rotating direction of the motor from the time t to each operating time in the operation end, and the like.

Based on the structure of the neural network in the device, the neural network can predict one predicted operation trajectory or a plurality of predicted operation trajectories based on the multi-modal data and the target operation state.

In step 105, the control device operates according to the predicted operation trajectory.

After the equipment predicts the running track by using the neural network, the equipment is controlled to run according to the predicted running track. The predicted operation locus comprises a predicted component operation locus of each component in the whole components, and the equipment controls the operation of the corresponding component by using the predicted component operation locus.

In the embodiment, the device performs the track prediction by combining the multi-mode data, so that the accuracy of the prediction result is improved, and the safe operation of the device and the smooth achievement of the target operation state are ensured.

In an alternative embodiment, the number of predicted operation trajectories predicted by the neural network is multiple, see fig. 2, which is a flowchart of another embodiment of the apparatus operation control method of the present application, and step 105 may be implemented by: in step 1051, screening an optimal predicted operation trajectory from the plurality of predicted operation trajectories according to a preset screening rule; in step 1052, the control device operates in accordance with the optimal predicted operational trajectory.

Based on the settings of the steps 1051 and 1052, the neural network is enabled to predict a plurality of predicted operation tracks and screen out an optimal predicted operation track from the plurality of predicted operation tracks.

Referring to step 1051 and referring to fig. 3, a neural network comprises a trajectory prediction sub-network, a trajectory evaluation sub-network, and a behavior determination sub-network, wherein the trajectory prediction sub-network is used for trajectory prediction; the track evaluation sub-network is used for track evaluation; the behavior determination sub-network is configured to determine predicted behavior data from the predicted trajectory. The trajectory prediction sub-network, the trajectory evaluation sub-network and the behavior determination sub-network are combined to obtain a behavior prediction sub-network, i.e., the behavior predictor in fig. 3.

Based on the above neural network setting, referring to fig. 4, which is a flowchart of another embodiment of the apparatus operation control method of the present application, step 1051 may be implemented by the following steps 1051-1 to 1051-4:

in step 1051-1, for each predicted operation trajectory, predicted behavior data of the device at time t is acquired from the predicted operation trajectory, the predicted behavior data includes predicted component behavior data of all components of the device, and all the predicted component behavior data are weighted and processed to obtain a weighted sum.

The predicted behavior data can be all predicted data corresponding to the t moment in the predicted running track; alternatively, the predicted behavior data may be data screened from all predicted data corresponding to the time t in the predicted operation trajectory, and the predicted behavior data may be feature data of the predicted behavior, for example, for a motor component, the rotation speed and the rotation direction of the motor at the time t are set in the predicted operation trajectory, and the predicted behavior data may be the rotation speed of the motor.

All predicted component behavior data may be weighted and processed by equation one below.

υ_t＝a_t，1υ_t，1+a_t，2υ_t，2+…+a_t，mυ_t，mFormula one

Wherein upsilon is_tIs the weighted sum at time t; upsilon is_t，1First predicted component behavior data for time t; a is_t，1Is upsilon_t，1A corresponding weight value; upsilon is_t，2Second predicted component behavior data for time t; a is_t，2Is upsilon_t,2A corresponding weight value; upsilon is_t，mPredicting component behavior data for the mth at time t; a is_t，mIs upsilon_t，mAnd (4) corresponding weight values.

In step 1051-2, historical behavior identification a of the device at time t-1 is obtained_t-1Historical behavior flag a_t-1The method comprises the following steps: historical component behavior identification for all components.

The component behavior identifier is used for indicating component behaviors, and the component behavior identifiers corresponding to different component behaviors are different. For example, the motor rotation is represented by the number 1 and the motor stop is represented by the number 2.

In step 1051-3, a first history context h at time t-1 is processed_t-1The historical behavior mark a obtained in the step 1051-2_t-1And the weighted sum obtained in the step 1051-1 is processed by LSTM coding to obtain a second history context h at the time t_t。

The LSTM is called Long Short-Term Memory, which is one of RNN (Current Neural network), and based on the design characteristics, the LSTM is particularly suitable for modeling time sequence data, such as text data. The second history context h at the time t can be obtained by the following formula two_t。

h_t＝LSTM([υ_t，a_t-1]，h_t-1) Formula two

Wherein h is_tA second historical context at time t; h is_t-1A first history context at time t-1; upsilon is_tIs the weighted sum at time t; a is_t-1And identifying historical behaviors of the equipment at the time t-1.

LSTM will be v_t、a_t-1And h_t-1Is coded as h_t. Iterative counting by using formula twoCalculating to obtain h₀To h_t。

In step 1051-4, the device operation instructions, the second historical context corresponding to each predicted operation trajectory, and all predicted component behavior data are processed according to an attention mechanism to determine an optimal predicted operation trajectory, wherein the device operation instructions include a plurality of component operation instructions.

Steps 1051-4 may be implemented as follows:

and a first step of processing the corresponding second historical context and all the part operation instructions according to an attention mechanism aiming at each predicted operation track to obtain a text context at the time t.

Encoding device operational instructions X into a set of textual features using a speech coder LSTM

The Attention model in deep learning actually simulates the Attention model of human brain, and the step learns the history context h_tWhich sub-instructions should be focused on next step is known through the attention model for the text of the condition. The text context at time t can be obtained by the following formula three.

Wherein the content of the first and second substances,

is the text context at time t; h is_tA second historical context at time t;

a text set of component operation instructions w included for the equipment operation instructions; i is the identification of the component operation instruction; n is the number of component execution instructions.

And secondly, processing the text context at the time t and all corresponding prediction component behavior data according to an attention mechanism to obtain a state context at the time t.

The state context at time t can be obtained by the following formula four.

Wherein the content of the first and second substances,state context at time t;

is the text context at time t;

a set of predicted component behavior data for all components at time t, j being an identification of the component; m is the number of parts.

And thirdly, carrying out bilinear dot product calculation on the corresponding second history context, text context and state context to obtain evaluation data of the predicted running track.

The evaluation data of the predicted running locus can be obtained by the following formula five.

Wherein, P_kEvaluating data for the equipment according to the running direction when the running track is predicted to run, and expressing the data in a probability form; h is_tA second historical context at time t;

is the text context at time t;

is the state context at time t.

The operation direction evaluation data may be a probability of normal operation of the device, a probability of failure of the device, a probability that the device can reach a target operation state, a probability that the device cannot reach the target operation state, or the like.

And step four, determining the predicted operation track corresponding to the optimal evaluation data as the optimal predicted operation track.

For example, when the operation direction evaluation data is the probability that the equipment can reach the target operation state, the predicted operation trajectory having the highest probability is determined as the optimal predicted operation trajectory. And when the operation direction evaluation data is the probability of equipment failure, determining the predicted operation track with the minimum probability as the optimal predicted operation track.

In an alternative embodiment, the plant may predict and operate according to predicted operating data at the current time using the prediction method described above at each operating time. The predicted operational data includes predicted component operational data for the global component.

Step 1052 may then be implemented as follows: obtaining the optimal predicted behavior data a of the equipment at the time t from the optimal predicted operation track_tThe control device follows the best predicted behavior data a at time t_tAnd (5) operating. When the time t +1 is reached, the equipment acquires the optimal predicted behavior data of the equipment at the time t +1 from the optimal predicted operation track predicted at the time t +1, and the equipment is controlled to operate according to the optimal predicted behavior data at the time t + 1.

The equipment predicts the predicted operation data of the current time at each operation time and operates according to the predicted operation data, and the operation track can be adjusted in time according to the operation condition of the equipment per se, so that the equipment can operate towards the direction of the target operation state more favorably.

In an optional embodiment, in order to improve the generalization of the learned strategy, an automatic supervision learning method is provided, and the state of the equipment without prior knowledge is predicted by simulating the previous good decision-making mode, so that a better and more efficient strategy is obtained, the performance difference of the prediction success rate in the environment with prior knowledge and the environment without prior knowledge is greatly reduced, and the prediction accuracy is improved.

Specifically, the device predicts the operation track by using the neural network, and adjusts network parameters of the neural network by using the predicted operation track and the situation that the device operates according to the predicted operation track, so that the prediction result of the operation track by the neural network is more accurate.

There are various ways of adjusting network parameters of the neural network by using the predicted operation trajectory and the condition that the device operates according to the predicted operation trajectory. For example, the first adjustment method: when the equipment predicts the predicted operation data of the current moment at each operation moment and operates according to the predicted operation data, the actual operation state of the equipment is obtained after the equipment operates according to the optimal predicted behavior data of all operation moments, namely after the equipment operates according to the predicted operation track; when the actual operation state and the target operation state meet the difference range, determining a first network loss of the neural network according to the difference between the actual operation state and the target operation state; network parameters of the neural network are adjusted based on the first network loss. And when the actual running state and the target running state do not meet the difference range, judging that the predicted running track does not meet the requirement, and not optimizing the neural network in the actual running state.

The first adjustment may be referred to as an external reward adjustment, which measures the success signal and prediction error of the action of each component of the device.

The second adjustment mode is as follows: referring to fig. 5, a schematic diagram of another embodiment of the device operation control method of the present application is shown, in which a neural network includes: trajectory prediction subnetwork pi_θTrajectory evaluation subnetwork V_βSimulating a sub-network; wherein the trajectory prediction sub-network pi_θFor predicting the track to obtain a plurality of predicted running tracks { tau₁,τ₂,…,τ_KK is the number of the predicted operation tracks, and a plurality of predicted operation tracks are transmitted to a track evaluation sub-network V_β(ii) a Trajectory evaluation subnetwork V_βThe simulation sub-network is used for performing track evaluation on the plurality of predicted running tracks, determining an optimal predicted running track and a non-optimal predicted running track, and transmitting the optimal predicted running track and the non-optimal predicted running track to the simulation sub-network; simulation subnetwork according to optimal predicted operation track and non-optimalAnd (4) well predicting the difference between the running tracks, and adjusting the network parameters of the neural network.

Specifically, the device may further perform the following operations: simulating the running equipment according to the plurality of predicted running tracks; determining performance evaluation data of the equipment after the execution of each predicted running track is finished; determining a predicted operation track corresponding to the optimal performance evaluation data as an optimal predicted operation track, and determining the residual predicted operation track as a non-optimal predicted operation track; determining a second network loss value of the neural network according to a difference between the optimal predicted operation trajectory and the non-optimal predicted operation trajectory; adjusting a network parameter of the neural network based on a second network loss value.

The performance assessment data may include at least one of: the target state is reached, the target state is not reached, the matching degree with the target state, the heating value, the use power and other parameters are obtained. The optimal performance assessment data includes: the target state is reached, the matching degree with the target state is maximum, the heating value is minimum and the like.

The second adjustment mode can be called an internal reward optimization mode, and the internal reward measurement is the matching condition between the device operation instruction X and the predicted operation track τ.

Pre-training a trajectory evaluation sub-network V_βTo calculate a loop reconstruction internal reward Rintr that facilitates matching alignment of the device operating instructions X with the trajectory τ. It encourages to respect the instructions and penalize tracks that deviate from the instructions.

The internal award Rintr can be calculated by the following equation six.

R_intr＝V_β(X，τ)＝V_β(X，π_θ(X)) formula six

One way to achieve a match evaluation is to measure the recurring reconstruction reward P ═ (X ═ X | pi)_θ(X)), i.e. the probability of a reconstruction device operating an instruction X under a given trajectory is given by τ ═ pi_θ(X) executing.

The internal award Rintr can be calculated by the following formula seven.

R_intr＝P_β(X，π_θ(X))＝P_β(X, τ) formula seven

The RL loss and the gradient of the immutable loss function can be determined from the internal reward Rintr, based on which network parameters of the neural network are adjusted.

In an alternative embodiment, step 105 may be implemented by: firstly, acquiring predicted behavior data of a predicted running track at time t; secondly, judging whether the predicted behavior data meet preset operation data conditions or not, wherein the preset behavior data conditions limit the operation data range of the equipment in normal operation; and finally, when the predicted behavior data meet the preset operation data conditions, the control equipment operates according to the predicted behavior data. When the predicted behavior data do not meet the preset operation data conditions, fault prediction prompt information is output; and/or stopping the equipment when the predicted behavior data does not meet the preset operation data condition.

The preset operation data conditions include component operation data conditions set for different components. For example, for a preset rotation speed range set for the motor, when the rotation speed of the motor exceeds the preset rotation speed range, fault prompt information of the motor is output; and aiming at the preset temperature range set by the bulb tube, when the temperature in the temperature rising process of the bulb tube exceeds the preset temperature range, outputting the fault prompt information of the bulb tube.

Based on the setting of the operation, the equipment has a fault prediction function and only operates under normal prediction operation data, so that safe use is realized.

The execution sequence of each step in the flow shown in fig. 1 to 3 is not limited to the sequence in the flow chart. Furthermore, the description of each step may be implemented in software, hardware or a combination thereof, for example, a person skilled in the art may implement it in the form of software code, and may be a computer executable instruction capable of implementing the corresponding logical function of the step. When implemented in software, the executable instructions may be stored in a memory and executed by a processor in the system.

Corresponding to the embodiment of the equipment operation control method, the application also provides an equipment operation control method device and an equipment embodiment.

Referring to fig. 6, a block diagram of an embodiment of an apparatus for controlling device operation according to the present application may include: the device comprises an acquisition module 21, a determination module 22, an acquisition module 23, a prediction module 24 and a control module 25; wherein the content of the first and second substances,

the obtaining module 21 is configured to obtain a device operation instruction;

the determining module 22 is configured to determine a target operation state that the device needs to reach according to the device operation instruction by using a neural network;

the acquisition module 23 is configured to acquire multi-modal data capable of reflecting the running state of the device at the time t after the device enters the time t;

the prediction module 24 is configured to predict a running trajectory of the device in a subsequent running period including the time t based on the multi-modal data and the target running state, so as to obtain a predicted running trajectory;

the control module 25 is configured to control the device to operate according to the predicted operation trajectory.

Referring to fig. 7, for purposes of one embodiment of the apparatus of the present application, the apparatus may comprise: a memory 320, a processor 330, and an external interface 340 connected by an internal bus 310.

The external interface 340 is configured to obtain data;

a memory 320 for storing machine readable instructions corresponding to device operational control;

a processor 330 configured to read the machine-readable instructions on the memory 320 and execute the instructions to:

acquiring an equipment operation instruction;

determining a target operation state which needs to be reached by the equipment according to the equipment operation instruction by utilizing a neural network;

acquiring multi-mode data capable of reflecting the running state of the equipment at the time t after the equipment enters the time t;

and controlling the equipment to operate according to the predicted operation track.

In the embodiments of the present application, the computer readable storage medium may be in various forms, such as, in different examples: a RAM (random Access Memory), a volatile Memory, a non-volatile Memory, a flash Memory, a storage drive (e.g., a hard drive), a solid state drive, any type of storage disk (e.g., an optical disk, a dvd, etc.), or similar storage medium, or a combination thereof. In particular, the computer readable medium may be paper or another suitable medium upon which the program is printed. Using these media, the programs can be electronically captured (e.g., optically scanned), compiled, interpreted, and processed in a suitable manner, and then stored in a computer medium.

The above description is only exemplary of the present application and should not be taken as limiting the present application, as any modification, equivalent replacement, or improvement made within the spirit and principle of the present application should be included in the scope of protection of the present application.

16页详细技术资料下载

上一篇：一种医用注射器针头装配设备

下一篇：数据处理器和数据处理方法

Equipment operation control method, device and equipment

相关技术

网友询问留言