Video processing method and terminal

文档序号：1889362 发布日期：2021-11-26 浏览：10次中文

阅读说明：本技术 一种视频处理方法及终端 (Video processing method and terminal ) 是由林宝勇郭华陈金林邰锋刘朝强于 2021-08-09 设计创作，主要内容包括：本发明公开一种视频处理方法及终端,在视频录制过程中判断是否接收到预设标签,若是,则获取所述预设标签对应的描述信息；根据所述预设标签以及所述描述信息对所述视频进行打点,生成打点后的视频；将所述预设标签以及所述预设标签对应的描述信息保存至预设视频内容检索库,提高了视频打点效率的同时,保证了视频打点的准确度,同时,将预设标签以及所述预设标签对应的描述信息保存至预设视频内容检索库,预设标签和描述信息能够标记并描述对应的视频内容,当视频录制完成后,与其对应的视频内容检索信息也已建立完成,提高了建立预设视频内容检索库的效率,从而在提高视频处理效率的同时,保证视频处理的准确度。(The invention discloses a video processing method and a terminal, wherein whether a preset label is received or not is judged in the video recording process, and if yes, description information corresponding to the preset label is obtained; dotting the video according to the preset label and the description information to generate a dotted video; the method comprises the steps of storing the preset label and the description information corresponding to the preset label to a preset video content retrieval library, ensuring the accuracy of video dotting while improving the efficiency of video dotting, storing the preset label and the description information corresponding to the preset label to the preset video content retrieval library, marking and describing the corresponding video content by the preset label and the description information, and improving the efficiency of establishing the preset video content retrieval library after the video recording is completed, thereby ensuring the accuracy of video processing while improving the efficiency of video processing.)

1. A video processing method, comprising:

judging whether a preset label is received or not in the video recording process, and if so, acquiring description information corresponding to the preset label;

dotting the video according to the preset label and the description information to generate a dotted video;

and storing the preset label and the description information corresponding to the preset label to a preset video content retrieval library.

2. The video processing method according to claim 1, wherein the preset tag comprises a preset voice tag;

judging whether a preset label is received or not in the video recording process, if so, acquiring description information corresponding to the preset label, wherein the description information comprises:

acquiring audio data of a video in real time in a video recording process;

and judging whether a preset voice tag exists in the audio data through voice recognition, and if so, acquiring description information corresponding to the preset voice tag.

3. The video processing method according to claim 2, wherein the obtaining the description information corresponding to the preset voice tag comprises:

acquiring initial content information and time point information corresponding to the preset voice tag;

judging whether the initial content information has a preset ending mark within preset time from the time point information, if so, determining the initial content information as final content information, otherwise, judging whether the initial content information accords with a preset rule corresponding to the preset voice tag, if so, determining the initial content information as final content information, and if not, sending a content information determining instruction;

receiving modified content information corresponding to the determined content information instruction;

determining the modified content information as final content information;

and determining the final content information as the description information corresponding to the preset voice tag.

4. The video processing method according to claim 3, wherein the dotting the video according to the preset tag and the description information, and the generating the dotted video comprises:

determining a key frame corresponding to the time point information in the video according to the time point information;

and dotting the key frames according to the preset voice tags and the final content information to generate a video after dotting.

5. The video processing method according to claim 3, wherein the storing the preset tag and the description information corresponding to the preset tag in a preset video content search library comprises:

obtaining a preset hierarchical relationship corresponding to the preset voice tag according to the preset voice tag;

and storing the preset voice tag, and the final content information and the time point information corresponding to the preset voice tag to a preset video content retrieval library according to the preset hierarchical relationship based on a key value storage mode.

6. The video processing method according to claim 3, wherein after storing the preset tag and the description information corresponding to the preset tag in a preset video content search library, the method comprises:

and generating a plurality of data tables based on the preset voice tag in the preset video content search library and the final content information, and establishing an association relation between the plurality of data tables and the preset video content search library.

7. The video processing method according to claim 6, wherein said establishing the association relationship between the plurality of data tables and the video content search library comprises:

receiving a video content retrieval request, wherein the video content retrieval request comprises retrieval keywords;

determining a retrieval result corresponding to the retrieval key word from the plurality of data tables according to the retrieval key word;

determining a target preset voice tag, target final content information and target time point information corresponding to the retrieval result according to the preset video content retrieval library;

and acquiring a video clip corresponding to the target preset voice tag, the target final content information and the target time point information from the video.

8. The method according to claim 7, wherein said obtaining a video segment corresponding to the target preset voice tag, the target final content information and the target time point information from the video comprises:

and judging whether a plurality of video clips exist, if so, splicing and displaying the plurality of video clips, and if not, displaying the video clips.

9. The video processing method according to claim 6, wherein said splicing and presenting the plurality of video segments comprises:

and acquiring and displaying the video in sequence according to the time sequence of the video clips.

10. A video processing terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of a video processing method according to any of claims 1 to 9 when executing the computer program.

Technical Field

The present invention relates to the field of video processing technologies, and in particular, to a video processing method and a terminal.

Background

With the rapid development of multimedia technology and networks, multimedia information such as images and videos has not been increased at an unprecedented speed, wherein the videos are important expression forms of the multimedia information and comprise rich contents and semantics. How people describe video contents correctly and find needed contents quickly and effectively becomes an increasingly important problem.

In the prior art, video data is generally analyzed, features representing video content are extracted, the video content is accurately described, and then retrieval is performed based on the described video content. The following methods are commonly used:

1. and a manual marking method is adopted, namely, the video is manually checked, the time and the occurrence event are recorded, then the content information is manually edited, the time point is marked, and then the retrieval is carried out according to the manually increased content information and the time point.

2. The audio information is extracted from the video, then the audio information is converted into characters, different paragraphs and paragraph titles are formed through arrangement of the characters, meanwhile, the association relation between the different paragraphs and the time line is formed, dotting processing is carried out in the video according to the paragraph titles, and therefore positioning and retrieval can be carried out according to the dotting information and the character information.

3. Analyzing the video content, and detecting a video image and an audio frame by using a preset detection algorithm to acquire face information, character information, pattern information and sound information of the video content; acquiring face similar information, character similar information, pattern similar information and sound similar information; generating a content retrieval report according to the face similar information, the character similar information, the pattern similar information and the sound similar information; and searching the face information, the character information, the pattern information and the sound information in the video content.

According to the three methods, the content retrieval information for establishing the video is extracted, analyzed and established by adopting a manual or computer automatic method for the content of the existing video; the manual labeling method has high precision but low efficiency, and the method for extracting audio and video contents for analysis has improved efficiency compared with the manual labeling method, but has larger error and avoids manual participation.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the video processing method and the terminal can improve the video processing efficiency and ensure the accuracy of video processing.

In order to solve the technical problems, the invention adopts a technical scheme that:

a video processing method, comprising:

judging whether a preset label is received or not in the video recording process, and if so, acquiring description information corresponding to the preset label;

dotting the video according to the preset label and the description information to generate a dotted video;

and storing the preset label and the description information corresponding to the preset label to a preset video content retrieval library.

In order to solve the technical problem, the invention adopts another technical scheme as follows:

a video processing terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the following steps when executing the computer program:

judging whether a preset label is received or not in the video recording process, and if so, acquiring description information corresponding to the preset label;

dotting the video according to the preset label and the description information to generate a dotted video;

and storing the preset label and the description information corresponding to the preset label to a preset video content retrieval library.

The invention has the beneficial effects that: unlike the prior art, the dotting processing is not performed on the recorded video, but in the video recording process, when a preset label is received, the video is dotted according to the preset label and the description information, the dotting efficiency of the video is improved, the dotting accuracy of the video is guaranteed, meanwhile, the preset label and the description information corresponding to the preset label are stored in a preset video content retrieval library, the preset label and the description information can mark and describe corresponding video content, after the video recording is completed, the video content retrieval information corresponding to the preset label is established, the efficiency of establishing the preset video content retrieval library is improved, and the video processing accuracy is guaranteed while the video processing efficiency is improved.

Drawings

FIG. 1 is a flowchart illustrating steps of a video processing method according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of a video processing terminal according to an embodiment of the present invention;

FIG. 3 is a flowchart illustrating a video processing method according to an embodiment of the invention;

fig. 4 is a schematic storage diagram of a preset video content search library in the video processing method according to the embodiment of the present invention;

fig. 5 is a schematic diagram of a plurality of data tables and a predetermined video content search library in the video processing method according to the embodiment of the invention.

Detailed Description

In order to explain technical contents, achieved objects, and effects of the present invention in detail, the following description is made with reference to the accompanying drawings in combination with the embodiments.

Referring to fig. 1, an embodiment of the present invention provides a video processing method, including:

judging whether a preset label is received or not in the video recording process, and if so, acquiring description information corresponding to the preset label;

dotting the video according to the preset label and the description information to generate a dotted video;

and storing the preset label and the description information corresponding to the preset label to a preset video content retrieval library.

From the above description, the beneficial effects of the present invention are: unlike the prior art, the dotting processing is not performed on the recorded video, but in the video recording process, when a preset label is received, the video is dotted according to the preset label and the description information, the dotting efficiency of the video is improved, the dotting accuracy of the video is guaranteed, meanwhile, the preset label and the description information corresponding to the preset label are stored in a preset video content retrieval library, the preset label and the description information can mark and describe corresponding video content, after the video recording is completed, the video content retrieval information corresponding to the preset label is established, the efficiency of establishing the preset video content retrieval library is improved, and the video processing accuracy is guaranteed while the video processing efficiency is improved.

Further, the preset tag comprises a preset voice tag;

acquiring audio data of a video in real time in a video recording process;

and judging whether a preset voice tag exists in the audio data through voice recognition, and if so, acquiring description information corresponding to the preset voice tag.

According to the description, the preset labels comprise the preset voice labels, in the video recording process, a user can directly dictate the preset voice labels and the corresponding description information, when the voice identification is used for judging that the preset voice labels exist in the audio data, the corresponding description information can be automatically acquired, the preset voice labels and the description information can describe the information needing to be dotted in the video most intuitively, the video dotting efficiency can be improved, and therefore the video processing efficiency is improved.

Further, the obtaining of the description information corresponding to the preset voice tag includes:

acquiring initial content information and time point information corresponding to the preset voice tag;

receiving modified content information corresponding to the determined content information instruction;

determining the modified content information as final content information;

and determining the final content information as the description information corresponding to the preset voice tag.

According to the description, the initial content information and the time point information corresponding to the preset voice tag are obtained, when the initial content information does not have the preset ending mark in the preset time from the time point information and the initial content information does not accord with the preset rule, it is indicated that a plurality of content information are input by the user voice in one preset voice tag or the content information is obviously wrong, a content information determining instruction needs to be sent for the user to determine the content information, the modified content information is determined as the final content information, the video dotting accuracy is further improved, and meanwhile, the reliability of subsequently established video content retrieval information is also guaranteed.

Further, the dotting the video according to the preset tag and the description information, and the generating the dotted video includes:

determining a key frame corresponding to the time point information in the video according to the time point information;

and dotting the key frames according to the preset voice tags and the final content information to generate a video after dotting.

According to the description, the key frame corresponding to the time point information in the video is determined according to the time point information, the key frame is dotted according to the preset voice tag and the final content information, the video after dotting is generated, the key element of the dotting information is directly formed in the video recording process, the video can be dotted in real time, the video dotting efficiency is improved, and the video processing efficiency is further improved.

Further, the storing the preset tag and the description information corresponding to the preset tag to a preset video content search library includes:

obtaining a preset hierarchical relationship corresponding to the preset voice tag according to the preset voice tag;

According to the description, the preset voice tag, the final content information and the time point information are stored in the preset video content retrieval library according to the preset hierarchical relationship based on the key value storage mode, so that the preset video content retrieval library forms a directory tree structure, and the key value storage mode is adopted to store data, so that distributed storage and retrieval can be supported, and efficient retrieval is facilitated.

Further, after the storing the preset tag and the description information corresponding to the preset tag to a preset video content search library, the method includes:

According to the description, the plurality of data tables are generated based on the preset voice tags in the preset video content search library and the final content information, the association relation between the plurality of data tables and the video content search library is established, during subsequent search, the search result can be obtained based on the plurality of data tables, and then the video content is searched from the preset video content search library, so that the video search efficiency is improved.

Further, the establishing the association relationship between the plurality of data tables and the video content search library comprises:

receiving a video content retrieval request, wherein the video content retrieval request comprises retrieval keywords;

determining a retrieval result corresponding to the retrieval key word from the plurality of data tables according to the retrieval key word;

and acquiring a video clip corresponding to the target preset voice tag, the target final content information and the target time point information from the video.

According to the above description, the retrieval result is determined from the plurality of data tables according to the retrieval keyword, the target preset voice tag, the target final content information and the target time point information corresponding to the retrieval result are determined from the preset video content retrieval library, and then the corresponding video clip is obtained from the video.

Further, after acquiring the video segment corresponding to the target preset voice tag, the target final content information and the target time point information from the video, the method includes:

and judging whether a plurality of video clips exist, if so, splicing and displaying the plurality of video clips, and if not, displaying the video clips.

Further, the splicing and displaying the plurality of video segments comprises:

and acquiring and displaying the video in sequence according to the time sequence of the video clips.

According to the above description, after the plurality of video segments are obtained through retrieval, the plurality of video segments are directly displayed according to the time sequence, the original video is not segmented or integrated, and the spliced plurality of videos are not stored, so that the dynamic splicing display of the videos is realized, and the storage space is saved.

Referring to fig. 2, another embodiment of the present invention provides a video processing terminal, which includes a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor implements the steps of the video processing method when executing the computer program.

The video processing method and the terminal of the present invention can be applied to videos that require dotting and content retrieval, and are described below by specific embodiments:

example one

Referring to fig. 1 and 3, a video processing method of the present embodiment includes:

s1, judging whether a preset label is received or not in the video recording process, and if so, acquiring description information corresponding to the preset label;

specifically, the preset voice tag comprises a preset voice tag, the preset voice tag can be set according to actual needs, for example, in a fire-fighting facility maintenance inspection scene, a video file needs to be recorded in the maintenance or inspection process to keep a work trace, and the video file is used for confirming work after the fact and analyzing and positioning problems, and the preset voice tag can be set to be a project number, a project name, a detection position, a detection person, a detection detailed position, a fire-fighting facility number, a fire-fighting facility name and the like;

acquiring audio data of a video in real time in a video recording process;

judging whether a preset voice tag exists in the audio data through voice recognition, and if so, acquiring description information corresponding to the preset voice tag;

as shown in fig. 3, in the video recording process, a video can be output to the streaming media processing engine in real time, and meanwhile, the streaming media processing engine shunts audio data to the voice recognition engine, and judges whether a preset voice tag exists in the audio data through voice recognition;

in another optional embodiment, the preset tag further includes a preset key tag;

when the preset label is a preset key label, judging whether the preset key label exists in the video in real time in the video recording process, and if so, acquiring description information corresponding to the preset key label;

keys can be arranged on a screen or a terminal main body of the mobile terminal, corresponding preset key labels are arranged for the keys, and when a user needs to perform dotting in the process of recording a video, the user can touch the keys so as to trigger the preset key labels corresponding to the keys;

specifically, the obtaining of the description information corresponding to the preset voice tag includes:

acquiring initial content information and time point information corresponding to the preset voice tag;

receiving modified content information corresponding to the determined content information instruction;

determining the modified content information as final content information;

determining the final content information as description information corresponding to the preset voice tag;

the time point information is a video time point when a preset voice tag appears, and the preset time can be set according to actual needs;

the preset end mark can be set according to actual needs, such as 'end', 'OVER' or 'finish' of voice input;

the preset rule corresponding to the preset voice tag can also be set according to actual needs, for example, when the preset voice tag is a fire-fighting equipment number, and the fire-fighting equipment numbers are all 4 digits, the preset rule corresponding to the preset voice tag is that the input information must be 4 digits;

for example, the preset time is 20 seconds, the preset end mark is "end", whether the initial content information has the "end" mark within 20 seconds from the time point information is judged, if yes, the initial content information is determined as final content information, otherwise, whether the initial content information meets a preset rule corresponding to a preset voice tag is judged, if yes, the initial content information is determined as final content information, and if not, a content information determining instruction is sent, and the content information determining instruction can remind a user to determine the content information in a voice output or text output mode;

in another optional implementation, a part of key preset voice tags which are easy to make mistakes can be marked as key tags, when the preset voice tags marked as the key tags are detected, a content information determining instruction is directly sent, confirmed content information corresponding to the content information determining instruction is received, whether the confirmed content information is consistent with initial content information or not is judged, if yes, the initial content information is determined as final content information, and if not, the confirmed content information is determined as final content information;

s2, dotting the video according to the preset label and the description information, and generating a dotted video;

and S3, storing the preset label and the description information corresponding to the preset label to a preset video content retrieval library.

Example two

Referring to fig. 3 and 4, the present embodiment further defines how to dotte the video and establish video content retrieval information on the basis of the first embodiment, and specifically includes:

the S2 includes:

determining a key frame corresponding to the time point information in the video according to the time point information;

dotting the key frames according to the preset voice tags and the final content information to generate a video after dotting, as shown in fig. 3;

specifically, after the dotted video is generated, corresponding dotting information on a specific key frame can be seen on a progress bar during video playing, and the dotting information can be a preset voice tag and final content information or only final content information;

the S3 includes:

obtaining a preset hierarchical relationship corresponding to the preset voice tag according to the preset voice tag;

storing the preset voice tag, and the final content information and the time point information corresponding to the preset voice tag to a preset video content retrieval library according to the preset hierarchical relationship based on a key value storage mode;

the preset hierarchical relationship can be set according to actual needs, and in a fire protection facility maintenance inspection scene, the preset hierarchical relationship can be divided from the project perspective, so that the project is divided into which stages, the work content in the specific stages and the like; dividing from the geographic position, such as which building, which floor, which fire-proof area and the like; dividing according to the time dimension, such as the work content of the last week, the work content of the current week, the work content of the next week and the like; the fire-fighting facilities are divided into levels, such as ventilation and smoke exhaust, fire-fighting water supply, smoke sensing and the like, and the fire-fighting water supply system comprises a fire-fighting water pump, a fire-fighting pipeline, an outdoor fire hydrant, an indoor fire hydrant and the like;

for example, the preset voice tag is a detection position, a detection person, a detection detailed position, a fire fighting equipment number and a fire fighting equipment name, the corresponding final content information is a, zhang san, a first layer, 1234, a fire hydrant, the time point information is 1 minute 03 second, 1 minute 05 second, 1 minute 07 second, 1 minute 10 second and 1 minute 14 second, then the division is performed by geographic positions, the corresponding preset hierarchical relationship can be set that the detection person and the detection position are in the same level, the detection detailed position is the next level of the detection person and the detection position, and the fire fighting equipment number and the fire fighting equipment name are the next level of the detection detailed position, as shown in fig. 4;

the storage method uses a Key Value storage method, i.e., Key-Value, for example, the detected location is used as Key, the corresponding Value is (a, 1 min 03 sec), the detected detailed location is used as Key, and the corresponding Value is (first layer, 1 min 07 sec).

EXAMPLE III

Referring to fig. 4 and 5, the embodiment further defines how to perform video content retrieval based on the preset video content retrieval library on the basis of the first or second embodiment, specifically:

s4, generating a plurality of data tables based on the preset voice tags in the preset video content search library and the final content information, and establishing an association relation between the data tables and the preset video content search library;

assuming that the preset video content search library is shown in fig. 4, a plurality of data tables are generated based on the preset voice tag and the final content information in the preset video content search library, such as a key unit data table, a building information data table, a fire protection facility state data table, a fire protection detection project data table, a fire protection detection task data table, and the like, and the data tables may be stored in XML, JSON, YML, TXT, or other formats, and the association relationship between the data tables and the preset video content search library is established, as shown in fig. 5;

the key unit data table can store relevant information of key units, including information of geographic positions, detection persons and the like; the building information data table can store the building information managed by key units, such as building, floor, fire-proof area and other information; the fire-fighting facility data table can store the fire-fighting systems, the facilities and the equipment conditions of different buildings, floors or fire-proof areas in a specific building, such as the number, the name, the model, the existing positions, the detection persons, the installation time and other information of the fire-fighting water supply system, the facilities and the equipment; the fire fighting equipment state data table can store the state information of each detection of specific fire fighting equipment; the fire-fighting detection project data table can store project information of fire-fighting detection performed by key units, such as project numbers, project names and the like; the fire detection task data table can store detailed tasks set in specific projects, such as task conditions for detecting certain fire-fighting facilities in a certain building, a certain floor and a certain fire-fighting area, including detection execution time, detection people, detection results, abnormal condition description and the like;

s5, receiving a video content retrieval request, wherein the video content retrieval request comprises a retrieval keyword;

for example, the fire department needs to retrieve: in the fire fighting annual detection carried out by key fire fighting units in the southern mountainous area from 4 months to 6 months in 2021, a detection record list of fire fighting water supply system abnormity, detection video content, a subsequent processing method, a re-detection result, re-detection video content, the running state conditions of the water supply systems up to now, the contact way of a responsible person who leaves problems and the like;

in this search requirement, three searches are included: the first search is a list of water supply problems in the fire detection of the southern mountain area in 4-6 months; the second search is for a record of subsequent detections in a problematic list; the third search is the current status in these lists and the associated responsible person; the second and third retrieval needs to extract relevant information from the first retrieval record for retrieval again;

s6, determining a retrieval result corresponding to the retrieval key word from the plurality of data tables according to the retrieval key word;

s7, determining a target preset voice tag, target final content information and target time point information corresponding to the retrieval result according to the preset video content retrieval library;

s8, acquiring video clips corresponding to the target preset voice tag, the target final content information and the target time point information from the video;

specifically, for the first retrieval, the first retrieval is refined into a retrieval, b retrieval, c retrieval and d retrieval, and for the a retrieval, a key unit information list of which the geographic position comprises a key Shenzhen Nanshan region is obtained from a key unit data table and comprises information such as a unit name, a geographic position and a detector; the search formula is: select unit name, geographic location, detecting human from focus unit where geographic location like '% Shenzhen City southern mountain area';

for the retrieval of b, acquiring item information of which the detection time is between 4 months and 6 months in 2021 and the detection type is fire fighting annual detection in a fire fighting detection item data table according to the retrieved unit information of a; the search formula is: a unit name, a geographic location, a fire principal, b.an item name from a unit of emphasis a inner join fire detection item b ON a.a.unit name b.a.geographic location like% nanshan district city shenzhen and b.a detection type of "fire year detection" and b.a detection start time > '2021-04-01' and b.a detection end time < '2021-07-01';

for the retrieval of c, acquiring task information of which the fire fighting system is a fire fighting water supply system and the detection result is abnormal in a fire fighting detection task data table according to the item information retrieved in the step b; the search formula is: a, unit name, a, geographical location, a, fire-fighting principal, b, item name, c, detection task, c, fire-fighting equipment name from (unit of emphasis a inner join fire-fighting detection item b ON a. unit name b. unit name) inner join fire-fighting detection task c where a, geographical location like% southern mountain area% and b, detection type ═ fire-fighting age detection "and b, detection start time > '2021-04-01' and b, detection end time [ '2021-07-01' and c, item name b, item name and c, fire-fighting system ═ fire-fighting water supply system 'and c, detection result ═ anomaly';

for the d retrieval, according to the task information of the c retrieval, video information corresponding to the tasks is obtained in a preset video content retrieval library, wherein the video information comprises time point information for detecting the occurrence and the end of abnormal points and the like; the search formula is: a unit name, a geographic location, a fire-fighting responsible person, b item name, c detection task, c fire-fighting equipment name, d video name, d abnormality start time, d abnormality end time from ((emphasis unit a inner join fire-fighting detection item b ON a unit name b unit name) inner join fire-fighting detection task c) inner join fire-fighting detection video d where a geographic location like 'southern mountain area%' and b.

For the second retrieval, acquiring the c-equipment name, the b-item name and the c-detection end time retrieved in the c-retrieval of the first retrieval; inquiring information such as task names and detection results, wherein the task names are that the item names are 1c.b. item names, the equipment names are 1c.c. equipment names, and the detection end time is larger than the 1c.c. detection end time in a fire protection detection task data table; according to the retrieval result of the second retrieval b, inquiring a corresponding video segment in a preset video content retrieval library;

for the third retrieval, acquiring a key unit and a device name retrieved in the first retrieval; inquiring the state of the equipment, the equipment name of which is 1c.c. and the state updating time of which is the latest, in a fire fighting equipment state data table; inquiring the name and the contact way of the fire safety responsible person of the unit with the key unit of 1c.a. key unit in the key unit data table;

the text information in the retrieval results of the first retrieval, the second retrieval and the third retrieval can be integrated together and displayed in a list mode;

s9, judging whether a plurality of video clips exist, if so, splicing and displaying the plurality of video clips, and if not, displaying the video clips;

specifically, the splicing and displaying the plurality of video segments includes:

after a plurality of video clips are obtained, displaying the video clips according to a time sequence;

the method comprises the steps of directly obtaining a plurality of video clips from storage positions of the video clips, displaying the video clips according to a time sequence, and not separately storing the video clips.

Example four

Referring to fig. 2, a video processing terminal includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor executes the computer program to implement the steps of the video processing method in the first embodiment, the second embodiment, or the third embodiment.

To sum up, in the video recording process, the video processing method and the terminal provided by the invention acquire the audio data of the video in real time, when the audio data is judged to have the preset voice tag through voice recognition, the initial content information and the time point information corresponding to the preset voice tag are acquired, when the initial content information has no preset ending mark in the preset time from the time point information and the initial content information does not accord with the preset rule, a content information determining instruction is sent for a user to determine the content information, the modified content information is determined as the final content information, the video dotting accuracy is further improved, the user can directly input the preset voice tag and the corresponding description information through voice, the corresponding description information can be automatically acquired through voice recognition, and the preset voice tag and the description information can most intuitively describe the information required to be dotted in the video, the video dotting efficiency can be improved; determining a key frame corresponding to the time point information in the video according to the time point information, dotting the key frame according to a preset voice tag and final content information to generate a dotted video, directly forming key elements of the dotting information in the video recording process, and dotting the video in real time; the preset voice tag and the description information corresponding to the preset voice tag are stored in the preset video content search library, so that the efficiency of establishing the preset video content search library is improved; the method comprises the steps of determining retrieval results corresponding to retrieval keywords from a plurality of data tables according to the retrieval keywords, determining target preset voice tags, target final content information and target time point information corresponding to the retrieval results according to a preset video content retrieval library, and acquiring corresponding video clips from videos, so that the method is suitable for simple video content retrieval and complex video content retrieval, and efficient video content retrieval is realized; after the plurality of video clips are obtained through retrieval, the plurality of video clips are directly displayed according to a time sequence, the original video is not segmented or integrated, and the spliced videos are not stored, so that the dynamic splicing display of the videos is realized, the storage space is saved, and the accuracy of video processing is ensured while the video processing efficiency is improved.

The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent changes made by using the contents of the present specification and the drawings, or applied directly or indirectly to the related technical fields, are included in the scope of the present invention.

15页详细技术资料下载

Video processing method and terminal

相关技术

网友询问留言