Divorce dispute referee document label extraction method and device

文档序号:1043316 发布日期:2020-10-09 浏览:8次 中文

阅读说明:本技术 离婚纠纷裁判文书标签提取方法及装置 (Divorce dispute referee document label extraction method and device ) 是由 刘大双 晋耀红 李德彦 张志一 于 2020-06-12 设计创作,主要内容包括:本申请公开了离婚纠纷裁判文书标签提取方法及装置,所述方法包括:所述方法包括:首先对获取到的离婚纠纷裁判文书进行结构化,生成至少一个数据对象集合,数据对象集合包括至少一个数据对象,所述数据对象中封装有所述离婚纠纷裁判文书的文书要素;然后从预设标签体系中获取待提取的目标文书标签的父标签,并确定父标签对应的数据对象集合;从父标签对应的数据对象集合中提取目标文书标签。本申请方法实现了对离婚纠纷裁判文书多层级、多维度的标签提取,为文书内容标注、案件检索和查询提供帮助。(The application discloses a divorce officer document label extraction method and a divorce officer document label extraction device, wherein the method comprises the following steps: the method comprises the following steps: firstly, structuring an acquired divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object; then, acquiring a parent tag of a target document tag to be extracted from a preset tag system, and determining a data object set corresponding to the parent tag; and extracting the target document label from the data object set corresponding to the parent label. The method realizes multi-level and multi-dimensional label extraction of the divorce referee document, and provides help for document content marking, case retrieval and query.)

1. A divorce officer document label extraction method is characterized by comprising the following steps:

acquiring a divorce referee document;

structuring the divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object;

acquiring a parent tag from a preset tag system of a divorce referee document, and determining a data object set corresponding to the parent tag, wherein the parent tag is a parent tag of a target document tag to be extracted;

and extracting the target document label from the data object set corresponding to the parent label.

2. The method of claim 1, wherein the structuring of the divorce officials' paperwork to generate at least one set of data objects comprises:

according to a preset referee document directory structure, cutting the divorce referee document into a plurality of text blocks, wherein the referee document directory structure comprises a plurality of directory titles, and each text block corresponds to one directory title;

selecting a text block corresponding to the specified element to be extracted from the plurality of text blocks according to the directory title corresponding to the text block;

extracting an element value of the specified element from the text block corresponding to the specified element by using an element tree corresponding to the specified element, wherein the element tree comprises an element node corresponding to each specified element and an extraction rule of the element node;

and generating a data object and a data object set comprising the data object by taking the element node name corresponding to the specified element as a field name and taking the element value of the specified element as a field value.

3. The method of claim 1 or 2, wherein said extracting the target document tag from the set of data objects corresponding to the parent tag comprises:

extracting the field value in the data object;

and splicing the extracted field values with corresponding parent labels respectively to obtain the target document label.

4. The method of claim 1 or 2, wherein said extracting the target document tag from the set of data objects corresponding to the parent tag comprises:

extracting the field name in the data object;

and splicing the extracted field names with corresponding parent labels respectively to obtain the target document label.

5. The method according to claim 4, wherein the element node corresponding to the specified element comprises at least two sub-nodes, one of the sub-nodes corresponds to a class label of one of the specified elements, a field name in the data object is a sub-node name, a field value corresponding to the sub-node name is a first predetermined value or a second predetermined value, the first predetermined value indicates that the class label corresponding to the sub-node name is hit, and the second predetermined value indicates that the class label corresponding to the sub-node name is not hit;

the extracting the field names in the data objects, and splicing the extracted field names with corresponding father labels respectively to obtain the target document labels, including:

extracting the names of child nodes with the field values of the first preset value in the data objects;

and splicing the extracted child node name and the corresponding parent label to obtain the target document label.

6. The method of claim 1 or 2, wherein extracting the target document tag from the set of data objects corresponding to the parent tag comprises:

extracting field values in the data objects;

acquiring a category label hit by the field value according to a preset classification rule;

and splicing the category label hit by the field value with the corresponding father label to obtain a target document label.

7. The method of claim 1, wherein said extracting the target document tag from the set of data objects corresponding to the parent tag comprises:

judging whether a field value corresponding to the field name in the data object is empty;

when the field value of the field name in the data object is null, acquiring a preset label of the field name;

and splicing the preset label with the corresponding father label to obtain the target document label.

8. The method of claim 1, wherein after extracting the target document tag from the set of data objects corresponding to the parent tag, the method further comprises:

and acquiring the positioning information of the target document label and storing the positioning information.

9. The method of claim 8, wherein the location information of the target document tag is:

the field value contained in the target document label is positioning information in the divorce officer document;

or, the element value corresponding to the element node name included in the target document label is positioning information in the divorce officer document;

or, the element value of the category label corresponding to the child node name in the target document label is hit, and the positioning information in the divorce officer document is obtained.

10. A divorce officer's document label extraction element, its characterized in that, the device includes:

the document acquisition module is used for acquiring divorce dispute referee documents;

the document structuring module is used for structuring the divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object;

the parent tag acquisition module is used for acquiring a parent tag from a preset tag system of the divorce officer document and determining a data object set corresponding to the parent tag, wherein the parent tag is a parent tag of a target document tag to be extracted;

and the document tag extraction module is used for extracting the target document tag from the data object set corresponding to the parent tag.

Technical Field

The application relates to the technical field of text processing, in particular to a divorce officer document label extraction method and device.

Background

The referee document is a carrier for recording the result of litigation activities such as the trial process and the result of the people's court, and is also a unique certificate for the people's court to determine and distribute the entity right obligation of the party. Official documents usually have a regular structural framework and writing format, which may be slightly different for different types of official documents. Common types of documents include civil referee documents (e.g., civil adjudication documents), criminal referee documents (e.g., criminal adjudication documents), administrative referee documents (e.g., administrative adjudication documents), and other general litigation documents, among others.

Since the official documents are described with important information such as the trial and error process and the decision result, which has important values for analysis and attention, for example, performing a case analysis, a case search, etc. based on the information, extracting valuable information (e.g., document elements, document labels) from the official documents is a basic requirement of practitioners in the relevant fields.

The divorce officer document is one of civil officials documents, and how to extract document labels from the divorce officials document in order to comprehensively understand the essential features of the divorce officials document becomes a technical problem to be urgently solved by technical personnel in the field.

Disclosure of Invention

The application provides a divorce officer document label extraction method and device, which are used for solving the problem of how to extract document labels from divorce officer documents.

In a first aspect, the present application provides a divorce officer document label extraction method, including:

acquiring a divorce referee document;

structuring the divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object;

acquiring a parent tag from a preset tag system of a divorce referee document, and determining a data object set corresponding to the parent tag, wherein the parent tag is a parent tag of a target document tag to be extracted;

and extracting the target document label from the data object set corresponding to the parent label.

In a second aspect, the present application further provides a divorce officer's document label extraction device, the device includes:

the document acquisition module is used for acquiring divorce dispute referee documents;

the document structuring module is used for structuring the divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object;

the parent tag acquisition module is used for acquiring a parent tag from a preset tag system of the divorce officer document and determining a data object set corresponding to the parent tag, wherein the parent tag is a parent tag of a target document tag to be extracted;

and the document tag extraction module is used for extracting the target document tag from the data object set corresponding to the parent tag.

According to the technical scheme, the application provides a divorce officer document label extraction method and device, and the method comprises the following steps: firstly, structuring an acquired divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object; then, acquiring a parent tag of a target document tag to be extracted from a preset tag system, and determining a data object set corresponding to the parent tag; and extracting the target document label from the data object set corresponding to the parent label. The method realizes multi-level and multi-dimensional label extraction of the divorce referee document, and provides help for document content marking, case retrieval and query.

Drawings

In order to more clearly explain the technical solution of the present application, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious to those skilled in the art that other drawings can be obtained according to the drawings without any creative effort.

FIG. 1 is a flow chart illustrating a method for extracting divorce officer's document elements according to an exemplary embodiment of the present application;

FIG. 2 is a flow chart of a method for extracting labels of documents from a divorce officer according to an exemplary embodiment of the present application;

fig. 3 is a block diagram of a divorce officer's document tag extraction apparatus according to an exemplary embodiment of the present application.

Detailed Description

In order to make those skilled in the art better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

In the judicial field, a referee document is a special document for recording the result of litigation activities such as the trial process and the result of the national court, and generally has a uniform structural composition and writing format, and each part of the composition (i.e. text block) corresponds to a content subject for representing the subject matter of the content covered by the part.

Taking the civil judgment as an example, the civil judgment is composed of head information, party information, trial process, litigant request, debate by litigant, dispute focus, evidence catalogue, trial finding, court view, judgment result and tail information, the corresponding components of each topic have a specific writing format or description mode, and each component contains established elements, for example, the head information necessarily contains the name of the trial court, the case number and the like.

Since the official document records important information such as the trial process and the result, which has important values for analysis and attention, the official document can be comprehensively known by extracting valuable information from the official document. For example, case type, case number, trial court name, trial court hierarchy, territory, conference members, and time elements such as acceptance time, trial time, etc. are known.

The document elements and the document tags are important information extracted from the document for understanding the document. The document elements are direct extraction results of document contents, such as case types, case numbers, trial court names, trial court levels, regions, conference members, acceptance time, trial time and the like, and the document labels are used for labeling the documents, not only label the document contents, but also label the classification of the document contents. The document tags may be used for document retrieval or querying, such as retrieving case documents with a particular tag, or retrieving similar cases with the same particular tag for case analysis.

The embodiment of the application provides a divorce referee document element extraction method and a divorce referee document label extraction method, and a specific implementation mode of the divorce referee document element extraction method is introduced in the following first in order to be capable of comprehensively knowing the content and/or the classification of the content of the divorce referee document.

Fig. 1 is a flowchart illustrating the method according to an exemplary embodiment of the present application, and as shown in fig. 1, the method may include:

and step 110, acquiring divorce referee documents.

In the present application, the divorce officer document may be a divorce dispute first-pass judgment book, such as a divorce dispute first-pass judgment book with some plum.

And step 120, cutting the marriage dispute official document into a plurality of text blocks according to a preset official document directory structure, wherein the official document directory structure comprises a plurality of directory titles, and each text block corresponds to one directory title.

In some embodiments, a data set of divorce referee documents of a certain scale is collected, the directory structure composition of the divorce referee documents and the compiling characteristics of each part of the divorce referee documents are learned and mined, a directory tree is built by using the directory structure composition characteristics of the referee documents and taking directory titles in the directory structure composition as directory nodes, and an extraction rule, such as at least one extraction expression, is designed for each directory node by using the compiling characteristics of each part of the composition. By using the extraction rule corresponding to the directory node, the text block corresponding to the directory node can be extracted from the divorce judge document, namely the text block corresponding to the directory title in the directory structure of the judge document.

In some embodiments, the directory node sequentially lists the content topics of text blocks that may exist in the umpire document, and the extraction rules under the directory node are used to extract text blocks from the umpire document that correspond to the directory node or content topic, a text block comprising one or more paragraphs.

Civil judgment book

Header information-extraction expression

Party information- -extraction expression >

The trial process is through an-extraction expression

Litigant request- - - - - < extraction expression >

Is resolved by litigation party- - - - - - - - - - < extraction expression >

Checking-out- -finding- -extracting expression >

The focus of dispute- - - - - - - - - - - < extraction expression >

Court view-of-extraction expression

Decision result-extraction expression

Tail information-extraction expression

Wherein, the "civil judgment" is the name of the directory tree selected according to the document type, and the "header information" and the like are the directory nodes included in the directory tree.

In some embodiments, the extraction expression corresponding to each directory node is used to extract the block header information of each text block, so that the start position of each text block can be determined according to the block header information, and paragraph contents between two adjacent start positions are extracted to obtain the corresponding text block.

By the method, a divorce officer document is cut into pieces, and an exemplary cutting result is obtained as follows:

< header information >/

People's court in XX district of Beijing City

Civil judgment book

(2018) Jing 0105 Minjun 77967 No.

< party information >/</su

Original notice: liu Gong, male, born in 31 th of 10 th of 1983, Han nationality, in the rising area of Beijing.

Entrusting litigation attorneys: zhao, Beijing xx law firm lawyer.

Is informed: luo Gong, female, born in 25.4.1985, Han nationality, located in the sunny region of Beijing.

Entrusting litigation attorneys: xian, the lawyer of Beijing xx law firm.

< audit pass >/(R)

When a case of property disputes after a divorce is reported to Yu in original Liu, … is carried out after the case is accepted and the case is settled. The present application has been examined and finalized.

< litigation-party request >/</

Liu somewhere asks the court for litigation: 1. luo somebody pays me 309678 yuan; 2. litigation fees are borne by the defendant. Facts and reasons: i together with Luo-Chi … 2012 in 11 months of 2012, I applied for buying two rooms located in the area facing the sun and at the arm X-yard X-building X-number, … I thought that Luo-Chi should pay me the portion that I has paid back. If Lu and a certain requirement divide the share of the house, I require Lu and a certain requirement pay tax and loan-half cost and decorate half of the cost.

< solicited by litigation party >/</H

Some dialects of Luo are written in divorce judgment to solve house division and debt together, and I require that houses are divided according to shares and two thirds of houses are divided.

< evidence directory >/(X >

The plaintiff submitted evidence around lawsuit requests legislation:

evidence 1, list 1, evidence 1;

evidence 2, manifest 2, proof 2;

the following evidence is verified by the court justice and certification:

evidence 1, identifying result 1;

evidence 2, affirming result 2;

< audit found >/broken

The hospital is approved to recognize the fact as follows: … are provided.

< focus of dispute >/</su

Lu Chi advocates dividing the concerned houses according to shares. Lu Di should have ownership for two thirds of the way that Lu Di is considered as a family name when applying for a house and considering the daughter of both parties. Liu Gong believes that although the house related to the case is applied in the name of a family, the qualification that the house is the identity of a transfer soldier to obtain the shaking number is mainly considered. Liu somebody advocates that Lu somebody paid the loan and duty after the Lu certain repayment marriage and requires Lu somebody to pay half of the decoration cost.

< court opinion >/</>/broken

This hospital deems, according to the relevant legal provisions, ….

< decision result >/

The decision is as follows:

from the month this decision was in effect, ….

< trailer information >/

Trial height XX

People's accompany person's pipe XX

People accompany and examine person congratulate XX

Two good quality one nine year twelve month one day

Proxy bookmarker LuXX

In the above example, since the directory node composition of the directory tree is designed according to the block composition of the referee document, the directory node composition of the directory tree corresponds to the block composition of the referee document, each text block obtained by splitting the directory tree corresponds to one directory node in the directory tree, and the directory title of the corresponding text block can be obtained by the directory node name.

And step 130, selecting a text block corresponding to the specified element to be extracted from the plurality of text blocks according to the directory title corresponding to the text block.

In some embodiments, by learning the data structure characteristics of the divorce officer document, a professional term knowledge system of the divorce case is mined, such as child foster distribution, common property distribution, emotional breaking affairs and the like, and an element system structure of the divorce officer document is designed according to the mined professional term knowledge system, wherein the element system structure comprises multi-level and multi-dimensional document element information, such as litigation requests, case facts, dispute focuses, judgment results and the like.

In the embodiment of the present application, document elements to be extracted, that is, specified elements are selected from a preset element architecture. In some embodiments, the specified element to be extracted is also referred to as a target element.

An exemplary element architecture is established by learning and mining the data structure features of the divorce officer's paperwork as follows:

Figure BDA0002537361830000061

the above exemplary element architecture includes a root node, intermediate nodes, and leaf nodes, the root node at the highest level may only have one or more child nodes, but no parent nodes; an intermediate node may have one or more child nodes and may only have one parent node; the leaf nodes at the lowest level have only parent nodes and no child nodes.

For example, in the above exemplary element architecture, the "litigation request", "party information", "case fact", "evidence class element", "dispute focus", "decision result" and the like are the highest-ranked root nodes, the "litigation request list" and the like are the child nodes of the "litigation request", the "child case element", "marriage type" and the like are the child nodes of the "case fact", the "number of children", "child name" and the like are the child nodes of the "child case element", and the like.

It will be appreciated that different document elements may be included in different text blocks, for example, a litigation request element is included in a text block with a directory title of litigant requests, and a child case element is included in one or more text blocks with a directory title of party information, litigant requests, debates by litigants, trial ascertainments, and court opinions. Therefore, the text block including the target element can be selected as the analysis target according to the specific target element, and for example, when the litigation requesting element needs to be extracted, the text block whose directory title is requested by the litigant can be selected as the structured object.

For the sake of distinction and explanation, a text block selected according to a target element and including the target element is defined as a target text block corresponding to the target element.

In some embodiments, a preset corresponding relationship between the catalog title and the document element is established in advance, and under the condition that the target element to be extracted is known, the catalog title corresponding to the target element can be selected according to the preset corresponding relationship, so that the target text block containing the target element is determined.

In this embodiment, the preset corresponding relationship between the catalog title and the document element can be shown in the following table:

it should be noted that the preset correspondence is only an exemplary representation, and does not include elements of all levels and dimensions in the element architecture, and those skilled in the art can further refine and refine the preset correspondence according to the exemplary representation.

Step 140, extracting the element value of the specified element from the text block corresponding to the specified element by using an element tree corresponding to the specified element, wherein the element tree comprises an element node corresponding to each specified element and an extraction rule of the element node.

In order to extract the document elements included in a given target text block from the target text block, an element tree structure matched with the specific text block is created in advance so as to extract different document elements from different text blocks by using different element trees. Each element tree comprises at least one element node, each element node corresponds to at least one extraction rule, and the extraction rules are used for extracting the document elements corresponding to the element nodes from the target blocks.

Based on this, in step 140, an element tree to be used is selected from the target text block and/or the target element, and the element value of the target element corresponding to each element node is extracted from the corresponding target text block using the element node in the element tree.

Illustratively, for child case elements, the pre-created element tree is as follows:

in some embodiments, since different element nodes are used for extracting different target elements, the extraction rule corresponding to each element node is different, and the extraction rule may be: a positioning rule, a time extraction rule, or a normalized element matching rule.

The positioning rule comprises a front positioning rule and a rear positioning rule based on a regular expression, and the main principle of the positioning rule is to determine the starting position of the target element in the target text block by using the front positioning rule and determine the ending position of the target document element in the target text block by using the rear positioning rule.

In some embodiments, determining the starting position of the target element in the target text block using the prepositioning rule includes: identifying prepositioning information of the target element by using a prepositioning rule; and determining the initial position of the target element in the target text block according to the preposed positioning information. The prepositioning information can be a specific Chinese word or a specific Chinese context, such as a role tag in front of the name of a conference room member, and can also be a Chinese character or a non-Chinese character of a specific position index, such as the prepositioning information taking the Chinese character at the first position in the header block information as the name of the trial court.

In some embodiments, determining the end position of the target element in the target text block by using the post-positioning rule comprises: identifying the post-positioning information of the target element by using a post-positioning rule; and determining the end position of the target element in the target text block according to the post-positioning information. The post-positioning information can be a specific suffix characteristic word, such as that of the trial court name, namely the court or the division, and can also be a non-Chinese character indexed by a specific position, such as a line-feed symbol.

In some embodiments, the extraction rule corresponding to one or more element nodes in the element tree is a time extraction rule, where the time extraction rule is specifically at least one time extraction expression for extracting a time element from a text block such as an audit pass block, where the time element is: the "appellation time", "acceptance time", "plan setting time", "trial time", and "trial period" included in the trial pass block include "living time" and the like.

Specifically, the time extraction expression is a regular expression supporting various date structure types, and supports identification of Chinese, Arabic number and full/half angle type numerical information.

In order to improve the accuracy of the extracted time class elements, in some embodiments, the time information in the text block corresponding to the time class elements is subjected to a reference resolution process, and then the element values of the time class elements are extracted from the text block after the reference resolution process.

In some embodiments, the extraction rule corresponding to one or more element nodes in the element tree is a normalized element matching rule. Here, the normalized element may be understood as a document element that must be expressed by a standard word/word in the official document, such as a sex element of a principal or a child that must be expressed by a standard word such as "male" or "female".

During specific implementation, a standard word set is preset according to the target normalized element, and the standard word set comprises at least one standard word. Furthermore, the standard words in the standard word set can be matched with the target text block, and then the document elements can be extracted from the target block according to the matching result.

In some embodiments, the element node corresponding to the target element has at least two sub-nodes, and each sub-node corresponds to a class label of the target element, which is substantially a class label of the extraction result of the element node. The extraction results of part of element nodes are classified in a targeted manner, so that element extraction and classification with finer granularity are realized, and the accuracy of element identification and extraction results is improved.

In some embodiments, the target element may be a litigation request element, and the target title of the text block corresponding to the litigation request element may be a litigation party request. With reference to the above exemplary element architecture, the litigation-request elements may specifically include a litigation-request list and whether the story is to be agreed to the divorce, and accordingly, the element tree for extracting the litigation-request elements includes at least an element node for extracting the litigation-request list and an element node for extracting "whether the story is to be agreed to the divorce", where the element node corresponding to "whether the story is to be agreed to the divorce" may have three child nodes, which are "yes", "no" and "no", respectively, which are category labels of the node extraction result of "whether the story is to be agreed to the divorce".

In the above embodiment, first, corresponding element nodes are used to extract litigation request information texts from corresponding target text blocks, and then the extracted litigation request information texts are segmented to obtain one or more independent litigation request items, where the one or more independent litigation request items are element values of a litigation request list. When cutting, firstly, judging whether the litigation request information text has item symbols such as '1', '2', and the like, if so, identifying the existing item symbols, separating the independent litigation request items from the litigation request information text by using a character string segmentation method, and if not, directly cutting the litigation request information text according to the separation symbols such as periods, semicolons, and the like.

For example, the litigation request information text extracted from the aforementioned liu and luo dispute referee documents and the independent litigation request items separated from the litigation request information text are as follows:

litigation request information text:

original report certain 1 makes litigation request to the institute: 1. is advised to pay me 309678 dollars; 2. litigation fees are borne by the defendant.

Independent litigation request items:

is advised to pay me 309678 meta @

The litigation fee is borne by the notice

In the above example, the independent litigation request item is the result corresponding to the litigation request list node.

In addition, it is also necessary to extract an expression text indicating whether the target text block is divorced or not from the corresponding target text block by using the corresponding element node, classify the extracted expression text, use a first predetermined value as an element value of a child node corresponding to a category tag hit by the expression text, use a second predetermined value as an element value of a child node corresponding to a category tag missed by the expression text, for example, if the expression text hits "yes", then assign "True" by using "True", and assign "no" and "none" by using "False".

In some embodiments, the target element may be a case fact element, the case fact element further includes a child condition element and other fact elements, wherein the child condition element further includes a child basic condition element and a child other condition element, and since the child other condition element belongs to an element needing to be classified, the element node corresponding to the child other condition element includes at least two child nodes, each child node corresponds to a category label of the child other condition element, for example, the child other condition element includes category labels that may indicate a relationship between the child and an original/informed subject, whether the child is underage/lactation, will follow a party, and the like, wherein the child nodes under the element node corresponding to the original/informed relationship respectively correspond to a child, a parent child, a nurse and a wedding child, and whether the child nodes under the element node corresponding to the underage/lactation respectively correspond to a child, and the like, And if not, the child nodes under the element nodes corresponding to the willing following the principal correspond to the original report and the reported report respectively. The other fact elements include common property, marital type, presence or absence of participation, residence time and the like, it should be noted that the other fact elements also include document elements to be classified, for such other fact elements, at least two sub-nodes respectively corresponding to the category labels are also arranged below the corresponding element node, and whether two sub-nodes are arranged below the element node corresponding to the presence or absence of the participation.

In some embodiments, child condition elements are extracted from the corresponding target text block according to the following steps:

firstly, identifying a sentence where child information is located from a text block corresponding to the child condition element;

secondly, analyzing the sentence where each piece of child information is located item by item, and extracting the child condition elements from the sentence where the piece of child information is located by using corresponding element nodes. Specifically, the child basic situation element is extracted using the element node corresponding to the child basic situation element, and the child other situation is extracted using the element node corresponding to the child other situation element.

Then, for the child basic situation elements, the data object generation and storage are directly performed by taking the extraction result of the corresponding element node as an element value, such as the name, sex, age and the like of the child. For other elements of children, the extraction result of the corresponding element node needs to be classified to obtain the hit category label, and the first predetermined value is used as the element value of the child node corresponding to the hit category label, and the second predetermined value is used as the element value of the child node corresponding to the missed category label. For example, the age of a child is classified to determine whether the child belongs to the immature period or the lactation period, and the expression text extracted from the element node relating to the original relationship is classified to determine whether the child is a married child or a foster child or a relay child or a parent child, and if no determination result is obtained, the child is defaulted to be a married child.

From the above embodiment, it can be seen that multi-dimensional and multi-level extraction of the condition elements of the children is realized by further classifying the other condition elements of the children, for example, the age of the children and whether the minor is an element with different levels and dimensions, are the result obtained by classifying the age of the children. Thus, the relevant personnel can know the age of the child and can directly know whether the child is underage.

For other case factual elements, element nodes in the element tree are used to extract element results according to the element tree corresponding to the specific target element.

Illustratively, for other case fact elements, the pre-created element tree structure is as follows:

Figure BDA0002537361830000121

in some embodiments, the target elements further include evidence class elements corresponding to text blocks titled evidence directory (in other directory structure compositions, text blocks corresponding to evidence class elements may be text blocks titled proof, and court approval results).

In some embodiments, the evidence class elements include proof elements including provenance proof elements and defendant proof elements, and forensic elements including provenance proof elements and defendant proof elements.

The evidence element comprises an evidence number, an evidence list, an evidence fact and an evidence detail rule, the evidence detail rule comprises at least one evidence item, and each evidence item comprises an evidence list element arranged according to a bit sequence and an evidence fact element adjacent to the evidence list element. Specifically, proof-proof numbers, proof lists and proof facts are respectively extracted from corresponding target text blocks through corresponding element nodes, then according to the proof-proof numbers, the proof lists and position indexes of the proof facts in the text blocks, the 'proof facts' are searched backwards from the 'proof lists', and until the next 'proof list', the proof list elements arranged in the order of bits and the proof fact elements adjacent to the positions of the proof list elements are respectively used as a proof-proof item.

The quality certification elements comprise quality certification evidence numbers, quality certification results, quality certification opinions and quality certification rules, the quality certification rules comprise at least one quality certification item, and each quality certification item comprises an evidence number element, a quality certification result element and a quality certification opinion element, wherein the evidence number elements, the quality certification result elements and the quality certification opinion elements are arranged in a bit sequence; specifically, based on the quality certificate result, the evidence number is searched forward, the quality certificate opinions are searched backward until the previous quality certificate result, and the evidence number elements arranged in order, the quality certificate result elements adjacent to the evidence number elements and the quality certificate opinion elements adjacent to the quality certificate result elements are used as a quality certificate item until the next quality certificate result.

The court certification elements comprise evidence numbers, evidence providers, court certification results and court certification rules, the court certification rules comprise at least one certification basis item, and each certification basis item comprises the evidence provider elements arranged in a bit sequence, evidence number elements adjacent to the evidence provider elements and court certification result elements adjacent to the evidence number elements. Specifically, based on the 'court approval result', the contents of the 'evidence provider' and the 'evidence number' are searched forward, and until the previous 'court approval result', the evidence provider element, the evidence number element adjacent to the position of the evidence provider element, and the court approval result element adjacent to the position of the evidence number element, which are arranged in a bit order, are used as an authentication basis item.

In some embodiments, the target element may be a dispute focus element. With reference to the above exemplary element architecture, the dispute focus element may further comprise a list of dispute focus points and at least one dispute focus category name, the list of dispute nodes comprising one or more independent dispute focus items, each independent dispute focus item, after being sorted, hitting one or more dispute focus category tags. Correspondingly, the element tree for the dispute focus element at least comprises an element node corresponding to the dispute focus list and an element node corresponding to the dispute focus category, wherein one or more child nodes corresponding to the dispute focus category label are arranged under the element node corresponding to the dispute focus category.

In some embodiments, the dispute focus information text is first identified from the corresponding target text block by using the element nodes specified in the corresponding element tree, and the dispute focus information text is segmented to obtain a dispute focus list including one or more independent dispute focus items. And then analyzing each independent dispute focus item one by one to obtain a hit dispute focus category label, taking a first preset value as an element value of a child node corresponding to the hit dispute focus category label, and taking a second preset value as an element value of a child node corresponding to the missed dispute focus category label.

In some embodiments, text data of the independent dispute focus items of the known class is used as a training sample, a certain amount of the training sample is used for training to obtain a classification model based on a neural network model, and the trained classification model is used for classifying the independent dispute focus items of the unknown class.

Illustratively, the dispute focus information text extracted from the divorce dispute referee documents of the luga and the liu, and the independent dispute focus items separated from the dispute focus information text are as follows:

dispute focus information text:

lu Chi advocates dividing the concerned houses according to shares. Lu Di should have ownership for two thirds of the way that Lu Di is considered as a family name when applying for a house and considering the daughter of both parties. Liu Gong believes that although the house related to the case is applied in the name of a family, the qualification that the house is the identity of a transfer soldier to obtain the shaking number is mainly considered. Liu somebody advocates that Lu somebody paid the loan and duty after the Lu certain repayment marriage and requires Lu somebody to pay half of the decoration cost.

Independent dispute focus items:

some claim of/Lu divides into the case house according to the share

The Luo should have two thirds of ownership based on the fact that the Luo applies on the name of the family when the Luo applies for the house in the case of the project and considers the daughter of the two parties

Liu et al thought that although the related housing system applies in the name of a family, the qualification that the house is the identity of a transfer military person to be signed is mainly considered

Liu claims loan and duty paid by Liu after Lu paying marriage and requires Lu paying half of decoration fee

In an exemplary divorce factor architecture, the partial dispute focus category names are as follows:

divorce dispute focus category

Whether the emotion is disrupted

-whether there is a family violence

By whom the child is fostered

-property segmentation

Debt split

-credit split

How common property is handled

How common debts are handled

How common claims are handled

……

House repayment

Housing discount money

How houses are handled

Illustratively, the above-mentioned individual dispute focus item "/some claim division by shares related to the case/" hit dispute focus category label includes at least "how the house is handled", the/Liu some claims another loan and tax paid by Liu some after marriage, and the claim that Lu some pays half of the decoration fee/"hit dispute focus category label includes at least" house loan ". And the first predetermined value is respectively stored as the element values of the child nodes corresponding to the 'how house is processed' and the 'house loan'.

As can be seen from the above embodiments, the dispute focus elements include not only the dispute focus list composed of the independent dispute focus items, but also one or more category labels of each independent dispute focus item, and the extracted dispute focus elements have a multi-level and multi-dimensional characteristic, for example, the dispute focus list and the dispute focus category belong to document elements of different levels, and document elements of different dimensions between the dispute focus categories belong to document elements of different dimensions.

In some embodiments, the target elements further include decision result elements including whether divorce is granted, child foster allocation elements, common property allocation, and the like, wherein whether divorce is granted further includes two category labels of "yes" and "no", the child foster allocation elements further include underlying level elements of foster, year of burden, proportion of burden, amount of money, payment method, and the like, and the common property allocation further includes home affiliation, vehicle affiliation, and the like.

And during specific implementation, analyzing the text information of the judgment result extracted by the corresponding directory node to obtain the classification result of whether divorce is granted or not. And if the classification result of whether divorce is granted is not obtained, judging whether the child fostering distribution node has child nodes with the extraction result, and if so, judging that divorce is granted.

Illustratively, the element nodes for extracting the child fostering allocation elements are composed as follows:

Figure BDA0002537361830000151

Figure BDA0002537361830000161

and screening minor information from the minor condition element result, and performing character string matching on the minor information (such as minor names) and the judgment result information text to obtain the position indexes of all minor information in the judgment result text information.

And distributing the fostering elements for each child, and extracting corresponding information by using the extraction rules corresponding to the corresponding element nodes. Judging whether the foster party node has an extraction result, if so, indicating that the foster party information exists in the judgment result, acquiring the extraction result of the foster party node, and extracting a role label or a name of the party from the extraction result, such as an original notice or a defended notice, as an element value of the foster party.

Step 150, generating a data object and a data object set including the data object by using the element node name corresponding to the specified element as a field name and the element value of the specified element as a field value.

In the embodiment of the present application, the element node name includes a name of an element node corresponding to the specified element, that is, an element node name, and also includes a name of a child node under the element node corresponding to the specified element, that is, a child node name.

For an element node and/or a child node using the first predetermined value or the second predetermined value as an element value, for example, a child node at a dispute focus, a child node under an element node corresponding to an element in other situations of children, or a child node under an element node corresponding to a decision result element, a data object is generated by using a name of the child node as a field name and using the first predetermined value or the second predetermined value as a field value.

In this embodiment, the data objects are structured results of dispute official documents, and each data object includes one or more information pairs, i.e. keys (field names): value field attribute), such as "child gender: female "," minor "," whether or not: yes, will foster the prescription: original notice "" house attribution is notified: yes, etc.

In some embodiments, the field value in the data object corresponding to the field name may be a null value.

It can be seen from the above embodiments that one data object set includes at least one data object, and one data object includes one or more information pairs, i.e. keys: value, where, Key is an element node name or a child node name, and the corresponding Value is an element Value corresponding to the element node or the child node.

In some embodiments, the data object further includes element node name/child node name and/or location information of the corresponding element value.

Based on the foregoing embodiment, the present application further provides a divorce officer document tag extraction method, fig. 2 is a flowchart of the method according to an exemplary embodiment of the present application, and as shown in fig. 2, the method may include:

step 210, acquiring divorce referee documents.

In the present application, the divorce officer document may be a divorce dispute first-pass judgment book, such as a divorce dispute first-pass judgment book with some plum.

Step 220, structuring the divorce officer document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce officer document are packaged in the data object.

In some embodiments, an element tree for extracting document elements from a divorce officer document is constructed from the data structure characteristics of the divorce officer document, from the constructed element architecture, and at least one set of data objects is generated. The specific implementation process can be seen in the embodiment shown in fig. 1, and is not described herein again.

In some embodiments, a root node in the element tree corresponds to a data object set, the data object set corresponding to the root node includes data objects corresponding to intermediate nodes and leaf nodes under the root node, for example, "litigation request" corresponds to a data object set, the data object set includes data objects encapsulating "litigation request list" and/or "whether the notice agrees to divorce", and for example, "case fact" corresponds to a data object set, and the data object set includes data objects encapsulating element information such as "child case element" and/or "marital type".

In another embodiment, an intermediate node in the element tree corresponds to a data object set, the data object set corresponding to the intermediate node includes data objects corresponding to leaf nodes below the intermediate node, for example, "child condition element" corresponds to a data object set, and the data object set includes data objects in which element information such as "number of children" and/or "name of children" is encapsulated.

In some embodiments, the data object further encapsulates location information of the element node name and/or the corresponding element value in the document, and the location information includes a prefix position index and/or a position index.

In some embodiments, each data object is encapsulated with an element node name or a child node name as a field name and an element value corresponding to the element node name or the child node name as a field value in a key-value pair format, and an exemplary structure of the data object is as follows:

Figure BDA0002537361830000171

the data objects in the data object set corresponding to the party information are as follows:

Figure BDA0002537361830000181

in some embodiments, the value of the element corresponding to the name of the element node in the data object may be null, for example, if the value of the element for "job" is null, it indicates no job. The value of the element corresponding to the name of the element node in the data object may be a first predetermined value (e.g., True) or a second predetermined value (e.g., False).

And 230, acquiring a parent tag from a preset tag system of the divorce referee document, and determining a data object set corresponding to the parent tag, wherein the parent tag is a tag of a target document tag to be extracted.

In some embodiments, a professional term knowledge system of the divorce officials is mined by learning the data structure features of the divorce officials document, a label system of the divorce officials document is designed according to the mined professional term knowledge system, and the preset label system comprises multi-level and multi-dimensional document labels.

In some embodiments, the preset tag hierarchy includes multiple hierarchical tags, where at least one low-level tag is provided under a tag of a highest hierarchical level, a tag of a middle hierarchical level corresponds to a tag of a highest hierarchical level, and at least one lowest hierarchical level tag, where the tag of the lowest hierarchical level corresponds to a parent tag path, and the parent tag path is a path from the tag of the highest hierarchical level corresponding to the lowest hierarchical level to a tag of a previous hierarchical level corresponding to the lowest hierarchical level. And when the upper-level label corresponding to the lowest-level label is the label of the highest level, the parent label is the label of the highest level, and when the upper-level label corresponding to the lowest-level label is the label of an intermediate level, the parent label is the path from the highest-level label corresponding to the lowest-level label to the intermediate-level label.

An exemplary default label system is as follows:

Figure BDA0002537361830000191

in the label system, the "principal" has the highest hierarchy and has a plurality of intermediate labels, such as "original" and "announced", and the two intermediate labels respectively have a plurality of lowest-level labels, such as "name" and "gender", and the like. In this label system, "principal-source" and "principal-defender" can be used as parent labels of a plurality of bottom labels, such as "principal-source-gender" and "principal-defender-name" for the parent label, and "principal-defender-name" for the parent label.

In step 230, a parent tag of the target document tag is obtained from a preset tag system, and a data object set corresponding to the parent tag is determined.

In the present application, since the target document tag is extracted from the data object and the target document tag is included in the data object in the designated data object set, that is, the data object set corresponding to the parent tag of the target document tag identified in advance, it is necessary to determine the corresponding data object set according to the parent tag of the target document tag, so as to extract the target document tag based on the parent tag from the data object set corresponding to the parent tag.

In some embodiments, a correspondence between the parent tag and the data object set is established in advance, and the data object set corresponding to the parent tag can be determined according to the correspondence when the parent tag of the target document tag is acquired.

For example, the set of data objects corresponding to the "party-original" is a set of data objects including the party element information, and the set of data objects corresponding to the "litigation request" is a set of data objects including the litigation request element information.

Step 240, extracting the target document tag from the data object set corresponding to the parent tag.

In some embodiments, field values in the data objects (i.e., element values of the document elements of the dispute officials document) are extracted, and the extracted field value or values are respectively spliced with corresponding parent tags to obtain the target document tag.

For example, the preset tag system includes a principal role tag, such as "principal-role", for example, "principal-source", "principal-destination", "principal-source agent", or "principal-destination agent", which can be used as a parent tag, and the element data object set corresponding to the parent tag is a data object set encapsulating principal element information, wherein the principal data object includes a field describing a role type and a field describing a picture of the principal. When a target document tag taking a role tag of a party as a parent tag needs to be extracted, according to a specified role type (such as a notice) represented by the role tag of the party, acquiring a data object with a field value in a role type field being the specified role type (such as the notice) from a corresponding data object set, and extracting field values of various portrait fields, such as a field value of a name, a field value of a gender and the like from the data object with the role type attribute being the specified role type; the extracted portrait field value is concatenated with the party role tag to obtain the target document tag with the parent tag being the party role tag, for example, the extracted portrait element "woman" is concatenated with the party role tag to obtain "party-notice-woman".

In the preset label system of the divorce officer document, the document labels taking the role labels of the parties as parent labels include but are not limited to: party-source/defendant-male/female/active military/during pregnancy/delivery within 1 year/termination of pregnancy within 6 months/psychotic patient, source agent/defendant-law/civilian group/unit recommender/guardian/relatives/legal/assigned/entrusted.

In some embodiments, field names in the data object are extracted, and the extracted field name or field names are spliced with the corresponding parent tag to obtain the target document tag.

For example, the preset tag system includes a "directory" which can be used as a parent tag, and the data object set corresponding to the parent tag is a data object encapsulated with directory element information, where the directory element information includes a directory title as an element node name and text block content corresponding to the directory title as a corresponding element value. When a target document tag with a directory as a parent tag needs to be extracted, the field names are extracted from the corresponding data objects, namely, the directory titles are extracted, such as a litigant request, a litigant requested, a dispute focus and the like, and then the extracted field names are spliced with the directory tags to obtain the target document tag, such as a directory-litigant request, a directory-litigant requested, a directory-dispute focus and the like.

In the preset label system of the divorce officer document, document labels with the 'directory' as a parent label include but are not limited to: directory-prosecution appeal/defendant/forensics/demonstratement/defendant/demonstratement/primordial testimony/defendant/court certification/focus of dispute/decision results, etc.

As can be seen from the embodiment shown in fig. 1, some element nodes for extracting a specified element include at least two sub-nodes, one sub-node corresponds to a category label of the specified element, a field name in a data object is a sub-node name, and a field value corresponding to the sub-node name is a first predetermined value or a second predetermined value, where the first predetermined value indicates that the category label corresponding to the sub-node name is hit, and the second predetermined value indicates that the category label corresponding to the sub-node name is not hit.

Based on this, in some embodiments, a child node name having a field value of a first predetermined value in the data object is extracted; and splicing the extracted child node name and the corresponding parent label to obtain the target document label.

For example, the preset tag system includes a "dispute focus" which may be a parent tag, and the data object set corresponding to the parent tag is a data object set in which dispute focus element information is encapsulated, where each data object includes a child node name serving as a field name, the child node name corresponds to a dispute focus category tag, and the field value corresponding to the child node name is a first predetermined value or a second predetermined value. If the field value corresponding to a field name is a first predetermined value, it indicates that the dispute focus category tag corresponding to the field name is hit, and if the field value corresponding to a field name is a second predetermined value, it indicates that the dispute focus category tag corresponding to the field name is not hit. When a document label based on a dispute focus needs to be extracted, a field name with a field value of a first preset value is extracted from a corresponding data object, and the extracted field name is spliced with the dispute focus to obtain a target document label, such as a dispute focus-tending right and a dispute focus-common property processing, wherein the tending right and the common property processing are the field name with the field value of the first preset value in the data object and are also hit dispute focus category labels.

In the default label system of the divorce officer document, the document label with the "dispute focus" as the parent label includes but is not limited to "dispute focus-whether emotion breaks/nursing right problem/nursing fee problem/visiting right problem/common property handling/common property identification/common property discount/personal property identification/mental consolation/salutation/color gift/fitting/legal request has fact and law basis/house initial payment composition/house repayment/house discount price/house handling/alternative settlement personal debt identification problem/one party is left unidentified".

In some embodiments, a field value is extracted from a data object corresponding to a parent tag, and a category tag corresponding to the field value is obtained according to a preset classification rule; and splicing the category label corresponding to the field value with the corresponding parent label to obtain the target document label.

For example, the preset tag system includes a "litigation request" which may be a parent tag, and the set of data objects corresponding to the parent tag is a set of data objects encapsulating litigation request element information, where the data objects include an element node name "litigation request list" as a field name and one or more independent litigation request items as corresponding field values. When a target document tag taking a litigation request as a father tag needs to be extracted, extracting field values, namely one or more independent litigation request items, from corresponding data objects, and performing text classification on each independent litigation request item one by using a pre-constructed litigation request classification model so as to obtain litigation request category tags hit by each independent litigation request item; and splicing the litigation request type tags hit by the independent litigation request items with the litigation request to obtain the document tags of which the father tags are the litigation requests, such as litigation request-divorce and litigation request-foster rights.

In the default label system of the divorce officer document, the document label with "litigation request" as the parent label includes but is not limited to "litigation request-divorce/tending right/tending fee/visiting right/common property/personal property/soothing money/ceremony/makeup".

Similarly, when a document tag with a "reported dialect" (or "requested by a litigator") as a parent tag needs to be extracted, the reported dialect text information is extracted from the corresponding data object, the reported dialect text information is classified by using a pre-constructed classification model, the hit reported dialect category tag is obtained, and the hit reported dialect category tag is spliced with the "reported dialect".

In the preset label system of the divorce officer document, the document label taking the "defended debt" as the parent label includes but is not limited to "defended debt-agreeing to divorce/disapproving divorce/requiring nursing right/requiring nursing fee/requiring visiting right" and the like.

For another example, the parent tag may also be "case fact-child case", a corresponding data object set of which is a set of data objects in which child case element information is encapsulated, for a child case element including the number of children, an element value of "number of children" is extracted from the corresponding data object, and a hit category tag is determined by classification, for example, if the element value of the number of children is 0, then "no children" is hit, if the element value of the number of children is 1, then "solitary children" is hit, etc., and the category tag hit by the element value of the number of children is spliced with the parent tag "case fact-child case", so as to obtain a target document tag, such as "case fact-child case-no children" or "case fact-child case-solitary children", etc.

In the preset label system of the divorce officer document, the document label taking "case fact-child condition" as a parent label includes but is not limited to "case fact-child condition-no-child/solitary-child/multiple children/married-child/nurseries/secondary/immature/lactation/past following the original/past following the announcement/past following the original/will follow the announcement" and the like.

In some embodiments, if the field value of the data object corresponding to the parent tag is null, the preset tag of the field name corresponding to the null value is obtained, and the preset tag is spliced with the corresponding parent tag to obtain the target document tag based on the parent tag.

For example, when a document tag taking a principal role tag as a parent tag is extracted, if a field value of a field name "occupation" in a corresponding data object is null, then "no business" is extracted and spliced with the principal role tag, such as "principal-being-no business", where "no business" is a preset tag of "occupation".

The parent tags in the divorce tag system also include "case facts-property situation", "court opinions", "judgment results", and the like.

When a document label taking case fact-property condition as a father label needs to be extracted, acquiring a data object set encapsulated with property condition element information, judging whether field values of field names in the data object are empty, such as common property, common debt, house information, vehicle information and the like, extracting field names with field values not empty, and splicing the field names with the father label "case fact-property condition" to obtain a target document label. In addition, a keyword matching mode is used, and whether the contents of the directory contain the keywords of the personal property or not is matched and checked. If the case fact-property condition-personal property condition exists, the 'personal property' label is extracted and spliced with the 'case fact-property condition' to obtain 'case fact-property condition-personal property'.

When a document label taking a 'court view' as a parent label needs to be extracted, acquiring a data object set encapsulated with court view element information, correspondingly extracting the field values of field names such as 'emotion break', 'emotion not break', 'emotion base', 'non-emotion base', and the like according to whether the field values of the field names such as 'whether emotion break is confirmed' and 'emotion base' in the data object are a first preset value or a second preset value, and splicing the extracted field values with the 'court view' to obtain a target document label.

In some embodiments, after extracting the target document tag from the data object set corresponding to the parent tag, the method further includes: and acquiring the positioning information of the target document label and storing the positioning information.

In some embodiments, for a target document tag containing a field value, the location information of the target document tag is the location information of the field value contained by the target document tag in the divorce officer document. For example, for the tag "principal-source-girl", the location information of the tag is the location information of "girl" in the document.

In some embodiments, for a target document tag containing an element node name, the positioning information of the target document tag is the positioning information in the divorce officer document for the element value corresponding to the element node name contained in the target document tag. For example, the location information of the document tag "directory-litigation request" is the extraction result (i.e., litigation request text) corresponding to the element node with the element node name "litigation request", i.e., the location information of the text block in the document with the directory tag "litigation request".

In some embodiments, for a target document tag containing a child node name, the location information of the target document tag, the element value of the category tag corresponding to the element node name contained in the target document tag is hit, and the location information in the divorce officer document. For example, the location information of the tag "dispute focus-common property assignment" is the location information of the individual dispute focus item in the text that hits the tag of the category "common property assignment".

As can be seen from the above embodiments, the present application provides a divorce officer document label extraction method, including: acquiring a divorce referee document; structuring a divorce referee document to generate at least one data object set, wherein the data object set comprises at least one data object, and document elements of the divorce referee document are packaged in each data object; acquiring a parent tag from a preset tag system of a divorce referee document, and determining a data object set corresponding to the parent tag; and extracting the target document label from the data object set corresponding to the parent label. The method realizes multi-level and multi-dimensional label extraction of the divorce referee document, and provides help for document content marking, case retrieval and query.

According to the above embodiments, the present application also provides a divorce officer document tag extraction device, as shown in fig. 3, the device may include:

the document acquisition module 310 is used for acquiring divorce dispute referee documents;

a document structuring module 320, configured to structure the divorce officer document to generate at least one data object set, where the data object set includes at least one data object, and document elements of the divorce officer document are encapsulated in the data object;

a parent tag obtaining module 330, configured to obtain a parent tag from a preset tag system of a dispute referee document, and determine a data object set corresponding to the parent tag, where the parent tag is a parent tag of a target document tag to be extracted;

document tag extraction module 340, configured to extract the target document tag from the data object set corresponding to the parent tag.

In a specific implementation, the present invention further provides a computer storage medium, where the computer storage medium may store a program, and the program may include some or all of the steps in each embodiment of the divorce officials document tag extraction method provided by the present invention when executed. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM) or a Random Access Memory (RAM).

Those skilled in the art will readily appreciate that the techniques of the embodiments of the present invention may be implemented as software plus a required general purpose hardware platform. Based on such understanding, the technical solutions in the embodiments of the present invention may be essentially or partially implemented in the form of a software product, which may be stored in a storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments or some parts of the embodiments.

The same and similar parts in the various embodiments in this specification may be referred to each other. In particular, as for the apparatus embodiment, since it is substantially similar to the method embodiment, the description is simple, and the relevant points can be referred to the description in the method embodiment.

The above-described embodiments of the present invention should not be construed as limiting the scope of the present invention.

27页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:离婚纠纷裁判文书要素提取方法及装置

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!