Method for obtaining case number of legal document and related equipment

文档序号:830120 发布日期:2021-03-30 浏览:21次 中文

阅读说明:本技术 一种获得法律文书的案号的方法及相关设备 (Method for obtaining case number of legal document and related equipment ) 是由 徐宏 于 2019-09-27 设计创作,主要内容包括:本发明公开了一种获得法律文书的案号的方法及相关设备,可以对法律文书进行解析,获得所述法律文书中携带的多种案卷信息;确定与获得的各所述案卷信息对应的案号字符;根据与所述法律文书的立案日期匹配的案号编制规则,将确定的至少部分所述案号字符组合为所述法律文书的案号。本发明通过法律文书中的案卷信息与法律文书对应的案号编制规则,可以自动准确的获得法律文书的案号,进而提升了录入法律文书的案号的效率。(The invention discloses a method for obtaining case numbers of legal documents and related equipment, which can analyze the legal documents to obtain various case information carried in the legal documents; determining case number characters corresponding to the obtained case information; and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document. According to the invention, the case number of the legal document can be automatically and accurately obtained through the case information in the legal document and the case number compiling rule corresponding to the legal document, so that the case number entering efficiency of the legal document is improved.)

1. A method of obtaining a case number for a legal instrument, comprising:

analyzing a legal document to obtain a plurality of case information carried in the legal document;

determining case number characters corresponding to the obtained case information;

and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document.

2. The method of claim 1, wherein parsing the legal document to obtain a plurality of types of case information carried in the legal document comprises:

determining a filing date of a legal document from a portfolio of the legal document;

determining whether the filing date of the legal document is earlier than a preset date, if so, analyzing the legal document to obtain case information carried in the legal document and comprising a case acceptance court, a case examination department, a case examination program and a case number; otherwise, the legal documents are analyzed to obtain case information carried in the legal documents, including case courtroom, case types and case numbers.

3. The method of claim 1, wherein the determining pattern number characters corresponding to each of the obtained pattern information comprises:

and determining case number characters corresponding to the obtained case information from a preset standard dictionary table, wherein the preset standard dictionary table records corresponding case information and case number characters.

4. The method of claim 1, wherein parsing the legal document to obtain a plurality of types of case information carried in the legal document comprises:

and obtaining a plurality of types of case information from the legal documents by using the regular expressions matched with the case information.

5. The method of any of claims 1 to 4, further comprising:

and comparing the case number of the legal document obtained by combination with the initial case number corresponding to the legal document, and modifying the initial case number into the case number of the legal document obtained by combination when the case number of the legal document is inconsistent with the initial case number.

6. An apparatus for obtaining a case number of a legal instrument, comprising: a case information obtaining unit, a case number character determining unit and a case number obtaining unit,

the case information obtaining unit is used for analyzing the legal documents to obtain various case information carried in the legal documents;

the case number character determining unit is used for determining case number characters corresponding to the obtained case information;

the case number obtaining unit is used for combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the setting date of the legal document.

7. The apparatus of claim 6, wherein the case information obtaining unit comprises: a scheme date determining subunit, a scheme date comparing subunit, a first scheme information obtaining subunit and a second scheme information obtaining subunit,

the filing date determining subunit is used for determining the filing date of the legal document from the files of the legal document;

the filing date comparison subunit is used for determining whether the filing date of the legal document is earlier than a preset date, and if so, triggering the first file information obtaining subunit; otherwise, triggering the second file information obtaining subunit;

the first case information obtaining subunit is used for analyzing the legal documents and obtaining case information carried in the legal documents, including case acceptance courts, case examination departments, case examination programs and case numbers;

and the second case information obtaining subunit is used for analyzing the legal document to obtain the case information which is carried in the legal document and comprises case launching courts, case types and case numbers.

8. The apparatus of any one of claims 6 to 7, further comprising: a case number modification unit for modifying the case number,

and the case number modification unit is used for comparing the case number of the legal document obtained by combination with the initial case number corresponding to the legal document, and modifying the initial case number into the case number of the legal document obtained by combination when the case number of the legal document is inconsistent with the initial case number of the legal document obtained by combination.

9. A storage medium having a program stored thereon, wherein the program, when executed by a processor, implements a method of obtaining a case number of a legal document as recited in any one of claims 1 to 5.

10. An electronic device comprising at least one processor, and at least one memory connected to the processor, a bus; the processor and the memory complete mutual communication through the bus; the processor is configured to invoke program instructions in the memory to perform the method of obtaining a case number for a legal instrument of any one of claims 1 to 5.

Technical Field

The invention relates to the field of text processing, in particular to a method for obtaining a case number of a legal document and related equipment.

Background

Under the background of information construction, the legal documents can be uploaded to the database and the case numbers of the legal documents and the legal documents are correspondingly stored in the database, so that a user can obtain the legal documents corresponding to the case numbers according to the case numbers stored in the database.

However, the case numbers stored in the database are manually entered according to the case information in the legal documents, and the case numbers entered into the database are often wrong due to unclear writing in the legal documents, careless entry personnel and the like.

Disclosure of Invention

In view of the above problems, the present invention provides a method and related apparatus for obtaining a number of a legal document, which overcomes or at least partially solves the above problems, and the technical solution is as follows:

a method of obtaining a case number for a legal instrument, comprising:

analyzing a legal document to obtain a plurality of case information carried in the legal document;

determining case number characters corresponding to the obtained case information;

and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document.

Optionally, the parsing the legal document to obtain various case information carried in the legal document includes:

determining a filing date of a legal document from a portfolio of the legal document;

determining whether the filing date of the legal document is earlier than a preset date, if so, analyzing the legal document to obtain case information carried in the legal document and comprising a case acceptance court, a case examination department, a case examination program and a case number; otherwise, the legal documents are analyzed to obtain case information carried in the legal documents, including case courtroom, case types and case numbers.

Optionally, the determining the case number characters corresponding to the obtained case information includes:

and determining case number characters corresponding to the obtained case information from a preset standard dictionary table, wherein the preset standard dictionary table records corresponding case information and case number characters.

Optionally, the parsing the legal document to obtain various case information carried in the legal document includes:

and obtaining a plurality of types of case information from the legal documents by using the regular expressions matched with the case information.

Optionally, the method further includes:

and comparing the case number of the legal document obtained by combination with the initial case number corresponding to the legal document, and modifying the initial case number into the case number of the legal document obtained by combination when the case number of the legal document is inconsistent with the initial case number.

An apparatus for obtaining a case number of a legal instrument, comprising: a case information obtaining unit, a case number character determining unit and a case number obtaining unit,

the case information obtaining unit is used for analyzing the legal documents to obtain various case information carried in the legal documents;

the case number character determining unit is used for determining case number characters corresponding to the obtained case information;

the case number obtaining unit is used for combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the setting date of the legal document.

Optionally, the case information obtaining unit includes: a scheme date determining subunit, a scheme date comparing subunit, a first scheme information obtaining subunit and a second scheme information obtaining subunit,

the filing date determining subunit is used for determining the filing date of the legal document from the files of the legal document;

the filing date comparison subunit is used for determining whether the filing date of the legal document is earlier than a preset date, and if so, triggering the first file information obtaining subunit; otherwise, triggering the second file information obtaining subunit;

the first case information obtaining subunit is used for analyzing the legal documents and obtaining case information carried in the legal documents, including case acceptance courts, case examination departments, case examination programs and case numbers;

and the second case information obtaining subunit is used for analyzing the legal document to obtain the case information which is carried in the legal document and comprises case launching courts, case types and case numbers.

Optionally, the apparatus further comprises: a case number modification unit for modifying the case number,

and the case number modification unit is used for comparing the case number of the legal document obtained by combination with the initial case number corresponding to the legal document, and modifying the initial case number into the case number of the legal document obtained by combination when the case number of the legal document is inconsistent with the initial case number of the legal document obtained by combination.

A storage medium having stored thereon a program which, when executed by a processor, implements the method of obtaining a case number of a legal document of any one of the above.

An electronic device comprising at least one processor, and at least one memory connected to the processor, a bus; the processor and the memory complete mutual communication through the bus; the processor is used for calling the program instructions in the memory to execute any one of the methods for obtaining the case number of the legal document.

By means of the technical scheme, the method for obtaining the case number of the legal document and the related equipment can analyze the legal document to obtain various case information carried in the legal document; determining case number characters corresponding to the obtained case information; and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document. According to the invention, the case number of the legal document can be automatically and accurately obtained through the case information in the legal document and the case number compiling rule corresponding to the legal document, so that the case number entering efficiency of the legal document is improved.

The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.

Drawings

Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:

FIG. 1 is a flow chart illustrating a method for obtaining a case number of a legal document according to an embodiment of the present invention;

FIG. 2 is a flow chart illustrating another method for obtaining a case number of a legal document according to an embodiment of the present invention;

FIG. 3 is a flowchart illustrating a method for generating a pattern information obtaining model according to an embodiment of the present invention;

FIG. 4 is a flow chart illustrating another method for obtaining a case number of a legal document according to an embodiment of the present invention;

FIG. 5 is a flow chart illustrating another method for obtaining a case number of a legal document according to an embodiment of the present invention;

FIG. 6 is a flow chart illustrating another method for obtaining a case number of a legal document according to an embodiment of the present invention;

FIG. 7 is a schematic structural diagram of an apparatus for obtaining the case number of a legal document according to an embodiment of the present invention;

FIG. 8 is a schematic structural diagram of a device for generating a pattern information obtaining model according to an embodiment of the present invention;

FIG. 9 is a schematic structural diagram of another device for acquiring the number of a legal document according to an embodiment of the present invention;

fig. 10 shows a schematic structural diagram of an electronic device provided in an embodiment of the present invention.

Detailed Description

Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

As shown in fig. 1, a method for obtaining a case number of a legal document according to an embodiment of the present invention may include:

s100, analyzing the legal document to obtain various case information carried in the legal document.

The legal documents may include civil and official documents, administrative and official documents, criminal and official documents, and arbitration legal documents. The embodiment of the invention can analyze the legal document through the text analysis technology to obtain a plurality of case information in the legal document, wherein the plurality of case information can comprise at least one of case acceptance court, case examining and managing department, case examining and managing program, case number and case type.

The text parsing technique may include at least one of regular expression, machine learning, and natural language processing. It is understood that the technology capable of parsing a legal document to obtain the case information in the legal document can be regarded as the text parsing technology in the embodiment of the present invention.

Alternatively, as shown in fig. 2, in another method for obtaining a case number of a legal document according to an embodiment of the present invention, step S100 may include:

and S110, obtaining a plurality of types of case information from the legal documents by using the regular expressions matched with the case information.

Optionally, in the embodiment of the present invention, case information that satisfies the regular expression may be obtained in a legal document through a preset regular expression that matches the case information.

Optionally, the embodiment of the present invention may label the case information for a plurality of legal texts, and use the legal text labeled with the case information as the training text. And performing machine learning on the training text by a convolutional neural network technology to obtain a trained case information acquisition model. According to the embodiment of the invention, the target legal text can be analyzed through the trained case information acquisition model, and the case information of the target legal text is acquired. Specifically, an embodiment of the present invention provides a method for generating a pattern information obtaining model, as shown in fig. 3, where the method may include:

and S010, obtaining a training text marked with case information, wherein the training text is a legal document.

And S020, performing word segmentation on the training text to obtain a word sequence.

Specifically, the embodiment of the present invention may use at least one of a plurality of word segmentation tools to perform word segmentation to obtain the word sequence. The word segmentation tool may include: harmony Large LTP, jieba, etc. The embodiment of the invention can identify the part of speech of each vocabulary to obtain the part of speech sequence besides obtaining the vocabulary sequence.

And S030, obtaining a matrix formed by the vocabulary vectors of all the vocabularies in the vocabulary sequence.

Specifically, the embodiment of the present invention may further include, for each vocabulary in the vocabulary sequence: obtaining word vectors and part-of-speech vectors of the vocabulary, and splicing the word vectors and the part-of-speech vectors of the vocabulary into the vocabulary vectors of the vocabulary; and arranging the vocabulary vectors of the vocabularies in the vocabulary sequence according to the arrangement sequence of the vocabularies in the vocabulary sequence to obtain a matrix formed by the vocabulary vectors of the vocabularies in the vocabulary sequence.

The embodiment of the invention can obtain the word vector through the word vector technology. If the vocabulary is not in the vocabulary table of the word vector, the expression is carried out by using the appointed preset word vector.

The embodiment of the invention can use random vectors with certain dimensionality to express the part of speech, so that the part of speech is characterized. For example, for a total of 30 parts of speech [ a1, a2, …, a30], a1 may be represented by vector a1, a2 may be represented by vector a2, and so on. The dimensions a1, a2, etc. are a fixed value specified, for example 20 dimensions, each of which is a randomly generated fraction close to 0.

After the word vector and the part of speech vector are obtained, the word vector and the part of speech vector are spliced to form vectorized expression of the vocabulary, namely: a vocabulary vector. The dimension of the vocabulary vector is the dimension of the word vector + the dimension of the part of speech vector. For each vocabulary in the judicial text to be processed, the vocabulary vector is obtained, and then the vocabulary vectors of each vocabulary in the judicial text to be processed are spliced together to form a matrix.

S040, machine learning is carried out on the matrix and the labeled case information, and a case information obtaining model is generated, wherein the case information obtaining model is input as follows: the output of the case information acquisition model is as follows: and (4) file information.

In practical application, the embodiment of the invention can train the case information acquisition model based on the deep learning frameworks such as tensierflow, mxnet, pytorch and the like.

Optionally, as shown in fig. 4, in another method for obtaining a case number of a legal document according to an embodiment of the present invention, step S100 may include:

and S120, determining the filing date of the legal document from the file of the legal document.

In practical situations, the corresponding case number establishment rules of the legal document may be different according to the case setting date of the legal document. For example, a legal document after 1/2016 is different from a legal document before 1/2016 in case of a case number. For example, the case number creation rule of the legal document before 1/2016 is "year of filing + case adoption court + case examination department + case examination procedure + case number", and the case number creation rule of the legal document after 1/2016 is "year of filing + case adoption court + case type + case number".

In order to facilitate understanding of the difference between the case number preparation rules of the legal documents after 1/2016 and those before 1/2016, the following description is given: according to the case number compilation rule of the legal documents before 1 month and 1 day of 2016, the case number of the 1 st first-examination civil case accepted in 2016 by the national institute of people in the white cloud area, Guangzhou city, is as follows: (2016) fringe cloud method line is the first letter No. 1. If the rule is compiled according to the case number of the legal documents after 2016, 1 month and 1 day, the case number of the 1 st first-examination civil case accepted in 2016 by the national institute of people in the white cloud area, Guangzhou city, is as follows: (2016) yue 0111 Min Xian 1.

The embodiment of the invention can determine the filing date of the legal document from the file of the legal document by text parsing technology.

S130, determining whether the filing date of the legal document is earlier than a preset date, and if so, executing the step S131; otherwise, step S132 is executed.

Wherein, the preset date can be 2016, 1 month and 1 day. Of course, other dates are also possible.

The case number compiling rule corresponding to the legal document with the case setting date before the preset date is different from the case number compiling rule corresponding to the legal document with the case setting date after the preset date. Therefore, the embodiment of the invention can determine the case number compiling rule corresponding to the legal document according to the case setting date of the legal document, and then determine the case information required to be obtained according to the case number compiling rule corresponding to the legal document.

S131, analyzing the legal documents to obtain case information carried in the legal documents, wherein the case information comprises case undertaking courts, case examining and managing departments, case examining and managing programs and case numbers.

S132, analyzing the legal documents to obtain case information including case courtroom, case types and case numbers carried in the legal documents.

The embodiment of the invention can determine the filing year of the legal document according to the determined filing date. The embodiment of the invention can determine the case number compiling rule corresponding to the case number of the legal document through the case setting date. According to the case number establishment rule corresponding to the case number of the legal document, the case information except for the case setting date in the case number establishment rule can be obtained from the legal document. For example, when determining that the filing date of a legal document is 7/4/2014, the embodiment of the present invention may determine the case acceptance court, the case examining and managing department, the case examining and managing program and the case number of the legal document. When the case setting date of a legal document is determined to be 2018, 7, month and 4, the embodiment of the invention can determine the case acceptance court, the case type and the case number of the legal document.

S200, determining the pattern number characters corresponding to the obtained pattern information.

It is understood that the case number is used as the identification of the legal document, and is not a direct combination of the case information, but a combination of the case number characters corresponding to the case information. The case number characters corresponding to the case information may be recognizable after the case information is simplified. For example: the case undertaking court obtained from the legal documents is the Guangzhou city white cloud district people court, and the case number character corresponding to the Guangzhou city white cloud district people court can be the fringe cloud method. It is understood that the case number characters can be actual characters used by the court, and can also be characters set by the person skilled in the art according to the needs of the person.

Optionally, as shown in fig. 5, in another method for obtaining a case number of a legal document according to an embodiment of the present invention, step S200 may include:

s210, determining case number characters corresponding to the obtained case information from a preset standard dictionary table, wherein the preset standard dictionary table records corresponding case information and case number characters.

Specifically, the legal documents after 2016 and 1/2016 have different rules for filing the case numbers of the legal documents before 2016 and 1/2016. Therefore, in the embodiment of the present invention, different standard dictionary tables can be preset for the legal documents after 2016 and 1/1 and the legal documents before 2016 and 1/1.

The embodiment of the invention can determine the preset standard dictionary table corresponding to the legal document according to the determined filing date of the legal document, and then inquire the case number characters corresponding to the case information in the preset standard dictionary table according to the case information. The standard dictionary table can be as shown in table 1, wherein table 1 is a standard dictionary table listed by way of example in the legal documents of the 1 st trial civil case accepted in 2014 at the national court of white cloud in Guangzhou.

TABLE 1

It should be understood that table 1 shows only the case information and case number characters corresponding to a part of the preset standard dictionary table corresponding to the legal documents before 1/2016, and in practical applications, all the corresponding case information and case number characters may be recorded in the preset standard dictionary table.

S300, according to a case number compiling rule matched with the case setting date of the legal document, combining at least part of the determined case number characters into the case number of the legal document.

The embodiment of the invention can combine at least part of the determined case number characters into the case number of the legal document according to the sequence specified in the case number compiling rule matched with the case setting date of the legal document. For example, the case number characters of the legal document of the 1 st trial civil case accepted in 2014 by the national court of white cloud in Guangzhou city are 2014, the Suzuyun law, the Row, the Chun and the first word, respectively, and the case number establishment rule corresponding to the legal document is "the case year + the case acceptance court + the case examination department + the case examination program + the case number", so the case number of the legal document is: (2014) the fringe cloud method is used for the first word.

The method for obtaining the case number of the legal document provided by the embodiment of the invention can analyze the legal document to obtain various case information carried in the legal document; determining case number characters corresponding to the obtained case information; and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document. According to the embodiment of the invention, the case number of the legal document can be automatically and accurately obtained through the case information in the legal document and the case number compiling rule corresponding to the legal document, so that the case number entering efficiency of the legal document is improved.

Optionally, as shown in fig. 6, another method for obtaining a case number of a legal document provided in an embodiment of the present invention may further include:

s400, comparing the case number of the legal document obtained by combination with the initial case number corresponding to the legal document, and modifying the initial case number into the case number of the legal document obtained by combination when the case number of the legal document is inconsistent with the initial case number.

The initial case number can be stored in a database, and the user can obtain the legal document corresponding to the initial case number according to the initial case number.

The embodiment of the invention can establish the corresponding relation between the case number and the legal document by taking the case number as the identification of the legal document in the database. The user can search the legal document corresponding to the case number according to the case number. For the sake of easy differentiation, the embodiment of the present invention refers to the initial case number as the identification of the legal document in the database corresponding to the legal document.

It should be noted that, in the embodiment of the present invention, the case number of the legal document obtained by modifying the initial case number into a combination is the initial case number corresponding to the legal document stored in the modification database, and is not the case number on the legal document.

The embodiment of the invention can obtain the initial case number corresponding to a certain legal document in the database storing the legal documents, compare the initial case number with the case number of the legal document obtained by the method for obtaining the case number of the legal document provided by the embodiment of the invention, judge whether the initial case number is an error case number, determine that the initial case number is the error case number when the comparison result is inconsistent, and further modify the initial case number into the case number of the legal document obtained by the method for obtaining the case number of the legal document provided by the embodiment of the invention.

It can be understood that the initial case number of a certain legal document in the database may be lost, the initial case number of the lost legal document may be empty, and the embodiment of the present invention may still compare the initial case number corresponding to the legal document with the case number of the legal document obtained by the method for obtaining the case number of the legal document provided by the embodiment of the present invention, and if the initial case number is inconsistent with the case number of the legal document obtained by the method for obtaining the case number of the legal document provided by the embodiment of the present invention, modify the initial case number into the case number of the legal document obtained by the method for obtaining the case number of the legal document provided by the embodiment of the present invention.

Optionally, in the embodiment of the present invention, when it is determined that the initial case number corresponding to a certain legal document in the database is empty, the case number of the legal document obtained by the method for obtaining the case number of the legal document provided by the embodiment of the present invention is directly stored in the database as the case number corresponding to the legal document.

The method for obtaining the case number of the legal document provided by the embodiment of the invention can verify whether the corresponding initial case number of the legal document in the database is correct or not, and simultaneously, relevant personnel can correct the initial case number corresponding to the legal document with the case number lost or wrong in the database conveniently, thereby improving the correction efficiency of the corresponding initial case number of the legal document in the database.

Corresponding to the above method embodiment, the embodiment of the present invention provides an apparatus for obtaining a document number of a legal document, which has a structure as shown in fig. 7 and may include: a case information obtaining unit 100, a case number character determining unit 200, and a case number obtaining unit 300.

The case information obtaining unit 100 is configured to parse a legal document to obtain a plurality of types of case information carried in the legal document.

The legal documents may include civil and official documents, administrative and official documents, criminal and official documents, and arbitration legal documents. The case information obtaining unit 100 may analyze the legal document through a text analysis technology to obtain a plurality of types of case information in the legal document, where the plurality of types of case information may include at least one of a case undertaking court, a case examining and managing department, a case examining and managing program, a case number, and a case type.

The text parsing technique may include at least one of regular expression, machine learning, and natural language processing. It is understood that the technology capable of parsing a legal document to obtain the case information in the legal document can be regarded as the text parsing technology in the embodiment of the present invention.

Optionally, the case information obtaining unit 100 may be specifically configured to obtain a plurality of kinds of case information from the legal documents by using regular expressions matched with the case information.

Optionally, the case information obtaining unit 100 may obtain, in a legal document, case information that satisfies a regular expression through a preset regular expression matched with the case information.

Optionally, the embodiment of the present invention may label the case information for a plurality of legal texts, and use the legal text labeled with the case information as the training text. And performing machine learning on the training text by a convolutional neural network technology to obtain a trained case information acquisition model. According to the embodiment of the invention, the target legal text can be analyzed through the trained case information acquisition model, and the case information of the target legal text is acquired. Specifically, an embodiment of the present invention provides a device for generating a pattern information obtaining model, which may be configured as shown in fig. 8, and includes: a training text obtaining unit 10, a vocabulary sequence obtaining unit 20, a matrix obtaining unit 30, and a model generating unit 40.

The training text obtaining unit 10 is used for obtaining a training text marked with case information, and the training text is a legal document.

And the vocabulary sequence obtaining unit 20 is configured to perform word segmentation on the training text to obtain a vocabulary sequence.

A matrix obtaining unit 30, configured to obtain a matrix formed by the vocabulary vectors of the vocabularies in the vocabulary sequence.

A model generating unit 40, configured to perform machine learning on the matrix and the labeled case information to generate a case information obtaining model, where the case information obtaining model has the following inputs: the output of the case information acquisition model is as follows: and (4) file information.

Optionally, the case information obtaining unit 100 may include: the device comprises a scheme date determining subunit, a scheme date comparing subunit, a first scheme information obtaining subunit and a second scheme information obtaining subunit.

The filing date determination subunit is used for determining the filing date of the legal document from the files of the legal document.

The filing date determination subunit may determine the filing date of the legal document from the portfolio of the legal document by a text parsing technique.

The filing date comparison subunit is used for determining whether the filing date of the legal document is earlier than a preset date, and if so, triggering the first file information obtaining subunit; otherwise, triggering the second file information obtaining subunit.

The first case information obtaining subunit is used for analyzing the legal documents and obtaining the case information carried in the legal documents, including case undertaking courts, case examining and managing departments, case examining and managing programs and case numbers.

And the second case information obtaining subunit is used for analyzing the legal document to obtain the case information which is carried in the legal document and comprises case launching courts, case types and case numbers.

The case number compiling rule corresponding to the legal document with the case setting date before the preset date is different from the case number compiling rule corresponding to the legal document with the case setting date after the preset date. Therefore, the embodiment of the invention can determine the case number compiling rule corresponding to the legal document according to the case setting date of the legal document, and then determine the case information required to be obtained according to the case number compiling rule corresponding to the legal document.

The embodiment of the invention can determine the filing year of the legal document according to the determined filing date. The embodiment of the invention can determine the case number compiling rule corresponding to the case number of the legal document through the case setting date. According to the case number establishment rule corresponding to the case number of the legal document, the case information except for the case setting date in the case number establishment rule can be obtained from the legal document.

The case character determination unit 200 is configured to determine a case character corresponding to each of the obtained case information.

It is understood that the case number is used as the identification of the legal document, and is not a direct combination of the case information, but a combination of the case number characters corresponding to the case information. The case number characters corresponding to the case information may be recognizable after the case information is simplified.

Optionally, the case number character determining unit 200 is specifically configured to determine, from a preset standard dictionary table, case number characters corresponding to the obtained case information, where the preset standard dictionary table records corresponding case information and case number characters.

The embodiment of the invention can determine the preset standard dictionary table corresponding to the legal document according to the determined filing date of the legal document, and then inquire the case number characters corresponding to the case information in the preset standard dictionary table according to the case information.

The case number obtaining unit 300 is configured to combine at least a part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the setting date of the legal document.

The case number obtaining unit 300 may combine at least a part of the determined case number characters into the case number of the legal document in an order specified in the case number formulation rule matching the case date of the legal document.

The device for obtaining the case number of the legal document provided by the embodiment of the invention can analyze the legal document to obtain various case information carried in the legal document; determining case number characters corresponding to the obtained case information; and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document. According to the embodiment of the invention, the case number of the legal document can be automatically and accurately obtained through the case information in the legal document and the case number compiling rule corresponding to the legal document, so that the case number entering efficiency of the legal document is improved.

Optionally, as shown in fig. 9, another apparatus for obtaining a document number of a legal document according to an embodiment of the present invention may further include: a pattern number modification unit 400.

The case number modification unit 400 is configured to compare the case number of the legal document obtained through combination with an initial case number corresponding to the legal document, and modify the initial case number into the case number of the legal document obtained through combination when the case number of the legal document obtained through combination is inconsistent with the initial case number of the legal document obtained through combination.

The initial case number can be stored in a database, and the user can obtain the legal document corresponding to the initial case number according to the initial case number.

The embodiment of the invention can establish the corresponding relation between the case number and the legal document by taking the case number as the identification of the legal document in the database. The user can search the legal document corresponding to the case number according to the case number. For the sake of easy differentiation, the embodiment of the present invention refers to the initial case number as the identification of the legal document in the database corresponding to the legal document.

It should be noted that, in the embodiment of the present invention, the case number of the legal document obtained by modifying the initial case number into a combination is the initial case number corresponding to the legal document stored in the modification database, and is not the case number on the legal document.

An embodiment of the present invention provides a storage medium, on which a program is stored, the program implementing any one of the above-mentioned methods for obtaining a case number of a legal document when the program is executed by a processor.

As shown in fig. 10, an electronic device 500 according to an embodiment of the present invention includes at least one processor 510, at least one memory 520 connected to the processor 510, and a bus 530; the processor 510 and the memory 520 are in communication with each other through the bus 530; the processor 510 is configured to call program instructions in the memory 520 to perform any of the above-described methods of obtaining a case number of a legal document.

The device for obtaining the case number of the legal document comprises a processor and a memory, wherein the case information obtaining unit 100, the case number character determining unit 200, the case number obtaining unit 300 and the like are stored in the memory as program units, and the processor executes the program units stored in the memory to realize corresponding functions.

The processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. The kernel can be set to be one or more than one, and the case number of the legal document can be automatically and accurately obtained by adjusting the kernel parameters.

The electronic device 500 herein may be a server, a PC, a PAD, a cell phone, etc.

The present application further provides a computer program product adapted to perform a program for initializing the following method steps when executed on a data processing device:

analyzing a legal document to obtain a plurality of case information carried in the legal document;

determining case number characters corresponding to the obtained case information;

and combining at least part of the determined case number characters into the case number of the legal document according to a case number compiling rule matched with the case setting date of the legal document.

Optionally, the parsing the legal document to obtain various case information carried in the legal document includes:

determining a filing date of a legal document from a portfolio of the legal document;

determining whether the filing date of the legal document is earlier than a preset date, if so, analyzing the legal document to obtain case information carried in the legal document and comprising a case acceptance court, a case examination department, a case examination program and a case number; otherwise, the legal documents are analyzed to obtain case information carried in the legal documents, including case courtroom, case types and case numbers.

Optionally, the determining the case number characters corresponding to the obtained case information includes:

and determining case number characters corresponding to the obtained case information from a preset standard dictionary table, wherein the preset standard dictionary table records corresponding case information and case number characters.

Optionally, the parsing the legal document to obtain various case information carried in the legal document includes:

and obtaining a plurality of types of case information from the legal documents by using the regular expressions matched with the case information.

Optionally, the method further includes:

and comparing the case number of the legal document obtained by combination with the initial case number corresponding to the legal document, and modifying the initial case number into the case number of the legal document obtained by combination when the case number of the legal document is inconsistent with the initial case number.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

In a typical configuration, a device includes one or more processors (CPUs), memory, and a bus. The device may also include input/output interfaces, network interfaces, and the like.

The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip. The memory is an example of a computer-readable medium.

Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.

It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.

As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

19页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:公司名称比对的方法、装置、计算机设备和存储介质

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!