Method for constructing virtual area for intellectual property statistical data

文档序号:190514 发布日期:2021-11-02 浏览:31次 中文

阅读说明:本技术 一种面向知识产权统计数据构建虚拟区域的方法 (Method for constructing virtual area for intellectual property statistical data ) 是由 李娜 杨洪志 于 2021-07-30 设计创作,主要内容包括:本发明涉及一种面向知识产权统计数据构建虚拟区域的方法,属于数据分析技术领域,本方法基于地理坐标的动态微分录入方法,首先通过将区域的数据录入系统;其次利用电子地图技术根据提供的区域四至范围,把区域范围转化为电子地图经纬度坐标;再次利用地理坐标解析技术解析专利地址得到专利地址的经纬度信息;再次遍历专利数据利用算法计算专利的经纬度是否处于区域经纬度范围内,如果在,标识专利属于该区域。最后标识完成后,再汇总计算虚拟区域内的专利量,具有统计准确,效率高的特点。(The invention relates to a method for constructing a virtual area by statistical data facing intellectual property, which belongs to the technical field of data analysis; secondly, converting the area range into longitude and latitude coordinates of the electronic map by utilizing the electronic map technology according to the provided area range from four to four; analyzing the patent address by using a geographic coordinate analysis technology to obtain longitude and latitude information of the patent address; and traversing the patent data again, and calculating whether the longitude and latitude of the patent are in the longitude and latitude range of the area by using an algorithm, and if so, identifying that the patent belongs to the area. And finally, after the identification is finished, summarizing and calculating the patent amount in the virtual area, and the method has the characteristics of accurate statistics and high efficiency.)

1. A method for constructing a virtual area facing intellectual property statistical data is a dynamic differential input method based on geographic coordinates, and is characterized in that: the method comprises the steps of regional data entry, regional latitude and longitude conversion, patent coordinate analysis, regional range confirmation and quantity summarization, and specifically comprises the following steps:

analyzing the geographic coordinates of the patent, namely inputting basic data of a region, wherein the basic data comprises the contents of a region name, a region type, a name of a city to which the region belongs, a name of a district and a county, a provincial level four-to-range, a city level four-to-range and the like;

a step of converting the longitude and latitude of the area, which is to convert the area range into the longitude and latitude coordinates of the electronic map by utilizing the electronic map technology according to the range from four to four provided by the electronic map;

and a patent coordinate analyzing step, namely analyzing the address information of the patent into specific longitude and latitude information.

Confirming the area range, traversing patent data and calculating whether the longitude and latitude of the patent are in the area longitude and latitude range;

quantity summarizing step: and summarizing and calculating the patent amount in the designated area, and counting.

2. The method of claim 1, wherein the method comprises the following steps: the geographic coordinate analyzing step comprises the following steps:

s1-1: a region entry system is constructed, the region entry system comprises a name storage unit, a four-to-range storage unit and a region data storage unit, region names, region types, city names and district name information are entered in the name storage unit, and provincial four-to-range and city four-to-range information are entered in the four-to-range storage unit;

s1-2: and saving the information in the S1-1, corresponding the area name and the city name to the provincial level four-to-range, and saving the area name and the city name into an area data storage unit to be used as an area data set M.

3. The method of claim 2, wherein the method comprises the following steps: the step of converting the longitude and latitude of the area comprises the following steps of S2-1: coordinate point marking, namely taking an electronic map as a substrate, establishing a coordinate system according to longitude and latitude, selecting coordinate points pairwise according to the electronic map, connecting the coordinate points to form straight line segments, establishing a plurality of connecting line segments which are connected end to end until a closed area is formed, recording Z (Z1, Z2, Z3 and …) for an area containing a plurality of closed area subsets and having the same name, and storing the information into an area data set M;

s2-2: combining the set Z with information in the region data set M to realize the combination of the range from four to four and coordinates;

s2-3: and saving the data in the S2-2 in a large text field.

4. The method of claim 3, wherein the method comprises the following steps: where latitude and longitude are represented by "X, Y" and each enclosed area is represented by a set of vertices of line segments.

5. The method of claim 3, wherein the method comprises the following steps: the patent coordinate analyzing step comprises the following steps:

s3-1: setting a patent data storage unit, inquiring patent data to be analyzed from a patent database, and storing the patent data into the patent data storage unit as a patent data set Ps;

s3-2: extracting address information of each patent from the patent data set Ps, calling geocoding of an electronic map through WebAPI, and combining coordinate information of the area data set M to obtain longitude and latitude information (X) corresponding to each patentP,YP);

S3-3: and setting a patent coordinate storage unit, and adding the longitude and latitude information corresponding to each patent in the S3-2 into the patent coordinate storage unit.

6. The method of claim 5, wherein the method comprises the following steps: the area range confirming step includes:

s4-1: extracting coordinate information of each patent from the patent coordinate storage unit;

s4-2: and calculating whether the corresponding coordinate information of each patent is in the range of the designated area, and if so, marking the area to uniquely identify the patent P.

7. The method of claim 6, wherein the method comprises the following steps: in the S4-2, the specific calculation method includes:

step 1: creating a counter C for recording the number of intersections;

step 2: traversing each coordinate point of the counted area, and taking two coordinate points A (X)1,Y1) And B (X)2,Y2);

step 3: for the patent coordinate P (X)P,YP) If Y is judgedP< min (Y1, Y2) or YPMax (Y1, Y2), then the patent does not belong to the specified region; is judged asFruit XP< min (X1, X2) or XPMax (X1, X2), then the patent does not belong to the specified region, otherwise the following is performed;

step 4: point P (X)P,YP) And A (X)1,Y1)、B(X2,Y2) The two points are respectively connected, if the slopes of the two line segments are equal, the point P is on the AB boundary line and is in the area needing to be counted;

step 5-calculate the X coordinate $ X of the intersection point,

$X=[(YP-Y1)×(X1-X2)/(Y1-Y2)]+X1

if $ X>XPIf the intersection point is judged to be on the right side of the patent coordinate P, the counter C executes +1 operation;

step 6: repeating step2-step5, operating the intersection points of all the connecting line segments, if C is an even number, the patent is not in the statistical region, otherwise, the patent is in the region;

step 7: and storing the patent with the unique area identifier into the area storage unit by combining the area data set M.

8. The method of claim 6, wherein the method comprises the following steps: the quantity summarizing step comprises:

s5-1: inquiring patent data with unique identification of the zone;

s5-2: counting the number of patents uniquely represented by each region;

s5-3: the number of patents in each associated area is obtained.

Technical Field

The invention belongs to the technical field of data analysis, and relates to a method for constructing a virtual area, in particular to a method for constructing a virtual area by using intellectual property statistical data.

Background

In recent years, the protection of intellectual property rights is continuously promoted in China, but the method is not used for division of administrative areas, and no special administrative division exists in a specific area to set the range of the area, such as high and new areas in some places, industrial parks, scientific and technological development areas and the like. The intellectual property quantity can not be counted when the intellectual property quantity needs to be counted in the specific areas, the areas are used as key patent output areas, managers need to know the counting results of the patent quantity, the patent types and the like in detail, and an existing administrative area dividing or manual overlapping counting method is used, so that on one hand, data statistics is fuzzy, and reliability is low; on the one hand, the efficiency is underground, the labor cost is high, and time and labor are wasted.

Disclosure of Invention

In order to solve the problems, the invention designs a method for constructing the virtual area by aiming at the intellectual property statistical data, and the method has the characteristics of accurate statistics and high efficiency.

The specific technical scheme of the invention is as follows:

a method for constructing a virtual area facing intellectual property statistical data comprises a regional data entry step, a regional longitude and latitude conversion step, a patent coordinate analysis step, a regional range confirmation step and a quantity summarization step, and specifically comprises the following steps:

analyzing the geographic coordinates of the patent, namely inputting basic data of a region, wherein the basic data comprises the contents of a region name, a region type, a name of a city to which the region belongs, a name of a district and a county, a provincial level four-to-range, a city level four-to-range and the like;

a step of converting the longitude and latitude of the area, which is to convert the area range into the longitude and latitude coordinates of the electronic map by utilizing the electronic map technology according to the range from four to four provided by the electronic map;

and a patent coordinate analyzing step, namely analyzing the address information of the patent into specific longitude and latitude information.

Confirming the area range, traversing patent data and calculating whether the longitude and latitude of the patent are in the area longitude and latitude range;

quantity summarizing step: and summarizing and calculating the patent amount in the designated area, and counting.

The geographic coordinate analyzing step comprises the following steps:

s1-1: a region entry system is constructed, the region entry system comprises a name storage unit, a four-to-range storage unit and a region data storage unit, region names, region types, city names and district name information are entered in the name storage unit, and provincial four-to-range and city four-to-range information are entered in the four-to-range storage unit;

s1-2: and saving the information in the S1-1, corresponding the area name and the city name to the provincial level four-to-range, and saving the area name and the city name into an area data storage unit to be used as an area data set M.

The step of converting the longitude and latitude of the area comprises the steps of,

s2-1: coordinate point marking, namely taking an electronic map as a substrate, establishing a coordinate system according to longitude and latitude, selecting coordinate points pairwise according to the electronic map, connecting the coordinate points to form straight line segments, establishing a plurality of connecting line segments which are connected end to end until a closed area is formed, recording Z (Z1, Z2, Z3 and …) for an area containing a plurality of closed area subsets and having the same name, and storing the information into an area data set M;

s2-2: combining the set Z with information in the region data set M to realize the combination of the range from four to four and coordinates;

s2-3: and saving the data in the S2-2 in a large text field.

Where latitude and longitude are represented by "X, Y" and each enclosed area is represented by a set of vertices of line segments.

The patent coordinate analyzing step comprises the following steps:

s3-1: setting a patent data storage unit, inquiring patent data to be analyzed from a patent database, and storing the patent data into the patent data storage unit as a patent data set Ps;

s3-2: extracting the address information of each patent from the patent data set Ps, calling the geocode of the electronic map through a Web API (application program interface), and simultaneously combining the coordinate information of the regional data set M to obtain the longitude and latitude information (X) corresponding to each patentP,YP);

S3-3: and setting a patent coordinate storage unit, and adding the longitude and latitude information corresponding to each patent in the S3-2 into the patent coordinate storage unit.

The area range confirming step includes:

s4-1: extracting coordinate information of each patent from the patent coordinate storage unit;

s4-2: and calculating whether the corresponding coordinate information of each patent is in the range of the designated area, and if so, marking the area to uniquely identify the patent P.

In the S4-2, the specific calculation method includes:

step 1: creating a counter C for recording the number of intersections;

step 2: traversing each coordinate point of the counted area, and taking two coordinate points A (X)1,Y1) And B (X)2,Y2);

step 3: for the patent coordinate P (X)P,YP) If Y is judgedP< min (Y1, Y2) or YPMax (Y1, Y2), then the patent does not belong to the specified region; judging if XP< min (X1, X2) or XPMax (X1, X2), then the patent does not belong to the specified region, otherwise the following is performed;

step 4: point P (X)P,YP) And A (X)1,Y1)、B(X2,Y2) The two points are respectively connected, if the slopes of the two line segments are equal, the point P is on the AB boundary line and is in the area needing to be counted;

step 5-calculate the X coordinate $ X of the intersection point,

$X=[(YP-Y1)×(X1-X2)/(Y1-Y2)]+X1

if $ X>XPIf the intersection point is judged to be on the right side of the patent coordinate P, the counter C executes +1 operation;

step 6: repeating step2-step5, operating the intersection points of all the connecting line segments, if C is an even number, the patent is not in the statistical region, otherwise, the patent is in the region;

step 7: and storing the patent with the unique area identifier into the area storage unit by combining the area data set M.

The quantity summarizing step comprises:

s5-1: inquiring patent data with unique identification of the zone;

s5-2: counting the number of patents uniquely represented by each region;

s5-3: the number of patents in each associated area is obtained.

The invention has the beneficial effects that:

the method comprises the steps of firstly, inputting data of an area into a system; secondly, converting the area range into longitude and latitude coordinates of the electronic map by utilizing the electronic map technology according to the provided area range from four to four; analyzing the patent address by using a geographic coordinate analysis technology to obtain longitude and latitude information of the patent address; and traversing the patent data again, and calculating whether the longitude and latitude of the patent are in the longitude and latitude range of the area by using an algorithm, and if so, identifying that the patent belongs to the area. And finally, summarizing and calculating the patent amount in the virtual area after the identification is finished.

The method for constructing the virtual area facing to the intellectual property statistical data has the following beneficial effects:

1. the method has the advantages that the regional information is electronized, the regional electronic map system is established, the text description is converted into more visual electronic map data, and the data base is made for the region to which the patent belongs while the display is convenient.

2. The patent address is analyzed into longitude and latitude data of the electronic map, and patent distribution can be displayed on the electronic map more visually.

3. The number of patents in the area is calculated according to the area electronic map coordinates and the patent electronic map coordinates, so that the problem of calculating the number of patents in the area is solved.

Drawings

FIG. 1 is a block flow diagram of the present invention.

Detailed Description

The technical solutions of the present invention are further described in detail below with reference to specific examples and drawings, but the scope and implementation of the present invention are not limited thereto.

In a specific embodiment, as shown in fig. 1 of the specification, a method for constructing a virtual area for intellectual property statistical data includes a dynamic differential entry method based on geographic coordinates, including an area data entry step, an area longitude and latitude conversion step, a patent coordinate analysis step, an area range confirmation step and a quantity summarization step, and specifically includes:

analyzing the geographic coordinates of the patent, namely inputting basic data of a region, wherein the basic data comprises the contents of a region name, a region type, a name of a city to which the region belongs, a name of a district and a county, a provincial level four-to-range, a city level four-to-range and the like;

a step of converting the longitude and latitude of the area, which is to convert the area range into the longitude and latitude coordinates of the electronic map by utilizing the electronic map technology according to the range from four to four provided by the electronic map;

and a patent coordinate analyzing step, namely analyzing the address information of the patent into specific longitude and latitude information.

The invention aims at the condition that the number of intellectual property rights is required to be counted in the area lacking in the non-administrative division, and establishes a geographic coordinate analysis model to solve the problem of the virtual area.

Confirming the area range, traversing patent data and calculating whether the longitude and latitude of the patent are in the area longitude and latitude range;

quantity summarizing step: and summarizing and calculating the patent amount in the designated area, and counting.

The geographic coordinate analyzing step comprises the following steps:

s1-1: a region entry system is constructed, the region entry system comprises a name storage unit, a four-to-range storage unit and a region data storage unit, region names, region types, city names and district name information are entered in the name storage unit, and provincial four-to-range and city four-to-range information are entered in the four-to-range storage unit;

s1-2: and saving the information in the S1-1, corresponding the area name and the city name to the provincial level four-to-range, and saving the area name and the city name into an area data storage unit to be used as an area data set M.

The step of converting the longitude and latitude of the area comprises the steps of,

s2-1: coordinate point marking, namely taking an electronic map as a substrate, establishing a coordinate system according to longitude and latitude, selecting coordinate points pairwise according to the electronic map, connecting the coordinate points to form straight line segments, establishing a plurality of connecting line segments which are connected end to end until a closed area is formed, recording Z (Z1, Z2, Z3 and …) for an area containing a plurality of closed area subsets and having the same name, and storing the information into an area data set M;

s2-2: combining the set Z with information in the region data set M to realize the combination of the range from four to four and coordinates;

s2-3: and saving the data in the S2-2 in a large text field.

By the method, corresponding geographic information can be obtained to prepare for subsequent processes.

Where latitude and longitude are represented by "X, Y" and each enclosed area is represented by a set of vertices of line segments. The specified virtual area is enclosed by a plurality of small line segments, and when the small line segments are enough and short enough, the area of the straight line segment is continuously moved to the real area. The longer the system is in use, the higher the density of the dots being drawn, and the more accurate the area coverage.

The patent coordinate analyzing step comprises the following steps:

s3-1: setting a patent data storage unit, inquiring patent data to be analyzed from a patent database, and storing the patent data into the patent data storage unit as a patent data set Ps;

s3-2: extracting address information of each patent from the patent data set Ps, calling geocoding of an electronic map through a Web API (application program interface), and combining the coordinates of the regional data set MInformation, obtaining the longitude and latitude information (X) corresponding to each patentP,YP);

S3-3: and setting a patent coordinate storage unit, and adding the longitude and latitude information corresponding to each patent in the S3-2 into the patent coordinate storage unit.

The position and position information of each patent is obtained through the steps and can be matched with the map information in the patent coordinate analysis step.

The area range confirming step includes:

s4-1: extracting coordinate information of each patent from the patent coordinate storage unit;

s4-2: and calculating whether the corresponding coordinate information of each patent is in the range of the designated area, and if so, marking the area to uniquely identify the patent P.

In the S4-2, the specific calculation method includes:

step 1: creating a counter C for recording the number of intersections;

step 2: traversing each coordinate point of the counted area, and taking two coordinate points A (X)1,Y1) And B (X)2,Y2);

step 3: for the patent coordinate P (X)P,YP) If Y is judgedP< min (Y1, Y2) or YPMax (Y1, Y2), then the patent does not belong to the specified region; judging if XP< min (X1, X2) or XPMax (X1, X2), then the patent does not belong to the specified region, otherwise the following is performed;

step 4: point P (X)P,YP) And A (X)1,Y1)、B(X2,Y2) The two points are respectively connected, if the slopes of the two line segments are equal, the point P is on the AB boundary line and is in the area needing to be counted;

step 5-calculate the X coordinate $ X of the intersection point,

$X=[(YP-Y1)×(X1-X2)/(Y1-Y2)]+X1

if $ X>XPThen the intersection point is judged to be in the patentTo the right of coordinate P, counter C performs +1 operation;

step 6: repeating step2-step5, operating the intersection points of all the connecting line segments, if C is an even number, the patent is not in the statistical region, otherwise, the patent is in the region;

step 7: and storing the patent with the unique area identifier into the area storage unit by combining the area data set M.

The quantity summarizing step comprises:

s5-1: inquiring patent data with unique identification of the zone;

s5-2: counting the number of patents uniquely represented by each region;

s5-3: the number of patents in each associated area is obtained.

The region is uniquely represented, so that the problem of repeated statistics or missing statistics of the counted data can be avoided, and the reliability of the data is ensured.

9页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:遥感解译样本点的显示方法、装置、电子设备及存储介质

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!