Multi-die FPGA designed based on signal delay balancing technology

文档序号：1100439 发布日期：2020-09-25 浏览：8次中文

阅读说明：本技术 基于信号延迟平衡技术设计的多裸片fpga (Multi-die FPGA designed based on signal delay balancing technology ) 是由单悦尔徐彦峰范继聪张艳飞闫华于 2020-07-01 设计创作，主要内容包括：本申请公开了一种基于信号延迟平衡技术设计的多裸片FPGA,涉及FPGA技术领域,该多裸片FPGA包括基板及其上的硅连接层以及按二维堆叠方式层叠排布在硅连接层上的若干个FPGA裸片,裸片内置的硅堆叠连接点通过RDL层具有预定绕线结构的顶层金属线连接到连接点引出端,连接点引出端再通过硅连接层内的两个方向的跨裸片连线即可连接到其他裸片,实现裸片之间的二维互连通信,这种级联结构支持由多个小规模小面积的裸片级联实现大规模大面积的FPGA产品,减少加工难度,提高芯片生产良率；通过调整顶层金属线的绕线距离即能高效的实现跨裸片信号延迟平衡,加速设计时序收敛,有利于提高应用中的资源布局灵活性。(The application discloses a multi-die FPGA designed based on a signal delay balancing technology, and relates to the technical field of FPGAs, wherein the multi-die FPGA comprises a substrate, a silicon connecting layer on the substrate, and a plurality of FPGA dies which are stacked on the silicon connecting layer in a two-dimensional stacking mode, a built-in silicon stacking connection point of the dies is connected to a connection point leading-out end through a top layer metal wire of a RDL layer with a preset winding structure, the connection point leading-out end can be connected to other dies through cross-die connection wires in two directions in the silicon connecting layer, and two-dimensional interconnection communication between the dies is realized; through adjusting the winding distance of the top layer metal wire, the cross-die signal delay balance can be efficiently realized, the design time sequence convergence is accelerated, and the resource layout flexibility in application is favorably improved.)

1. The multi-die FPGA is designed based on a signal delay balancing technology and comprises a substrate, a silicon connection layer and a plurality of FPGA dies, wherein the silicon connection layer is arranged on the substrate in a stacked mode, the FPGA dies are arranged on the silicon connection layer in a stacked mode in two dimensions, and the silicon connection layer covers all the FPGA dies;

each FPGA die comprises a plurality of configurable function modules, interconnection resource modules distributed around the configurable function modules and connection point terminals, the configurable function module in the FPGA bare chip at least comprises a programmable logic unit, a silicon stacking connection module and an input/output port, the silicon stacking connection module comprises a plurality of silicon stacking connection points, the programmable logic unit in the FPGA bare chip is respectively connected with the silicon stacking connection points and the input/output port through the interconnection resource module, the silicon stacking connection point in the FPGA bare chip is connected with the corresponding connection point leading-out end through a top layer metal wire in the rewiring layer, the top layer metal wire has a preset winding structure, the winding distance corresponds to the signal delay on the signal transmission path, and at least two top layer metal wires with different winding distances exist in the rewiring layer;

the connection point leading-out terminal in each FPGA bare chip is connected with the corresponding connection point leading-out terminal in other FPGA bare chips through a cross bare chip connection wire in the silicon connection layer, and each FPGA bare chip is connected with any other FPGA bare chip through the cross bare chip connection wire in the silicon connection layer; cross-die connections connecting the FPGA dies are arranged crosswise along a first direction and a second direction in the silicon connection layer, and the first direction and the second direction are vertical to each other in the horizontal direction; and the input/output port in the FPGA bare chip is connected to the substrate through the silicon through hole on the silicon connecting layer.

2. The multi-die FPGA of claim 1 wherein the cross-die connections within said silicon connection layer have a predetermined routing structure and routing distance corresponds to signal delay on the signal transmission path.

3. The multi-die FPGA of claim 1 wherein said silicon stack connection points are directly connected to interconnect switches in said interconnect resource module, and wherein said silicon stack connection points are fully or partially interconnected to said interconnect switches.

4. The multi-die FPGA of any one of claims 1-3, wherein each of the silicon stack connection modules includes N rows and M columns of silicon stack connection points inside, each of the silicon stack connection modules includes three types, i.e., a high-delay silicon stack connection point, a medium-delay silicon stack connection point, and a low-delay silicon stack connection point, and a routing distance of a top metal line connected to the high-delay silicon stack connection point, a routing distance of a top metal line connected to the medium-delay silicon stack connection point, and a routing distance of a top metal line connected to the low-delay silicon stack connection point are sequentially decreased in the rewiring layer.

5. The multi-die FPGA of any one of claims 1-3, wherein the configurable functional modules in the FPGA die further include other functional modules, the other functional modules include DSP modules and/or BRAM modules, the other functional modules are respectively connected to the silicon stacking connection points and the input/output ports through the interconnection resource modules, the programmable logic units, the silicon stacking connection modules and the other functional modules are arranged to form a two-dimensional array, and the silicon stacking connection modules are arranged in a row-column structure where the programmable logic units are located and in a row-column structure where the other functional modules are located.

6. The multi-die FPGA of claim 5, wherein the size of the silicon stacked connection module is smaller than the size of the other functional modules, and the row-column structure of the other functional modules forms a vacant area at the silicon stacked connection module, and the vacant area is provided with at least one of a capacitor, a test circuit, a noise reduction circuit and a monitoring circuit.

7. A multi-die FPGA of any one of claims 1-3 wherein the connection point terminals in each FPGA die are arranged in a row-column configuration along said first direction and said second direction, and wherein a plurality of rows of connection point terminals are arranged along said first direction and/or a plurality of columns of connection point terminals are arranged along said second direction in each FPGA die.

8. The multi-die FPGA of claim 7, wherein each FPGA die has rows of connection point terminals evenly spaced along said first direction; or, a plurality of rows of connection point terminals are randomly distributed in each FPGA bare chip along the first direction.

9. The multi-die FPGA of any one of claims 1-3, wherein cross-die connections connecting individual FPGA dies are arranged hierarchically within said silicon connection layer.

10. The multi-die FPGA of any one of claims 1-3, wherein a plurality of FPGA dies are arranged on said silicon connection layer according to a shape and an area of each FPGA die.

11. The multi-die FPGA of any one of claims 1-3 further comprising other dies, said other dies being stacked on said silicon connection layer, wherein connection point terminals in an FPGA die are connected to corresponding connection point terminals in said other dies by cross-die connections within said silicon connection layer; wherein:

at least one other die is a Processor chip including at least one of a Processor chip such as an ARM chip or a RISC-V chip;

and/or at least one other bare chip is a DSP chip;

and/or, at least one other die is an AI chip;

and/or, at least one other die is a memory chip comprising at least one of SRAM, DRAM, ROM, FLASH, MRAM, and RRAM;

and/or at least one other bare chip is a data conversion chip, and the data conversion chip comprises at least one of an analog-to-digital conversion chip and a digital-to-analog conversion chip;

and/or, at least one other die is a radio frequency chip;

and/or, at least one other die comprises an HBM, RAMBUS or NOC interface;

and/or, at least one other die includes a PCIE, Ethernet MAC, XUAI, SONET/SDH, or INTERLAKEN interface.

12. The multi-die FPGA of claim 2 wherein a first top level metal line between a first silicon stack connection point on any first FPGA die and a corresponding first connection point lead-out, a cross-die connection line between the first connection point lead-out and a second connection point lead-out on any second FPGA die, and a second top level metal line between a second connection point lead-out on the second FPGA die and a corresponding second silicon stack connection point form a signal transmission path; the signal delay of the signal transmission path corresponds to a total wiring distance of the signal transmission path, the total wiring distance is the sum of the winding distance of the first top-layer metal wire, the winding distance of the cross-die connecting wire and the winding distance of the second top-layer metal wire, the longer the total wiring distance is, the higher the signal delay of the signal transmission path is, and when the total wiring distances of the two signal transmission paths are adjusted to be equal, the signal delays of the two signal transmission paths are equal.

Technical Field

The invention relates to the technical field of semiconductors, in particular to a multi-die FPGA designed based on a signal delay balancing technology.

Background

An FPGA (Field Programmable Gate Array) is a Programmable logic device of hardware, and is widely applied to prototype verification in integrated circuit design besides the fields of mobile communication, data center, etc., so as to effectively verify the correctness of circuit functions and accelerate the circuit design speed. The prototype verification needs to realize circuit design by using programmable logic resources inside the FPGA, the demand for the number of the programmable logic resources of the FPGA is continuously increased along with the continuous increase of the scale of the integrated circuit and the realization of complex functions, the number of the programmable resources of the FPGA becomes a larger bottleneck due to the continuous increase of the subsequent technical development and demand, and a larger challenge is provided for the development of the industry. The increase in the FPGA scale represents an increasing chip area, which leads to an increase in chip processing difficulty and a decrease in chip production yield.

Some patents have proposed a method for designing chip interconnects by using silicon-on-chip (SSI), for example, the patent with application number 2016800598883 proposes a stacked die interconnect without interposer, which uses a plurality of discrete interconnect dies to implement signal interconnects between two adjacent dies; another example is patent application No. 2017800501825, which proposes a separate interface for silicon stacked interconnect technology integration that uses an entire interposer as an interconnect carrier to achieve signal interconnection of two adjacent IC dies. However, the above two patents are limited by the defects of the chip itself and the structure design, and both can only interconnect two adjacent IC dies arranged side by side, and can only implement signal interconnection in a single direction, so that the structure limitation is large in practical application, and the signal delay of the device is difficult to design and adjust due to the structure limitation, and the complex circuit requirement of the large-scale integrated circuit is difficult to meet in practice.

Disclosure of Invention

The present invention provides a multi-die FPGA designed based on a signal delay balancing technique, aiming at the above problems and technical requirements, and the technical scheme of the present invention is as follows:

a multi-die FPGA designed based on a signal delay balancing technology comprises a substrate, a silicon connecting layer arranged on the substrate in a stacked mode and a plurality of FPGA dies arranged on the silicon connecting layer in a stacked mode, wherein the FPGA dies are arranged on the silicon connecting layer in a two-dimensional stacking mode, and the silicon connecting layer covers all the FPGA dies;

each FPGA bare chip comprises a plurality of configurable function modules, interconnection resource modules distributed around each configurable function module and a connection point leading-out end, each configurable function module in the FPGA bare chip at least comprises a programmable logic unit, a silicon stacking connection module and an input/output port, each silicon stacking connection module comprises a plurality of silicon stacking connection points, each programmable logic unit in the FPGA bare chip is respectively connected with the silicon stacking connection point and the input/output port through the interconnection resource modules, each silicon stacking connection point in the FPGA bare chip is connected with the corresponding connection point leading-out end through a top layer metal wire in a rewiring layer, each top layer metal wire has a preset winding structure, the winding distance corresponds to the signal delay on the signal transmission path, and at least two top layer metal wires with different winding distances exist in the rewiring layer;

the connection point leading-out end in each FPGA bare chip is connected with the corresponding connection point leading-out end in other FPGA bare chips through a cross bare chip connection wire in the silicon connection layer, and each FPGA bare chip is connected with any other FPGA bare chip through the cross bare chip connection wire in the silicon connection layer; the cross-die connecting lines which are communicated with the FPGA dies are arranged in the silicon connecting layer along a first direction and a second direction in a crossed mode, and the first direction and the second direction are vertical to each other in the horizontal direction; and the input/output port in the FPGA bare chip is connected to the substrate through the silicon through hole on the silicon connecting layer.

The further technical scheme is that the cross-bare chip connecting wire in the silicon connecting layer has a preset winding structure, and the winding distance corresponds to the signal delay on the signal transmission path.

The further technical scheme is that the silicon stacking connection point is directly connected with an interconnection switch in the interconnection resource module, and the silicon stacking connection point and the interconnection switch are fully or partially interconnected.

The further technical scheme is that each silicon stacking connection module comprises N rows and M columns of silicon stacking connection points, each silicon stacking connection module comprises a high-delay silicon stacking connection point, a middle-delay silicon stacking connection point and a low-delay silicon stacking connection point, and in a rewiring layer, the winding distance of a top layer metal wire connected with the high-delay silicon stacking connection point, the winding distance of a top layer metal wire connected with the middle-delay silicon stacking connection point and the winding distance of a top layer metal wire connected with the low-delay silicon stacking connection point are sequentially reduced.

The FPGA chip further comprises other function modules, wherein the other function modules comprise a DSP module and/or a BRAM module, the other function modules are respectively connected with the silicon stacking connection point and the input/output port through the interconnection resource module, the programmable logic units, the silicon stacking connection module and the other function modules are arranged to form a two-dimensional array, and the silicon stacking connection module is arranged in a row-column structure where the programmable logic units are located and in a row-column structure where the other function modules are located.

The further technical scheme is that the size of the silicon stacking connection module is smaller than that of other functional modules, a row and column structure where the other functional modules are located forms a vacant area at the silicon stacking connection module, and at least one of a capacitor, a test circuit, a noise reduction circuit and a monitoring circuit is arranged at the vacant area.

The further technical scheme is that the connection point leading-out ends in each FPGA bare chip are distributed according to a row-column structure along a first direction and a second direction, and a plurality of rows of connection point leading-out ends are distributed in each FPGA bare chip along the first direction and/or a plurality of columns of connection point leading-out ends are distributed along the second direction.

The technical scheme is that a plurality of rows of connection point leading-out ends are uniformly distributed in each FPGA bare chip along a first direction at the same intervals; or, a plurality of rows of connection point terminals are randomly distributed in each FPGA bare chip along the first direction.

The further technical scheme is that the cross-die connecting lines which are communicated with the FPGA dies are arranged in a silicon connecting layer in a layered mode.

The further technical scheme is that the plurality of FPGA bare chips are arranged on the silicon connecting layer according to the shape and the area of each FPGA bare chip.

The multi-die FPGA further comprises other dies, the other dies are stacked on the silicon connection layer, and the connection point leading-out end in the FPGA die is connected with the corresponding connection point leading-out end in the other dies through a cross-die connection line in the silicon connection layer; wherein:

the at least one other die is a Processor chip including at least one of a Processor chip such as an ARM chip or a RISC-V chip;

and/or at least one other bare chip is a DSP chip;

and/or, at least one other die is an AI chip;

and/or, the at least one other bare chip is a memory chip, and the memory chip comprises at least one of SRAM, DRAM, ROM, FLASH, MRAM and RRAM;

and/or at least one other bare chip is a data conversion chip, and the data conversion chip comprises at least one of an analog-to-digital conversion chip and a digital-to-analog conversion chip;

and/or, at least one other die is a radio frequency chip;

and/or, at least one other die comprises an HBM, RAMBUS or NOC interface;

and/or, at least one other die includes a PCIE, Ethernet MAC, XUAI, SONET/SDH, or INTERLAKEN interface.

A first top layer metal wire between a first silicon stacking connection point in any first FPGA bare chip and a corresponding first connection point leading-out end, a cross-bare chip connection wire between the first connection point leading-out end and a second connection point leading-out end on any second FPGA bare chip, and a second top layer metal wire between a second connection point leading-out end on the second FPGA bare chip and a corresponding second silicon stacking connection point form a signal transmission path; the signal delay of the signal transmission path corresponds to the total wiring distance of the signal transmission path, the total wiring distance is the sum of the winding distance of the first top-layer metal wire, the winding distance of the cross-die connecting wire and the winding distance of the second top-layer metal wire, the longer the total wiring distance is, the higher the signal delay of the signal transmission path is, and when the total wiring distance of the two signal transmission paths is adjusted to be equal, the signal delays of the two signal transmission paths are equal.

The beneficial technical effects of the invention are as follows:

1. the multi-die FPGA comprises a substrate, a silicon connection layer and a plurality of FPGA dies, wherein a silicon stacking connection point arranged in each FPGA die is connected to a connection point leading-out end through a top layer metal wire of a preset winding structure in an RDL layer, and the connection end leading-out end is connected with any other FPGA die through a die crossing connection line of the silicon connection layer, so that the plurality of FPGA dies realize two-dimensional interconnection communication through the die crossing connection line arranged in the silicon connection layer along two directions, and can realize die crossing signal delay balance efficiently by adjusting the winding distance of the top layer metal wire, and design timing convergence is accelerated. The cascade structure of the multiple FPGA bare chips supports the realization of large-scale and large-area FPGA chips by cascading the multiple small-scale and small-area FPGA bare chips, not only can reduce the processing difficulty, improve the production yield of the chips and accelerate the design speed, but also can improve the flexibility of resource layout in application and accelerate the time sequence convergence of the design by the cross-bare chip signal delay balancing technology.

2. The cross-bare-chip connecting line in the silicon connecting layer also has a preset winding structure, and when the winding distances are different, the signal transmission delays are different, so after the FPGA bare chips are designed, the signal delay among the FPGA bare chips can be further adjusted through the winding distances of the cross-bare-chip connecting line in the silicon connecting layer, and the cross-bare-chip signal delay is simple, feasible and low in cost.

3. Besides the conventional configurable functional module, each FPGA bare chip also comprises a newly added configurable functional module special for bare chip signal extraction, namely a silicon stacking connection point, and the FPGA bare chips carry out signal extraction and interconnection through the special silicon stacking connection point, so that the FPGA bare chip has the advantages of higher bandwidth, lower delay, lower power consumption and the like.

And 4, various circuit structures can be arranged in the vacant areas at the silicon stacking connection points inside each FPGA bare chip, so that the FPGA bare chips have better performance and stronger function.

5. The multi-bare-chip FPGA can also be provided with other bare chips of various types on the silicon connecting layer, and the other bare chips can also realize two-dimensional interconnection communication with the FPGA bare chips through the cross bare chip connecting line arranged in the silicon connecting layer along two directions, so that various types of bare chips can be flexibly integrated, FPGA chips with rich varieties and powerful functions facing different applications can be quickly realized, and the market is occupied.

Drawings

FIG. 1 is a schematic cross-sectional view of a multi-die FPGA of the present application.

Fig. 2 is an enlarged view of a portion of the structure in fig. 1.

FIG. 3 is a top down two-dimensional schematic of one configuration of the FPGA of the present application.

Fig. 4 is a schematic diagram of an internal structure of a conventional FPGA.

FIG. 5 is a schematic diagram of an internal structure of a conventional FPGA adopting a Column-Based architecture.

FIG. 6 is a block diagram of a conventional FPGA adopting the Column-Based architecture.

Fig. 7 is a schematic block diagram of an FPGA die in the present application when the block structure shown in fig. 6 is adopted.

Fig. 8 is a schematic diagram of connections between a silicon stacked connection module and an interconnection resource module inside an FPGA die in the present application.

FIG. 9 is a schematic diagram of the routing structure of the top metal lines within the RDL layer of the FPGA die and the routing structure of the cross-die wires in the silicon connection layer in the present application.

Fig. 10 is a schematic diagram of a connection point terminal on an FPGA die in the present application.

Fig. 11 is a two-dimensional schematic diagram of an interconnect structure formed between FPGA dies through connection point leadouts in the present application.

Detailed Description

The following further describes the embodiments of the present invention with reference to the drawings.

The application provides a multi-die FPGA designed based on a signal delay balancing technology, fig. 1 is a schematic cross-sectional view of a packaging structure of the FPGA, fig. 2 is an enlarged view of a part of the structure in fig. 1, and fig. 3 is a corresponding schematic top view of fig. 1. The FPGA comprises a substrate 1, a silicon connection layer 2 and a plurality of FPGA dies, which are stacked in sequence from bottom to top, for example, the structure shown in fig. 1 to 3 includes 6 FPGA dies, which are respectively represented by dies 1 to 6. In practical implementation, the FPGA further comprises a package housing for protecting the various components, packaged outside the substrate 1, the silicon connection layer 2 and the FPGA die, and further comprises pins for signal extraction, etc. connected to the substrate, which conventional structures are not shown in detail in fig. 1 and 2.

The FPGA of the application does not adopt a single FPGA bare chip structure, but comprises a plurality of FPGA bare chips, the plurality of FPGA bare chips are all stacked and arranged on the same silicon connection layer 2, and the plurality of FPGA bare chips are arranged on the silicon connection layer 2 in a two-dimensional stacking mode, namely are arranged along the horizontal direction and the vertical direction on the horizontal plane, as shown in FIG. 3. The plurality of FPGA bare chips can be reasonably arranged on the silicon connection layer 2, and are compactly arranged on the silicon connection layer 2 according to the shape and the area of each FPGA bare chip, so that the whole area of the whole FPGA is smaller, and the interconnection performance between the bare chips is better.

The internal structure of the FPGA bare chip and the connection mode of the FPGA bare chip and the silicon connection layer 2 are adjusted and designed elaborately. Next, the present application introduces specific connection structures and implementations between the FPGA die and the silicon connection layer 2:

the FPGA bare chip in the present application is different from a conventional FPGA bare chip, and first, the structure of the conventional FPGA bare chip is described as follows, please refer to the schematic structure shown in fig. 4, where the conventional FPGA bare chip is composed of configurable functional modules with multiple functions, and a common configurable functional module mainly includes a programmable logic unit (CLB or PLBs) and an input/output port (IOB), and sometimes includes some other functional modules, such as BRAM, DSP, PC, and the like. Each configurable functional module has an interconnection resource module (INT) with the same structure distributed around the configurable functional module, and horizontal or vertical connecting lines among the configurable functional modules are connected through the INT module. On the basis of the conventional structure, the FPGA bare chip in the application also comprises a silicon stacking connection module which is specially designed in the bare chip according to the signal interconnection requirement between bare chips, besides the conventional configurable function modules comprising the CLB, the IOB and other function modules, each silicon stacking connection module comprises a plurality of silicon stacking connection points, the silicon stacking connection module is a newly-added configurable function module which is specially used for leading out bare chip signals, and the FPGA bare chip in the application replaces some conventional configurable function modules in the conventional FPGA bare chip into the silicon stacking connection module. And the conventional configurable function module at any position can be replaced according to the signal interconnection requirement, for example, the conventional Column-Based FPGA architecture shown in fig. 5 is taken as an example, wherein each Column is the same module, for example, CLBcolumn is a chip Column filled with CLB modules arranged from top to bottom. The height of each module is aligned with INT, the height of the CLB is equal to the INT height, the heights of other functional modules can be equal to a plurality of INT heights, the structure enables the whole FPGA to look like a two-dimensional array formed by INT, the heights of the modules in the array are consistent, the width of the same column is the same, but the widths of different columns can be different. Based on the two-dimensional array architecture, the silicon stacking connection module can be arranged in a row-column structure where the programmable logic unit is located, and can also be arranged in a row-column structure where other functional modules are located to obtain the FPGA bare chip in the application. The module size of the silicon stacking connection module is small, the height of the module is equal to the height of the CLB and INT, the width of the module is slightly smaller than the CLB, and the height and the width of other functional modules such as the DSP and the BRAM are larger than the CLB, so that an empty area is formed when the silicon stacking connection module is placed at the other functional modules (the DSP and the BRAM) with larger size, and at least one conventional circuit structure of a capacitor, a test circuit, a noise reduction circuit and a monitoring circuit can be arranged at the empty area to improve the circuit performance and enrich the functions.

The following examples are given: a schematic diagram of a conventional FPGA module using the architecture of fig. 5 is shown in fig. 6, which includes two other functional modules, namely, a DSP and a BRAM, in addition to a CLB, where the height of the CLB is equal to the height of INT, the height of the DSP is equal to 2 INT heights, and the height of the BRAM is equal to 4 INT heights. Based on the module structure shown in fig. 6, the CLB, the DSP, and the BRAM at any position may be replaced and set as the silicon stacking connection module to obtain the FPGA bare chip in the present application, for example, in fig. 7, the CLB column and the DSP column are both provided with the silicon stacking connection module LNK, and the CLB and one of the DSPs at the four corners are replaced and set as the silicon stacking connection module LNK. As can be seen from fig. 7, since the height of the DSP is equal to 2 INT heights, two silicon stack connection modules LNK are disposed in the height direction in the area where the DSP is originally arranged, and since the width of the DSP is greater than the width of the CLB, other conventional circuits, such as the TEST circuit TEST shown in fig. 7, may also be disposed in the vacant area in the width direction. Actual FPGA dies are also not necessarily limited to the Column-Based architecture illustrated in this application, and there is no fixed location for the silicon stack connection modules inside the FPGA dies.

Each silicon stacking connection module in the FPGA bare chip in the application also has an interconnection resource module distributed around the silicon stacking connection point, so that the winding structure of the FPGA bare chip in the application can be consistent with that of a conventional FPGA bare chip without changing. The horizontal or vertical connection lines between the silicon stacking connection module and each other configurable functional module are all connected via the INT module, and the silicon stacking connection module LNK is directly connected to the interconnection switch in the INT module corresponding to the silicon stacking connection module, and is a part of the interconnection line, please refer to fig. 8. The silicon stacking connection module LNK and the interconnection switch can be fully interconnected or partially interconnected according to the requirement of connectivity.

The FPGA die in the present application further includes a connection point terminal 4 corresponding to the internal silicon stacking connection point 3, and the silicon stacking connection point 3 in the FPGA die is connected to the corresponding connection point terminal 4 through a top layer metal wire 5 in a redistribution layer (RDL layer), as shown in fig. 8, fig. 8 does not show a winding manner of the top layer metal wire 5, and only shows a connection relationship between the silicon stacking connection point 3 and the connection point terminal 4. In the present application, the top metal line 5 in the RDL layer has a predetermined routing structure and the routing distance corresponds to the signal delay on the signal transmission path, and the longer the routing distance of the top metal line 5, the larger the time delay between the silicon stacking connection point 3 and the connection point lead-out 4, so the cross-die signal delay can be balanced by adjusting the routing distance of the top metal line 5.

At least two top metal lines 5 with different winding distances exist in the RDL layer, and the method comprises the following steps: each silicon stacking connection module LNK includes N rows and M columns of silicon stacking connection points 3 therein, as shown in fig. 8, each silicon stacking connection module LNK includes three types, i.e., a high delay silicon stacking connection point, a middle delay silicon stacking connection point, and a low delay silicon stacking connection point, respectively, and in the redistribution layer, the winding distance of the top layer metal line 5 connected to the high delay silicon stacking connection point, the winding distance of the top layer metal line 5 connected to the middle delay silicon stacking connection point, and the winding distance of the top layer metal line 5 connected to the low delay silicon stacking connection point are sequentially decreased, as shown in fig. 9, the high delay silicon stacking connection point forms a high delay area a, and the winding distance of the top layer metal line 5 connected thereto is the longest; the middle delay silicon stacking connection point forms a middle delay area B, and the routing distance of the top layer metal wire 5 connected with the middle delay silicon stacking connection point is the second order; the low-delay silicon stack connection point forms a low-delay region C, which connects the shortest routing distance of the top metal lines 5.

The connection point terminals 4 are generally arranged in a row-column structure along the first direction and the second direction according to the requirement of stacking interconnection, and the structure diagram can be referred to as fig. 10. With this structure, the die signals of the FPGA die have been connected by the silicon stack connection points 3 to connection point terminals 4. The connection point leading-out end 4 can be connected to the corresponding connection point leading-out end 4 in other FPGA bare chips through a cross bare chip connection wire 6 in the silicon connection layer 2, interconnection between the FPGA bare chips is achieved, specifically, a micro convex ball grows on the FPGA bare chips, the connection point leading-out end 4 is connected with the silicon connection layer 2 through the micro convex ball and is connected to other FPGA bare chips through the cross bare chip connection wire inside the silicon connection layer 2, the micro convex ball structure at the bottom of the FPGA bare chips can be seen in figure 2, and detailed marking is omitted in the application.

The silicon connection layer 2 is internally provided with the cross-die connecting wire 6, the cross-die connecting wire 6 is distributed in the whole area or partial area of the silicon connection layer 2, and meanwhile, the silicon connection layer 2 covers all the FPGA dies, so each FPGA die can be connected to any other FPGA die through the cross-die connecting wire 6 according to the circuit requirement, the circuit interconnection between the dies is almost not limited in space, and the flexibility is far better than the structures of the two patents mentioned in the background technology. The inter-die connection lines 6 arranged inside the silicon connection layer 2 are arranged crosswise along a first direction and a second direction, the first direction and the second direction are mutually vertical in the horizontal direction, namely, the first direction and the second direction are two directions of the horizontal direction and the vertical direction matched with the die arrangement structure. Therefore, each FPGA die can be simultaneously connected with other FPGA dies in the first direction and the second direction through the cross-die connection 6 in the two directions, so that a two-dimensional interconnection structure is formed among the plurality of FPGA dies, as shown in fig. 3, the die 1 can be interconnected with the die 2 through the cross-die connection 6 in the first direction, and can also be interconnected with the die 4 through the cross-die connection 6 in the second direction, and the same is true for the rest of the dies. In each direction, each FPGA die can be interconnected not only with other adjacent FPGA dies by the cross-die connection 6, but also with other spaced FPGA dies by the cross-die connection 6, for example, as shown in fig. 1 to 3, the die 1 can be interconnected not only with the adjacent die 2 by the cross-die connection 6, but also with the die 3 by the cross-die connection 6, or even with other spaced dies. In addition, the FPGA die can be connected to different rows of FPGA dies simultaneously through the cross-die connection 6 in two directions, for example, in fig. 3, the die 1 can be connected to the die 6 through the cross-die connection 6. It should be noted that fig. 3 shows the cross-die connections between the spaced FPGA dies as crossing the surface of the intermediate FPGA dies, such as the cross-die connections between die 1 and die 3 crossing the surface of die 2, but this is merely for convenience of illustration of the connections, and practically all of the cross-die connections 6 are within the silicon connection layer 2, as shown in fig. 1 and 2. The cross-die connecting lines 6 are arranged in the silicon connecting layer 2 in a layered and crossed mode, and the cross-die connecting lines 6 in the same direction and the cross-die connecting lines 6 in different directions can be arranged in a layered and crossed mode, so that the cross-die connecting lines 6 are not influenced with each other. It should be noted that although the present application claims such a two-dimensional stacking scheme, the technical solution is also applicable to a one-dimensional stacking scheme, but only there is a die-crossing wire 6 in only one direction (lateral or longitudinal) inside the silicon connection layer 2. The manufacturing process of the silicon connection layer 2 can be different from that of an FPGA (field programmable gate array) die, and only the cross-die connection line 6 consisting of a plurality of layers of metal lines is arranged inside the silicon connection layer 2 without active devices, so that the manufacturing is easy and the cost is low.

Fig. 3 illustrates the connection between the cross-die connection 3 and the connection point terminals 4 of two FPGA dies, but in this application, the cross-die connection 6 also has a predetermined routing structure and the routing distance corresponds to the signal delay on the signal transmission path. The longer the distance of the windings of the cross-die wire 6, the greater the signal delay between the connection point outlets 4 of the two FPGA dies, so the cross-die signal delay can be balanced by adjusting the distance of the windings of the cross-die wire 6. As shown in fig. 9, taking the cross-die wires 6 with three different routing distances as an example, the routing distances of the cross-die wires 6 of the three routing structures from left to right in fig. 9 are sequentially shorter and the signal delays are sequentially smaller.

Therefore, in the present application, a signal transmission path is formed by a first top layer metal wire between a first silicon stacked connection point in any first FPGA die and a corresponding first connection point lead-out, a cross-die connection wire between the first connection point lead-out and a second connection point lead-out on any second FPGA die, and a second top layer metal wire between a second connection point lead-out on the second FPGA die and a corresponding second silicon stacked connection point. The signal delay of the signal transmission path corresponds to the total wiring distance of the signal transmission path, the total wiring distance is the sum of the winding distance of the first top-layer metal wire, the winding distance of the cross-die connecting wire and the winding distance of the second top-layer metal wire, and the longer the total wiring distance is, the higher the signal delay of the signal transmission path is. The Delay of the signal transmission path is T1a + T2a + T3a, where T1a is the signal Delay on the first top-level metal line, T2a is the signal Delay on the die-crossing line, and T3a is the signal Delay on the second top-level metal line, so to adjust the Delay of the signal transmission path, Delay _ path can be implemented by adjusting one or more of T1a, T2a, and T3a, and each signal Delay corresponds to a respective routing distance, so that the signal Delay can be adjusted by adjusting the routing distances of the first top-level metal line, the die-crossing line, and the second top-level metal line. By adjusting the total wiring distances of the two signal transmission paths to be equal, the signal delays of the two signal transmission paths can be made equal.

Since the FPGA die is provided with the connection point terminals 4 along both the first direction and the second direction, the FPGA die can be connected to other FPGA dies in a two-dimensional direction by using the connection point terminals 4 in the two directions and combining the cross-die connection lines 6 in the two directions, please refer to the schematic diagram shown in fig. 11.

In addition, in order to achieve higher communication bandwidth, multiple rows/multiple columns of connection point leading-out terminals 4 can be arranged, that is, multiple rows of connection point leading-out terminals 4 are arranged in each FPGA die along the first direction, and/or multiple columns of connection point leading-out terminals 4 are arranged along the second direction, so that efficient two-dimensional cascade of multiple rows and multiple columns is achieved. When the plurality of rows/columns of the connection point leading-out terminals 4 are arranged along each direction, they may be arranged at regular intervals or at random.

It should be noted that the multi-die FPGA of the present application employs the FPGA die with the built-in dedicated silicon stack connection point, and signal extraction and die interconnection are performed through the independent silicon stack connection point. But the multi-die FPGA is also compatible with a conventional FPGA die, and die signals of the conventional FPGA die are directly connected to the cross-die connecting line 6 in the silicon connecting layer 2 via the IOB to achieve die interconnection, but compared with a structure directly adopting IOB interconnection, the structure adopting silicon stacking connection point interconnection has the advantages of higher bandwidth, lower delay, lower power consumption and the like.

Referring to fig. 1 and 2, a silicon connection layer 2 is stacked on a substrate 1, specifically, micro-bumps are grown on a side of the silicon connection layer 2 away from the FPGA die, and the silicon connection layer 2 is connected to the substrate 1 through the micro-bumps. Silicon through holes 7 are further formed in the silicon connecting layer 2, and IOBs in the FPGA bare chip are connected to the substrate 1 through the silicon through holes 7 in the silicon connecting layer 2 so as to finally lead out signals.

The multi-die FPGA realizes the interconnection communication among the multiple FPGA dies, so that when a large-scale FGPA needs to be designed, the multi-die FPGA can be formed by cascading multiple smaller-scale FPGA dies, and the processing difficulty is greatly reduced. In addition, the multi-die FPGA can be further expanded according to usage requirements, that is, other dies are stacked on the silicon connection layer 2 in addition to the plurality of FPGA dies, and the arrangement of the other dies and the connection with the silicon connection layer are similar to the FPGA, which is not described in detail herein. The connection point leading-out end in the FPGA bare chip is connected with the corresponding connection point leading-out end in other bare chips through a cross bare chip connecting wire in the silicon connecting layer, so that the FPGA with more varieties and powerful functions can be realized, and the FPGA has the advantages of high speed, low power consumption and more connection channels. Wherein:

the at least one other die is a Processor chip including at least one of a Processor chip such as an ARM chip or a RISC-V chip;

and/or at least one other bare chip is a DSP chip;

and/or, at least one other die is an AI chip;

and/or, the at least one other bare chip is a memory chip, and the memory chip comprises at least one of SRAM, DRAM, ROM, FLASH, MRAM and RRAM;

and/or at least one other bare chip is a data conversion chip, and the data conversion chip comprises at least one of an analog-to-digital conversion chip and a digital-to-analog conversion chip;

and/or, at least one other die is a radio frequency chip;

and/or, at least one other die comprises an HBM, RAMBUS or NOC interface;

and/or, at least one other die includes a PCIE, Ethernet MAC, XUAI, SONET/SDH, or INTERLAKEN interface.

What has been described above is only a preferred embodiment of the present application, and the present invention is not limited to the above embodiment. It is to be understood that other modifications and variations directly derivable or suggested by those skilled in the art without departing from the spirit and concept of the present invention are to be considered as included within the scope of the present invention.

18页详细技术资料下载

上一篇：一种医用注射器针头装配设备

下一篇：一种通用多裸片硅堆叠互连结构

Multi-die FPGA designed based on signal delay balancing technology

相关技术

网友询问留言