PCIE equipment management system in server and server

文档序号:19979 发布日期:2021-09-21 浏览:28次 中文

阅读说明:本技术 一种服务器中pcie设备管理系统和服务器 (PCIE equipment management system in server and server ) 是由 郭艳杰 于 2021-05-28 设计创作,主要内容包括:本发明提供了一种服务器中PCIE设备管理系统和服务器,PCIE设备管理系统包括:管理设备,管理设备通过第一总线连接到PCIE设备中独立供电的状态缓存单元,配置用于从所述状态缓存单元获取PCIE设备的状态信息;处理器,处理器通过第二总线连接到PCIE设备,配置用于获取PCIE设备的状态信息,并通过KCS板载接口将获取到的状态信息发送到管理设备中,其中,管理设备配置为响应于无法通过处理器获取PCIE设备的状态信息,则转为从状态缓存单元获取PCIE设备的状态信息。通过使用本发明的方案,能够快速定位服务器的PCIE设备故障,能够建立独立于处理器外的PCIE监控系统,提供了设备的稳定性,提高了设备的可服务性和可靠性。(The invention provides a PCIE equipment management system in a server and the server, wherein the PCIE equipment management system comprises: the management device is connected to a state cache unit independently powered in the PCIE device through a first bus and is configured to acquire state information of the PCIE device from the state cache unit; and the processor is connected to the PCIE device through the second bus, configured to acquire state information of the PCIE device, and send the acquired state information to the management device through the KCS onboard interface, where the management device is configured to respond that the state information of the PCIE device cannot be acquired through the processor, and then acquire the state information of the PCIE device from the state cache unit instead. By using the scheme of the invention, the failure of the PCIE equipment of the server can be quickly positioned, a PCIE monitoring system independent of the processor can be established, the stability of the equipment is provided, and the serviceability and the reliability of the equipment are improved.)

1. A PCIE device management system in a server is characterized by comprising:

the management device is connected to a state cache unit independently powered in the PCIE device through a first bus and configured to acquire state information of the PCIE device from the state cache unit;

a processor connected to the PCIE device through a second bus, configured to acquire status information of the PCIE device and send the acquired status information to the management device through a KCS onboard interface,

the management device is configured to respond to the situation that the state information of the PCIE device cannot be acquired by the processor, and then switch to acquire the state information of the PCIE device from the state cache unit.

2. The PCIE device management system of claim 1, wherein the state cache unit in the PCIE device is an ram memory, and the state cache unit in the PCIE device is connected to an independent power supply.

3. The PCIE device management system of claim 1, wherein the first bus is a smbus bus and the second bus is a PCIE bus.

4. The PCIE device management system of claim 1, wherein the management device is a baseboard management controller in a server.

5. The PCIE device management system of claim 1, further comprising:

a power supply connected to the processor, the management device, and the PCIE device.

6. A server, comprising a PCIE device management system, wherein the PCIE device management system includes:

the management device is connected to a state cache unit independently powered in the PCIE device through a first bus and configured to acquire state information of the PCIE device from the state cache unit;

a processor connected to the PCIE device through a second bus, configured to acquire status information of the PCIE device and send the acquired status information to the management device through a KCS onboard interface,

the management device is configured to respond to the situation that the state information of the PCIE device cannot be acquired by the processor, and then switch to acquire the state information of the PCIE device from the state cache unit.

7. The server according to claim 6, wherein the state cache unit in the PCIE device is an ram memory, and the state cache unit in the PCIE device is connected to an independent power supply.

8. The server of claim 6, wherein the first bus is a smbus bus and the second bus is a PCIE bus.

9. The server according to claim 6, wherein the management device is a baseboard management controller in the server.

10. The server of claim 6, further comprising:

a power supply connected to the processor, the management device, and the PCIE device.

Technical Field

The field relates to the field of computers, and more particularly to a PCIE device management system in a server and a server.

Background

The server is a product with extremely high requirement on reliability, and the PCIE equipment is an expansion interface of standard configuration equipment. Management of PCIE devices is very important to the stability of servers.

The traditional PCIE device management method for the server is to directly read PCIE device information by a processor of the server to perform device management. An onboard management system of the server receives information of the PCIE equipment through an interface between the processor and the bmc, but the onboard management system cannot manage the condition of the PCIE equipment when the processor fails or is not started.

Disclosure of Invention

In view of this, an embodiment of the present invention provides a PCIE device management system in a server and a server, and by using the technical solution of the present invention, a PCIE device failure of the server can be quickly located, a PCIE monitoring system independent of a processor can be established, stability of the device is provided, and serviceability and reliability of the device are improved.

Based on the above object, an aspect of an embodiment of the present invention provides a PCIE device management system in a server, including:

the management device is connected to a state cache unit independently powered in the PCIE device through a first bus and is configured to acquire state information of the PCIE device from the state cache unit;

a processor connected to the PCIE device through a second bus, configured to acquire status information of the PCIE device and send the acquired status information to the management device through a KCS onboard interface,

the management device is configured to respond to the situation that the state information of the PCIE device cannot be acquired by the processor, and then switch to acquire the state information of the PCIE device from the state cache unit.

According to an embodiment of the present invention, the state cache unit in the PCIE device is an ram memory, and the state cache unit in the PCIE device is connected to an independent power supply.

According to one embodiment of the invention, the first bus is a smbus bus and the second bus is a PCIE bus.

According to one embodiment of the invention, the management device is a baseboard management controller in a server.

According to an embodiment of the present invention, further comprising:

and the power supply is connected to the processor, the management equipment and the PCIE equipment.

In another aspect of the embodiments of the present invention, a server is further provided, where the server includes a PCIE device management system, and the PCIE device management system includes:

the management device is connected to a state cache unit independently powered in the PCIE device through a first bus and is configured to acquire state information of the PCIE device from the state cache unit;

a processor connected to the PCIE device through a second bus, configured to acquire status information of the PCIE device and send the acquired status information to the management device through a KCS onboard interface,

the management device is configured to respond to the situation that the state information of the PCIE device cannot be acquired by the processor, and then switch to acquire the state information of the PCIE device from the state cache unit.

According to an embodiment of the present invention, the state cache unit in the PCIE device is an ram memory, and the state cache unit in the PCIE device is connected to an independent power supply.

According to one embodiment of the invention, the first bus is a smbus bus and the second bus is a PCIE bus.

According to one embodiment of the invention, the management device is a baseboard management controller in a server.

According to an embodiment of the present invention, further comprising:

and the power supply is connected to the processor, the management equipment and the PCIE equipment.

The invention has the following beneficial technical effects: in the PCIE device management system in the server provided in the embodiment of the present invention, by setting the management device, the management device is connected to a state cache unit independently powered in the PCIE device through the first bus, and is configured to acquire state information of the PCIE device from the state cache unit; the management device is configured to respond to the situation that the processor cannot acquire the state information of the PCIE device, and then the management device changes to a technical scheme of acquiring the state information of the PCIE device from the state cache unit, so that the failure of the PCIE device of the server can be quickly positioned, a PCIE monitoring system independent of the processor can be established, the stability of the device is improved, and the serviceability and the reliability of the device are improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other embodiments can be obtained by using the drawings without creative efforts.

Fig. 1 is a schematic diagram of a PCIE device management system according to an embodiment of the present invention;

fig. 2 is a schematic diagram of a server according to one embodiment of the invention.

Detailed Description

Embodiments of the present disclosure are described below. However, it is to be understood that the disclosed embodiments are merely examples and that other embodiments may take various and alternative forms. The figures are not necessarily to scale; certain features may be exaggerated or minimized to show details of particular components. Therefore, specific structural and functional details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for teaching one skilled in the art to variously employ the present invention. As one of ordinary skill in the art will appreciate, various features illustrated and described with reference to any one of the figures may be combined with features illustrated in one or more other figures to produce embodiments that are not explicitly illustrated or described. The combination of features shown provides a representative embodiment for a typical application. However, various combinations and modifications of the features consistent with the teachings of the present disclosure may be desirable for certain specific applications or implementations.

Based on the foregoing objective, a first aspect of the embodiments of the present invention provides an embodiment of a PCIE device management system in a server. Fig. 1 is a schematic diagram of the PCIE device management system.

As shown in fig. 1, the PCIE device management system may include:

the management device is connected to the state cache unit independently powered in the PCIE device through a first bus, and is configured to acquire state information of the PCIE device from the state cache unit.

The management device may be a BMC (baseboard management controller) in a server, the first bus is a Smbus bus, the Smbus is a low-speed and low-cost bus suitable for being used by a BMC low-speed platform, and an independent status buffer unit is required to be arranged in the PCIE device, the status buffer unit is an ram memory, the status buffer unit is required to be provided with an independent power supply, for example, the independent power supply may be a storage battery, and a system power supply cannot be used as the independent power supply, so as to prevent the status buffer unit from being powered off at the same time when the system is powered off and being unable to read data in the status buffer unit, when the PCIE device is in operation, the operation information and the status information of the PCIE device are sent to the status buffer unit for storage in real time, and the status buffer unit is required to send the received operation information and the status information of the PCIE device to the management device for storage at the same time, in a normal operation state of the server, the operation information and the state information of the PCIE device may be directly obtained in the management device (the baseboard management controller), and when the server is abnormal or is turned off, or the baseboard management controller is abnormal, the operation information and the state information of the PCIE device may be obtained by reading data in a state cache unit provided in the PCIE device, which can ensure that a failure of the PCIE device of the server can be quickly located when the server is abnormal.

In another embodiment, a module of a status buffer unit that is independently powered may be provided, where the module of the status buffer unit is composed of an ram memory, and the module of the status buffer unit needs to be provided with an independent power supply, for example, the independent power supply may be a storage battery, and a system power supply cannot be used as the independent power supply, so as to prevent the status buffer unit from being powered off simultaneously and being unable to read data in the status buffer unit when the system is powered off, when a PCIE device is running, the module of the status buffer unit needs to send running information and status information of each PCIE device to the module of the independent status buffer unit for storage in real time, and at the same time, the module of the status buffer unit needs to send running information and status information of the PCIE device received to a management device for storage, and in a state where a server is running normally, the running information and status information of the PCIE device can be directly obtained in the management device (substrate management controller), when the server is abnormal or closed, or the baseboard management controller is abnormal, the running information and the state information of the PCIE device can be obtained by reading the data in the module of the state cache unit that is independently arranged, so that it can be ensured that the failure of the PCIE device of the server can be quickly located when the server is abnormal.

In another embodiment, an independent status buffer unit may be disposed in the BMC, where the status buffer unit is composed of an ram memory, and the status buffer unit needs to be disposed with an independent power supply, for example, the independent power supply may be a storage battery, and a system power supply cannot be used as the independent power supply, so as to prevent the status buffer unit from being powered off at the same time and being unable to read data in the status buffer unit when the system is powered off, when a PCIE device is running, the status buffer unit sends running information and status information of each PCIE device to the status buffer unit in the BMC in real time for storage, and at the same time, the status buffer unit needs to send the received running information and status information of the PCIE device to the management device for storage, and in a normal running state of the server, the running information and status information of the PCIE device may be directly acquired in the management device (substrate management controller), when the server is abnormal or closed, or the baseboard management controller is abnormal, the running information and the state information of the PCIE device can be obtained by reading data in the module of the state cache unit in the BMC, so that it can be ensured that a PCIE device failure of the server can be quickly located when the server is abnormal.

The PCIE equipment management system further comprises a processor, wherein the processor is connected to the PCIE equipment through a second bus, configured to acquire state information of the PCIE equipment, and sends the acquired state information to the management equipment through a KCS onboard interface, wherein the management equipment is configured to respond that the state information of the PCIE equipment cannot be acquired through the processor, and then the state information of the PCIE equipment is acquired from a state cache unit.

The processor is a CPU of the system, the second bus is a PCIE bus, the CPU reads the operation information and the state information of the PCIE device through the PCIE bus, and then sends the operation information and the state information to the substrate management Controller through a KCS onboard interface (Keyboard Controller mode, a transport protocol), that is, the substrate management Controller can simultaneously receive the operation information and the state information of the PCIE device sent by the CPU and the operation information and the state information of the PCIE device sent by the state cache unit, and the two pieces of information received at the same time should be the same because both are sent by the PCIE device, so that when the substrate management Controller does not receive two pieces of the same information at the same time, it indicates that one of the lines has a fault, and the substrate management Controller can generate a corresponding alarm to notify an administrator to perform an inspection. In addition, the baseboard management controller receives two pieces of same information, the two pieces of same information are not both stored, the two pieces of information are compared, if the two pieces of information are the same PCIE equipment and the time is the same, one piece of information is deleted, and the other piece of information is stored, so that the baseboard management controller is prevented from storing more repeated data.

The running information and the state information of the PCIE device in the state cache unit may be circularly stored in the form of a log, and the log that is stored first is deleted after a certain time elapses, so as to ensure that the state cache unit has a sufficient storage space to store data.

Through the technical scheme of the invention, the failure of the PCIE equipment of the server can be quickly positioned, a PCIE monitoring system independent of a processor can be established, the stability of the equipment is provided, and the serviceability and the reliability of the equipment are improved.

In a preferred embodiment of the present invention, the state cache unit in the PCIE device is an ram memory, and the state cache unit in the PCIE device is connected to an independent power supply. The memory needs to have enough storage space to be able to store the operation information and the status information of a sufficient number of PCIE devices, and the independent power supply may be a storage battery.

In a preferred embodiment of the present invention, the first bus is a smbus bus and the second bus is a PCIE bus.

In a preferred embodiment of the present invention, the management device is a baseboard management controller in a server.

In a preferred embodiment of the present invention, the method further comprises:

and the power supply is connected to the processor, the management equipment and the PCIE equipment.

Through the technical scheme of the invention, the failure of the PCIE equipment of the server can be quickly positioned, a PCIE monitoring system independent of a processor can be established, the stability of the equipment is provided, and the serviceability and the reliability of the equipment are improved.

Based on the above object, a second aspect of the embodiment of the present invention provides a server 1, as shown in fig. 2, where the server 1 includes a PCIE device management system, and the PCIE device management system includes:

the management device is connected to a state cache unit independently powered in the PCIE device through a first bus, and is configured to acquire state information of the PCIE device from the state cache unit.

The management device may be a BMC (baseboard management controller) in a server, the first bus is a Smbus bus, the Smbus is a low-speed and low-cost bus suitable for being used by a BMC low-speed platform, and an independent status buffer unit is required to be arranged in the PCIE device, the status buffer unit is an ram memory, the status buffer unit is required to be provided with an independent power supply, for example, the independent power supply may be a storage battery, and a system power supply cannot be used as the independent power supply, so as to prevent the status buffer unit from being powered off at the same time when the system is powered off and being unable to read data in the status buffer unit, when the PCIE device is in operation, the operation information and the status information of the PCIE device are sent to the status buffer unit for storage in real time, and the status buffer unit is required to send the received operation information and the status information of the PCIE device to the management device for storage at the same time, in a normal operation state of the server, the operation information and the state information of the PCIE device may be directly obtained in the management device (the baseboard management controller), and when the server is abnormal or is turned off, or the baseboard management controller is abnormal, the operation information and the state information of the PCIE device may be obtained by reading data in a state cache unit provided in the PCIE device, which can ensure that a failure of the PCIE device of the server can be quickly located when the server is abnormal.

In another embodiment, an independent module of a state cache unit may be provided, where the module of the state cache unit is composed of an ram memory, and the state cache unit module needs to be provided with an independent power supply, for example, the independent power supply may be a storage battery, and a system power supply cannot be used as the independent power supply, so as to prevent the state cache unit from being powered off simultaneously and being unable to read data in the state cache unit when the system is powered off, when a PCIE device is running, the module of the state cache unit needs to send running information and state information of each PCIE device to the module of the independent state cache unit for storage in real time, and send the received running information and state information of the PCIE device to a management device for storage at the same time, and in a state where a server is running normally, the running information and state information of the PCIE device can be directly obtained in the management device (substrate management controller), when the server is abnormal or closed, or the baseboard management controller is abnormal, the running information and the state information of the PCIE device can be obtained by reading the data in the module of the state cache unit that is independently arranged, so that it can be ensured that the failure of the PCIE device of the server can be quickly located when the server is abnormal.

In another embodiment, an independent status buffer unit may be disposed in the BMC, where the status buffer unit is composed of an ram memory, and the status buffer unit needs to be disposed with an independent power supply, for example, the independent power supply may be a storage battery, and a system power supply cannot be used as the independent power supply, so as to prevent the status buffer unit from being powered off at the same time and being unable to read data in the status buffer unit when the system is powered off, when a PCIE device is running, the status buffer unit sends running information and status information of each PCIE device to the status buffer unit in the BMC in real time for storage, and at the same time, the status buffer unit needs to send the received running information and status information of the PCIE device to the management device for storage, and in a normal running state of the server, the running information and status information of the PCIE device may be directly acquired in the management device (substrate management controller), when the server is abnormal or closed, or the baseboard management controller is abnormal, the running information and the state information of the PCIE device can be obtained by reading data in the module of the state cache unit in the BMC, so that it can be ensured that a PCIE device failure of the server can be quickly located when the server is abnormal.

The management system of the PCIE equipment further comprises a processor, wherein the processor is connected to the PCIE equipment through a second bus, configured to acquire state information of the PCIE equipment, and sends the acquired state information to the management equipment through a KCS onboard interface, wherein the management equipment is configured to respond to the situation that the state information of the PCIE equipment cannot be acquired through the processor, and then the state information of the PCIE equipment is acquired from the state cache unit instead.

The processor is a CPU of the system, the second bus is a PCIE bus, the CPU reads the operation information and the state information of the PCIE device through the PCIE bus, and then sends the operation information and the state information to the substrate management Controller through a KCS onboard interface (Keyboard Controller mode, a transport protocol), that is, the substrate management Controller can simultaneously receive the operation information and the state information of the PCIE device sent by the CPU and the operation information and the state information of the PCIE device sent by the state cache unit, and the two pieces of information received at the same time should be the same because both are sent by the PCIE device, so that when the substrate management Controller does not receive two pieces of the same information at the same time, it indicates that one of the lines has a fault, and the substrate management Controller can generate a corresponding alarm to notify an administrator to perform an inspection. In addition, the baseboard management controller receives two pieces of same information, the two pieces of same information are not both stored, the two pieces of information are compared, if the two pieces of information are the same PCIE equipment and the time is the same, one piece of information is deleted, and the other piece of information is stored, so that the baseboard management controller is prevented from storing more repeated data.

The running information and the state information of the PCIE device in the state cache unit may be circularly stored in the form of a log, and the log that is stored first is deleted after a certain time elapses, so as to ensure that the state cache unit has a sufficient storage space to store data.

In a preferred embodiment of the present invention, the state cache unit in the PCIE device is an ram memory, and the state cache unit in the PCIE device is connected to an independent power supply.

In a preferred embodiment of the present invention, the first bus is a smbus bus and the second bus is a PCIE bus.

In a preferred embodiment of the present invention, the management device is a baseboard management controller in a server.

In a preferred embodiment of the present invention, the method further comprises:

and the power supply is connected to the processor, the management equipment and the PCIE equipment.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

The embodiments described above, particularly any "preferred" embodiments, are possible examples of implementations and are presented merely to clearly understand the principles of the invention. Many variations and modifications may be made to the above-described embodiments without departing from the spirit and principles of the technology described herein. All such modifications are intended to be included within the scope of this disclosure and protected by the following claims.

9页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种通信设备、表项更新方法及存储介质

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!