Next Patent: Integrated circuit and related improvements
Next Patent: Integrated circuit and related improvements
The present system also comprises a method for correcting errors in a programmable logic device having configuration data to program the programmable logic device. The method comprises a background reading of the configuration data. Next, the configuration data is analyzed for errors. Finally, the configuration data is then corrected and the configuration data is rewritten if errors are located.
Plaque It!
|
[0001] 1. Field of the System
[0002] The present invention relates to integrated circuits. In particular, the present invention relates to a method for error detection and correction in a radiation tolerant static random access memory (SRAM) for a field programmable gate array (FPGA).
[0003] 2. Background
[0004] A major concern in building a radiation-hardened SRAM-based FPGA for a space application is the reliability of the configuration memory. Memory devices used in satellites and in other computer equipment, can be placed in environments that are highly susceptible to radiation. A satellite memory cell in a space environment can be exposed to a radiation-induced soft error, commonly called a single event upset (SEU), when a cell is struck by high-energy particles. Electron-hole pairs created by, and along the path of, a single energetic particle as it passes through an integrated circuit such as a memory typically cause a soft error or SEU. An SEU typically results from alpha particles (helium nuclei), beta particles or other ionized nuclei rays impacting a low-capacitance node of a semiconductor circuit. Should the energetic particle generate the critical charge in the critical volume of the memory cell, the logic state of the memory is upset. This critical charge, by definition, is the minimum amount of electrical charge required to change the logic state of the memory cell. It is commonly called Q Critical (Q
[0005] An SEU can change the contents of any volatile memory cell. If that bit of memory is doing something besides merely storing data, such as controlling the logic functionality of an FPGA, the results can be catastrophic. While other technologies may be better suited for the most sensitive control functions of a spacecraft, there is a significant advantage to be had by being able to change a portion of the spacecraft's functionality remotely, either during prototyping on the ground or later during the mission. Spacecraft designers accept the idea that SEUs will inevitably occur. Based on the inevitable, they are willing to use SRAM-based FPGAs in non-critical portions of the vehicle provided the error rate is reasonable, sufficient error trapping is available and the recovery time is reasonable.
[0006] When a heavy ion traverses a node within a memory storage cell, the ion can force the node from its original state to an opposite state for a period of time. This change of state is due to the charge that the heavy ion deposits as it passes through the silicon of the Metal Oxide Semiconductor (MOS) transistor of the memory cell. If this node is held in the opposite state for a period of time longer than the delay around the feed back loop of the memory cell, the cell can switch states and the stored data can be lost. The period of time the node is held in the opposite state can depend on several factors. The most critical being the charge deposited.
[0007]
[0008]
[0009] In such an occurrence, there are two sources of current vying for control of the node Q: the CMOS p-channel device
[0010] Unfortunately, it takes time for a small CMOS device to regain control against a high-energy strike. In the case, for example, of a victimized gate being part of the feedback path in a sequential (i.e. memory) element with the incorrect logic level propagating around the loop, the CMOS device gets shut off and is never able to make the needed correction and the memory element loses state. If the memory element controls something important, system or subsystem failure can result.
[0011]
[0012]
[0013] SRAM in an FPGA may also be specified as CSRAM or USRAM. CSRAM is Configuration SRAM. This CSRAM is used to hold the configuration bits for the FPGA. It is physically spread out over the entire die and is interspersed with the rest of the FPGA circuitry. At least one of the two nodes in the static latch that make up the SRAM cell can be connected to the FPGA circuitry that controls it. When the contents of the CSRAM change, the logic function implemented by the FPGA changes. What is needed is a solution to insure the data integrity is maintained.
[0014] USRAM is the abbreviation for user SRAM. This is memory that is part of a user logic design and is concentrated inside a functional block dedicated to the purpose. What is needed is a solution to insure the data integrity of an USRAM is maintained.
[0015] In an SRAM based FPGA, there are a variety of separate elements that go into the making of a useful product. There are configuration memory bits in the CSRAM, which allow the user to impose his/her design on the uncommitted resources available. There are the combinational and sequential modules that do the user's logic. There are the configurable switches, signal lines, and buffers that allow the modules to be connected together. There are support circuits like clocks and other global signals like enables and resets, which allow the building of one or more subsystems in different time domains. There are blocks like the SRAM and DLL that allow the user access to more highly integrated functions than can be built out of an array of logic modules and interconnect.
[0016] Hence, there is a need for an apparatus and method of providing error detection and correction in a radiation-hardened SRAM based FPGA, which can easily be implemented using conventional CMOS processes, and which has performance speed comparable to an SRAM based FPGA that has not been radiation-hardened.
[0017] The present system comprises a radiation tolerant programmable logic device having logic modules and routing resources coupling together the logic modules. Configuration data lines providing configuration data control the programming of the logic modules and the routing resources. Error correction circuitry coupled to the configuration data lines analyzes and corrects any errors in the configuration data that may occur due to a single event upset (SEU).
[0018] The present system also comprises a method for correcting errors in a programmable logic device having configuration data to program the programmable logic device. The method comprises a background reading of the configuration data. Next, the configuration data is analyzed for errors. Finally, the configuration data is then corrected and the configuration data is rewritten if errors are located.
[0019] A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description of the invention and accompanying drawings which set forth an illustrative embodiment in which the principles of the invention are utilized.
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034] The preferred embodiment of the invention is discussed in detail below. While specific implementations are discussed, it should be understood that this is done for illustration purposes only. A person skilled in the relevant art will recognize that other components and configurations may be used without parting from the spirit and scope of the invention.
[0035] In this disclosure, various circuits and logical functions are described. It is to be understood that designations such as “1” and “0” in these descriptions are arbitrary logical designations. In a first implementation of the invention, “1” may correspond to a voltage high, while “0” corresponds to a voltage low or ground, while in a second implementation, “0” may correspond to a voltage high, while “1” corresponds to a voltage low or ground. Likewise, where signals are described, a “signal” as used in this disclosure may represent the application, or pulling “high” of a voltage to a node in a circuit where there was low or no voltage before, or it may represent the termination, or the bringing “low” of a voltage to the node, depending on the particular implementation of the invention.
[0036] The disclosed invention relates to a method for designing a radiation-hardened FPGA and the required circuit designs for conversion from a commercial Static Random Access Memory (SRAM) based Field Programmable Gate Array (FPGA) to a radiation-hardened version. The radiation-hardened FPGA described herein greatly reduces the (Single Event Upset) SEU issues associated with prior-art devices.
[0037] Since radiation-hardened circuits tend to be rather large relative to their non-radiation hardened counterparts, making all parts of the circuit hard is not practical due to area considerations. A method is needed to prioritize the need for radiation hardness of the various items and only implement the essential items radiation hardened circuits. Radiation-hardened design is very much a statistical approach. Described below is the approach used to create a radiation-hardened FPGA.
[0038] The CSRAM must be hardened since the product may not be commercially viable unless the part can reliably store the logic design. Also, it should be done in an extremely area efficient way since there are millions of configuration bits which comprise about 25% of the core area. A background READ\READ\READ\WRITE on error scheme with the Error Correction Code (ECC) bits to ensure the data is kept accurate is employed.
[0039] The configuration data input circuitry, the read or write systems, and the CSRAM wordline driver/decoder and associated control logic will be required to be hardened, since they control the memory loading and background checking.
[0040]
[0041] Column counter
[0042] Row counter
[0043]
[0044] ECC latches
[0045]
[0046] ECC latches
[0047] In one embodiment of the present invention, only a single error correction scheme may be used, thus it is important that the error correction code circuit is designed such that there will be one error to correct in any ECC word line. However in some cases, errors referred to as “double strike” errors occur. These errors occur when a particle hits a circuit at a relatively shallow angle, upsetting two or more programming bits in a single word line simultaneously. In the ECC circuit of the present invention, the “double strike” problem is solved by physically separating the bits in any ECC word line by a distance larger than the “double strike” distance. Though the “double strike” distance is an estimate, it is believed to be approximately 20 um in a 0.25 mm CMOS process. In the present embodiment, the memory cell size in a first dimension (parallel to the word lines) may be approximately 7.66 um. Thus, three memory cells span a greater distance than the 20 um in a first dimension and that memory cells four or more places apart on a word line are insulated from “double strikes.” In addition, there is almost always FPGA circuitry distributed among the memory columns such that the distance is usually greater than the above distances. Thus, in one embodiment of the present invention, four ECC word lines are interdigiated at one so that all the bits on any single word line are guaranteed to be more than the “double strike” distance apart. Because space on any integrated circuit is crucial, it is desirable to implement the ECC scheme of the present invention using data word lines as wide as possible. An example of an embodiment of this scheme is shown in
[0048]
[0049]
[0050] As stated above, in one embodiment of the present invention, four ECC words are interdigiated on each word line so that all the bits in any single word line are guaranteed to be more than the “double strike” distance apart. In one illustrative embodiment, the total number of memory columns is 488, thus, in this embodiment, four ECC decoder/encoders that can accept a 122-bit data word line (488/4=122). As is well known to those of ordinary skill in the art, ECC uses hamming encode/decode with parity. In one example, to implement a single error correction/double error detection (SECDED) scheme (as shown in
[0051] To spread out the delays for the FPGA routing resources, it is desirable to distribute the 36 check bits in groups of four (one bit for each ECC word line) over the width of FPGA core
[0052]
[0053] Referring still to
[0054] Precharge circuit
[0055] Precharge periods occur between all read and write operations. For example, precharge input
[0056] Precharge circuit
[0057] Sense amplifier
[0058]
[0059] Referring still to
[0060] Precharge circuit
[0061] Precharge periods occur between all read and write operations. For example, precharge input
[0062] MASKB input
[0063] Precharge circuit
[0064] Sense amplifier
[0065]
[0066] In static applications (AROM, etc.) masking is unnecessary. In dynamic applications (RAM. FIFO, etc.) masking is necessary.
[0067] From this disclosure, it will be apparent to persons of ordinary skill in the art that various alternatives to the embodiments of the disclosed system described herein may be employed in practicing the disclosed system. It is intended that the following claims define the scope of the disclosed system and that structures and methods within the scope of these claims and their equivalents be covered thereby.