Title:
Memory controller method and system compensating for memory cell data losses
Document Type and Number:
United States Patent 7447974

Abstract:
A computer system includes a memory controller coupled to a memory module containing several DRAMs. The memory module also includes a non-volatile memory storing row addresses identifying rows containing DRAM memory cells that are likely to lose data during normal refresh of the memory cells. Upon power-up, the data from the non-volatile memory are transferred to a comparator in the memory controller. The comparator compares the row addresses to row addresses from a refresh shadow counter that identify the rows in the DRAMs being refreshed. When a row of memory cells is being refreshed that is located one-half of the rows away from a row that is likely to loose data, the memory controller causes the row that is likely to loose data to be refreshed. The memory controller also includes error checking circuitry for identifying the rows of memory cells that are likely to lose data during refresh.

Inventors:
Klein, Dean A. (Eagle, ID, US)
      Plaque It!

Application Number:
11/269403
Publication Date:
11/04/2008
Filing Date:
11/07/2005
View Patent Images:
Images are available in PDF form when logged in. To view PDFs, Login  or  Create Account (Free!)
Assignee:
Micron Technology, Inc. (Boise, ID, US)
Primary Class:
Other Classes:
714/754, 714/765
International Classes:
H03M13/00; G11C29/00
Field of Search:
714/773, 714/754, 714/765
US Patent References:
4334295Memory deviceJune, 1982Nagami365/222
4433211Privacy communication system employing time/frequency transformationFebruary, 1984McCalmont et al.380/36
4598402System for treatment of single bit error in buffer storage unitJuly, 1986Matsumoto et al.371/38
4706249Semiconductor memory device having error detection/correction functionNovember, 1987Nakagawa et al.371/38
4710934Random access memory with error correction capabilityDecember, 1987Traynor371/38
4766573Semiconductor memory device with error correcting circuitAugust, 1988Takemae365/222
4780875Semiconductor memory with reduced size ECC circuitOctober, 1988Sakai371/38
4858236Method for error correction in memory systemAugust, 1989Ogasawara371/38
4862463Error correcting code for 8-bit-per-chip memory with reduced redundancyAugust, 1989Chen371/38
4918692Automated error detection for multiple block memory array chip and correction thereofApril, 1990Hidaka et al.371/2.2
4937830Semiconductor memory device having function of checking and correcting error of read-out dataJune, 1990Kawashima et al.371/40.1
4958325Low noise semiconductor memorySeptember, 1990Nakagome et al.365/206
5056089Memory deviceOctober, 1991Furuta et al.371/3
5127014Dram on-chip error correction/detectionJune, 1992Raynham371/37.3
5172339Semiconductor memory device having error checking and correcting circuit and operating method thereforDecember, 1992Noguchi et al.365/201
5278796Temperature-dependent DRAM refresh circuitJanuary, 1994Tillinghast et al.365/211
5291498Error detecting method and apparatus for computer memory having multi-bit output memory circuitsMarch, 1994Jackson et al.371/40.1
5307356Interlocked on-chip ECC systemApril, 1994Fifield371/40.1
5313425Semiconductor memory device having an improved error correction capabilityMay, 1994Lee et al.365/201
5313464Fault tolerant memory using bus bit aligned Reed-Solomon error correction code symbolsMay, 1994Reiff371/2.1
5313475ECC function with self-contained high performance partial write or read/modify/write and parity look-ahead interface schemeMay, 1994Cromer et al.371/40.1
5313624DRAM multiplexerMay, 1994Harriman et al.714/6
5321661Self-refreshing memory with on-chip timer test circuitJune, 1994Iwakiri et al.365/222
5335201Method for providing synchronous refresh cycles in self-refreshing interruptable DRAMsAugust, 1994Walther et al.365/222
5369651Multiplexed byte enable bus for partial word writes to ECC protected memoryNovember, 1994Marisetty371/40.1
5418796Synergistic multiple bit error correction for memory of array chipsMay, 1995Price et al.371/39.1
5428630System and method for verifying the integrity of data written to a memoryJune, 1995Weng et al.371/40.1
5432802Information processing device having electrically erasable programmable read only memory with error check and correction circuitJuly, 1995Tsuboi371/40.1
5446695Memory device with programmable self-refreshing and testing methods thereforeAugust, 1995Douse et al.365/222
5448578Electrically erasable and programmable read only memory with an error check and correction circuitSeptember, 1995Kim371/40.4
5450424Semiconductor memory device with error checking and correcting functionSeptember, 1995Okugaki et al.371/40.1
5455801Circuit having a control array of memory cells and a current source and a method for generating a self-refresh timing signalOctober, 1995Blodgett et al.365/222
5459742Solid state disk memory using storage devices with defectsOctober, 1995Cassidy et al.371/40.1
5481552Method and structure for providing error correction code for 8-byte data words on SIMM cardsJanuary, 1996Aldereguia et al.371/40.1
5509132Semiconductor memory device having an SRAM as a cache memory integrated on the same chip and operating method thereofApril, 1996Matsuda et al.395/403
5513135Synchronous memory packaged in single/dual in-line memory module and method of fabricationApril, 1996Dell et al.365/52
5515333Semiconductor memoryMay, 1996Fujita et al.365/229
5588112DMA controller for memory scrubbingDecember, 1996Dearth et al.395/182.07
5600662Error correction method and apparatus for headersFebruary, 1997Zook371/40.1
5604703Semiconductor memory device with error check-correction function permitting reduced read-out timeFebruary, 1997Nagashima365/200
5623506Method and structure for providing error correction code within a system having SIMMsApril, 1997Dell et al.371/40.1
5629898Dynamic memory device, a memory module, and a method of refreshing a dynamic memory deviceMay, 1997Idei et al.365/222
5631914Error correcting apparatusMay, 1997Kashida et al.371/37.4
5703823Memory device with programmable self-refreshing and testing methods thereforeDecember, 1997Douse et al.365/222
5706225Memory apparatus with dynamic memory cells having different capacitor valuesJanuary, 1998Buchenrieder et al.365/102
5712861Error correcting method and decoder with improved reliabilityJanuary, 1998Inoue et al.714/752
5732092Method of refreshing flash memory data in flash disk cardMarch, 1998Shinohara371/40.2
5740188Error checking and correcting for burst DRAM devicesApril, 1998Olarig371/40.11
5754753Multiple-bit error correction in computer main memoryMay, 1998Smelser395/182.06
5761222Memory device having error detection and correction function, and methods for reading, writing and erasing the memory deviceJune, 1998Baldi371/40.18
5765185EEPROM array with flash-like core having ECC or a write cache or interruptible load cyclesJune, 1998Lambrache et al.711/103
5784328Memory system including an on-chip temperature sensor for regulating the refresh rate of a DRAM arrayJuly, 1998Irrinki et al.365/222
5784391Distributed memory system with ECC and method of operationJuly, 1998Konigsburg371/40.18
5808952Adaptive auto refreshSeptember, 1998Fung et al.365/222
5841418Dual displays having independent resolutions and refresh ratesNovember, 1998Bril et al.345/3
5864569Method and apparatus for performing error correction on data read from a multistate memoryJanuary, 1999Roohparvar371/40.18
5878059Method and apparatus for pipelining an error detection algorithm on an n-bit word stored in memoryMarch, 1999Maclellan371/40.13
5896404Programmable burst length DRAMApril, 1999Kellogg et al.371/40.11
5912906Method and apparatus for recovering from correctable ECC errorsJune, 1999Wu et al.371/40.11
5953278Data sequencing and registering in a four bit pre-fetch SDRAMSeptember, 1999McAdams et al.365/219
5961660Method and apparatus for optimizing ECC memory performanceOctober, 1999Capps, Jr. et al.714/763
5963103Temperature sensitive oscillator circuitOctober, 1999Blodgett331/75
6009547ECC in memory arrays having subsequent insertion of contentDecember, 1999Jaquette et al.714/758
6009548Error correcting code retrofit method and apparatus for multiple memory configurationsDecember, 1999Chen et al.714/762
6018817Error correcting code retrofit method and apparatus for multiple memory configurationsJanuary, 2000Chen et al.714/762
6041001Method of increasing data reliability of a flash memory device without compromising compatibilityMarch, 2000Estakhri365/200
6041430Error detection and correction code for data and check code fieldsMarch, 2000Yamauchi714/752
6085283Data selecting memory device and selected data transfer deviceJuly, 2000Toda711/104
6085334Method and apparatus for testing an integrated memory deviceJuly, 2000Giles et al.714/7
6092231Circuit and method for rapid checking of error correction codes using cyclic redundancy checkJuly, 2000Sze714/758
6101614Method and apparatus for automatically scrubbing ECC errors in memory via hardwareAugust, 2000Gonzales et al.714/6
6125467Method and apparatus for partial word read through ECC blockSeptember, 2000Dixon714/763
6134167Reducing power consumption in computer memoryOctober, 2000Atkinson365/222
6178537Method and apparatus for performing error correction on data read from a multistate memoryJanuary, 2001Roohparvar714/773
6199139Refresh period control apparatus and method, and computerMarch, 2001Katayama et al.711/106
6212118Semiconductor memoryApril, 2001Fujita365/222
6212631Method and apparatus for automatic L2 cache ECC configuration in a computer systemApril, 2001Springer et al.713/1
6216246Methods to make DRAM fully compatible with SRAM using error correction code (ECC) mechanismApril, 2001Shau714/763
621624732-bit mode for a 64-bit ECC capable memory subsystemApril, 2001Creta et al.714/763
6219807Semiconductor memory device having an ECC circuitApril, 2001Ebihara et al.714/720
6223309Method and apparatus for ECC logic testApril, 2001Dixon et al.714/703
6233717Multi-bit memory device having error check and correction circuit and method for checking and correcting data errors thereinMay, 2001Choi714/805
6262925Semiconductor memory device with improved error correctionJuly, 2001Yamasaki365/200
6279072Reconfigurable memory with selectable error correction storageAugust, 2001Williams et al.711/105
6310825Data writing method for semiconductor memory deviceOctober, 2001Furuyama365/233
6324119Data input circuit of semiconductor memory deviceNovember, 2001Kim365/233
6349068Semiconductor memory device capable of reducing power consumption in self-refresh operationFebruary, 2002Takemae et al.365/222
6349390On-board scrubbing of soft errors memory moduleFebruary, 2002Dell et al.714/6
6353910Method and apparatus for implementing error correction coding (ECC) in a dynamic random access memory utilizing vertical ECC storageMarch, 2002Carnevale et al.714/763
6397290Reconfigurable memory with selectable error correction storageMay, 2002Williams et al.711/105
6397357Method of testing detection and correction capabilities of ECC memory controllerMay, 2002Cooper714/703
6397365Memory error correction using redundant sliced memory and standard ECC mechanismsMay, 2002Brewer et al.714/766
6438066Synchronous semiconductor memory device allowing control of operation mode in accordance with operation conditions of a systemAugust, 2002Ooishi et al.365/233
6442644Memory system having synchronous-link DRAM (SLDRAM) devices and controllerAugust, 2002Gustavson et al.711/105
6457153Storage device and storage subsystem for efficiently writing error correcting codeSeptember, 2002Yamamoto et al.714/763
6484246High-speed random access semiconductor memory deviceNovember, 2002Tsuchida et al.711/169
6510537Semiconductor memory device with an on-chip error correction circuit and a method for correcting a data error thereinJanuary, 2003Lee714/763
6526537Storage for generating ECC and adding ECC to dataFebruary, 2003Kishino714/763
6549460Memory device and memory cardApril, 2003Nozoe et al.365/185.09
6556497Refresh controller and address remapping circuit and method for dual mode full/reduced density DRAMsApril, 2003Cowles et al.365/222
6557072Predictive temperature compensation for memory devices systems and methodApril, 2003Osborn711/106
6560155System and method for power saving memory refresh for dynamic random access memory devices after an extended intervalMay, 2003Hush365/222
6584543Reconfigurable memory with selectable error correction storageJune, 2003Williams et al.711/105
6591394Three-dimensional memory array and method for storing data bits and ECC bits thereinJuly, 2003Lee et al.714/766
6594796Simultaneous processing for error detection and P-parity and Q-parity ECC encodingJuly, 2003Chiang714/800
6601211Write reduction in flash memory systems through ECC usageJuly, 2003Norman714/773
6603694Dynamic memory refresh circuitryAugust, 2003Frankowsky et al.365/222
6609236Semiconductor IC device having a memory and a logic circuit implemented with a single chipAugust, 2003Watanabe et al.716/8
6614698Method and apparatus for synchronous data transfers in a memory device with selectable data or address pathsSeptember, 2003Ryan et al.365/189.04
6618281Content addressable memory (CAM) with error checking and correction (ECC) capabilitySeptember, 2003Gordon365/49
6618319Synchronous semiconductor memory device allowing control of operation mode in accordance with operation conditions of a systemSeptember, 2003Ooishi et al.365/233
6628558Proportional to temperature voltage generatorSeptember, 2003Fiscus365/222
6636444Semiconductor memory device having improved data transfer rate without providing a register for holding write dataOctober, 2003Uchida et al.365/189.05
6636446Semiconductor memory device having write latency operation and method thereofOctober, 2003Lee et al.365/194
6646942Method and circuit for adjusting a self-refresh rate to maintain dynamic data at low supply voltagesNovember, 2003Janzen365/222
6662333Shared error correction for memory designDecember, 2003Zhang et al.714/767
6665231Semiconductor device having pipelined dynamic memoryDecember, 2003Mizuno et al.365/233
6678860Integrated circuit memory devices having error checking and correction circuits therein and methods of operating sameJanuary, 2004Lee714/763
6697926Method and apparatus for determining actual write latency and accurately aligning the start of data capture with the arrival of data at a memory deviceFebruary, 2004Johnson et al.711/167
6697992Data storing method of dynamic RAM and semiconductor memory deviceFebruary, 2004Ito et al.714/763
6701480System and method for providing error check and correction in memory systemsMarch, 2004Karpuszka et al.714/764
6704230Error detection and correction method and apparatus in a magnetoresistive random access memoryMarch, 2004DeBrosse et al.365/201
6715104Memory access systemMarch, 2004Imbert de Tremiolles et al.714/25
6715116Memory data verify operationMarch, 2004Lester et al.714/718
6735726Method of deciding error rate and semiconductor integrated circuit deviceMay, 2004Muranaka et al.714/708
6751143Method and system for low power refresh of dynamic random access memoriesJune, 2004Morgan et al.365/222
6754858SDRAM address error detection method and apparatusJune, 2004Borkenhagen et al.714/720
6775190Semiconductor memory device with detection circuitAugust, 2004Setogawa365/193
6778457Variable refresh control for a memoryAugust, 2004Burgan365/222
6781908Memory having variable refresh control and method thereforAugust, 2004Pelley et al.365/222
6788616Semiconductor memory deviceSeptember, 2004Takahashi365/233
6789209Semiconductor integrated circuit deviceSeptember, 2004Suzuki et al.713/401
6792567System and method for correcting soft errors in random access memory devicesSeptember, 2004Laurent714/763
6795362Power controlling method for semiconductor storage device and semiconductor storage device employing sameSeptember, 2004Nakai et al.365/222
6807108Semiconductor memory device having select circuitOctober, 2004Maruyama et al.365/189.05
6810449Protocol for communication with dynamic memoryOctober, 2004Barth et al.710/61
6819624Latency time circuit for an S-DRAMNovember, 2004Acharya et al.365/233
6834022Partial array self-refreshDecember, 2004Derner et al.365/222
6934199Memory device and method having low-power, high write latency mode and high-power, low write latency mode and/or independently selectable write latencyAugust, 2005Johnson et al.365/194
6940773Method and system for manufacturing DRAMs with reduced self-refresh current requirementsSeptember, 2005Poechmueller365/222
6965537Memory system and method using ECC to achieve low power refreshNovember, 2005Klein et al.365/222
7027337Memory device and method having low-power, high write latency mode and high-power, low write latency mode and/or independently selectable write latencyApril, 2006Johnson et al.365/194
7095669Refresh for dynamic cells with weak retentionAugust, 2006Oh365/222
7096407Technique for implementing chipkill in a memory systemAugust, 2006Olarig714/768
7117420Construction of an optimized SEC-DED code and logic for soft errors in semiconductor memoriesOctober, 2006Yeung et al.714/763
7171605Check bit free error correction for sleep mode data retentionJanuary, 2007White714/763
20010023496Storage device and storage subsystem for efficiently writing error correcting codeSeptember, 2001Yamamoto et al.714/763
20010029592Memory sub-system error cleansingOctober, 2001Walker et al.714/42
20010044917Memory data verify operationNovember, 2001Lester et al.714/718
20010052090Storage device having an error correction functionDecember, 2001Mio714/42
20010052102Method and apparatus for performing error correction on data read from a multistate memoryDecember, 2001Roohparvar714/773
20020013924Semiconductor memory device having ECC type error recovery circuitJanuary, 2002Yamasoto714/763
20020029316Reconfigurable memory with selectable error correction storageMarch, 2002Williams et al.711/105
20020144210SDRAM address error detection method and apparatusOctober, 2002Borkenhagen et al.714/805
20020152444Multi-cycle symbol level error correction and memory systemOctober, 2002Chen et al.714/785
20020162069System and method for correcting soft errors in random access memory devicesOctober, 2002Laurent714/763
20020184592Semiconductor memory deviceDecember, 2002Koga et al.714/763
20030009721Method and system for background ECC scrubbing for a memory arrayJanuary, 2003Hsu et al.714/773
20030070054Reconfigurable memory with selectable error correction storageApril, 2003Williams et al.711/173
20030093744Error correcting memory and method of operating sameMay, 2003Leung et al.714/763
20030097608System and method for scrubbing errors in very large memoriesMay, 2003Rodeheffer et al.714/7
20030101405Semiconductor memory deviceMay, 2003Shibata714/763
20030149855Unbuffered memory systemAugust, 2003Shibata et al.711/200
20030167437Cache entry error-correcting code (ECC) based at least on cache entry data and memory addressSeptember, 2003DeSota et al.714/763
20030191888Method and system for dynamically operating memory in a power-saving error correction modeOctober, 2003Klein711/105
20040008562Semiconductor memory deviceJanuary, 2004Ito et al.365/223
20040064646Multi-port memory controller having independent ECC encodersApril, 2004Emerson et al.711/131
20040083334Method and apparatus for managing the integrity of data in non-volatile memory systemApril, 2004Chang et al.711/103
20040098654FIFO memory with ECC functionMay, 2004Cheng et al.714/758
20040117723Error correction scheme for memoryJune, 2004Foss714/805
20040225944Systems and methods for processing an error correction code word for storage in memory componentsNovember, 2004Brueggen714/758
20050099868Refresh for dynamic cells with weak retentionMay, 2005Oh365/222
20080092016Memory system and method using partial ECC to achieve low power refresh and fast access to dataApril, 2008Pawlowski714/764
20080109705Memory system and method using ECC with flag bit to identify modified dataMay, 2008Pawlowski et al.714/767
Other References:
Stojko, J. et al., “Error-Correction Code”, IBM Technical Disclosure Bulletin, vol. 10, No. 10, Mar. 1968.
Primary Examiner:
Torres, Joseph D.
Attorney, Agent or Firm:
Dorsey & Whitney LLP
Parent Case Data:

CROSS-REFERENCE TO RELATED APPLICATION

This application is a divisional of pending U.S. patent application Ser. No. 10/839,942, filed May 6, 2004.

Claims:
I claim:

1. A memory controller, comprising: a refresh shadow counter that is operable to output a row address; a least one inverter coupled to receive at least one of bit of the row address from the refresh shadow counter, the at least one inverter being operable to invert the at least one bit of the row address from the refresh counter to provide at least one inverted bit; a failing address comparator storing row addresses corresponding to rows of memory cells in a memory device that may contain at least one operational memory cell that is prone to error, the failing address comparator being coupled to the refresh shadow counter and to an output of the inverter to receive the row address from the refresh shadow counter and the at least one inverted bit, the failing address comparator being operable to substitute the at least one inverted bit for at least one corresponding bit in the row address from the refresh shadow counter to provide a comparison row address and to compare the comparison row address to the stored row address and to generate an indicating signal responsive to a predetermined relationship between the comparison row address and one of the stored row addresses; and a memory control circuit coupled to receive the indicating signal from the failing address comparator, the memory control circuit being operable to output the stored row address having the predetermined relationship with the comparison row address, the memory control circuit further being operable to output refresh command signals responsive to the indicating signal.

2. The memory controller of claim 1, further comprising: an ECC generator coupled to receive write data and to generate respective ECC syndrome bits corresponding to the write data; and an ECC checker coupled to receive read data along with corresponding stored ECC syndrome bits, the ECC checker being operable to detect if the read data are in error based on the ECC syndrome bits corresponding to the read data, the ECC checker being operable to output an error signal responsive to detecting a read data error.

3. The memory controller of claim 1, further comprising a refresh timer coupled to the memory control circuit, the refresh time being operable to periodically generate a trigger signal that causes the memory control circuit to output command signals corresponding to an auto-refresh command.

Description:

TECHNICAL FIELD

This invention relates to dynamic random access memory (“DRAM”) devices and controllers for such memory device, and, more particularly, to a method and system for controlling the operation of a memory controller, a memory module or a DRAM to manage the rate at which data bits stored in the DRAM are lost during refresh.

BACKGROUND OF THE INVENTION

As the use of electronic devices, such as personal computers, continue to increase, it is becoming ever more important to make such devices portable. The usefulness of portable electronic devices, such as notebook computers, is limited by the limited length of time batteries are capable of powering the device before needing to be recharged. This problem has been addressed by attempts to increase battery life and attempts to reduce the rate at which such electronic devices consume power.

Various techniques have been used to reduce power consumption in electronic devices, the nature of which often depends upon the type of power consuming electronic circuits that are in the device. For example, electronic devices, such a notebook computers, typically include dynamic random access memory (“DRAM”) devices that consume a substantial amount of power. As the data storage capacity and operating speeds of DRAM devices continues to increase, the power consumed by such devices has continued to increase in a corresponding manner.

In general, the power consumed by a DRAM increases with both the capacity and the operating speed of the DRAM devices. The power consumed by DRAM devices is also affected by their operating mode. A DRAM, for example, will generally consume a relatively large amount of power when the memory cells of the DRAM are being refreshed. As is well-known in the art, DRAM memory cells, each of which essentially consists of a capacitor, must be periodically refreshed to retain data stored in the DRAM device. Refresh is typically performed by essentially reading data bits from the memory cells in each row of a memory cell array and then writing those same data bits back to the same cells in the row. A relatively large amount of power is consumed when refreshing a DRAM because rows of memory cells in a memory cell array are being actuated in the rapid sequence. Each time a row of memory cells is actuated, a pair of digit lines for each memory cell are switched to complementary voltages and then equilibrated. As a result, DRAM refreshes tends to be particularly power-hungry operations. Further, since refreshing memory cells must be accomplished even when the DRAM is not being used and is thus inactive, the amount of power consumed by refresh is a critical determinant of the amount of power consumed by the DRAM over an extended period. Thus many attempts to reduce power consumption in DRAM devices have focused on reducing the rate at which power is consumed during refresh.

Refresh power can, of course, be reduced by reducing the rate at which the memory cells in a DRAM are being refreshed. However, reducing the refresh rate increases the risk of data stored in the DRAM memory cells being lost. More specifically, since, as mentioned above, DRAM memory cells are essentially capacitors, charge inherently leaks from the memory cell capacitors, which can change the value of a data bit stored in the memory cell over time. However, current leaks from capacitors at varying rates. Some capacitors are essentially short-circuited and are thus incapable of storing charge indicative of a data bit. These defective memory cells can be detected during production testing, and can then be repaired by substituting non-defective memory cells using conventional redundancy circuitry. On the other hand, current leaks from most DRAM memory cells at much slower rates that span a wide range. A DRAM refresh rate is chosen to ensure that all but a few memory cells can store data bits without data loss. This refresh rate is typically once every 64 ms. The memory cells that cannot reliably retain data bits at this refresh rate are detected during production testing and replaced by redundant memory cells. However, the rate of current leakage from DRAM memory cells can change after production testing, both as a matter of time and from subsequent production steps, such as in packaging DRAM chips. Current leakage, and hence the rate of data loss, can also be effected by environmental factors, such as the temperature of DRAM devices. Therefore, despite production testing, a few memory cells will typically be unable to retain stored data bits at normal refresh rates.

One technique that has been used to reduce prevent data errors during refresh is to generate an error correcting code “ECC” from each item of stored data, and then store the ECC along with the data. A computer system 10 employing typical ECC techniques is shown in FIG. 1. The computer system 10 includes a central processor unit (“CPU”) 14 coupled to a system controller 16 through a processor bus 18 . The system controller 16 is coupled to input/output (“I/O”) devices (not shown) through a peripheral bus 20 and to an I/O controller 24 through an expansion bus 26 . The I/O controller 24 is also connected to various peripheral devices (not shown) through an I/O bus 28 .

The system controller 16 includes a memory controller 30 that is coupled to several memory modules 32 a - c through an address bus 36 , a control bus 38 , a syndrome bus 40 , and a data bus 42 . Each of the memory modules 32 a - c includes several DRAM devices (not shown) that store data and an ECC. The data are coupled through the data bus 42 to and from the memory controller 30 and locations in the DRAM devices mounted on the modules 32 a - c . The locations in the DRAM devices to which data are written and data are read are designated by addresses coupled to the memory modules 32 a - c on the address bus 36 . The operation of the DRAM devices in the memory modules 32 a - c are controlled by control signals coupled to the memory modules 32 a - c on the control bus 38 .

In operation, when data are to be written to the DRAM devices in the memory modules 32 a - c , the memory controller 30 generates an ECC, and then couples the ECC and the write data to the memory modules 32 a - c through the syndrome bus 40 and the data bus 42 , respectively, along with control signals coupled through the control bus 38 and a memory address coupled through the address bus 36 . When the store data are to be read from the DRAM devices in the memory modules 32 a - c , the memory controller 30 applies to the memory modules 32 a - c control signals through the control bus 38 and a memory address 36 through the address bus. Read data and the corresponding syndrome are then coupled from the memory modules 32 a - c to the memory controller 30 through the data bus 42 and syndrome bus 40 , respectively. The memory controller 30 then uses the ECC to determine if any bits of the read data are in error, and, if not too many bits are in error, to correct the read data.

One example of a conventional memory controller 50 is shown in FIG. 2. The operation of the memory controller 50 is controlled by a memory control state machine 54 , which outputs control signals on the control bus 38 . The state machine 54 also outputs a control signal to an address multiplexer 56 that outputs an address on the address bus 36 . The most significant or upper bits of an address are coupled to a first port the multiplexer 56 on an upper address bus 60 , and the least significant or lower bits of an address are coupled to a second port of the multiplexer 56 on a lower address bus 62 . The upper and lower address buses 60 , 62 , respectively are coupled to an address bus 18 A portion of the processor bus 18 (FIG. 1).

A data bus portion 18 D of the processor bus 18 on which write data are coupled is connected to a buffer/transceiver 70 and to an ECC generator 72 . A data bus portion 18 D′ on which read data are coupled is connected to an ECC check/correct circuit 74 . In practice, both data bus portions 18 D and 18 D′ comprise a common portion of the processor bus 18 , but they are illustrated as being separate in FIG. 2 for purposes of clarity. The ECC generator 72 generates an ECC from the write data on bus 18 D, and couples the syndrome to the buffer transceiver through an internal ECC syndrome bus 74 . The ECC check/correct circuit 76 receives read data from the buffer transceiver 70 through an internal read bus 78 and a syndrome through an internal ECC syndrome bus 80 . The buffer/transceiver 70 applies the syndrome received from the ECC generator 72 to the memory modules 32 a - c (FIG. 1) through the syndrome bus 40 . The buffer/transceiver 70 couples the syndrome to the memory modules 32 a - c along with the write data, which are coupled through the data bus 42 . The buffer/transceiver 70 also couples read data from the data bus 42 and a syndrome from the syndrome bus 40 to the ECC check/correct circuit 76 . The ECC check/correct circuit 76 then determines whether or not any of the bits of the read data are in error. If the ECC's check/correct circuit 76 determines that any of the bits of the read data are in error, it corrects those bits as long as a sufficiently low number of bits are in error that they can be corrected. As is well-known in the art, the number of bits in the syndrome determines the number of bits of data that can be corrected. The uncorrected read data, if no error was detected, or the corrected read data, if an error was detected, are then coupled through the data bus 18 D′. In the event a correctable error was found, the ECC check/correct circuit 76 generates a read error R_ERROR signal, which is coupled to the memory control state machine 54 . If, however, too many bits of the read data were in error to be corrected, the ECC check/correct circuit 76 generates a fatal error F_ERROR signal, which is coupled to the CPU 14 (FIG. 1).

The memory controller 50 also includes a refresh timer 84 that schedules a refresh of the DRAM devices in the memory modules 32 a - c at a suitable rate, such as once every 64 ms. The refresh timer 84 periodically outputs a refresh trigger signal on line 88 that causes the memory control state machine 54 to issue an auto refresh command on the control bus 38 .

The use of ECCs in the memory controller 50 shown in FIG. 2 can significantly improve the reliability of data stored in the DRAM devices in the memory modules 32 a - c . Furthermore, the refresh timer 84 can cause the DRAMs to be refreshed at a slower refresh rate since resulting data bit errors can be corrected. The use of a slower refresh rate can provide the significant advantage of reducing the power consumed by the DRAM. However, the use of ECCs requires that a significant portion of the DRAM storage capacity be used to store the ECCs, thus effectively reducing the storage capacity of the DRAM. Further, the use of ECCs can reduce the rate at the DRAM can be refreshed because the ECC must be used to check and possibly correct each item of data read from the DRAM during refresh. Furthermore, the need to perform ECC processing on read data all during refresh can consume a significant amount of power. Also, if the ECCs are not used during normal operation, it is necessary to refresh the DRAM array at the normal refresh rate while checking the entire array for data errors and correcting any errors that are found before switching to the normal operating mode.

There is therefore a need for a method and system that eliminates or corrects data storage errors produced during refresh of a DRAM either without the use of ECCs or without the need to repetitively correct data errors with ECCs.

SUMMARY OF THE INVENTION

A system and method for refreshing rows of dynamic random access memory cells avoids data loss even though some of the memory cells are operational but prone to errors during refresh. The system and method refreshes the rows of memory cells that do not contain any error-prone memory cells at a first rate, and they refresh the rows of memory cells that contain at least one error-prone memory cell at a second rate that is higher than the first rate. The rows containing an error-prone memory cell are preferably refreshed at a more rapid rate by detecting when a row of memory cells is refreshed that has a row address that is offset from the row containing an error-prone memory cell by a predetermined quantity of rows, such as half. After detecting the row of memory cells is being refreshed, the row containing at least one error-prone memory cell is refreshed. The rows of memory cells containing at least one error-prone memory cell are detected by writing data to the memory cells in the dynamic random access memory. Following a refresh of the memory cells, the data stored in the memory cells are read to detect data read errors. These data read errors may be detected by storing error correcting codes along with the data, which are then read and processed to identify and correct the read data errors.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a conventional computer system.

FIG. 2 is a block diagram of a conventional memory controller that may be used in the computer system of FIG. 1.

FIG. 3 is a block diagram of a computer system according to one embodiment of the invention.

FIG. 4 is a block diagram of a memory controller according to one embodiment of the invention that may be used in the computer system of FIG. 3.

FIG. 5 is a flow chart showing a procedure for transferring error-prone row addresses from a memory module to the memory controller of FIG. 4 and for storing the error-prone row addresses in the memory controller.

FIG. 6 is a flow chart showing a procedure identifying error-prone row addresses and for storing information about the error-prone row addresses in a memory module.

FIG. 7 is a schematic diagram illustrating the manner in which the memory controller of FIG. 3 may insert extra refreshes of rows containing at least one error-prone memory cell.

FIG. 8 is a block diagram of a computer system according to another embodiment of the invention.

FIG. 9 is a block diagram of a computer system according to still another embodiment of the invention.

DETAILED DESCRIPTION

One embodiment of a computer system 100 according to one embodiment of the invention is shown in FIG. 3. The computer system 100 uses many of the same components that are used in the conventional computer system 10 of FIG. 1. Therefore, in the interest of brevity, these components have been provided with the same reference numerals, and an explanation of their operation will not be repeated. The computer system 100 of FIG. 3 differs from the computer system 10 of FIG. 1 by including memory modules 102 a - c that each include a non-volatile memory 110 a - c , respectively (only 110 a is shown in FIG. 3). The non-volatile memories 110 a - c store row addresses identifying rows containing one or more memory cells in the DRAM devices in the respective modules 102 a - c that are prone to errors because they discharge at a relatively high rate. The computer system 100 also differs from the computer system 10 of FIG. 1 by including circuitry that detects and identifies these error-prone memory cells and subsequently takes protective action. More specifically, as described in greater detail below, a memory controller 120 in the computer system 100 uses ECC techniques to determine which memory cells are error-prone during refresh. Once these error-prone memory cells have been identified, the memory controller 120 inserts additional refreshes for the rows containing these memory cells. As a result, this more rapid refresh is performed only on the rows containing memory cells that need to be refreshed at a more rapid rate so that power is not wasted refreshing memory cells that do not need to be refreshed at a more rapid rate.

One embodiment of the memory controller 120 that is used in the computer system 100 is shown in FIG. 4. The memory controller 120 uses many of the same components that are used in the conventional memory controller 50 of FIG. 2. Again, in the interest of brevity, these components have been provided with the same reference numerals, and an explanation of their operation will not be repeated except to the extent that they perform different or additional functions in the memory controller 120 . In addition to the components included in the memory controller 50 , the memory controller 120 includes a failing address register and comparator unit (“FARC”) 124 that stores the row addresses containing error-prone memory cells requiring refreshes at a more rapid rate. The FARC 124 is coupled to the raw write data bus 18 D to receive from the CPU 14 (FIG. 3) the row addresses that are stored in the non-volatile memories 110 a - c (FIG. 3). At power-up of the computer system 100 , the CPU 14 performs a process 130 to either transfer the row addresses from the non-volatile memories 110 a - c to the FARC 124 as shown in the flow-chart of FIG. 5 or to test the DRAMs in the memory modules 102 a - c to determine which rows contain at least one error-prone memory cell and then program the non-volatile memories 110 a - c and the FARC, as shown in the flow-chart of FIG. 6.

With reference, first, to FIG. 5, the process 130 is entered during power-on at step 134 . The non-volatile memories 110 a - c are then read at 136 by the CPU 14 coupling read addresses to the non-volatile memories 110 a - c and the I/O controller coupling control signals to the non-volatile memories 110 a - c through line 137 . The FARC 124 is then initialized at 140 before continuing at 142 by the CPU 14 coupling the row addresses through the raw write data bus 18 D and the data bus 126 .

In the event row addresses have not yet been stored in the non-volatile memories 110 a - c , the memory controller 120 may determine which rows contain error-prone memory cells and program the non-volatile memories 110 a - c with the addresses of such rows. The non-volatile memories 110 a - c are initially programmed by the CPU 14 writing data to the DRAMs in the memory modules 110 a - c and then reading the stored data from the DRAMs after the DRAMs have been refreshed over a period. Any errors that have arisen as a result of excessive discharge of memory cells during the refresh are detected by the ECC check/correct circuit 76 . As the DRAMs are read, the row addresses coupled to the DRAMs through the address bus 18 A are stored in address holding registers 128 and coupled to the FARC 124 . If the read data are in error, the ECC check/correct circuit 76 outputs an R_ERROR that is coupled through line 148 to the memory control state machine 54 . The memory control state machine 54 then processes the R_ERROR signal using the process 150 shown in FIG. 6. The process is initiated by the memory control state machine 54 upon receipt of the R_ERROR signal at step 154 . The address holding register 128 is then read at 156 , and a determination is made at 160 whether the row responsible for the R_ERROR signal being generated is a new row in which an error-prone memory cells previously not been detected. If an error-prone memory cells was previously detected, the row address being output from the read address holding register 128 has already been recorded for extra refreshes. The process 150 can therefore progress direction to the final continue step 162 without the need for further action.

If an error-prone memory cells had previously not been detected in the current row, the row address being output from the address holding register 128 is transferred to the FARC 124 at step 164 . This is accomplished by the memory control state machine 54 outputting a “FAIL” signal on line 132 that causes the FARC 124 to store the current row address, which is output from the address holding registers 128 on bus 138 . The address is also appended at step 168 to the non-volatile memory 110 in the memory module 102 a - c containing the DRAM having the error-prone memory cell. This is accomplished by coupling data identifying the row addresses containing error-prone memory cells to the raw write data bus 18 D. The data identifying the row addresses are then coupled to the memory modules 102 a - c for storage in the non-volatile memories 110 a - c.

Once either the process 130 of FIG. 5 or the process 150 of FIG. 6 has been completed for all rows, the row addresses identifying rows containing one or more error-prone memory cells have been stored in the FARC 124 . The memory controller 120 is then ready to insert extra refreshes of such rows. As is well known in the art, when an auto-refresh command is issued to a DRAM, an internal refresh counter in the DRAM generates row addresses that are used to select the rows being refreshed. However, since these row addresses are not coupled from the DRAMs to the memory controller 120 , the address of each row being refreshed must be determined in the memory controller 120 . This is accomplished by using a refresh shadow counter 170 to generate refresh row addresses in the same that the refresh counter in the DRAMs generate such addresses. Furthermore, for the memory controller 120 , the addresses that are used for refreshing the memory cells in the DRAMs are generated by the memory controller 120 . When the memory control state machine 54 issues an auto-refresh command to a DRAM, it outputs a trigger signal on line 174 that resets the refresh shadow counter 170 and the refresh timer 84 and causes the refresh shadow counter 170 to begin outputting incrementally increasing row addresses. These incrementally increasing row addresses are coupled to the DRAMs via the address bus 18 A, and they are also coupled to the FARC 124 via bus 176 . However, the most significant bit (“MSB”) of the row address is applied to an inverter 178 so that the FARC 124 receives a row address that is offset from the current row address by one-half the number of rows in the DRAMs. This offset row address is compared to the addresses of the rows containing error-prone memory cell(s) that are stored in the FARC 124 . In the event of a match, the FARC 124 outputs a HIT signal on line 180 .

The memory control state machine 54 responds to the HIT signal by inserting an extra refresh of the row identified by the offset address. For this purpose, the address bus 18 A receives all but the most significant bit of the row address from the refresh shadow counter 170 and the most significant bit from the FARC 124 on line 182 . As a result, the row identified by the offset is refreshed twice as often as other rows, i.e., once when the address is output from the refresh shadow counter 170 and once when the row address offset from the address by one-half the number of rows is output from the refresh shadow counter 170 .

The manner in which extra refreshes of rows occurs will be apparent with reference to FIG. 7, which shows the output of the refresh shadow counter 170 (FIG. 4) on the left hand side and the addresses of the rows actually being refreshed on the right hand side. Every 64 ms, the refresh shadow counter 170 outputs row addresses that increment from “0000000000000” to “1111111111111.” For purposes of illustration, assume that row “0000000000010” contains one or more error-prone memory cells. This row will be refreshed in normal course when the refresh shadow counter 170 outputs “0000000000010” on the third count of the counter 170 . When the refresh shadow counter 170 has counted three counts past one-half of the rows, it outputs count “1000000000010.” However, the MSB is inverted by the inverter 178 so that the FARC 124 receives a count of “0000000000010.” Since this count corresponds to an address for a row containing one or more error-prone memory cells, a refresh of row “0000000000010” is inserted between row “1000000000010” and row “1000000000011,” as shown on the right hand side of FIG. 7.

Although the memory controller 120 refreshes rows containing one or more error-prone memory cells twice as often as other rows, it may alternatively refresh rows containing error-prone memory cells more frequently. This can be accomplished by inverting the MSB and the next to MSB (“NTMSB”) of the row address coupled from the refresh shadow counter 170 to the FARC 124 . A row would then be refreshed when the refresh shadow counter 170 outputs its address, when the refresh shadow counter 170 outputs its address with the NTMSB inverted, when the refresh shadow counter 170 outputs its address with the MSB inverted, and when the refresh shadow counter 170 outputs its address with both the MSB and the NTMSB inverted. Other variations will be apparent to one skilled in the art.

A computer system 190 according to another embodiment of the invention is shown in FIG. 8. In this embodiment, the computer system 190 includes the conventional memory controller 30 of FIG. 1 coupled to memory modules 194 a - c . Each of the memory modules 194 a - c includes several DRAMs 196 , although only one DRAM is shown in FIG. 8. The DRAM 196 includes the FARC 124 , which is coupled to a refresh counter 198 through inverting circuitry 200 . The FARC 124 is initialized with data stored in a non-volatile memory 202 that identifies the addresses of the rows containing one or more error-prone memory cells. The non-volatile memory 202 is initially programmed in the same manner that the non-volatile memory was programmed, as explained above, using ECC circuitry 204 . The inverting circuitry 200 inverts appropriate bits of refresh addresses generated by the refresh counter 198 to schedule extra refreshes of rows containing one or more error-prone memory cells. The DRAM 196 also includes a memory control state machine 210 that controls the operation of the above-described components.

A computer system 220 according to another embodiment of the invention is shown in FIG. 9. This embodiment includes several memory modules 224 a - c coupled to a memory controller 230 . The memory modules 224 a - c each include the ECC generator 72 and ECC check/correct circuit 76 of FIGS. 2 and 3 as well as the other components that are used to determine which rows contain one or more error-prone memory cells. The computer system 220 does not include a syndrome bus 40 , of course, since the ECC syndromes are generated in the memory modules 224 a - c . However, once the memory modules 224 a - c have determined the address of rows containing one or more error-prone memory cells, it programs a non-volatile memory device 234 in each of the memory modules 224 a - c with those addresses. DRAMs 238 each include the FARC 124 , the refresh counter 198 , the inverting circuitry 200 , and the memory control state machine 210 of FIG. 8 to schedule extra refreshed of rows containing one or more error-prone memory cell, as previously explained.

Although the component of the various embodiments have been explained as being in either a memory controller, a memory module or a DRAM, it will be understood that there is substantial flexibility in the location of many components. For example, the FARC 124 may be either in the memory controller as shown in FIG. 4, the DRAMs as shown in FIGS. 8 and 9, or in the memory modules separate from the DRAMs. Furthermore, although the present invention has been described with reference to the disclosed embodiments, persons skilled in the art will recognize that changes may be made in form and detail without departing from the spirit and scope of the invention.





<- Previous Patent (Memory controller me...)   |   Next Patent (Supporting cyclic re...) ->