Title:
Memory hub and access method having internal prefetch buffers
Document Type and Number:
United States Patent 7412566

Abstract:
A memory module includes a memory hub coupled to several memory devices. The memory hub includes history logic that predicts on the basis of read memory requests which addresses in the memory devices from which date are likely to be subsequently read. The history logic applies prefetch suggestions corresponding to the predicted addresses to a memory sequencer, which uses the prefetch suggestions to generate prefetch requests that are coupled to the memory devices. Data read from the memory devices responsive to the prefetch suggestions are stored in a prefetch buffer. Tag logic stores prefetch addresses corresponding to addresses from which data have been prefetched. The tag logic compares the memory request addresses to the prefetch addresses to determine if the requested read data are stored in the prefetch buffer. If so, the requested data are read from the prefetch buffer. Otherwise, the requested data are read from the memory devices.

Inventors:
Lee, Terry R. (Boise, ID, US)
Jeddeloh, Joseph M. (Shoreview, MN, US)
      Plaque It!

Sponsored by:
Flash of Genius
Application Number:
11/510150
Publication Date:
08/12/2008
Filing Date:
08/24/2006
View Patent Images:
Images are available in PDF form when logged in. To view PDFs, Login  or  Create Account (Free!)
Assignee:
Micron Technology, Inc. (Boise, ID, US)
Primary Class:
Other Classes:
709/250
International Classes:
G06F12/08
Field of Search:
709/229, 709/104, 709/250, 710/22, 711/154, 711/113, 711/114, 710/3, 710/100, 709/233, 711/145, 711/137, 711/118, 711/213, 711/112
US Patent References:
0751703February, 1904Chilton et al.137/190
3742253THREE STATE LOGIC DEVICE WITH APPLICATIONSJune, 1973Kronies307/247
4045781Memory module with selectable byte addressing for digital data processing systemAugust, 1977Levy et al.364/200
4240143Hierarchical multi-processor network for memory sharingDecember, 1980Besemer et al.364/200
4245306Selection of addressed processor in a multi-processor networkJanuary, 1981Besemer et al.364/200
4253144Multi-processor communication networkFebruary, 1981Bellamy et al.364/200
4253146Module for coupling computer-processorsFebruary, 1981Bellamy et al.364/200
4608702Method for digital clock recovery from Manchester-encoded signalsAugust, 1986Hirzel et al.375/110
4707823Fiber optic multiplexed data acquisition systemNovember, 1987Holdren et al.370/1
4724520Modular multiport data hubFebruary, 1988Athanas et al.364/200
4831520Bus interface circuit for digital data processorMay, 1989Rubinfeld et al.364/200
4891808Self-synchronizing multiplexerJanuary, 1990Williams370/112
4930128Method for restart of online computer system and apparatus for carrying out the sameMay, 1990Suzuki et al.371/12
4953930CPU socket supporting socket-to-socket optical communicationsSeptember, 1990Ramsey et al.350/96.11
5133059Computer with multiple processors having varying priorities for access to a multi-element memoryJuly, 1992Ziegler et al.395/425
5241506Semiconductor memory circuit apparatusAugust, 1993Motegi et al.365/210
5243703Apparatus for synchronously generating clock signals in a data processing systemSeptember, 1993Farmwald et al.395/325
5251303System for DMA block data transfer based on linked control blocksOctober, 1993Fogg, Jr. et al.395/275
5269022Method and apparatus for booting a computer system by restoring the main memory from a backup memoryDecember, 1993Shinjo et al.395/700
5313590System having fixedly priorized and grouped by positions I/O lines for interconnecting router elements in plurality of stages within parrallel computerMay, 1994Taylor395/325
5317752Fault-tolerant computer system with auto-restart after power-fallMay, 1994Jewett et al.395/750
5319755Integrated circuit I/O using high performance bus interfaceJune, 1994Farmwald et al.395/325
5327553Fault-tolerant computer system with /CONFIG filesystemJuly, 1994Jewett et al.395/575
5355391High speed bus systemOctober, 1994Horowitz et al.375/36
5432823Method and circuitry for minimizing clock-data skew in a bus systemJuly, 1995Gasbarro et al.375/356
5432907Network hub with integrated bridgeJuly, 1995Picazo, Jr. et al.395/200
5442770Triple port cache memoryAugust, 1995Barratt395/403
5461627Access protocol for a common channel wireless networkOctober, 1995Rypinski370/95.2
5465229Single in-line memory moduleNovember, 1995Bechtolsheim et al.345/477
5479370Semiconductor memory with bypass circuitDecember, 1995Furuyama et al.365/189.12
5497476Scatter-gather in data processing systemMarch, 1996Oldfield et al.395/439
5502621Mirrored pin assignment for two sided multi-chip layoutMarch, 1996Schumacher et al.361/760
5544319Fiber optic memory coupling system with converter transmitting and receiving bus data in parallel fashion and diagnostic data in serial fashionAugust, 1996Acton et al.395/200.07
5566325Method and apparatus for adaptive memory accessOctober, 1996Bruce, II et al.395/494
5577220Method for saving and restoring the state of a CPU executing code in protected mode including estimating the value of the page table base registerNovember, 1996Combs et al.395/416
5581767Bus structure for multiprocessor system having separated processor section and control/memory sectionDecember, 1996Katsuki et al.395/800
5606717Memory circuitry having bus interface for receiving information in packets and access time registersFebruary, 1997Farmwald et al.395/856
5638334Integrated circuit I/O using a high performance bus interfaceJune, 1997Farmwald et al.365/230.03
5638534Memory controller which executes read and write commands out of orderJune, 1997Mote, Jr.395/485
5659798Method and system for initiating and loading DMA controller registers by using user-level programsAugust, 1997Blumrich et al.395/846
5687325Application specific field programmable gate arrayNovember, 1997Chang395/284
5706224Content addressable memory and random access memory partition circuitJanuary, 1998Srinivasan et al.365/49
5710733Processor-inclusive memory moduleJanuary, 1998Chengson et al.365/52
5715456Method and apparatus for booting a computer system without pre-installing an operating systemFebruary, 1998Bennett et al.395/652
5729709Memory controller with burst addressing circuitMarch, 1998Harness395/405
5748616Data link module for time division multiplexing control systemsMay, 1998Riley370/242
5796413Graphics controller utilizing video memory to provide macro command capability and enhanched command bufferingAugust, 1998Shipp et al.345/522
5818844Address generation and data path arbitration to and from SRAM to accommodate multiple transmitted packetsOctober, 1998Singh et al.370/463
5819304Random access memory assemblyOctober, 1998Nilsen et al.711/5
5822255Semiconductor integrated circuit for supplying a control signal to a plurality of object circuitsOctober, 1998Uchida365/194
5832250Multi set cache structure having parity RAMs holding parity bits for tag data and for status data utilizing prediction circuitry that predicts and generates the needed parity bitsNovember, 1998Whittaker395/471
5875352Method and apparatus for multiple channel direct memory access controlFebruary, 1999Gentry et al.395/843
5875454Compressed data cache storage systemFebruary, 1999Craft et al.711/113
5887159Dynamically determining instruction hint fieldsMarch, 1999Burrows395/567
5889714Adaptive precharge management for synchronous DRAMMarch, 1999Schumann et al.365/203
5928343Memory module having memory devices containing internal device ID registers and method of initializing sameJuly, 1999Farmwald et al.710/104
5963942Pattern search apparatus and methodOctober, 1999Igata707/6
5966724Synchronous memory device with dual page and burst mode operationsOctober, 1999Ryan711/105
5973935Interdigitated leads-over-chip lead frame for supporting an integrated circuit dieOctober, 1999Schoenfeld et al.361/813
5973951Single in-line memory moduleOctober, 1999Bechtolsheim et al.365/52
5978567System for distribution of interactive multimedia and linear programs by enabling program webs which include control scripts to define presentation by client transceiverNovember, 1999Rebane et al.395/200.49
5987196Semiconductor structure having an optical signal path in a substrate and method for forming the sameNovember, 1999Noble385/14
6011741Computer memory cards using flash EEPROM integrated circuit chips and memory-controller systemsJanuary, 2000Wallace et al.365/221
6023726User configurable prefetch control system for enabling client to prefetch documents from a network serverFebruary, 2000Saksena709/219
6029250Method and apparatus for adaptively adjusting the timing offset between a clock signal and digital signals transmitted coincident with that clock signal, and memory device and system using sameFebruary, 2000Keeth713/400
6031241Capillary discharge extreme ultraviolet lamp source for EUV microlithography and other related applicationsFebruary, 2000Silfvast et al.250/504R
6033951Process for fabricating a storage capacitor for semiconductor memory devicesMarch, 2000Chao438/253
6038630Shared access control device for integrated system with multiple functional units accessing external structures over multiple data busesMarch, 2000Foster et al.710/132
6061263Small outline rambus in-line memory moduleMay, 2000Boaz et al.365/51
6061296Multiple data clock activation with programmable delay for use in multiple CAS latency memory devicesMay, 2000Ternullo, Jr. et al.365/233
6067262Redundancy analysis for embedded memories with built-in self test and built-in self repairMay, 2000Irrinki et al.365/201
6067649Method and apparatus for a low power self test of a memory subsystemMay, 2000Goodwin714/718
6073190System for dynamic buffer allocation comprising control logic for controlling a first address buffer and a first data buffer as a matched pairJune, 2000Rooney710/56
6076139Multimedia computer architecture with multi-channel concurrent memory accessJune, 2000Welker et al.711/104
6079008Multiple thread multiple data predictive coded parallel processing system and methodJune, 2000Clery, III712/11
6092158Method and apparatus for arbitrating between command streamsJuly, 2000Harriman et al.711/151
6098158Software-enabled fast bootAugust, 2000Lay et al.711/162
6105075Scatter gather memory system for a hardware accelerated command interpreter engineAugust, 2000Ghaffari710/5
6111757SIMM/DIMM memory moduleAugust, 2000Dell et al.361/737
6125431Single-chip microcomputer using adjustable timing to fetch data from an external memorySeptember, 2000Kobayashi711/154
6128703Method and apparatus for memory prefetch operation of volatile non-coherent dataOctober, 2000Bourekas et al.711/138
6128706Apparatus and method for a load bias--load with intent to semaphoreOctober, 2000Bryg et al.711/141
6131149Apparatus and method for reading data from synchronous memory with skewed clock pulsesOctober, 2000Lu et al.711/167
6134624High bandwidth cache systemOctober, 2000Burns et al.710/113
6137709Small outline memory moduleOctober, 2000Boaz et al.365/51
6144587Semiconductor memory deviceNovember, 2000Yoshida365/189.05
6145033Management of display FIFO requests for DRAM access wherein low priority requests are initiated when FIFO level is below/equal to high threshold valueNovember, 2000Chee710/57
6157743Method for retrieving compressed texture data from a memory systemDecember, 2000Goris et al.382/233
6157962Multipath I/O storage systems with multiipath I/O request mechanismsDecember, 2000Hodges et al.710/1
6167465System for managing multiple DMA connections between a peripheral device and a memory and performing real-time operations on data carried by a selected DMA connectionDecember, 2000Parvin et al.710/22
6167486Parallel access virtual channel memory system with cacheable channelsDecember, 2000Lee et al.711/120
6175571Distributed memory switching hubJanuary, 2001Haddock et al.370/423
6185352Optical fiber ribbon fan-out cablesFebruary, 2001Hurley385/114
6185676Method and apparatus for performing early branch prediction in a microprocessorFebruary, 2001Poplingher et al.712/239
6186400Bar code reader with an integrated scanning component module mountable on printed circuit boardFebruary, 2001Dvorkis et al.235/462.45
6191663Echo reduction on bit-serial, multi-drop busFebruary, 2001Hannah333/17.3
6201724Semiconductor memory having improved register array access speedMarch, 2001Ishizaki et al.365/49
6208180Core clock correction in a 2/N mode clocking schemeMarch, 2001Fisch et al.327/141
6212590Computer system having integrated bus bridge design with delayed transaction arbitration mechanism employed within laptop computer docked to expansion baseApril, 2001Melo et al.710/119
6219725Method and apparatus for performing direct memory access transfers involving non-sequentially-addressable memory locationsApril, 2001Diehl et al.710/26
6223301Fault tolerant memoryApril, 2001Santeler et al.714/6
6233376Embedded fiber optic circuit boards and integrated circuitsMay, 2001Updegrove385/14
6243769Dynamic buffer allocation for a computer systemJune, 2001Rooney710/56
6243831Computer system with power loss protection mechanismJune, 2001Mustafa et al.714/24
6246618Semiconductor integrated circuit capable of testing and substituting defective memories and method thereofJune, 2001Yamamoto et al.365/200
6247107Chipset configured to perform data-directed prefetchingJune, 2001Christie711/216
6249802Method, system, and computer program product for allocating physical memory in a distributed shared memory networkJune, 2001Richardson et al.709/200
6252821Method and apparatus for memory address decode in memory subsystems supporting a large number of memory devicesJune, 2001Nizar et al.365/238.6
6256692CardBus interface circuit, and a CardBus PC having the sameJuly, 2001Yoda et al.710/104
6266730High-frequency bus systemJuly, 2001Perino et al.710/126
6272609Pipelined memory controllerAugust, 2001Jeddeloh711/169
6285349Correcting non-uniformity in displaysSeptember, 2001Smith345/147
6286083Computer system with adaptive memory arbitration schemeSeptember, 2001Chin et al.711/151
6294937Method and apparatus for self correcting parallel I/O circuitrySeptember, 2001Crafts et al.327/158
6301637High performance data pathsOctober, 2001Krull et al.711/112
6327642Parallel access virtual channel memory systemDecember, 2001Lee et al.711/120
6330205Virtual channel synchronous dynamic random access memoryDecember, 2001Shimizu et al.365/230.06
6347055Line buffer type semiconductor memory device capable of direct prefetch and restore operationsFebruary, 2002Motomura365/189.05
6349363Multi-section cache with different attributes for each sectionFebruary, 2002Cai et al.711/129
6356573Vertical cavity surface emitting laserMarch, 2002Jonsson et al.372/46
6367074Operation of a systemApril, 2002Bates et al.717/11
6370068Semiconductor memory devices and methods for sampling data therefrom based on a relative position of a memory cell array section containing the dataApril, 2002Rhee365/196
6370611Raid XOR operations to synchronous DRAM using a read buffer and pipelining of synchronous DRAM burst read dataApril, 2002Callison et al.711/105
6373777Semiconductor memoryApril, 2002Suzuki365/230.03
6381190Semiconductor memory device in which use of cache can be selectedApril, 2002Shinkai365/230.03
6389514Method and computer system for speculatively closing pages in memoryMay, 2002Rokicki711/136
6392653Device for processing acquisition data, in particular image dataMay, 2002Malandain et al.345/501
6401149Methods for context switching within a disk controllerJune, 2002Dennin et al.710/58
6401213Timing circuit for high speed memoryJune, 2002Jeddeloh713/401
6405280Packet-oriented synchronous DRAM interface supporting a plurality of orderings for data block transfers within a burst sequenceJune, 2002Ryan711/105
6421744Direct memory access controller and method thereforJuly, 2002Morrison et al.710/22
6430696Method and apparatus for high speed data capture utilizing bit-to-bit timing correction, and memory device using sameAugust, 2002Keeth713/503
6433785Method and apparatus for improving processor to graphics device throughputAugust, 2002Garcia et al.345/531
6434639System for combining requests associated with one or more memory locations that are collectively associated with a single cache line to furnish a single memory operationAugust, 2002Haghighi710/39
6434696Method for quickly booting a computer systemAugust, 2002Kang713/2
6434736Location based timing scheme in memory designAugust, 2002Schaecher et al.716/17
6438622Multiprocessor system including a docking systemAugust, 2002Haghighi et al.710/1
6438668Method and apparatus for reducing power consumption in a digital processing systemAugust, 2002Esfahani et al.711/165
6449308High-speed digital distribution systemSeptember, 2002Knight, Jr. et al.375/212
6453393Method and apparatus for interfacing to a computer memorySeptember, 2002Holman et al.711/154
6457116Method and apparatus for controlling contexts of multiple context processing elements in a network of multiple context processing elementsSeptember, 2002Mirsky et al.712/16
6460108Low cost data streaming mechanismOctober, 2002McCoskey et al.710/310
6460114Storing a flushed cache line in a memory buffer of a controllerOctober, 2002Jeddeloh711/120
6462978Method of designing semiconductor integrated circuit device and semiconductor integrated circuit deviceOctober, 2002Shibata et al.365/63
6463059Direct memory access execution engine with indirect addressing of circular queues in addition to direct memory addressingOctober, 2002Movshovich et al.370/389
6467013Memory transceiver to couple an additional memory channel to an existing memory channelOctober, 2002Nizar711/1
6470422Buffer memory management in a system having multiple execution entitiesOctober, 2002Cai et al.711/129
6473828Virtual channel synchronous dynamic random access memoryOctober, 2002Matsui711/104
6477592System for I/O interfacing for semiconductor chip utilizing addition of reference element to each data element in first data stream and interpret to recover data elements of second data streamNovember, 2002Chen et al.710/52
6477614Method for implementing multiple memory buses on a memory moduleNovember, 2002Leddige et al.711/5
6477621Parallel access virtual channel memory systemNovember, 2002Lee et al.711/120
6479322Semiconductor device with two stacked chips in one resin body and method of producingNovember, 2002Kawata et al.438/109
6487556Method and system for providing an associative datastore within a data processing systemNovember, 2002Downs et al.707/101
6487628Peripheral component interface with multiple data channels and reduced latency over a system area networkNovember, 2002Duong et al.710/313
6490188Semiconductor devices having mirrored terminal arrangements, devices including same, and methods of testing such semiconductor devicesDecember, 2002Nuxoll et al.365/63
6493803Direct memory access controller with channel width configurability supportDecember, 2002Pham et al.711/147
6496193Method and apparatus for fast loading of texture data into a tiled memoryDecember, 2002Surti et al.345/552
6496909Method for managing concurrent access to virtual memory data structuresDecember, 2002Schimmel711/163
6501471Volume renderingDecember, 2002Venkataraman et al.345/424
6502161Memory system including a point-to-point linked memory subsystemDecember, 2002Perego et al.711/5
6505287Virtual channel memory access controlling circuitJanuary, 2003Uematsu711/170
6523092Cache line replacement policy enhancement to avoid memory page thrashingFebruary, 2003Fanning711/134
6523093Prefetch buffer allocation and filtering systemFebruary, 2003Bogin et al.711/137
6526483Page open hint in transactionsFebruary, 2003Cho et al.711/154
6526498Method and apparatus for retiming in a network of multiple context processing elementsFebruary, 2003Mirsky et al.712/11
6539490Clock distribution without clock delay or skewMarch, 2003Forbes et al.713/401
6552564Technique to reduce reflections and ringing on CMOS interconnectionsApril, 2003Forbes et al.326/30
6553479Local control of multiple context processing elements with major contexts and minor contextsApril, 2003Mirsky et al.712/16
6564329System and method for dynamic clock generationMay, 2003Cheung et al.713/322
6587912Method and apparatus for implementing multiple memory buses on a memory moduleJuly, 2003Leddige et al.711/5
6590816Integrated memory and method for testing and repairing the integrated memoryJuly, 2003Perner365/200
6594713Hub interface unit and application unit interfaces for expanded direct memory access processorJuly, 2003Fuoco et al.710/31
6594722Mechanism for managing multiple out-of-order packet streams in a PCI host bridgeJuly, 2003Willke, II et al.710/313
6598154Precoding branch instructions to reduce branch-penalty in pipelined processorsJuly, 2003Vaid et al.712/237
6615325Method for switching between modes of operationSeptember, 2003Mailloux et al.711/154
6622227Method and apparatus for utilizing write buffers in memory control/interfaceSeptember, 2003Zumkehr et al.711/167
6628294Prefetching of virtual-to-physical address translation for display dataSeptember, 2003Sadowsky et al.345/568
6629220Method and apparatus for dynamic arbitration between a first queue and a second queue based on a high priority transaction typeSeptember, 2003Dyer711/158
6631440Method and apparatus for scheduling memory calibrations based on transactionsOctober, 2003Jenne et al.711/105
6633959Non-uniform memory access (NUMA) data processing system that provides notification of remote deallocation of shared dataOctober, 2003Arimilli et al.711/141
6636110Internal clock generating circuit for clock synchronous semiconductor memory deviceOctober, 2003Ooishi et al.327/565
6646929Methods and structure for read data synchronization with minimal latencyNovember, 2003Moss et al.365/194
6647470Memory device having posted write per commandNovember, 2003Janzen711/154
6658509Multi-tier point-to-point ring memory interfaceDecember, 2003Bonella et al.710/100
6662304Method and apparatus for bit-to-bit timing correction of a high speed memory busDecember, 2003Keeth et al.713/400
6665202Content addressable memory (CAM) devices that can identify highest priority matches in non-sectored CAM arrays and methods of operating sameDecember, 2003Lindahl et al.365/49
6667895Integrated circuit device and module with integrated circuitsDecember, 2003Jang et al.365/63
6681292Distributed read and write caching implementation for optimized input/output applicationsJanuary, 2004Creta et al.711/119
6681302Page open hint in transactionsJanuary, 2004Cho et al.711/154
6697926Method and apparatus for determining actual write latency and accurately aligning the start of data capture with the arrival of data at a memory deviceFebruary, 2004Johnson et al.711/167
6704817Computer architecture and system for efficient management of bi-directional busMarch, 2004Steinman et al.710/100
6715018Computer including installable and removable cards, optical interconnection between cards, and method of assembling a computerMarch, 2004Farnworth et al.710/300
6718440Memory access latency hiding with hint bufferApril, 2004Maiyuran et al.711/137
6721195Reversed memory module socket and motherboard incorporating sameApril, 2004Brunelle et al.365/63
6724685Configuration for data transmission in a semiconductor memory system, and relevant data transmission methodApril, 2004Braun et al.365/233
6728800Efficient performance based scheduling mechanism for handling multiple TLB operationsApril, 2004Lee et al.710/54
6731548Reduced power registered memory module and methodMay, 2004Pax365/189.12
6735679Apparatus and method for optimizing access to memoryMay, 2004Herbst et al.711/167
6735682Apparatus and method for address calculationMay, 2004Segelken et al.711/220
6745275Feedback system for accomodating different memory module loadingJune, 2004Chang710/305
6751113Arrangement of integrated circuits in a memory moduleJune, 2004Bhakta et al.365/63
6751703Data storage systems and methods which utilize an on-board cacheJune, 2004Chilton711/113
6751722Local control of multiple context processing elements with configuration contextsJune, 2004Mirsky et al.712/15
6754117System and method for self-testing and repair of memory modulesJune, 2004Jeddeloh365/201
6754812Hardware predication for conditional instruction path branchingJune, 2004Abdallah et al.712/234
6756661Semiconductor device, a semiconductor module loaded with said semiconductor device and a method of manufacturing said semiconductor deviceJune, 2004Tsuneda et al.257/673
6760833Split embedded DRAM processorJuly, 2004Dowling712/34
6771538Semiconductor integrated circuit and nonvolatile memory elementAugust, 2004Shukuri et al.365/185.05
6772295System and method for managing data in an I/O cacheAugust, 2004Spencer et al.711/137
6775747System and method for performing page table walks on speculative software prefetch operationsAugust, 2004Venkatraman711/137
6782466Arrangement and method for accessing data in a virtual memory arrangementAugust, 2004Steele et al.711/209
6785780Distributed processor memory module and methodAugust, 2004Klein et al.711/48
6788104Field programmable logic device with efficient memory utilizationSeptember, 2004Singh et al.326/39
6789173Node controller for performing cache coherence control and memory-shared multiprocessor systemSeptember, 2004Tanaka et al.711/147
6792059Early/on-time/late gate bit synchronizerSeptember, 2004Yuan et al.375/354
6792496Prefetching data for peripheral component interconnect devicesSeptember, 2004Aboulenein et al.710/306
6795899Memory system with burst length shorter than prefetch lengthSeptember, 2004Dodd et al.711/137
6799246Memory interface for reading/writing data from/to a memorySeptember, 2004Wise et al.711/117
6799268Branch ordering bufferSeptember, 2004Boggs et al.712/228
6804760Method for determining a type of memory present in a systemOctober, 2004Wiliams711/170
6804764Write clock and data window tuning based on rank selectOctober, 2004LaBerge et al.711/170
6807630Method for fast reinitialization wherein a saved system image of an operating system is transferred into a primary memory from a secondary memoryOctober, 2004Lay et al.713/2
6811320System for connecting a fiber optic cable to an electronic deviceNovember, 2004Abbott385/58
6816947System and method for memory arbitrationNovember, 2004Huffman711/151
6820181Method and system for controlling memory accesses to memory modules having a memory hub architectureNovember, 2004Jeddeloh et al.711/169
6821029High speed serial I/O technology using an optical linkNovember, 2004Grung et al.385/92
6823023Serial bus communication systemNovember, 2004Hannah375/296
6829705System information display method and apparatusDecember, 2004Smith713/1
6845409Data exchange methods for a switch which selectively forms a communication channel between a processing unit and multiple devicesJanuary, 2005Talagala et al.710/20
6889304Memory device supporting a dynamically configurable core organizationMay, 2005Perego et al.711/170
6904556Systems and methods which utilize parity setsJune, 2005Walton et al.714/766
6910109Tracking memory page stateJune, 2005Holman et al.711/156
6947672High-speed optical data linksSeptember, 2005Jiang et al.398/135
6980042Delay line synchronizer apparatus and methodDecember, 2005LaBerge327/291
7107415Posted write buffers and methods of posting write requests in memory modulesSeptember, 2006Jeddeloh et al.711/154
7318130System and method for thermal throttling of memory modulesJanuary, 2008Morrow et al.711/154
20010039612Apparatus and method for fast bootingNovember, 2001Lee713/2
20020112119Dual-port buffer-to-memory interfaceAugust, 2002Halbert et al.711/115
20020116588Software management systems and methods for automotive computing devicesAugust, 2002Beckert et al.711/161
20020120709Shared-memory controller for use in a multimedia processor systemAugust, 2002Chow et al.709/213
20020144064Controlling cache memory in external chipset using processorOctober, 2002Fanning711/144
20020178319Optical bus arrangement for computer systemNovember, 2002Sanchez-Olea710/305
20030005223System boot time reduction methodJanuary, 2003Coulson et al.711/118
20030014578Routability for memeory devicesJanuary, 2003Pax710/100
20030043158Method and apparatus for reducing inefficiencies in shared memory devicesMarch, 2003Wasserman et al.345/545
20030043426Optical interconnect in high-speed memory systemsMarch, 2003Baker et al.359/109
20030065836Controller data sharing using a modular DMA architectureApril, 2003Pecone710/62
20030093630Techniques for processing out-of -order requests in a processor-based systemMay, 2003Richard et al.711/154
20030095559Systems including packet interfaces, switches, and packet DMA circuits for splitting and merging packet streamsMay, 2003Sano et al.370/419
20030149809Method and apparatus for timing and event processing in wireless systemsAugust, 2003Jensen et al.710/22
20030156639Frame rate control system and methodAugust, 2003Liang375/240.01
20030163649Shared bypass bus structureAugust, 2003Kapur et al.711/146
20030177320Memory read/write reorderingSeptember, 2003Sah et al.711/158
20030193927Random access memory architecture and serial interface with continuous packet handling capabilityOctober, 2003Hronik370/351
20030217223Combined command setNovember, 2003Nino, Jr. et al.711/105
20030223295Field programmable gate array with a variably wide word width memoryDecember, 2003Ozguz et al.365/201
20030229762Apparatus, method, and system for synchronizing information prefetch between processors and memory controllersDecember, 2003Maiyuran et al.711/137
20030229770Memory hub with internal cache and/or memory access predictionDecember, 2003Jeddeloh711/213
20040019728Multiple hardware partitions under one input/output hubJanuary, 2004Sharma710/306
20040022094Cache usage for concurrent multiple streamsFebruary, 2004Radhakrishnan et al.365/200
20040024959System and method for optically interconnecting memory devicesFebruary, 2004Taylor711/105
20040024978Memory hub and access method having internal row cachingFebruary, 2004Jeddeloh711/154
20040034753Memory hub bypass circuit and methodFebruary, 2004Jeddeloh711/163
20040044833System and method for optimizing interconnections of memory devices in a multichip moduleMarch, 2004Ryan711/5
20040044857Method and system for controlling memory accesses to memory modules having a memory hub architectureMarch, 2004Jeddeloh et al.711/154
20040049649Computer system and method with memory copy commandMarch, 2004Durrant711/165
20040064602Claiming cycles on a processor bus in a system having a PCI to PCI bridge north of a memory controllerApril, 2004George710/22
20040122988System for controlling data transfer protocol with a host bus interfaceJune, 2004Han et al.710/5
20040126115System having multiple agents on optical and electrical busJuly, 2004Levy et al.398/116
20040128449Method and system to improve prefetching operationsJuly, 2004Osborne et al.711/137
20040144994Apparatus and methods for optically-coupled memory systemsJuly, 2004Lee et al.257/200
20040158677Buffered writes and memory page controlAugust, 2004Dodd711/118
20040236885Arrangement and method for system of locally deployed module units, and contact unit for connection of such a module unitNovember, 2004Fredriksson et al.710/100
20040260864Reconfigurable memory module and methodDecember, 2004Lee et al.711/101
20040268061Mechanism for handling explicit writeback in a cache coherent multi-node architectureDecember, 2004Khare et al.711/151
20050044327Asynchronous, independent and multiple process shared memory system in an adaptive computing architectureFebruary, 2005Howard et al.711/147
20050066137Method and system for controlling memory accesses to memory modules having a memory hub architectureMarch, 2005Jeddeloh et al.711/154
20050071542Prefetch mechanism for use in a system including a host connected to a plurality of memory modules via a serial memory interconnectMarch, 2005Weber et al.711/105
20050078506POSTED PRECHARGE AND MULTIPLE OPEN-PAGE RAM ARCHITECTUREApril, 2005Rao et al.365/154
20050105350Memory channel test fixture and methodMay, 2005Zimmerman365/201
20050149774System and method for read synchronization of memory modulesJuly, 2005Jeddeloh et al.713/500
20050166006System including a host connected serially in a chain to one or more memory modules that include a cacheJuly, 2005Talbot et al.711/105
20050172084Buffer control system and method for a memory system having memory request buffersAugust, 2005Jeddeloh711/154
20050216678Memory hub and method for providing memory sequencing hintsSeptember, 2005Jeddeloh711/154
20050223161Memory hub and access method having internal row cachingOctober, 2005Jeddeloh711/105
20050246558Power management using a pre-determined thermal characteristic of a memory moduleNovember, 2005Ku713/300
20050286506System and method for an asynchronous data buffer having buffer write and read pointersDecember, 2005LaBerge370/363
20060085616Method and system for dynamically adjusting DRAM refresh rateApril, 2006Zeighami et al.711/167
20060168407Memory hub system and method having large virtual page sizeJuly, 2006Stern711/154
20070011392Reconfigurable memory module and methodJanuary, 2007Lee et al.711/5
20070033353Memory hub and method for memory sequencingFebruary, 2007Jeddeloh711/154
Foreign References:
EP0849685June, 1998Communication bus system between processors and memory modules
JP2001265539September, 2001ARRAY TYPE STORAGE DEVICE AND INFORMATION PROCESSING SYSTEM
WO/1993/019422September, 1993FIBER OPTIC MEMORY COUPLING SYSTEM
WO/1998/057489December, 1998MODULAR SYSTEM FOR ACCELERATING DATA SEARCHES AND DATA STREAM OPERATIONS
WO/2002/027499April, 2002SHARED TRANSLATION ADDRESS CACHING
Other References:
“Free On-Line Dictionary of Computing” entry ‘Flash Erasable Programmable Read-Only Memory, online May 17, 2004 [http://foldoc.doc.ic.ac.uk/foldoc/foldoc.cgi?flash+memory].
Intel, “Flash Memory PCI Add-In Card for Embedded Systems”, Application Note AP-758, Sep. 1997, pp. i-13.
Intel, “Intel 840 Chipset: 82840 Memory Controller Hub (MCH)”, Datasheet, www.intel.com/design/chipsets/datashts/298020.htm, Oct. 1999, pp. 1-178.
Micron Technology, Inc., “Synchronous DRAM Module 512MB/1GB (x72, ECC) 168-PIN Registered FBGA SDRAM DIMM”, Micron Technology, Inc., 2002, pp. 1-23.
Shanley, T. et al., “PCI System Architecture”, Third Edition, Mindshare, Inc., 1995, pp. 24-25.
Hellwagner, Hermann et al., “Enabling a PC Cluster for High-Performance Computing”, The Speedup Journal, vol. 11, No. 1, Jun. 1997, pp. 1-9.
Primary Examiner:
Bataille, Pierre-michel
Attorney, Agent or Firm:
Dorsey & Whitney LLP
Parent Case Data:

CROSS-REFERENCE TO RELATED APPLICATION

This application is a continuation of U.S. patent application Ser. No. 10/601,252, filed Jun. 20, 2003, now U.S. Pat. No. 7,260,685.

Claims:
The invention claimed is:

1. A system for prefetching data in a memory system comprising: a link interface receiving memory requests for access to memory cells in at least one of a plurality of memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive read data responsive to at least some of the memory requests; and a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and to prefetch and store data from memory cells according to the predicted addresses, the prefetch circuit further operable to provide the prefetched data to the link interface responsive to subsequent memory requests, the prefetch circuit being configured to detect a pattern from which addresses to memory requests likely to be accessed are predicted, and to generate prefetch suggestions indicative of addresses corresponding to the predicted memory requests.

2. The system of claim 1 wherein the prefetch circuit is operable to group the predicted memory request addresses into a plurality of sets corresponding to respective strides, and causes the prefetched data received from the memory cells to be stored into a plurality of sections each corresponding to a respective stride.

3. The system of claim 1 wherein the prefetch circuit is operable to selectively enable prefetching data based on the nature of the memory requests received from the link interface.

4. The system of claim 3 wherein the prefetch circuit is operable to selectively enable prefetching based on a percentage of memory requests for read data from the stored prefetch data compared to read data from the memory cells.

5. The system of claim 1 wherein the prefetch circuit comprises a plurality of prefetch buffers coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the predicted addresses.

6. The system of claim 5 wherein the plurality of prefetch buffers comprises a single static random access memory device partitioned into a plurality of sections corresponding a plurality of strides.

7. The system of claim 5 wherein the prefetch circuit comprises a data read control circuit coupled to the memory device interface, the link interface, and the plurality of prefetch buffers, the data read control circuit operable to determine from the memory requests received from the link interface if the data corresponding to the memory requests are stored in the plurality of prefetch buffers, the data read control circuit further operable to transfer data from the plurality of prefetch buffers if the requested data are stored in the plurality of prefetch buffers and to transfer data from the memory device interface if the data are stored in the memory cells.

8. The system of claim 7 wherein the data read control circuit comprises a tag logic unit coupled to the link interface and the plurality of prefetched buffers, the tag logic unit operable to store addresses of the prefetched data stored in the plurality of prefetch buffers, the tag logic unit further operable to receive memory requests from the link interface, compare the addresses of the received memory requests to the addresses of the stored prefetched data, and generate a hit control signal indicative of whether an address match was determined.

9. The system of claim 7 wherein the data read control circuit comprises a multiplexer having data inputs coupled to the plurality of prefetch buffers, to the tag logic unit and to the memory device interface, and a data output coupled to the link interface, the multiplexer operable to transfer the data input from the plurality of prefetch buffers to the data output responsive to an active hit control signal received from the tag logic unit, and to transfer the data input form the memory device interface to the data output responsive to an inactive hit control signal received from the tag logic unit.

10. The system of claim 1 wherein the prefetch circuit is further operable to prefetch data from memory cells that are currently accessed such that the prefetching does not require the selected memory cells to be precharged.

11. The system of claim 1 wherein the link interface comprises an optical input/output port.

12. A system for prefetching data in a memory system comprising: a link interface receiving memory requests for access to memory cells in at least one of a plurality of memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive read data responsive to at least some of the memory requests; and a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and to prefetch and store data from memory cells according to the predicted addresses, the prefetch circuit further operable to provide the prefetched data to the link interface responsive to subsequent memory requests, the prefetch circuit comprising a single static random access memory device partitioned into a plurality of sections corresponding a plurality of strides and being coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the predicted addresses.

13. A system for prefetching data in a memory system comprising: a link interface receiving memory requests for access to memory cells in at least one of a plurality of memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive read data responsive to at least some of the memory requests; and a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and to prefetch and store data from memory cells according to the predicted addresses, the prefetch circuit further operable to provide the prefetched data to the link interface responsive to subsequent memory requests, the prefetch circuit further comprising: a plurality of prefetch buffers coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the predicted addresses; and a data read control circuit coupled to the memory device interface, the link interface, and the plurality of prefetch buffers, the data read control circuit operable to determine from the memory requests received from the link interface if the data corresponding to the memory requests are stored in the plurality of prefetch buffers, the data read control circuit further operable to transfer data from the plurality of prefetch buffers if the requested data are stored in the plurality of prefetch buffers and to transfer data from the memory device interface if the data are stored in the memory cells.

14. The system of claim 13 wherein the data read control circuit comprises a tag logic unit coupled to the link interface and the plurality of prefetched buffers, the tag logic unit operable to store addresses of the prefetched data stored in the plurality of prefetch buffers, the tag logic unit further operable to receive memory requests from the link interface, compare the addresses of the received memory requests to the addresses of the stored prefetched data, and generate a hit control signal indicative of whether an address match was determined.

15. The system of claim 13 wherein the data read control circuit comprises a multiplexer having data inputs coupled to the plurality of prefetch buffers, to the tag logic unit and to the memory device interface, and a data output coupled to the link interface, the multiplexer operable to transfer the data input from the plurality of prefetch buffers to the data output responsive to an active hit control signal received from the tag logic unit, and to transfer the data input form the memory device interface to the data output responsive to an inactive hit control signal received from the tag logic unit.

16. A memory hub comprising: a link interface receiving memory requests for access to memory cells in at least one of a plurality of memory device; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive data responsive to at least some of the memory requests; a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and generate prefetch suggestions indicative of the predicted addresses to prefetch data from memory cells, the prefetch circuit further operable to store the data that are prefetched and transfer the prefetch data to the link interface; and a memory sequencer coupled to the link interface, the memory device interface and the prefetch circuit, the memory sequencer operable to transfer memory requests received from the link interface to the prefetch circuit to determine if the data corresponding to the memory requests are stored in the prefetch circuit, the memory sequencer further being operable to transfer memory requests received from the link interface to the memory device interface to fetch the data corresponding to the memory requests if the data are not stored in the prefetch circuit, the memory sequencer further being operable to generate prefetch requests responsive to the prefetch suggestions received from the prefetch circuit to fetch data corresponding to the prefetch suggestions.

17. The memory hub of claim 16 wherein the prefetch circuit is operable to detect a pattern from which addresses to memory requests likely to be accessed are predicted, and operable to generate prefetch suggestions indicative of addresses corresponding to the predicted memory requests.

18. The memory hub of claim 17 wherein the prefetch circuit is further operable to group the predicted memory request addresses into a plurality of sets corresponding to respective strides, and to cause the prefetched data received from the memory cells to be stored into a plurality of sections each corresponding to a respective stride.

19. The memory hub of claim 17 wherein the prefetch circuit is further operable to selectively enable prefetching data based on the nature of the memory requests received from the link interface.

20. The memory hub of claim 19 wherein the prefetch circuit is configured to selectively enable prefetching based on a percentage of memory requests for read data from the stored prefetch data compared to read data from the memory cells.

21. The memory hub of claim 16 wherein the prefetch circuit comprises a plurality of prefetch buffers coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the prefetch suggestions.

22. The memory hub of claim 21 wherein the plurality of prefetch buffers comprises a single static random access memory device partitioned into a plurality of sections corresponding a plurality of strides.

23. The memory hub of claim 21 wherein the plurality of prefetch buffers comprises a plurality of registers that are enabled and partitioned according to a plurality of respective strides.

24. The memory hub of claim 21 wherein the prefetch circuit comprises a data read control circuit coupled to the memory device interface, the link interface, and the plurality of prefetch buffers, the data read control circuit operable to determine from the memory requests received from the link interface if the data corresponding to the memory requests are stored in the plurality of prefetch buffers, the data read control circuit further operable to transfer data from the plurality of prefetch buffers if the requested data are stored in the plurality of prefetch buffers and to transfer data from the memory device interface if the data are stored in the memory cells.

25. The memory hub of claim 24 wherein the data read control circuit comprises a tag logic unit coupled to the link interface and the plurality of prefetched buffers, the tag logic unit operable to store addresses of the prefetched data stored in the plurality of prefetch buffers, the tag logic unit further operable to receive memory requests from the link interface, compare the addresses of the received memory requests to the addresses of the stored prefetched data, and generate a hit control signal indicative of whether an address match was determined.

26. The memory of hub of claim 25 wherein the tag logic unit is further coupled the memory sequencer to which the hit control signal is applied, and wherein the memory sequencer is prevented from transferring a memory request to the memory device interface responsive to receiving an active hit control signal.

27. The memory hub of claim 24 wherein the data read control circuit comprises a multiplexer having data inputs coupled to the plurality of prefetch buffers, to the tag logic unit and to the memory device interface, and a data output coupled to the link interface, the multiplexer operable to transfer the data input from the plurality of prefetch buffers to the data output responsive to an active hit control signal received from the tag logic unit, and to transfer the data input from the memory device interface to the data output responsive to an inactive hit control signal received from the tag logic unit.

28. The memory hub of claim 16 wherein the memory sequencer is operable to generate and transfer prefetch requests to the memory device interface only when memory requests are not being transferred from the link interface to the memory sequencer.

29. The memory hub of claim 16 wherein the prefetch circuit is further operable to prefetch data from memory cells that are currently accessed such that the act of prefetching data does not require the selected memory cells to be precharged.

30. The memory hub of claim 16 wherein the link interface comprises an optical input/output port.

31. A computer system, comprising: a central processing unit (“CPU”); a system controller coupled to the CPU, the system controller having an input port and an output port; an input device coupled to the CPU through the system controller; an output device coupled to the CPU through the system controller; a storage device coupled to the CPU through the system controller; a plurality of memory modules, each of the memory modules comprising: a plurality of memory devices; and a system for prefetching data comprising: a link interface receiving memory requests for access to memory cells in at least one of the memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive read data responsive to at least some of the memory requests; and a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and to prefetch and store data from memory cells according to the predicted addresses, the prefetch circuit further operable to provide the prefetched data to the link interface responsive to subsequent memory requests, the prefetch circuit being configured to detect a pattern from which addresses to memory requests likely to be accessed are predicted, and operable to generate prefetch suggestions indicative of addresses corresponding to the predicted memory requests.

32. The computer system of claim 31 wherein the prefetch circuit is operable to group the predicted memory request addresses into a plurality of sets corresponding to respective strides, and causes the prefetched data received from the memory cells to be stored into a plurality of sections each corresponding to a respective stride.

33. The computer system of claim 31 wherein the prefetch circuit is operable to selectively enable prefetching data based on the nature of the memory requests received from the link interface.

34. The computer system of claim 33 wherein the prefetch circuit is operable to selectively enable prefetching based on a percentage of memory requests for read data from the stored prefetch data compared to read data from the memory cells.

35. The computer system of claim 31 wherein the prefetch circuit comprises a plurality of prefetch buffers coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the predicted addresses.

36. The computer system of claim 35 wherein the plurality of prefetch buffers comprises a single static random access memory device partitioned into a plurality of sections corresponding a plurality of strides.

37. The computer system of claim 35 wherein the prefetch circuit comprises a data read control circuit coupled to the memory device interface, the link interface, and the plurality of prefetch buffers, the data read control circuit operable to determine from the memory requests received from the link interface if the data corresponding to the memory requests are stored in the plurality of prefetch buffers, the data read control circuit further operable to transfer data from the plurality of prefetch buffers if the requested data are stored in the plurality of prefetch buffers and to transfer data from the memory device interface if the data are stored in the memory cells.

38. The computer system of claim 37 wherein the data read control circuit comprises a tag logic unit coupled to the link interface and the plurality of prefetched buffers, the tag logic unit operable to store addresses of the prefetched data stored in the plurality of prefetch buffers, the tag logic unit further operable to receive memory requests from the link interface, compare the addresses of the received memory requests to the addresses of the stored prefetched data, and generate a hit control signal indicative of whether an address match was determined.

39. The computer system of claim 37 wherein the data read control circuit comprises a multiplexer having data inputs coupled to the plurality of prefetch buffers, to the tag logic unit and to the memory device interface, and a data output coupled to the link interface, the multiplexer operable to transfer the data input from the plurality of prefetch buffers to the data output responsive to an active hit control signal received from the tag logic unit, and to transfer the data input form the memory device interface to the data output responsive to an inactive hit control signal received from the tag logic unit.

40. The computer system of claim 31 wherein the prefetch circuit is further operable to prefetch data from memory cells that are currently accessed such that the prefetching does not require the selected memory cells to be precharged.

41. The computer system of claim 31 wherein the link interface comprises an optical input/output port.

42. A computer system, comprising: a central processing unit (“CPU”); a system controller coupled to the CPU, the system controller having an input port and an output port; an input device coupled to the CPU through the system controller; an output device coupled to the CPU through the system controller; a storage device coupled to the CPU through the system controller; a plurality of memory modules, each of the memory modules comprising: a plurality of memory devices; and a memory hub comprising: a link interface receiving memory requests for access to memory cells in at least one of the memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive data responsive to at least some of the memory requests; a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and generate prefetch suggestions indicative of the predicted addresses to prefetch data from memory cells, the prefetch circuit further operable to store the data that are prefetched and transfer the prefetched data to the link interface; and a memory sequencer coupled to the link interface, the memory device interface and the prefetch circuit, the memory sequencer operable to transfer memory requests received from the link interface to the prefetch circuit to determine if the data corresponding to the memory requests are stored in the prefetch circuit, the memory sequencer further being operable to transfer memory requests received from the link interface to the memory device interface to fetch the data corresponding to the memory requests if the data are not stored, in the prefetch circuits, the memory sequencer further operable to generate prefetch requests responsive to the prefetch suggestions received from the prefetch circuit to fetch data corresponding to the prefetch suggestions.

43. The computer system of claim 42 wherein the prefetch circuit is operable to detect a pattern from which addresses to memory requests likely to be accessed are predicted, and operable to generate prefetch suggestions indicative of addresses corresponding to the predicted memory requests.

44. The computer system of claim 43 wherein the prefetch circuit is further operable to group the predicted memory request addresses into a plurality of sets corresponding to respective strides, and to cause the prefetched data received from the memory cells to be stored into a plurality of sections each corresponding to a respective stride.

45. The computer system of claim 43 wherein the prefetch circuit is further operable to selectively enable prefetching data based on the nature of the memory requests received from the link interface.

46. The computer system of claim 45 wherein the prefetch circuit is configured to selectively enable prefetching based on a percentage of memory requests for read data from the stored prefetch data compared to read data from the memory cells.

47. The computer system of claim 42 wherein the prefetch circuit comprises a plurality of prefetch buffers coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the prefetch suggestions.

48. The computer system of claim 47 wherein the plurality of prefetch buffers comprises a single static random access memory device partitioned into a plurality of sections corresponding a plurality of strides.

49. The computer system of claim 47 wherein the plurality of prefetch buffers comprises a plurality of registers that are enabled and partitioned according to a plurality of respective strides.

50. The computer system of claim 47 wherein the prefetch circuit comprises a data read control circuit coupled to the memory device interface, the link interface, and the plurality of prefetch buffers, the data read control circuit operable to determine from the memory requests received from the link interface if the data corresponding to the memory requests are stored in the plurality of prefetch buffers, the data read control circuit further operable to transfer data from the plurality of prefetch buffers if the requested data are stored in the plurality of prefetch buffers and to transfer data from the memory device interface if the data are stored in the memory cells.

51. The computer system of claim 50 wherein the data read control circuit comprises a tag logic unit coupled to the link interface and the plurality of prefetched buffers, the tag logic unit operable to store addresses of the prefetched data stored in the plurality of prefetch buffers, the tag logic unit further operable to receive memory requests from the link interface, compare the addresses of the received memory requests to the addresses of the stored prefetched data, and generate a hit control signal indicative of whether an address match was determined.

52. The computer system of claim 51 wherein the tag logic unit is further coupled the memory sequencer to which the hit control signal is applied, and wherein the memory sequencer is prevented from transferring a memory request to the memory device interface responsive to receiving an active hit control signal.

53. The computer system of claim 50 wherein the data read control circuit comprises a multiplexer having data inputs coupled to the plurality of prefetch buffers, to the tag logic unit and to the memory device interface, and a data output coupled to the link interface, the multiplexer operable to transfer the data input from the plurality of prefetch buffers to the data output responsive to an active hit control signal received from the tag logic unit, and to transfer the data input from the memory device interface to the data output responsive to an inactive hit control signal received from the tag logic unit.

54. The computer system of claim 42 wherein the memory sequencer is operable to generate and transfer prefetch requests to the memory device interface only when memory requests are not being transferred from the link interface to the memory sequencer.

55. The computer system of claim 42 wherein the prefetch circuit is further operable to prefetch data from memory cells that are currently accessed such that the act of prefetching data does not require the selected memory cells to be precharged.

56. The computer system of claim 42 wherein the link interface comprises an optical input/output port.

57. A computer system, comprising: a central processing unit (“CPU”); a system controller coupled to the CPU, the system controller having an input port and an output port; an input device coupled to the CPU through the system controller; an output device coupled to the CPU through the system controller; a storage device coupled to the CPU through the system controller; a plurality of memory modules, each of the memory modules comprising: a plurality of memory devices; and a system for prefetching data comprising: a link interface receiving memory requests for access to memory cells in at least one of the memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive read data responsive to at least some of the memory requests; and a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and to prefetch and store data from memory cells according to the predicted addresses, the prefetch circuit further operable to provide the prefetched data to the link interface responsive to subsequent memory requests, the prefetch circuit comprising a single static random access memory device partitioned into a plurality of sections corresponding a plurality of strides and being coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the predicted addresses.

58. A computer system, comprising: a central processing unit (“CPU”); a system controller coupled to the CPU, the system controller having an input port and an output port; an input device coupled to the CPU through the system controller; an output device coupled to the CPU through the system controller; a storage device coupled to the CPU through the system controller; a plurality of memory modules, each of the memory modules comprising: a plurality of memory devices; and a system for prefetching data comprising: a link interface receiving memory requests for access to memory cells in at least one of the memory devices; a memory device interface coupled to the memory devices, the memory device interface being operable to transfer memory requests to the memory devices for access to memory cells in at least one of the memory devices and to receive read data responsive to at least some of the memory requests; and a prefetch circuit coupled to the link interface to receive memory requests from the link interface and coupled to the memory device interface to receive data from the memory cells, the prefetch circuit operable to predict addresses that are likely to be accessed in the memory devices based on the memory requests and to prefetch and store data from memory cells according to the predicted addresses, the prefetch circuit further operable to provide the prefetched data to the link interface responsive to subsequent memory requests, the prefetch circuit further comprising: a plurality of prefetch buffers coupled to the memory device interface for receiving and storing prefetched data from memory cells being accessed based on the predicted addresses; and a data read control circuit coupled to the, memory device interface, the link interface, and the plurality of prefetch buffers, the data read control circuit operable to determine from the memory requests received from the link interface if the data corresponding to the memory requests are stored in the plurality of prefetch buffers, the data read control circuit further operable to transfer data from the plurality of prefetch buffers if the requested data are stored in the plurality of prefetch buffers and to transfer data from the memory device interface if the data are stored in the memory cells.

59. The computer system of claim 58 wherein the data read control circuit comprises a tag logic unit coupled to the link interface and the plurality of prefetched buffers, the tag logic unit operable to store addresses of the prefetched data stored in the plurality of prefetch buffers, the tag logic unit further operable to receive memory requests from the link interface, compare the addresses of the received memory requests to the addresses of the stored prefetched data, and generate a hit control signal indicative of whether an address match was determined.

60. The computer system of claim 58 wherein the data read control circuit comprises a multiplexer having data inputs coupled to the plurality of prefetch buffers, to the tag logic unit and to the memory device interface, and a data output coupled to the link interface, the multiplexer operable to transfer the data input from the plurality of prefetch buffers to the data output responsive to an active hit control signal received from the tag logic unit, and to transfer the data input from the memory device interface to the data output responsive to an inactive hit control signal received from the tag logic unit.

61. A method of prefetching data in a memory system comprising: receiving memory requests for access to memory cells in a plurality of memory devices; coupling the memory requests to the memory devices, wherein at least some of the memory requests being memory requests to read data; receiving read data responsive to the read memory requests; prefetching data from the memory devices that are likely to be accessed in the memory devices based on the received memory requests and storing the prefetched data; transferring the prefetched data responsive to subsequent memory requests; dividing the stored prefetched data into a plurality of sections each corresponding to a respective stride; and storing the prefetched data from the memory devices indicative of the predicted addresses in the section corresponding to the respective stride.

62. The method of claim 61 further comprising: storing prefetch addresses corresponding to the stored prefetched data; receiving memory request addresses corresponding to subsequent memory requests; comparing each received memory request address to the prefetch addresses; and determining that the requested read data are stored as prefetched data in the event of an address match.

63. The method of claim 62 further comprising: transferring read data stored as prefetched data responsive to determining that the requested read data are stored as prefetched data; and transferring read data from the memory devices responsive to determining that the requested read data are not stored as prefetched data.

64. The method of claim 61 further comprising prefetching data from memory cells that are currently accessed such that the prefetching does not require the selected memory cells to be precharged.

65. The method of claim 61 wherein prefetching data from the memory devices comprises prefetching data only when the memory requests are not being received.

66. The method of claim 61 further comprising selectively enabling prefetching based on the nature of the received memory requests.

67. The method of claim 61 wherein the act of receiving memory requests for access to memory cells in a plurality of memory devices comprises receiving optical signals corresponding to the memory requests.

68. A method of reading data in a memory hub comprising: receiving memory requests for access to a memory device; coupling the memory requests to the memory device responsive to the received memory request, at least some of the memory requests being memory requests to read data; receiving read data responsive to the read memory requests; predicting addresses that are likely to be accessed in the memory device based on the read memory requests; generating prefetch requests indicative of the predicted addresses; prefetching and storing the read data from the memory device responsive to the prefetch requests; determining from a read memory request if the requested read data are stored as prefetched data; transferring the prefetched data if a determination has been made; transferring data from the memory device if a determination has not been made; grouping the predicted addresses into a plurality of sets corresponding to respective strides; dividing the stored prefetched data into a plurality of sections each corresponding to a respective stride; and storing the prefetched data from the memory devices indicative of the predicted addresses in the section corresponding to the stride containing the predicted address from which the data was read.

69. The method of claim 68 further comprising: storing prefetch addresses corresponding to the stored prefetched data; receiving memory request addresses corresponding to subsequent memory requests; comparing each received memory request address to the prefetch addresses; and determining that the requested read data are stored as prefetched data in the event of an address match.

70. The method of claim 68 further comprising prefetching data from memory cells that are currently accessed such that the prefetching does not require the selected memory cells to be precharged.

71. The method of claim 68 wherein prefetching and storing the read data from the memory device responsive to the prefetch requests comprise prefetching data only when the memory requests are not being received.

72. The method of claim 68 further comprising selectively enabling prefetching based on the nature of the received memory requests.

73. The method of claim 68 wherein the act of receiving memory requests for access to memory cells in a plurality of memory devices comprises receiving optical signals corresponding to the memory requests.

Description:

TECHNICAL FIELD

This invention relates to computer systems, and, more particularly, to a computer system having a memory hub coupling several memory devices to a processor or other memory access device.

BACKGROUND OF THE INVENTION

Computer systems use memory devices, such as dynamic random access memory (“DRAM”) devices, to store data that are accessed by a processor. These memory devices are normally used as system memory in a computer system. In a typical computer system, the processor communicates with the system memory through a processor bus and a memory controller. The processor issues a memory request, which includes a memory command, such as a read command, and an address designating the location from which data or instructions are to be read. The memory controller uses the command and address to generate appropriate command signals as well as row and column addresses, which are applied to the system memory. In response to the commands and addresses, data are transferred between the system memory and the processor. The memory controller is often part of a system controller, which also includes bus bridge circuitry for coupling the processor bus to an expansion bus, such as a PCI bus.

Although the operating speed of memory devices has continuously increased, this increase in operating speed has not kept pace with increases in the operating speed of processors. Even slower has been the increase in operating speed of memory controllers coupling processors to memory devices. The relatively slow speed of memory controllers and memory devices limits the data bandwidth between the processor and the memory devices.

In addition to the limited bandwidth between processors and memory devices, the performance of computer systems is also limited by latency problems that increase the time required to read data from system memory devices. More specifically, when a memory device read command is coupled to a system memory device, such as a synchronous DRAM (“SDRAM”) device, the read data are output from the SDRAM device only after a delay of several clock periods. Therefore, although SDRAM devices can synchronously output burst data at a high data rate, the delay in initially providing the data can significantly slow the operating speed of a computer system using such SDRAM devices.

One approach to alleviating the memory latency problem is to use multiple memory devices coupled to the processor through a memory hub. In a memory hub architecture, a system controller or memory controller is coupled to several memory modules, each of which includes a memory hub coupled to several memory devices. The memory hub efficiently routes memory requests and responses between the controller and the memory devices. Computer systems employing this architecture can have a higher bandwidth because a processor can access one memory device while another memory device is responding to a prior memory access. For example, the processor can output write data to one of the memory devices in the system while another memory device in the system is preparing to provide read data to the processor. Although computer systems using memory hubs may provide superior performance, they nevertheless often fail to operate at optimum speed for several reasons. For example, even though memory hubs can provide computer systems with a greater memory bandwidth, they still suffer from latency problems of the type described above. More specifically, although the processor may communicate with one memory device while another memory device is preparing to transfer data, it is sometimes necessary to receive data from one memory device before the data from another memory device can be used. In the event data must be received from one memory device before data received from another memory device can be used, the latency problem continues to slow the operating speed of such computer systems.

One technique that has been used to reduce latency in memory devices is to prefetch data, i.e., read data from system memory before the data are requested by a program being executed. Generally the data that are to be prefetched are selected based on a pattern of previously fetched data. The pattern may be as simple as a sequence of addresses from which data are fetched so that data can be fetched from subsequent addresses in the sequence before the data are needed by the program being executed. The pattern, which is known as a “stride,” may, of course, be more complex.

Although data prefetching can reduce memory access latencies in conventional computer systems, prefetching of data has not been effectively used in a manner that provides optimum performance in computer systems using memory hubs. In particular, the vast amount of data that can be addressed in a computer system having several memory hubs makes it difficult to accurately predict which data will be subsequently needed. Furthermore, even if the data that will be required can be correctly anticipated, it can be unduly time consuming to couple the data from memory devices in a memory module, and through a memory hub in the memory module to a prefetch buffer in the system controller or memory controller. The need to couple the data from the memory module to the prefetch buffer can also reduce the memory bandwidth of the system if the data are being prefetched at a time when normal memory accesses are being attempted.

There is therefore a need for a computer architecture that provides the advantages of a memory hub architecture and also minimize the latency problems common in such systems, thereby providing memory devices with high bandwidth and low latency.

SUMMARY OF THE INVENTION

A memory module that may be used in a computer system includes a plurality of memory devices coupled to a memory hub. The memory hub includes a link interface receiving memory requests for access to memory cells in at least one of the memory devices. A memory device interface couples memory requests to the memory devices and receives read data responsive to at least some of the memory requests. A history logic unit included in the memory hub receives memory requests from the link interface and predicts on the basis of the memory requests the addresses in the memory devices that are likely to be accessed. The history logic unit then generates prefetching suggestions indicative of the predicted addresses. The memory hub also includes a memory sequencer that couples memory requests to the memory device interface responsive to memory requests received from the link interface. The memory sequencer also generates and couples prefetching requests to the memory device interface responsive to prefetching suggestions received from the history logic unit. A prefetch buffer included in the memory hub receives and stores read data from memory cells being accessed responsive to the prefetching requests. Finally, a data read control unit included in the memory hub determines from a read memory request received from the link interface if the read data are stored in the prefetch buffer. If the read data are stored in the prefetch buffer, the read data are read from the prefetch buffer. If the read data are not stored in the prefetch buffer, the read data are read from the memory devices.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a computer system according to one example of the invention in which a memory hub is included in each of a plurality of memory modules.

FIG. 2 is a block diagram of a memory hub used in the computer system of FIG. 1, which contains a prefetch buffer according to one example of the invention.

DETAILED DESCRIPTION OF THE INVENTION

A computer system 100 according to one example of the invention is shown in FIG. 1. The computer system 100 includes a processor 104 for performing various computing functions, such as executing specific software to perform specific calculations or tasks. The processor 104 includes a processor bus 106 that normally includes an address bus, a control bus, and a data bus. The processor bus 106 is typically coupled to cache memory 108 , which, as previously mentioned, is usually static random access memory (“SRAM”). Finally, the processor bus 106 is coupled to a system controller 110 , which is also sometimes referred to as a “North Bridge” or “memory controller.”

The system controller 110 serves as a communications path to the processor 104 for a variety of other components. More specifically, the system controller 110 includes a graphics port that is typically coupled to a graphics controller 112 , which is, in turn, coupled to a video terminal 114 . The system controller 110 is also coupled to one or more input devices 118 , such as a keyboard or a mouse, to allow an operator to interface with the computer system 100 . Typically, the computer system 100 also includes one or more output devices 120 , such as a printer, coupled to the processor 104 through the system controller 110 . One or more data storage devices 124 are also typically coupled to the processor 104 through the system controller 110 to allow the processor 104 to store data or retrieve data from internal or external storage media (not shown). Examples of typical storage devices 124 include hard and floppy disks, tape cassettes, and compact disk read-only memories (CD-ROMs).

The system controller 110 is coupled to several memory modules 130 a, b . . . n , which serve as system memory for the computer system 100 . The memory modules 130 are preferably coupled to the system controller 110 through a high-speed link 134 , which may be an optical or electrical communication path or some other type of communications path. In the event the high-speed link 134 is implemented as an optical communication path, the optical communication path may be in the form of one or more optical fibers, for example. In such case, the system controller 110 and the memory modules will include an optical input/output port or separate input and output ports coupled to the optical communication path. The memory modules 130 are shown coupled to the system controller 110 in a multi-drop arrangement in which the single high-speed link 134 is coupled to all of the memory modules 130 . However, it will be understood that other topologies may also be used, such as a point-to-point coupling arrangement in which a separate high-speed link (not shown) is used to couple each of the memory modules 130 to the system controller 110 . A switching topology may also be used in which the system controller 110 is selectively coupled to each of the memory modules 130 through a switch (not shown). Other topologies that may be used will be apparent to one skilled in the art.

Each of the memory modules 130 includes a memory hub 140 for controlling access to 8 memory devices 148 , which, in the example illustrated in FIG. 2, are synchronous dynamic random access memory (“SDRAM”) devices. However, a fewer or greater number of memory devices 148 may be used, and memory devices other than SDRAM devices may, of course, also be used. The memory hub 140 is coupled to each of the system memory devices 148 through a bus system 150 , which normally includes a control bus, an address bus and a data bus.

One example of the memory hub 140 of FIG. 1 is shown in FIG. 2. The memory hub 140 includes a link interface 152 that is coupled to the high-speed link 134 . The nature of the link interface 152 will depend upon the characteristics of the high-speed link 134 . For example, in the event the high-speed link 134 is implemented using an optical communications path, the link interface 152 will include an optical input/output port or separate input and output ports and will convert optical signals received through the optical communications path into electrical signals and electrical signals into optical signals that are transmitted to the optical communications path. In any case, the link interface 152 may include a variety of conventional interface circuitry such as, for example, a first-in, first-out buffer (not shown), for receiving and storing memory requests as they are received through the high-speed link 134 . The memory requests can then be stored in the link interface until they can be processed by the memory hub 140 .

A memory request received by the link interface 152 is processed by first transferring the request to a memory sequencer 160 . The memory sequencer 160 converts the memory requests from the format output from the system controller 110 (FIG. 1) into a memory request having a format that can be used by the memory devices 148 . These re-formatted request signals will normally include memory command signals, which are derived from memory commands contained in the memory request received by the memory hub 140 , and row and column address signals, which are derived from an address contained in the memory request received by the memory hub 140 . In the event the memory request is a write memory request, the re-formatted request signals will normally include write data signals which are derived from write data contained in the memory request received by the memory hub 140 . For example, where the memory devices 148 are conventional DRAM devices, the memory sequencer 160 will output row address signals, a row address strobe (“RAS”) signal, an active low write/active high read signal (“W*/R”), column address signals and a column address strobe (“CAS”) signal. The re-formatted memory requests are preferably output from the sequencer 160 in the order they will be used by the memory d