Friday, December 12, 2008

Intel QuickPath Interconnect

The Intel QuickPath Interconnect ("QuickPath", "QPI")[1][2][3] is a point-to-point processor interconnect developed by Intel to compete with HyperTransport. Prior to the announcement of the name, Intel referred to it as Common System Interface or "CSI". Earlier incarnations were known at YAP (Yet Another Protocol) and YAP+. The development was conducted at Intel's MMDC (Massachusetts Microprocessor Design Center) by members of DEC's Alpha Development Group (acquired by Intel). It will replace the Front Side Bus (FSB) for Desktop, Xeon, and Itanium platforms. Intel first delivered it in November 2008 on the Intel Core i7 desktop processor and the X58 chipset, and it will be used on new Nehalem-based Xeon processors[4] and Tukwila-based Itanium processors.[5]

The QPI is an element of a system architecture that Intel calls the QuickPath architecture that implements what Intel calls QuickPath technology.[6] In its simplest form on a single-processor motherboard, a single QPI is used to connect the processor to the IO Hub (e.g., to connect a Core i7 to an X58.) In more complex instances of the architecture, separate QPI links connect one or more processors and one or more IO hubs or routing hubs in a network on the motherboard, allowing all of the components to access other components via the network. As with AMD's Hypertransport, the QuickPath Architecture assumes that the processors will have integrated memory controllers, so a multiprocessor system implements a NUMA architecture.

Each QPI comprises two 20-bit point-to-point links, one in each direction, with a separate clock in each direction, for a total of 42 signals. Each signal is a differential pair, so the total number of pins is 84.

Performance numbers for QuickPath are reported to be 4.8 to 6.4 Gigatransfers per second (GT/s) per direction. Therefore the bandwidth amounts to 12.0 to 16.0 GB/s per direction, or 24.0 to 32.0 GB/s per link.[7]

On high-reliability servers, A QPI link can operate in a degraded mode. If one or more of the 20+1 signals fails, the interface will operate using 15+1, 10+1, or even 5+1 remaining signals, even reassigning the clock to a data signal if the clock fails.

The initial Nehalem implementation uses a 20-bit wide 25.6 GB/s link (as reported in the Intel Nehalem Speech on IDF). This 25.6 GB/s link provides exactly double the theoretical bandwidth of Intel's 1600 MHz FSB used in the X48 chipset.

0 Comments:

 
Copyright 2008 | Langite Blogger Template by Bloganol