Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Contents
[hide]
3.1 Conroe/Merom (65 nm) 3.2 Conroe-L/Merom-L 3.3 Penryn/Wolfdale (45 nm) 3.4 Dunnington
4 Steppings
o o
5 System requirements
o
[edit]Features
The Core microarchitecture returned to lower clock rates and improved the usage of both available clock cycles and power when compared with the preceding NetBurst microarchitecture of the Pentium 4/Dbranded CPUs.[1] The Core microarchitecture provides more efficient decoding stages, execution units, caches, and buses, reducing the power consumption of Core 2-branded CPUs while increasing their processing capacity. Intel's CPUs have varied widely in power consumption according to clock rate, architecture, and semiconductor process, shown in the CPU power dissipationtables. Like the last NetBurst CPUs, Core based processors feature multiple cores and hardware virtualization support (marketed as Intel VT-x), as well as Intel 64 and SSSE3. However, Core-based processors do not have the Hyper-Threading Technology found in Pentium 4 processors. This is because the Core microarchitecture is a descendant of the P6 microarchitecture used by Pentium Pro, Pentium II, Pentium III, and Pentium M. The L1 cache size was enlarged in the Core microarchitecture, from 32KB on Pentium II/III (16 KB L1 Data + 16 KB L1 Instruction) to 64 KB L1 cache/core (32 KB L1 Data + 32 KB L1 Instruction) on Pentium M and Core/Core 2. It also lacks an L3 Cache found in the Gallatin core of the Pentium 4 Extreme Edition, although an L3 Cache is present in high-end versions of Core-based Xeons. Both an L3 cache and Hyperthreading were reintroduced in the Nehalem microarchitecture.
[edit]Technology
The Intel Core Microarchitecture was designed from the ground up, but is similar to the Pentium M microarchitecture in design philosophy. The Penryn pipeline is 14 stages long[2] less than half ofPrescott's, a signature feature of wide order execution cores. Penryn's successor, Nehalem has 20-24 pipeline stages.[2] Core's execution unit is 4 issues wide, compared to the 3-issue cores of P6,Pentium M, and NetBurst microarchitectures. The new architecture is a dual core design with linked L1 cache and shared L2 cache engineered for maximum performance per watt and improved scalability. One new technology included in the design is Macro-Ops Fusion, which combines two x86 instructions into a single micro-operation. For example, a common code sequence like a compare followed by a conditional jump would become a single micro-op. Other new technologies include 1 cycle throughput (2 cycles previously) of all 128-bit SSE instructions and a new power saving design. All components will run at minimum speed, ramping up speed dynamically as needed (similar to AMD's Cool'n'Quiet power-saving technology, as well as Intel's own SpeedStep technology from earlier mobile processors). This allows the chip to produce less heat, and consume as little power as possible.
comparable). Merom, the mobile variant, is listed at 35 watts TDP for standard versions and 5 watts TDP for Ultra Low Voltage (ULV) versions.[citation needed] Previously, Intel announced that it would now focus on power efficiency, rather than raw performance. However, at IDF in the spring of 2006, Intel advertised both. Some of the promised numbers were:
20% more performance for Merom at the same power level (compared to Core Duo) 40% more performance for Conroe at 40% less power (compared to Pentium D) 80% more performance for Woodcrest at 35% less power (compared to the original dual-core Xeon)
[edit]Processor
cores
The processors of the Core microarchitecture can be categorized by number of cores, cache size, and socket; each combination of these has a unique code name and product code that is used across a number of brands. For instance, code name "Allendale" with product code 80557 has two cores, 2 MB L2 cache and uses the desktop socket 775, but has been marketed as Celeron, Pentium, Core 2 and Xeon, each with different sets of features enabled. Most of the mobile and desktop processors come in two variants that differ in the size of the L2 cache, but the specific amount of L2 cache in a product can also be reduced by disabling parts at production time. Wolfdale-DP and all quad-core processors except Dunnington QC are multi-chip modules combining two dies. For the 65 nm processors, the same product code can be shared by processors with different dies, but the specific information about which one is used can be derived from the stepping.
fab
core s
Mobile
Desktop, UP Server
CL DP Server Server
MP Server
SingleCore 6 5 nm
65 nm
Merom -L 80537
Conroe-L 80557
SingleCore 4 5 nm
45 nm
Penryn -L 80585
WolfdaleCL 80588
DualCore 65 nm
65 n m
DualCore
45 n m
Penryn Penryn Wolfdale Wolfdale Wolfdale Wolfdale80570 -3M 80576 -3M -CL DP
45 nm
80577
80571
80588
80573
QuadCore 65 nm
65 n m
Kentsfiel d 80562
QuadCore 45 nm
45 n m
Penryn Yorkfield Yorkfiel Yorkfield Harpertow Dunningto -QC -6M d -CL n n QC 80581 80580 80569 80584 80574 80583
SixCore 45 nm
45 n m
Dunningto n 80582
[edit]Conroe/Merom
(65 nm)
The original Core 2 processors are based around the same dies that can be identified as CPUID Family 6 Model 15. Depending on their configuration and packaging, their code names are Conroe (LGA 775, 4 MB L2 cache), Allendale (LGA 775, 2 MB L2 cache), Merom (Socket M, 4 MB L2 cache) and Kentsfield (Multichip module, LGA 775, 2x4MB L2 cache). Merom and Allendale processors with limited features can be found in Pentium Dual Core and Celeron processors, while Conroe, Allendale and Kentsfield also are sold as Xeon processors. Additional code names for processors based on this model are Woodcrest (LGA 771, 4 MB L2 cache), Clovertown (MCM, LGA 771, 2x4MB L2 cache) and Tigerton (MCM, Socket 604, 2x4MB L2 cache), all of which are marketed only under the Xeon brand.
Processor
Brand name
Model (list)
Cores
L2 Cache
Socket
TDP
Merom-2M
U7xxx
2 MiB BGA479
10 W
4 MiB
17 W
2-4 MiB
Merom
X7xxx
4 MiB
Socket P
44 W
5x0 1 5x5
30 W
31 W
Merom-2M
Celeron Dual-Core
T1xxx
512-1024 KiB
Socket P
35 W
Merom-2M
Pentium Dual-Core
T2xxx T3xxx
1 MiB
Socket P
35 W
3xxx 2 3xxx
Conroe-CL
E6xx5
2-4 MB
LGA 771
Conroe-XE
Core 2 Extreme
X6xxx
4 MB
LGA 775 75 W
Allendale
Pentium Dual-Core
E2xxx
1 MB
LGA 775 65 W
Allendale
Celeron
E1xxx
512 KB
LGA 775 65 W
Kentsfield
Xeon
32xx
2x4 MiB
Kentsfield
Core 2 Quad
Q6xxx
2x4 MiB
Kentsfield XE
Core 2 Extreme
QX6xxx
2x4 MiB
Woodcrest
51xx
4 MB
L53xx
40-50 W
Clovertown
E53xx
2x4 MB
LGA 771
80 W
X53xx Xeon Tigerton-DC E72xx 2 2x4 MB L73xx Socket 604 Tigerton E73xx 4 2x2-2x4 MB
120-150 W
80 W
50 W
80 W
X73xx [edit]Conroe-L/Merom-L
2x4 MB
130 W
The Conroe-L and Merom-L processors are based around the same core as Conroe and Merom, but only contain a single core and 1 MB of L2 cache, significantly reducing production cost and power consumption of the processor at the expense of performance compared to the dual-core version. It is used only in ultralow voltage Core 2 Solo U2xxx and in Celeron processors and is identified as CPUID family 6 model 22.
Processor
Brand name
L2 Cache
Socket
TDP
Merom-L
2 MiB
BGA479 5.5 W
Merom-L
Celeron M
5x0
512 KiB
Socket M 27 W Socket P
Merom-L
5x3
LGA 775 35 W
LGA 771 65 W
(45 nm)
In Intel's Tick-Tock cycle, the 2007/2008 "Tick" was the shrink of the Core microarchitecture to 45 nanometers as CPUID model 23. In Core 2 processors, it is used with the code names Penryn (Socket P), Wolfdale (LGA 775) and Yorkfield (MCM, LGA 775), some of which are also sold as Celeron, Pentium and Xeon processors. In the Xeon brand, the Wolfdale-DP and Harpertown code names are used for LGA 771 based MCMs with two or four active Wolfdale cores. The chips come in two sizes, with 6 MB and 3 MB L2 cache. The smaller version is commonly called Penryn-3M and Wolfdale-3M as well as Yorkfield-6M, respectively. The single-core version of Penryn, listed as Penryn-L here, is not a separate model like Merom-L but a version of the Penryn-3M model with only one active core.
Processor
Brand name
Model (list)
Cores
L2 Cache
Socket
TDP
Penryn-L
Core 2 Solo
SU3xxx
3 MiB
BGA956
5.5 W
SU7xxx Penryn-3M SU9xxx BGA956 SL9xxx Penryn Core 2 Duo SP9xxx 2 6 MiB 25/28 W 17 W 3 MB 10 W
Penryn
P9xxx
6 MiB
2 MiB
3 MiB
35 W
6 MiB
6 MiB
Socket P
35-55 W
Penryn-QC
Core 2 Quad
Q9xxx
2x3-2x6 MiB
Socket P
45 W
X9xxx
6 MiB Socket P
44 W
QX9xxx
2x6 MiB
45 W
Socket P
35 W
FC-BGA 956
10 W
Socket P
35 W
FC-BGA 956
10 W
1 MiB
Socket P
35 W
Penryn-L
SU2xxx
Celeron
E3xxx 1 MB E2210
Wolfdale-3M Pentium
3 MB
45-65 W 6 MB
Xeon Wolfdale-CL
30x4
1 LGA 771
30 W
31x3
65 W
LGA 775
6595 W
LGA 771
80 W
22 MB
23 MB LGA 775
6595 W
Yorkfield
Q9x5x
26 MB
Yorkfield XE
Core 2 Extreme
QX9xxx
26 MB
130136 W
QX9xx5
LGA 771
150 W
E52xx
65 W
Wolfdale-DP
L52xx
6 MB
LGA 771
20-55 W
80 W
80 W
Harpertown
L54xx
26 MB
LGA 771
40-50 W
X54xx [edit]Dunnington
120-150 W
The Xeon "Dunnington" processor (CPUID Family 6, model 30) is closely related to Wolfdale but comes with six cores and an on-chip L3 cache and is designed for servers with Socket 604, so it is marketed only as Xeon, not as Core 2.
Socket
TDP
E74xx
4-6
8-16 MB
90 W
Dunnington Xeon
L74xx
4-6
12 MB
X7460 [edit]Steppings
16 MB
130 W
The Core microarchitecture uses a number of steppings, which unlike previous microarchitectures not only represent incremental improvements but also different sets of features like cache size and low power modes. Most of these steppings are used across brands, typically by disabling some of the features and limiting clock frequencies on low-end chips.
Steppings with a reduced cache size use a separate naming scheme, which means that the releases are no longer in alphabetic order. Additional steppings have been used in internal and engineering samples, but are not listed in the tables. Many of the high-end Core 2 and Xeon processors use Multi-Chip Modules of two or three chips in order to get larger cache sizes or more than two cores.
[edit]Steppings
using 65 nm process
Desktop (Conroe) Server Desktop (Woodcrest, (Kentsfi Clovertown,T eld) igerton)
Mobile (Merom)
L C Ma X X Ste Rel 2 Cel Ce Pe Ar P x. Pent Core e Core e ppi eas ca ero Core 2 ler nti ea UI clo ium 2 o 2 o ng ed ch n on um D ck n n e
Xeon
5100
T5000 U7000
T7000 L7000X
E2
m FB iB Hz xx
7000
000 00
0 000 0
0 0 7300 0
Jul 111 2 2.4 5xx T200 T5000 E1 06 E2 E400 M 0 G T10 0T30 T7000U 00 M0 200 m FD 000 0 7 m iB Hz 00 00 7000 0
U2000
22 04 x0
Steppings B2/B3, E1 and G0 of model 15 (cpuid 06fx) processors are evolutionary steps of the standard Merom/Conroe die with 4 MiB L2 cache, with the short-lived E1 stepping only being used in mobile processors. Stepping L2 and M0 are the "Allendale" chips with just 2 MiB L2 cache, reducing production cost and power consumption for low-end processors. The G0 and M0 steppings improve idle power consumption in C1E state and add the C2E state in desktop processors. In mobile processors, all of which support C1 through C4 idle states, steppings E1, G0, and M0 add support for the Mobile Intel 965 Express (Santa Rosa) platform with Socket P, while the earlier B2 and L2 steppings only appear for the Socket M based Mobile Intel 945 Express (Napa refresh) platform. The model 22 stepping A1 (cpuid 10661h) marks a significant design change, with just a single core and 1 MiB L2 cache further reducing the power consumption and manufacturing cost for the low-end. Like the earlier steppings, A1 is not used with the Mobile Intel 965 Express platform. Steppings G0, M0 and A1 mostly replaced all older steppings in 2008. In 2009, a new stepping G2 was introduced to replace the original stepping B2.[5]
[edit]Steppings
using 45 nm process
Server (Wolfdal Desktop eDesktop (Yorkfield DP, Har (Wolfdale) ) pertown, Dunning ton)
Mobile (Penryn)
R C St el A P ep ea re U pi se a I ng d D
L 2 c a c h e
M ax . Celer Pentiu cl on m oc k
Core 2
C C X el Pe o e er nti r Core 2 o o um e n n 2
X e o n
Xeon
N C ov 0 20 07
1 10 6 3. 0 7 M 00 6 m i G 7 m B Hz 6
E 8 0 0 0
3 1 QX9000 0 0
5200 540 0
1 M 3 2. 82 0 M 40 M ar m 6 7xx i G 0 20 m 7 08 B Hz 6
E E5 7 000 0 E2 0 000 0
M C ar 1 20 08
1 10 6 3. 0 7 M 20 6 m i G 7 m B Hz 7
3 Q9000Q 3 X9000 0 0
1 M 3 2. 82 0 M 50 M ar m 6 i G 1 20 m 7 08 B Hz 7
Q8000 Q9000
3 3 0 0
A E ug 0 20 08
1 10 6 3. 0 7 M 33 6 m i G 7 m B Hz A
E 8 0 0 0
82 1 3 2. 7xx 90 T4000 SU3000 T6000 S E E5 E R A m 0 M 93 0SU20 SU200 U7000 P8000SU 30 000 7 0 ug m 6 i G 00T30 0SU40 9000 00 E6 0 20 7 0
08
A B Hz 00
00
000 0
000S
Se A p 1 20 08
1 50 3 2. 0 3 M 67 6 m i G D m B Hz 1
7400
In the model 23 (cpuid 01067xh), Intel started marketing stepping with full (6 MiB) and reduced (3 MiB) L2 cache at the same time, and giving them identical cpuid values. All steppings have the newSSE4.1 instructions. Stepping C1/M1 was a bug fix version of C0/M0 specifically for quad core processors and only used in those. Stepping E0/R0 adds two new instructions (XSAVE/XRSTOR) and replaces all earlier steppings. In mobile processors, stepping C0/M0 is only used in the Intel Mobile 965 Express (Santa Rosa refresh) platform, whereas stepping E0/R0 supports the later Intel Mobile 4 Express (Montevina) platform. Model 30 stepping A1 (cpuid 106d1h) adds an L3 cache as well as six instead of the usual two cores, which leads to an unusually large die size of 503 mm.[6] As of February 2008, it has only found its way into the very high-end Xeon 7400 series (Dunnington).
[edit]System
requirements
compatibility
[edit]Motherboard
Conroe, Conroe XE and Allendale all use Socket LGA 775; however, not every motherboard is compatible with these processors. Supporting chipsets are:
Intel: 865G/PE/P, 945G/GZ/GC/P/PL, 965G/P, 975X, P/G/Q965, Q963, 946GZ/PL, P3x, G3x, Q3x, X38, X48, P4x , 5400 Express, Intel G31, G33 Chipsets
NVIDIA: nForce4 Ultra/SLI X16 for Intel, nForce 570/590 SLI for Intel, nForce 650i Ultra/650i SLI/680i LT SLI/680i SLI and nForce 750i SLI/780i SLI/790i SLI/790i Ultra SLI.
VIA: P4M800, P4M800PRO, P4M890, P4M900, PT880 Pro/Ultra, PT890. SiS: 662, 671, 671fx, 672, 672fx ATI: Radeon Xpress 200 and CrossFire Xpress 3200 for Intel See also: List of Intel chipsets The currently released Yorkfield XE model QX9770 (45 nm with 1600FSB) currently has limited chipset compatibility - with only X38, P35 (With Overclocking) and some high-performance X48 and P45 motherboards being compatible. BIOS updates are gradually being released to provide support for
the new Penryn technology, and the new QX9775 is only compatible with D5400XS. The Wolfdale-3M model E7200 also has limited compatibility (at least the Xpress 200 chipset is incompatible)[citation needed]. Although a motherboard may have the required chipset to support Conroe, some motherboards based on the above mentioned chipsets do not support Conroe. This is because all Conroe-based processors require a new power delivery feature set specified in Voltage Regulator-Down (VRD) 11.0. This requirement is a result of Conroe's significantly lower power consumption, compared to the Pentium 4/D CPUs it is replacing. A motherboard that has both a supporting chipset and VRD 11 supports Conroe processors, but even then some boards will need an updated BIOS to recognize Conroe's FID (Frequency ID) and VID (Voltage ID).
[edit]Synchronous
memory modules
Unlike the previous Pentium 4 and Pentium D design, the Core 2 technology sees a greater benefit from memory running synchronously with the Front Side Bus (FSB). This means that for the Conroe CPUs with FSB of 1066 MT/s, the ideal memory performance for DDR2 is PC2-8500. In a few configurations, using PC2-5300 instead of PC2-4200 can actually decrease performance. Only when going to PC2-6400 is there a significant performance increase. While DDR2 memory models with tighter timing specifications do improve performance, the difference in real world games and applications is often negligible.[7] Optimally, the memory bandwidth afforded should match the bandwidth of the FSB, that is to say that a CPU with a 533 MT/s rated bus speed should be paired with RAM matching the same rated speed, for example DDR2 533, or PC2-4200. A common myth[citation needed] is that installing interleaved RAM will offer double the bandwidth. This myth is false; at most the increase in bandwidth by installing interleaved RAM is roughly 510%. The AGTL+ PSB used by all NetBurst processors as well as current and medium-term (pre-QuickPath) Core 2 processors provide a 64-bit data path. Current chipsets provide for a couple of either DDR2 or DDR3 channels.
Matched memory and maximum bandwidth Processor model Front side single channel / dual channel bus DDR DDR2 DDR3
mobile:
PC2-4200
PC3-8500
E6n00, E6n20, X6n00, E7n00, Q6n00 and QX6n00 mobile: T9400, T9550, T9600, P7350, 1066 MT/s P7450, P8400, P8600, P8700, P9500, P9600, SP9300, SP9400, X9100
desktop:
(DDR400)
3.2 GB/s
(DDR2533)
4.264 GB/s
(DDR31066)
8.530 GB/s
PC2-8500 (DDR21066)
8.532 GB/s
667 MT/s
PC-3200 (DDR400)
3.2 GB/s
PC2-5300 (DDR2667)
5.336 GB/s
PC3-10600 (DDR31333)
10.670 GB/s
T5n70, T6400, T7n00 (Socket P), L7300, L7500, X7n00, T8n00, T9300, T9500, X9000 desktop: E4n00, Pentium E2nn0, Pentium E5nn0, Celeron 4n0, E3n00
mobile:
PC2-6400 (DDR2800)
6.400 GB/s
PC3-6400 (DDR3800)
6.400 GB/s
PC2-8500 (DDR21066)
8.532 GB/s
PC3-12800 (DDR31600)
12.800 GB/s
desktop:
QX9770, QX9775
On jobs requiring large amounts of memory access, the quad-core Core 2 processors can benefit significantly[8] from using a PC2-8500 memory, which runs exactly the same speed as the CPU's FSB; this is not an officially supported configuration, but a number of motherboards offer it. The Core 2 processor does not require the use of DDR2. While the Intel 975X and P965 chipsets require this memory, some motherboards and chipsets support both the Core 2 and DDR memory. When using DDR memory, performance may be reduced because of the lower available memory bandwidth.
[edit]Chip
errata
The Core 2 memory management unit (MMU) in X6800, E6000 and E4000 processors does not operate to previous specifications implemented in previous generations of x86 hardware. This may cause problems, many of them serious security and stability issues, with existing operating system software. Intel's documentation states that their programming manuals will be updated "in the coming months" with information on recommended methods of managing the translation lookaside buffer (TLB) for Core 2 to avoid issues, and admits that, "in rare instances, improper TLB invalidation may result in unpredictable system behavior, such as hangs or incorrect data."[9] Among the issues noted:
Non-execute bit is shared across the cores. Floating point instruction non-coherencies. Allowed memory corruptions outside of the range of permitted writing for a process by running common instruction sequences.
Intel errata Ax39, Ax43, Ax65, Ax79, Ax90, Ax99 are said to be particularly serious. [10] 39, 43, 79, which can cause unpredictable behavior or system hang, have been fixed in recent steppings. Among those who have noted the errata to be particularly serious are OpenBSD's Theo de Raadt[11] and DragonFly BSD's Matthew Dillon.[12] Taking a contrasting view was Linus Torvalds, calling the TLB issue "totally insignificant", adding, "The biggest problem is that Intel should just have documented the TLB behavior better."[13] Microsoft has issued update KB936357 to address the errata by microcode update,[14] with no performance penalty. BIOS updates are also available to fix the issue.