

## **AP-829**

# APPLICATION NOTE

100 MHz 2-Way SMP Pentium® III Xeon<sup>TM</sup> Processor/Intel® 440GX AGPset AGTL+ Layout Guidelines

March 1999

Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications.

Intel may make changes to specifications and product descriptions at any time, without notice.

Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them.

The Pentium<sup>®</sup> II Xeon™ processor, Pentium<sup>®</sup> III Xeon™ processor, and the Intel<sup>®</sup> 440GX AGPset may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request.

Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.

Copies of documents which have an ordering number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725 or by visiting Intel's website at http://www.intel.com

Copyright © Intel Corporation 1999.

\* Third-party brands and names are the property of their respective owners.



## **CONTENTS**

| PAGE                                                               | PAGE                                                                      |
|--------------------------------------------------------------------|---------------------------------------------------------------------------|
| 1. 0.INTRODUCTION5                                                 | 5.0. MORE DETAILS AND INSIGHTS22                                          |
| 2. 0.ABOUT THIS DOCUMENT5                                          | 5.1. Textbook Timing Equations22                                          |
| 2.1. Document Organization5                                        | 5.2. Effective Impedance and                                              |
| 2.2. References                                                    | Tolerance/Variation                                                       |
| 2.3. Definition of Terms5                                          | 5.3. Power/Reference Planes, PCB Stackup, and High Frequency Decoupling23 |
| 3.0. AGTL+ DESIGN GUIDELINE7                                       | 5.3.1. POWER DISTRIBUTION23                                               |
| 3.1. Initial Timing Analysis8                                      | 5.3.2. REFERENCE PLANES AND PCB<br>STACKUP23                              |
| 3.2. Determine General Topology, Layout,                           | 5.3.3. HIGH FREQUENCY DECOUPLING 25                                       |
| and Routing Desired11                                              | 5.3.4. SC330 CONNECTOR25                                                  |
| 3.3. Pre-Layout Simulation13                                       | 5.4. Clock Routing26                                                      |
| 3.3.1. METHODOLOGY13                                               | 5.5. Conclusion                                                           |
| 3.3.2. SIMULATION CRITERIA13                                       |                                                                           |
| 3.4. Place and Route Board14                                       | 6. 0.VREF GUARDBAND27                                                     |
| 3.4.1. ESTIMATE COMPONENT TO COMPONENT SPACING FOR AGTL+ SIGNALS14 | 7. 0.OVERDRIVE REGION27                                                   |
| 3.4.2. LAYOUT AND ROUTE BOARD14                                    | 8. 0.FLIGHT TIME DEFINITION AND                                           |
| 3.5. Post-Layout Simulation                                        | MEASUREMENT29                                                             |
| 3.5.1. INTERSYMBOL INTERFERENCE 15                                 |                                                                           |
| 3.5.2. CROSSTALK ANALYSIS15                                        | FIGURES                                                                   |
| 3.5.3. MONTE CARLO ANALYSIS                                        | Figure 1. Example 2-Way SMP Network                                       |
| 3.6. Validation                                                    | Topology12                                                                |
| 3.6.1. MEASUREMENTS16                                              | Figure 2. Test Load vs. Actual System Load 16                             |
| 3.6.2. FLIGHT TIME SIMULATION16                                    | Figure 3. Rising Edge Noise Margin17                                      |
| 3.6.3. FLIGHT TIME HARDWARE                                        | Figure 4. Aggressor and Victim Networks19                                 |
| VALIDATION16                                                       | Figure 5. Transmission Line Geometry: (A) Microstrip (B) Stripline20      |
| 4.0. THEORY16                                                      | Figure 6. One Signal Layer and One Reference Plane24                      |
| 4.1. AGTL+16                                                       |                                                                           |
| 4.2. Timing Requirements17                                         | Figure 7. Layer Switch with One Reference Plane24                         |
| 4.3. Noise Margin17                                                | Figure 8. Layer Switch with Multiple Reference                            |
| 4.3.1. FALLING EDGE OR LOW LEVEL                                   | Planes (same type)24                                                      |
| NOISE MARGIN18 4.3.2. RISING EDGE OR HIGH LEVEL                    | Figure 9. Layer Switch with Multiple Reference Planes                     |
| NOISE MARGIN18                                                     | Figure 10. One Layer with Multiple Reference                              |
| 4.4. Crosstalk Theory19                                            | Planes25                                                                  |
| 4.4.1. CROSSTALK MANAGEMENT21                                      |                                                                           |
| 4.4.2. POTENTIAL TERMINATION CROSSTALK PROBLEMS22                  |                                                                           |

#### AP-829



| TABLES                                                                                                               |
|----------------------------------------------------------------------------------------------------------------------|
| Table 1. Pentium <sup>®</sup> III Xeon <sup>™</sup> Processor and 82443GX AGTL+ Parameters for Example Calculations9 |
| Table 2. Example TFLT_MAX Calculations for 100 MHz Bus10                                                             |
| Table 3. Example TFLT_MIN Calculations (Frequency Independent)10                                                     |
| Table 4. Segments Descriptions and Lengths for Figure 1                                                              |

| Table 5. Example Backward Crosstalk Coupling               |
|------------------------------------------------------------|
| Factors with $\varepsilon_r$ = 4.5, $V_{OH\_MAX}$ = 1.5 V, |
| and $Z_0 = 65 \Omega$ 21                                   |
| Figure 11. Overdrive Region and VREF                       |
| Guardband28                                                |
| Figure 12. Rising Edge Flight Time                         |
| Measurement28                                              |
|                                                            |



#### 1.0. INTRODUCTION

The Pentium® III Xeon™ processor is a follow-on to the Pentium® Pro and Pentium® II Xeon™ processors. The Intel® 440BX AGPset and Intel® 440GX AGPset have been designed to provide a high-performance memory, Advanced Graphics Port (AGP), and I/O subsystem for Pentium III Xeon processor based systems. The Intel 440BX AGPset integrates a memory controller that supports up to 1 Gbyte of main memory (Intel 440GX AGPset supports up to 2 Gbyte).

The Pentium III Xeon processor implements a synchronous, latched bus protocol that allows a full clock cycle for signal transmission and a full clock cycle for signal interpretation and generation. This protocol simplifies interconnect timing requirements and supports 100 MHz system designs using conventional interconnect technology. The Pentium III Xeon processor system bus operates using GTL+signaling levels with a new type of buffer utilizing active negation and multiple termination. This new bus logic is called *Assisted Gunning Transceiver Logic*, or *AGTL+*.

The goal of this layout guideline is to provide the system designer with the information needed for the 100 MHz 2-way SMP Pentium III Xeon processor and 82443GX AGTL+ bus portion of PCB layout. The topology and information presented in this application note also apply to Pentium II Xeon processor/Intel 440GX AGPset system bus design. This layout guideline does not cover designs using other AGPsets or PCIsets. This document provides guidelines and methodologies that are to be used with good engineering practices. See the Pentium® III Xeon<sup>TM</sup> Processor at 500 and 550 MHz datasheet and the applicable Intel 440GX AGPset specification for component specific electrical details. Intel strongly recommends running analog simulations using the available I/O buffer models together with layout information extracted from your specific design.

#### 2.0. ABOUT THIS DOCUMENT

#### 2.1. Document Organization

This section defines terms used in the document. Section 3.0. discusses specific system guidelines. This is a step-by-step methodology that Intel has successfully used to design 2-way SMP Pentium III Xeon processor systems using the 82443GX. Section 4.0. introduces the theories that are applicable to this layout guideline. Section 5.0. contains more details and

insights. The items in Section 5.0. expand on some of the rationale for the recommendations in the step-bystep methodology. This section also includes equations that may be used for reference.

The actual guidelines start on Section 3.0.

#### 2.2. References

- Pentium® III Xeon<sup>TM</sup> Processor at 500 and 550 MHz (Order Number 245094-001)
- Pentium<sup>®</sup> Ill Xeon<sup>TM</sup> Processor Power Distribution Guidelines (Order Number 245095-001)
- Pentium<sup>®</sup> II Xeon<sup>™</sup> Processor Bus Terminator Design Guidelines (Order Number 243774-001)
- Pentium® II Processor Developer's Manual (Order Number 243341)
- VRM 8.2 DC-DC Converter Design Guidelines (Order Number 243773-002)
- VRM 8.3 DC-DC Converter Design Guidelines (Order Number 243870-002)

#### 2.3. Definition of Terms

**Aggressor** — a network that transmits a coupled signal to another network is called the aggressor network.

AGTL+ — The Pentium III Xeon processor system bus uses a bus technology called AGTL+, or Assisted Gunning Transceiver Logic. AGTL+ buffers are opendrain and require pull-up resistors for providing the high logic level and termination. The Pentium III Xeon processor AGTL+ output buffers differ from GTL+buffers with the addition of an active pMOS pull-up transistor to "assist" the pull-up resistors during the first clock of a low-to-high voltage transition. Additionally, the Pentium III Xeon processor Single Edge Connector (S.E.C.) cartridge contains internal 150  $\Omega$  pull-up resistors to provide termination at each bus load.

**Bus Agent** — a component or group of components that, when combined, represent a single load on the AGTL+ bus.

Corner — describes how a component performs when all parameters that could impact performance are adjusted to have the same impact on performance. Examples of these parameters include variations in manufacturing process, operating temperature, and operating voltage. The results in performance of an electronic component that may change as a result of



(including, but not limited to): clock to output time, output driver edge rate, output drive current, and input drive current. Discussion of the "slow" corner would mean having a component operating at its slowest, weakest drive strength performance. Similar discussion of the "fast" corner would mean having a component operating at its fastest, strongest drive strength performance. Operation or simulation of a component at its slow corner and fast corner is expected to bound the extremes between slowest, weakest performance and fastest, strongest performance.

**Crosstalk** — the reception on a victim network of a signal imposed by aggressor network(s) through inductive and capacitive coupling between the networks.

- Backward Crosstalk coupling which creates a signal in a victim network that travels in the opposite direction as the aggressor's signal.
- Forward Crosstalk coupling which creates a signal in a victim network that travels in the same direction as the aggressor's signal.
- Even Mode Crosstalk coupling from multiple aggressors when all the aggressors switch in the same direction that the victim is switching.
- Odd Mode Crosstalk coupling from multiple aggressors when all the aggressors switch in the opposite direction that the victim is switching.

**Edge Finger** — The cartridge electrical contact which interfaces to the SC330 connector.

**Flight Time** — is a term in the timing equation that includes the signal propagation delay, any effects the system has on the  $T_{CO}$  of the driver, plus any adjustments to the signal at the receiver needed to guarantee the setup time of the receiver.

More precisely, *flight time* is defined to be:

The time difference between a signal at the input pin of a receiving agent crossing V<sub>REF</sub> (adjusted to meet the receiver manufacturer's conditions required for AC timing specifications; i.e., ringback, etc.), and the output pin of the driving agent crossing V<sub>REF</sub> if the driver was driving the Test Load used to specify the driver's AC timings.

See Section 3.6.2. for details regarding flight time simulation and validation.

Figure 11 in Appendix A shows the *V<sub>REF</sub> Guardband* boundaries, where maximum and minimum flight time

measurements are taken. The  $V_{REF}$  Guardband takes into account sources of noise that may affect the way an AGTL+ signal becomes valid at the receiver. See the definition of the  $V_{REF}$  Guardband.

- Maximum and Minimum Flight Time Flight time variations can be caused by many different parameters. The more obvious causes include variation of the board dielectric constant, changes in load condition, crosstalk, V<sub>TT</sub> noise, V<sub>REF</sub> noise, variation in termination resistance and differences in I/O buffer performance as a function of temperature, voltage and manufacturing process. Some less obvious causes include effects of Simultaneous Switching Output (SSO) and packaging effects.
- The Maximum Flight Time is the largest flight time a network will experience under all variations of conditions. Maximum flight time is measured at the appropriate V<sub>REF</sub> Guardband boundary.
- The Minimum Flight Time is the smallest flight time a network will experience under all variations of conditions. Minimum flight time is measured at the appropriate V<sub>REF</sub> Guardband boundary.

For more information on flight time and the V<sub>REF</sub> Guardband, see Appendix A of this guideline and the Pentium® II Processor Developer's Manual.

GTL+ is the bus technology used by the Pentium Pro processor. This is an incident wave switching, opendrain bus with pull-up resistors which provide both the high logic level and termination. It is an enhancement to the GTL (Gunning Transceiver Logic) technology. See the Pentium® II Processor Developer's Manual for more details of GTL+

**Network** — the trace of a Printed Circuit Board (PCB) that completes an electrical connection between two or more components.

**Network Length** — the distance between extreme bus agents on the network and does not include the distance connecting the end bus agents to the termination resistors.

Overdrive Region — is the voltage range, at a receiver, located above and below V<sub>REF</sub> for signal integrity analysis. See the *Pentium® II Processor Developer's Manual* for more details.

**Overshoot** — Maximum voltage allowed for a signal at the processor core pad. See the *Pentium*® *III Xeon*<sup>™</sup> *Processor at 500 and 550 MHz* datasheet for overshoot specification.



**Pad** — a feature of a semiconductor die contained within an internal logic package on the S.E.C cartridge substrate used to connect the die to the package bond wires. A pad is only observable in simulation.

**Pin** — a feature of a logic package contained within the S.E.C. cartridge used to connect the package to an internal substrate trace.

**Ringback** — is the voltage that a signal rings back to after achieving its maximum absolute value. Ringback may be due to reflections, driver oscillations, etc. See the *Pentium® Ill Xeon*<sup>TM</sup> *Processor at 500 and 550 MHz* datasheet for ringback specification.

**Settling Limit** — defines the maximum amount of ringing at the receiving pin that a signal must reach before its next transition. See the *Pentium® III Xeon*<sup>TM</sup> *Processor at 500 and 550 MHz* datasheet for settling limit specification.

**Setup Window** — is the time between the beginning of Setup to Clock (T<sub>SU\_MIN</sub>) and the arrival of a valid clock edge. This window may be different for each type of bus agent in the system.

Simultaneous Switching Output (SSO) Effects — refers to the difference in electrical timing parameters and degradation in signal quality caused by multiple signal outputs simultaneously switching voltage levels (i.e., high-to-low) in the opposite direction from a single signal (i.e., low-to-high) or in the same direction (e.g., high-to-low). These are respectively called odd-mode switching and even-mode switching. This simultaneous switching of multiple outputs creates higher current swings that may cause additional propagation delay (or "pushout"), or a decrease in propagation delay (or "pull-in"). These SSO effects may impact the setup and/or hold times and are not always taken into account by simulations. System timing budgets should include margin for SSO effects.

**Stub** — the branch from the trunk terminating at the pad of an agent.

**Test Load** — Intel uses a 25  $\Omega$  test load for specifying its components.

**Trunk** — the main connection, excluding interconnect branches, terminating at agent pads.

Undershoot — Maximum voltage allowed for a signal to extend below V<sub>SS</sub> at the processor core pad. See the *Pentium*<sup>®</sup> *Ill Xeon*<sup>™</sup> *Processor at 500 and 550 MHz* datasheet for undershoot specifications.

**Victim** — a network that receives a coupled crosstalk signal from another network is called the victim network.

 $V_{REF}$  Guardband — A guardband ( $\Delta V_{REF}$ ) defined above and below  $V_{REF}$  to provide a more realistic model accounting for noise such as crosstalk,  $V_{TT}$  noise, and  $V_{REF}$  noise.

#### 3.0. AGTL+ DESIGN GUIDELINE

The following step-by-step guideline was developed for systems based on two Pentium III Xeon processor loads and one 82443GX load. Systems using custom chipsets will require timing analysis and analog simulations specific to those components.

The guideline recommended in this section is based on experience developed at Intel while developing many different Pentium Pro processor family and Pentium II Xeon processor based systems. Begin with an initial timing analysis and topology definition. Perform prelayout analog simulations for a detailed picture of a working "solution space" for the design. These prelayout simulations help define routing rules prior to placement and routing. After routing, extract the interconnect database and perform post-layout simulations to refine the timing and signal integrity analysis. Validate the analog simulations when actual systems become available. The validation section describes a method for determining the flight time in the actual system.

Guideline Methodology:

- Initial Timing Analysis
- · Determine General Topology, Layout, and Routing
- Pre-Layout Simulation (Sensitivity sweep)
- · Place and Route Board
  - Estimate Component to Component Spacing for AGTL+ Signals
  - Layout and Route Board
- Post-Layout Simulation
  - Interconnect Extraction
  - Intersymbol Interference (ISI), Crosstalk, and Monte Carlo Analysis
- Validation
  - Measurements
  - Determining Flight Time



#### 3.1. Initial Timing Analysis

Perform an initial timing analysis of the system using Equation 1 and Equation 2 shown below. These equations are the basis for timing analysis. To complete the initial timing analysis, values for clock skew and clock jitter are needed, along with the component specifications. These equations contain a multi-bit adjustment factor, M<sub>ADJ</sub>, to account for multi-bit switching effects such as SSO pushout or pull-in that are often hard to simulate. These equations **do not** take into consideration all signal integrity factors that affect timing. Additional timing margin should be budgeted to allow for these sources of noise.

#### **Equation 1. Setup Time**

 $T_{CO\_MAX} + T_{SU\_MIN} + CLK_{SKEW} + CLK_{JITTER} + T_{FLT\_MAX} + M_{ADJ} \le Clock\ Period$ 

#### **Equation 2. Hold Time**

TCO\_MIN + TFLT\_MIN - MADJ ≥ THOLD + CLKSKEW

Symbols used in Equation 1 and Equation 2:

- TCO\_MAX is the maximum clock to output specification.<sup>1</sup>
- T<sub>SU\_MIN</sub> is the minimum required time specified to setup before the clock. <sup>1</sup>
- CLK<sub>JITTER</sub> is the maximum clock edge-to-edge variation.
- CLK<sub>SKEW</sub> is the maximum variation between components receiving the same clock edge.
- T<sub>FLT\_MAX</sub> is the maximum flight time as defined in Section 2.3.
- T<sub>FLT\_MIN</sub> is the minimum flight time as defined in Section 2.3.
- M<sub>ADJ</sub> is the multi-bit adjustment factor to account for SSO pushout or pull-in.
- T<sub>CO\_MIN</sub> is the minimum clock to output specification. <sup>1</sup>
- T<sub>HOLD</sub> is the minimum specified input hold time.

#### NOTE

1. The Clock to Output (T<sub>CO</sub>) and Setup to Clock (T<sub>SU</sub>) timings are both measured from the signals last crossing of V<sub>REF</sub>, with the requirement that the signal does not violate the ringback or edge rate limits. See the Pentium<sup>®</sup> III Xeon<sup>™</sup> Processor at 500 and 550 MHz datasheet and the Pentium<sup>®</sup> II Processor Developer's Manual for more details.

Solving these equations for  $T_{FLT}$  results in the following equations:

#### **Equation 3. Maximum Flight Time**

 $T_{FLT\_MAX} \leq Clock \ Period - T_{CO\_MAX} - T_{SU\_MIN} - \\ CLK_{SKEW} - CLK_{JITTER} - M_{ADJ}$ 

#### **Equation 4. Minimum Flight Time**

T<sub>FLT</sub> MIN ≥ T<sub>HOLD</sub> + CLK<sub>SKEW</sub> - T<sub>CO</sub> MIN + M<sub>ADJ</sub>

There are multiple cases to consider. Note that while the same trace connects two components, component A and component B, the minimum and maximum flight time requirements for component A driving component B as well as component B driving component A must be met. The cases to be considered are:

- Pentium® III Xeon<sup>TM</sup> processor driving Pentium III Xeon processor
- Pentium III Xeon processor driving 82443GX
- 82443GX driving Pentium III Xeon processor

A designer using components other than those listed above must evaluate additional combinations of driver and receiver

Table 1 lists the AGTL+ component timings of the Pentium III Xeon processor and 82443GX defined at the pins. These timings are for reference only; obtain component specifications from the *Pentium® III Xeon™ Processor at 500 and 550 MHz* datasheet and appropriate Intel 440GX AGPset component specification.



| Table 1. Tentian in Acon Trocessor and 024430X ACTE Transfers for Example Calculations |                                             |         |  |  |
|----------------------------------------------------------------------------------------|---------------------------------------------|---------|--|--|
| IC Parameters                                                                          | Pentium® III Xeon™ Processor at 100 MHz Bus | 82443GX |  |  |
| Clock to Output maximum (T <sub>CO_MAX</sub> )                                         | 2.70                                        | 4.45    |  |  |
| Clock to Output minimum (T <sub>CO_MIN</sub> )                                         | -0.07                                       | 0.83    |  |  |
| Setup time (T <sub>SU_MIN</sub> )                                                      | 1.75                                        | 2.98    |  |  |
| Hold time (T <sub>HOLD</sub> )                                                         | 0.62                                        | -0.09   |  |  |

Table 1. Pentium® III Xeon™ Processor and 82443GX AGTL+ Parameters for Example Calculations¹,2

#### NOTES:

- 1. All times in nanoseconds.
- Numbers in table are for reference only. These timing parameters are subject to change. Please check the appropriate component datasheets for valid timing parameter values.

Table 2 gives an example AGTL+ initial maximum flight time calculation for a 100 MHz, 2-way Pentium III Xeon processor/Intel 440GX AGPset system bus. Note that assumed values for clock skew and clock jitter were used. Clock skew and clock jitter values are dependent on the clock components and distribution method chosen for a particular design and must be budgeted into the initial timing equations as appropriate for each design.

Intel highly recommends adding margin as shown in the "Madj" column to offset the degradation caused by SSO pushout and other multi-bit switching effects. The "Recommended TFLT\_MAX" column contains the recommended maximum flight time after incorporating the Madj value. If edge rate, ringback, and monotonicity requirements are not met, flight time correction must be performed as documented in the *Pentium® II Processor Developer's Manual* with the additional requirements noted in Appendix A. The commonly used "textbook" equations used to calculate the expected signal propagation rate of a board are included in Section 5.1.

Simulation and control of baseboard design parameters can ensure that signal quality and maximum and minimum flight times are met. Baseboard propagation speed is highly dependent on transmission line geometry configuration (stripline vs. microstrip), dielectric constant, and loading. This layout guideline includes high-speed baseboard design practices that may improve the amount of timing and signal quality margin. The magnitude of MADI is highly dependent on baseboard

design implementation (stackup, decoupling, layout, routing, reference planes, etc.) and needs to be characterized and budgeted appropriately for each design.

Table 3 is an example calculation for minimum flight time that is frequency independent. Intel highly recommends adding margin as shown in the " $M_{ADJ}$ " column to offset the degradation caused by SSO pull-in and other multi-bit switching effects. The "Recommended  $T_{FLT\_MIN}$ " column contains the recommended minimum flight time after incorporating the  $M_{ADJ}$  value.

Table 2 and Table 3 are derived assuming:

- CLK<sub>SKEW</sub> = 0.15 ns (PCB skew only assumes zero driver skew by tying clock driver outputs)
- $CLK_{JITTER} = 0.15 \text{ ns}$

Positive or negative jitter of up to 150 ps is allowed between adjacent cycles, but will result in up to 100 ps of AGTL+ I/O and CMOS timing degradation (i.e.,  $T_{\rm CO}$ ,  $T_{\rm SU}$ ,  $T_{\rm HOLD}$  will all increase by 100 ps). Thus, a system with jitter of 250 ps would need flight times that are 300 ps (100 ps additional jitter + 100 ps I/O timing degradation for both the source and receiver) better than a system with jitter of 150 ps. See the *Pentium® III Xeon™ Processor at 500 and 550 MHz* datasheet for details on the amount of I/O timing degradation and clock jitter specifications.



Table 2. Example T<sub>FLT MAX</sub> Calculations for 100 MHz Bus<sup>1</sup>

| Driver                                                     | Receiver                         | Clk<br>Period | TCO_MAX | TSU_MIN | CIKSKEW | CIKJITTER | MADJ | Recommended TFLT_MAX2 |
|------------------------------------------------------------|----------------------------------|---------------|---------|---------|---------|-----------|------|-----------------------|
| Pentium <sup>®</sup> III<br>Xeon <sup>™</sup><br>processor | Pentium III<br>Xeon<br>processor | 10.00         | 2.70    | 1.75    | 0.15    | 0.15      | 2.15 | 3.10                  |
| Pentium III<br>Xeon<br>processor                           | 82443GX                          | 10.00         | 2.70    | 2.98    | 0.15    | 0.15      | 0.92 | 3.10                  |
| 82443GX                                                    | Pentium III<br>Xeon<br>processor | 10.00         | 4.45    | 1.75    | 0.15    | 0.15      | 0.40 | 3.10                  |

#### NOTES:

- 1. All times in nanoseconds.
- The flight times in this column include margin to account for the following phenomena which Intel has observed when multiple bits are switching simultaneously. These multi-bit effects can adversely affect flight time and signal quality and are sometimes not accounted for in simulation. Accordingly, maximum flight times depend on the baseboard design and additional adjustment factors or margins are recommended.
  - · SSO pushout or pull-in.
  - Rising or falling edge rate degradation at the receiver caused by inductance in the current return path, requiring
    extrapolation that causes additional delay.
  - Crosstalk on the PCB and internal to the package can cause variation in the signals.

There are additional effects that **may not** necessarily be covered by the multi-bit adjustment factor and should be budgeted as appropriate to the baseboard design. Examples include:

- The effective board propagation constant (SEFF), which is a function of:
  - Dielectric constant (ε<sub>Γ</sub>) of the PCB material.
  - The type of trace connecting the components (stripline or microstrip).
  - The length of the trace and the load of the components on the trace. Note that the board propagation constant multiplied by the trace length is a component of the flight time but not necessarily equal to the flight time.

Table 3. Example TFLT MIN Calculations (Frequency Independent)1

| Driver                                      | Receiver                   | THOLD | CIKSKEW | TCO_MIN | MADJ | Recommended TFLT_MIN <sup>2</sup> |
|---------------------------------------------|----------------------------|-------|---------|---------|------|-----------------------------------|
| Pentium <sup>®</sup> III Xeon™<br>processor | Pentium III Xeon processor | 0.62  | 0.15    | -0.07   | 0.26 | 1.10                              |
| Pentium III Xeon processor                  | 82443GX                    | -0.10 | 0.15    | -0.07   | 0.98 | 1.10                              |
| 82443GX                                     | Pentium III Xeon processor | 0.62  | 0.15    | 0.83    | 1.16 | 1.10                              |



#### NOTES:

- 1. All times in nanoseconds.
- 2. The flight times in this column include margin to account for the following phenomena which Intel has observed when multiple bits are switching simultaneously. These multi-bit effects can adversely affect flight time and signal quality and are sometimes not accounted for in simulation. Accordingly, maximum flight times depend on the baseboard design and additional adjustment factors or margins are recommended.
  - SSO pushout or pull-in.
  - Rising or falling edge rate degradation at the receiver caused by inductance in the current return path, requiring extrapolation that causes additional delay.
  - Crosstalk on the PCB and internal to the package can cause variation in the signals.

There are additional effects that **may not** necessarily be covered by the multi-bit adjustment factor and should be budgeted as appropriate to the baseboard design. Examples include:

- The effective board propagation constant (SEFF), which is a function of:
  - Dielectric constant (ε<sub>r</sub>) of the PCB material.
  - The type of trace connecting the components (stripline or microstrip).
  - The length of the trace and the load of the components on the trace. Note that the board propagation constant multiplied by the trace length is a component of the flight time but not necessarily equal to the flight time.

# 3.2. Determine General Topology, Layout, and Routing Desired

After calculating the timing budget, determine the approximate location of the Pentium III Xeon processors and Intel 440GX AGPset on the base board. An example topology is shown in Figure 1. This example topology is valid for the Intel 440GX AGPset only, and does not cover designs using other AGPsets or PCIsets. The 82443GX should be placed electrically in the center of the bus. The Pentium III Xeon processor slots should be placed on either end of the bus to allow the processors to terminate each end. Two termination resistors are added in this topology — resistor Rt-CS at the CS-Fork intersection and resistor Rt-T at the T-Fork intersection. The CS-Fork is the connection point of stub segments L7 and L8. L7 is the length of the resistor stub taken from

the CS-Fork to the pin of the resistor package. L8 is the length of trace from the CS-Fork to the 82443GX package pin. The T-Fork is where segment L6 forks into segments L4 and L5. Table 4 gives the segment descriptions and length recommendations for the investigated topology shown in Figure 1. For this given topology and segment lengths, resistor values of Rt-T =  $85~\Omega$  and Rt-CS =  $150~\Omega$  (both with a 1% tolerance) are recommended

For 2-way SMP Pentium III Xeon processor/Intel 440GX AGPset designs, a termination card must be placed in the unused slot when only one processor is populated. This is necessary to ensure signal integrity requirements are met. Refer to the *Pentium® II Xeon™ Processor Bus Terminator Design Guidelines* for details regarding termination card design and fabrication.





Figure 1. Example 2-Way SMP Network Topology

Table 4. Segments Descriptions and Lengths for Figure 11

| Segment | Description                                | Min length (inches)                                                           | Max length (inches) |  |
|---------|--------------------------------------------|-------------------------------------------------------------------------------|---------------------|--|
| L1      | Processor core package pin to on-card fork | Fixed length in Pentium® III Xeon™ processor package                          |                     |  |
| L2      | Stub length of on-card resistor            | Minimum and maximum Traces in Pentium III Xeo processor package               |                     |  |
| L3      | Cartridge edge finger to on-card fork      | Minimum and maximum traces contained in<br>Pentium III Xeon processor package |                     |  |
| L4      | SC330 connector pin to T-Fork              | 1.50                                                                          | 1.75                |  |
| L5      | SC330 connector pin to T-Fork              | 1.50                                                                          | 1.75                |  |
| L6      | T-Fork to CS-Fork                          | 3.00                                                                          | 3.25                |  |
| L7      | Rt-CS pin to CS-Fork                       | 0.00                                                                          | 0.25                |  |
| L8      | CS-Fork to 82443GX pin                     | 1.00                                                                          | 1.25                |  |
| L9      | T-Fork to Rt-T pin                         | 0.00 0.25                                                                     |                     |  |

#### NOTE:

<sup>1.</sup> L4 does not have to equal L5.



#### 3.3. Pre-Layout Simulation

#### 3.3.1. METHODOLOGY

Pentium III Xeon processor designs require analog simulations. Start simulations prior to layout. Pre-layout simulations provide a detailed picture of the working "solution space" that meets flight time and signal quality requirements. The layout recommendations in the previous sections are based on pre-layout simulations conducted at Intel. By basing board layout guidelines on the solution space, the iterations between layout and post-layout simulation can be reduced.

Intel recommends running simulations at the **device pads** for signal quality and at the **device pins** for timing analysis. However, simulation results at the device pins may be used later to correlate simulation performance against actual system measurements.

Pre-layout analysis includes a sensitivity analysis using parametric sweeps. Parametric sweep analysis involves varying one or two system parameters while all others such as driver strength, package, Z0, and S0 are held constant. This way, the sensitivity of the proposed bus topology to varying parameters can be analyzed systematically. Sensitivity of the bus to minimum flight time, maximum flight time, and signal quality should be covered. Suggested sweep parameters include trace lengths, termination resistor values, and any other factors that may affect flight time, signal quality, and feasibility of layout. Minimum flight time and worst signal quality are typically analyzed using fast I/O buffers and interconnect. Maximum flight time is typically analyzed using slow I/O buffers and slow interconnect.

Outputs from each sweep should be analyzed to determine which regions meet timing and signal quality specifications. To establish the working solution space, find the common space across all the sweeps that result in passing timing and signal quality. The solution space should allow enough design flexibility for a feasible, cost-effective layout.

#### 3.3.2. SIMULATION CRITERIA

Accurate simulations require that the actual range of parameters be used in the simulations. Intel has consistently measured the cross-sectional resistivity of the PCB copper to be approximately 1 ohm\*mil<sup>2</sup>/inch,

not the 0.662 ohm\*mil<sup>2</sup>/inch value for annealed copper that is published in reference material. Using the 1 ohm\*mil<sup>2</sup>/inch value may increase the accuracy of lossy simulations.

Positioning drivers with faster edges closer to the middle of the network typically results in more noise than positioning them towards the ends. However, Intel has shown that the worst-case noise margin can be generated by drivers located in all positions (given appropriate variations in the other network parameters). Therefore, Intel recommends simulating the networks from all driver locations, and analyzing each receiver for each possible driver.

Analysis has shown that **both fast and slow corner conditions** must be run for both rising and falling edge transitions. The fast corner is needed because the fast edge rate creates the most noise. The slow corner is needed because the buffer's drive capability will be a minimum, causing the V<sub>OL</sub> to shift up, which may cause the noise from the slower edge to exceed the available budget. Slow corner models may produce minimum flight time violations on rising edges if the transition starts from a higher V<sub>OL</sub>. So, Intel **highly recommends** checking for minimum and maximum flight time violations with both the fast and slow corner models. The fast and slow corner I/O buffer models are contained in the Pentium III Xeon and Intel 440GX AGPset electronic models provided by Intel.

The transmission line package models must be inserted between the output of the buffer and the net it is driving. Likewise, the package model must also be placed between a net and the input of a receiver model. This is generally done by editing the simulator's net description or topology file.

Intel has found wide variation in noise margins when varying the stub impedance and the PCB's Z0 and S0. Intel therefore recommends that PCB parameters be controlled as tightly as possible, with a sampling of the allowable Z0 and S0 simulated. The recommended effective line impedance (Z<sub>EFF</sub>) is 65  $\Omega$  +/-10%. Intel recommends running uncoupled simulations using the Z0 of the package stubs; and performing fully coupled simulations if increased accuracy is needed or desired. Accounting for crosstalk within the device package by varying the stub impedance was investigated and was not found to be sufficiently accurate. This lead to the development of full package models for the component packages.



#### 3.4. Place and Route Board

# 3.4.1. ESTIMATE COMPONENT TO COMPONENT SPACING FOR AGTL+ SIGNALS

Estimate the number of layers that will be required. Then determine the expected interconnect distances between each of the components on the AGTL+ bus. Using the estimated interconnect distances, verify that the placement can support the system timing requirements.

The maximum network length between the bus agents is determined by the required bus frequency and the maximum flight time propagation delay on the PCB. The minimum network length is independent of the required bus frequency. Table 2 and Table 3 assume values for CLK<sub>SKEW</sub> and CLK<sub>JITTER</sub> — parameters that are controlled by the system designer. In order to reduce system clock skew to a minimum, clock buffers which allow their outputs to be tied together are recommended. Intel strongly recommends running analog simulations to ensure that each design has adequate noise and timing margin.

#### 3.4.2. LAYOUT AND ROUTE BOARD

Route the board satisfying the estimated space and timing requirements. Also stay within the solution space set from the pre-layout sweeps. Estimate the printed circuit board parameters from the placement and other information including the following general guidelines:

- Distribute V<sub>TT</sub> with a power plane or a partial power plane. If this cannot be accomplished, use as wide a trace as possible and route the V<sub>TT</sub> trace with the same topology as the AGTL+ traces.
- Keep the overall length of the bus as short as possible (but don't forget minimum component-to-component distances to meet hold times).
- Plan to minimize crosstalk with the following guidelines developed for the example topology given (signal spacing recommendations were based on fully coupled simulations — spacing may be decreased based upon the amount of coupled length):
  - Use a spacing to line width to dielectric thickness ratio of at least 3:1:2. If ε<sub>Γ</sub> = 4.5, this should limit coupling to 3.4%.
  - Minimize the dielectric process variation used in the PCB fab.
  - Eliminate parallel traces between layers not separated by a power or ground plane.

- Route intra-group AGTL+ signals (AGTL+ signals in the same group) with 6/9 mil or 5/10 mil trace width/spacing.
- Route the same type of AGTL+ I/O signals in isolated signal groups. See the *Pentium® II Processor Developer's Manual* for a description of the different group types. Recommendation for AGTL+ group to group (inter-group) trace width/spacing is 6/18 mil or 5/15 mil.
- Route AGTL+ to non-AGTL+ signals with 6/24 mil or 5/20 mil trace width/spacing.

The spacing between the various bus agents causes variations in trunk impedance and stub locations. These variations cause reflections which can cause constructive or destructive interference at the receivers. A reduction of noise may be obtained by a minimum spacing between the agents. Unfortunately, a tighter spacing results in reduced component placement options and lower hold margins. Therefore, adjusting the inter-agent spacing may be one way to change the network's noise margin, but mechanical constraints often limit the usefulness of this technique. Always be sure to validate signal quality after making any changes in agent locations or changes to inter-agent spacing.

There are six AGTL+ signals that can be driven by more than one agent simultaneously. These signals may require more attention during the layout and validation portions of the design. When a signal is asserted (driven low) by two or more agents on the same clock edge, the two falling edge wave fronts will meet at some point on the bus and can sum to form a negative voltage. The ringback from this negative voltage can easily cross into the overdrive region. The signals are AERR#, BERR#, BINIT#, BNR#, HIT#, and HITM#.

This document addresses AGTL+ layout for the 2-way Pentium III Xeon processor/Intel 440GX AGPset system. Power distribution and chassis requirements for cooling, connector location, memory location, etc., may constrain the system topology and component placement location, therefore constraining the board routing. These issues are not directly addressed in this document. Section 2.2 contains a listing of several documents that address some of these issues.

#### 3.5. Post-Layout Simulation

Following layout, extract the interconnect information for the board from the CAD layout tools. Run simulations to verify that the layout meets timing and noise requirements. A small amount of "tuning" may be



required; experience at Intel has shown that sensitivity analysis dramatically reduces the amount of tuning required. The post layout simulations should take into account the expected variation for all interconnect parameters.

Intel specifies signal integrity at the device pads and therefore recommends running simulations at the device pads for signal quality. However, Intel specifies core timings at the device pins, so simulation results at the device pins should be used later to correlate simulation performance against actual system measurements.

#### 3.5.1. INTERSYMBOL INTERFERENCE

Intersymbol Interference (ISI) refers to the distortion or change in the waveform shape caused by the voltage and transient energy on the network when the driver begins its next transition.

Intersymbol Interference (ISI) occurs when transitions in the current cycle interfere with transitions in subsequent cycles. ISI can occur when the line is driven high, low, and then high in consecutive cycles (the opposite case is also valid). When the driver drives high on the first cycle and low on the second cycle, the signal may not settle to the minimum  $V_{\rm OL}$  before the next rising edge is driven. This results in improved flight times in the third cycle. ISI simulations for the topology given in this section were performed by comparing flight times for the first and third cycle. ISI effects do not necessarily span only 3 cycles so it may be necessary to simulate beyond 3 cycles for certain designs. After simulating and quantifying ISI effects, adjust the timing budget accordingly to take these conditions into consideration.

#### 3.5.2. CROSSTALK ANALYSIS

AGTL+ crosstalk simulations can consider the Pentium III Xeon processor core package, 82443GX package, and SC330 connectors as non-coupled. Treat the traces on the Pentium III Xeon processor cartridge and baseboard as fully coupled for maximum crosstalk conditions. Simulate the traces as lossless for worst case crosstalk, and lossy where more accuracy is needed. Evaluate both odd and even mode crosstalk conditions.

AGTL+ Crosstalk simulation involves the following cases:

- Intra-group AGTL+ crosstalk
- Inter-group AGTL+ crosstalk
- CMOS to AGTL+ crosstalk

#### 3.5.3. MONTE CARLO ANALYSIS

Perform a Monte Carlo analysis to refine the passing solution space region. A Monte Carlo analysis involves randomly varying parameters (independent of one another) over their tolerance range. This analysis intends to ensure that no regions of failing flight time and signal quality exist between the extreme corner cases run in prelayout simulations. For the example topology, vary the following parameters during Monte Carlo simulations:

- · Lengths L4 through L9
- Termination resistance R<sub>TT</sub> on Pentium® III Xeon<sup>TM</sup> processor cartridge #1
- Termination resistance R<sub>TT</sub> on Pentium III Xeon processor cartridge #2
- Z<sub>0</sub> of traces on Pentium III Xeon processor cartridge #1
- Z<sub>0</sub> of traces on Pentium III Xeon processor cartridge #2
- S<sub>0</sub> of traces on Pentium III Xeon processor cartridge #1
- S<sub>0</sub> of traces on Pentium III Xeon processor cartridge #2
- Rt-T resistance
- · Rt-CS resistance
- · Z<sub>0</sub> of traces on base board
- · So of traces on base board
- Fast and slow corner Pentium III Xeon processor I/O buffer models for cartridge #1
- Fast and slow corner Pentium III Xeon processor I/O buffer models for cartridge #2
- Fast and slow package models for Pentium III Xeon processor cartridge #1
- Fast and slow package models for Pentium III Xeon processor cartridge #2
- Fast and slow corner 82443GX I/O buffer models
- Fast and slow 82443GX package models

Refer to the Pentium<sup>®</sup> III Xeon™ Processor at 500 and 550 MHz datasheet and electronic I/O buffer models for the parameter ranges of the Pentium III Xeon processor and Intel 440GX AGPset.

#### 3.6. Validation

Build systems and validate the design and simulation assumptions.



#### 3.6.1. MEASUREMENTS

Note that the AGTL+ specification for signal quality is at the **pad** of the component. The expected method of determining the signal quality is to run analog simulations for the pin and the pad. Then correlate the simulations at the pin against actual system measurements at the pin. Good correlation at the pin leads to confidence that the simulation at the pad is accurate. Controlling the temperature and voltage to correspond to the I/O buffer model extremes should enhance the correlation between simulations and the actual system.

#### 3.6.2. FLIGHT TIME SIMULATION

As defined earlier in Section 2.3., flight time is the time difference between a signal crossing  $V_{REF}$  at the input pin of the receiver, and the output pin of the driver crossing  $V_{REF}$  were it driving a *test load*. The timings in the tables and topologies discussed in this guideline assume the actual system load is 25  $\Omega$  and is **equal to the** *test load*.



Figure 2. Test Load vs. Actual System Load

Figure 2 above shows the different configurations for  $T_{CO}$  testing and flight time simulation. The flip-flop represents the logic input and driver stage of a typical AGTL+ I/O buffer.  $T_{CO}$  timings are specified at the driver pin output.  $T_{FLIGHT-SYSTEM}$  is usually reported by a simulation tool as the time from the driver pad starting its transition to the time when the receiver's input pin sees a valid data input. Since both timing numbers ( $T_{CO}$  and  $T_{FLIGHT-SYSTEM}$ ) will include propagation time from the pad to the pin, it is necessary to subtract this time ( $T_{REF}$ ) from the reported flight time to avoid double

counting.  $T_{REF}$  is defined as the time that it takes for the driver output pin to reach the measurement voltage,  $V_{REF}$ , starting from the beginning of the driver transition at the pad.  $T_{REF}$  must be generated using the same test load for  $T_{CO}$ . Intel provides this timing value in the AGTL+I/O buffer models.

In this manner, the following valid delay equation is satisfied:

#### Valid Delay Equation

This valid delay equation is the total time from when the driver sees a valid clock pulse to the time when the receiver sees a valid data input.

## 3.6.3. FLIGHT TIME HARDWARE VALIDATION

When a measurement is made on the actual system,  $T_{CO}$  and flight time do not need  $T_{REF}$  correction since these are the actual numbers. These measurements include all of the effects pertaining to the driver-system interface and the same is true for the  $T_{CO}$ . Therefore the addition of the measured  $T_{CO}$  and the measured flight time must be equal to the valid delay calculated above.

#### 4.0. THEORY

#### 4.1. AGTL+

AGTL+ is the electrical bus technology used for the Pentium III Xeon processor bus. This is an incident wave switching, open-drain bus with external pull-up resistors that provide both the high logic level and termination at each load. The Pentium III Xeon processor AGTL+ drivers contain a full cycle active pull-up device to improve system timings. The AGTL+ specification defines:

- Termination voltage (V<sub>TT</sub>).
- Receiver reference voltage (V<sub>REF</sub>) as a function of termination voltage (V<sub>TT</sub>).
- Pentium® III Xeon<sup>TM</sup> processor termination resistance (R<sub>TT</sub>).
- Input low voltage (V<sub>IL</sub>).
- Input high voltage (V<sub>IH</sub>).



- NMOS on resistance (R<sub>ONN</sub>).
- Edge rate specifications.
- · Ringback specifications.
- · Overshoot/Undershoot specifications.
- · Settling Limit.

The complete AGTL+ specification can be found in the *Pentium® III Xeon™ Processor at 500 and 550 MHz* datasheet. Layout recommendations for the AGTL+ bus can be found in Section 3 of this document.

#### 4.2. Timing Requirements

The system timing for AGTL+ is dependent on many things. Each of the following elements combine to determine the maximum and minimum frequency the AGTL+ bus can support:

- The range of timings for each of the agents in the system.
  - Clock to output  $[T_{CO}]$ . (Note that the system load is likely to be different from the "specification" load therefore the  $T_{CO}$  observed in the system may not be the same as the  $T_{CO}$  from the specification.)
  - The minimum required setup time to clock [T<sub>SU\_MIN</sub>] for each receiving agent.
- The range of flight time between each component. This includes:
  - The velocity of propagation for the loaded printed circuit board [S<sub>EFF</sub>].

- The board loading impact on the effective T<sub>CO</sub> in the system.
- The amount of skew and jitter in the system clock generation and distribution.
- Changes in flight time due to crosstalk, noise, and other effects.

#### 4.3. Noise Margin

The goal of this section is to describe the total amount of noise that can be tolerated in a system (the noise budget), identify the sources of noise in the system, and recommend methods to analyze and control the noise so that the allowed noise budget is not exceeded.

There are several sources of noise which must be accounted for in the system noise budget, including:

- V<sub>REF</sub> variation
- V<sub>CCCORE</sub> variation
- Variation in V<sub>TT</sub>
- Crosstalk
- Ringback due to impedance variation along the network, termination mismatch, and/or stubs on the network
- Simultaneous Switching Output Effects

Calculate the total noise budget by taking the difference in the worst case specified input level and the worst case driven output level.



Figure 3. Rising Edge Noise Margin



Sections 4.3.1 and Section 4.3.2. discuss noise margin calculations. These sections do not discuss ringback tolerant receivers that can increase the effective noise margin. See the appropriate component datasheets for information about ringback tolerance.

# 4.3.1. FALLING EDGE OR LOW LEVEL NOISE MARGIN

Equation 5 below shows a method of calculating falling edge noise margin when the Pentium III Xeon processor is driving. An example calculation follows.

#### **Equation 5. Low Level Noise Margin**

Noise Margin<sub>LOW</sub> LEVEL = 
$$V_{ILMAX} - V_{OLMAX}$$
  
 $\Rightarrow V_{REF\_MIN} - 100 \text{ mV} - V_{OLMAX}$   
 $\Rightarrow [[2/3 (V_{TT\_MIN})] - 1\%] - 100 \text{ mV} - V_{OLMAX}$ 

Symbols for Equation 5 are:

- V<sub>ILMAX</sub> is the maximum specified valid input low level from the component specification. For this example, 100 mV below the reference voltage is assumed
- V<sub>OLMAX</sub> is the maximum output low level the component will drive. This V<sub>OLMAX</sub> maximum condition corresponds to the slow corner components and models.
- V<sub>REF\_MIN</sub> is the minimum valid voltage reference used for the threshold reference.
- V<sub>TT\_MIN</sub> is the minimum termination voltage.

For the following example calculations for low level and high level noise margin, an  $R_{ON\_MAX}$  equal to  $12.5~\Omega$  is assumed, along with  $V_{REF}$  and  $V_{TT}$  tolerance assumptions. These specs should be obtained from the  $Pentium^{\oplus}$  III  $Xeon^{TM}$  Processor at 500 and 550 MHz datasheet.

Solving for V<sub>REF</sub> MIN with 1% V<sub>REF</sub> uncertainty:

$$V_{REF\_MIN} = [ 2/3 ( V_{TT\_MIN} ) ] - 1\%$$
  
=  $[ 2/3 (1.5 V - 9\%) ] - 1\%$   
=  $[ 2/3 (1.37 V) ] - 1\%$   
=  $901 \text{ mV}$ 

The output low current in the case of  $V_{TT\_MIN}$ , can be calculated as shown below:

$$I = V/R = 1.37/(25 \Omega + 12.5 \Omega) = 36.5 \text{ mA}$$

Then the  $V_{OLMAX}$  for  $V_{REF\_MIN}$  is (36.5 mA \* 12.5  $\Omega$ ) = 456 mV

Then,

Noise Margin<sub>LOW LEVEL</sub>

These example calculations are for an effective termination resistance of 25  $\Omega$ . These calculations **do not** include any resistive drop along the trace.

# 4.3.2. RISING EDGE OR HIGH LEVEL NOISE MARGIN

Equation 6 below shows a method of calculating rising edge noise margin when the Pentium III Xeon processor is driving. An example calculation follows.

#### **Equation 6. High Level Noise Margin**

Noise Margin<sub>HIGH</sub> LEVEL = 
$$V_{OH\_MIN} - V_{IH\_MIN} \Rightarrow V_{TT\_MIN} - (V_{REF\_MAX} + 100 \text{ mV})$$

Symbols for Equation 6 are:

- V<sub>IH\_MIN</sub> is the minimum specified valid input high level from the component specification. For this example, 100 mV above the reference voltage is assumed.
- V<sub>OH\_MIN</sub> is the minimum output high level the component will drive.
- V<sub>TT\_MIN</sub> is the minimum termination voltage. This is assumed to be 1.5 V - 9%, or 1.37 V.
- V<sub>REF\_MAX</sub> is the maximum valid voltage reference used for the threshold reference. Since V<sub>REF</sub> is defined as a function of V<sub>TT</sub>, the maximum V<sub>REF</sub> with V<sub>TT\_MIN</sub> is 2/3 \* (1.37 V) + 1% = 922 mV

V<sub>OH\_MIN</sub> for AGTL+ signals is V<sub>TT\_MIN</sub>.

Then Noise Margin<sub>HIGH</sub> LEVEL

```
= V_{TT\_MIN} - (V_{REF\_MAX} + 100 mV)
= 1.37 V - 922 mV - 100 mV
= 348 mV
```



#### 4.4. Crosstalk Theory

AGTL+ signals swing across a smaller voltage range and have a correspondingly smaller noise margins than technologies that have traditionally been used in personal computer designs. This requires that designers using AGTL+ be more aware of crosstalk than they may have been in past designs.

Crosstalk is caused through capacitive and inductive coupling between networks. Crosstalk appears as both backward crosstalk and as forward crosstalk. Backward crosstalk creates an induced signal on a victim network that propagates in a direction opposite that of the aggressor's signal. Forward crosstalk creates a signal that propagates in the same direction as the aggressor's signal. On the AGTL+ bus, a driver on the aggressor network is not at the end of the network, therefore it sends signals in both directions on the aggressor's network. Figure 4 shows a driver on the aggressor network and a receiver on the victim network that are not at the ends of the network. The signal propagating in each direction causes crosstalk on the victim network.



Figure 4. Aggressor and Victim Networks

19





Figure 5. Transmission Line Geometry: (A) Microstrip (B) Stripline

Additional aggressors are possible in the z-direction, if adjacent signal layers are not routed in mutually perpendicular directions. Because crosstalk coupling coefficients decrease rapidly with increasing separation, it is rarely necessary to consider aggressors which are at least five line widths separated from the victim. The maximum crosstalk occurs when all the aggressors are switching in the same direction at the same time.

There is crosstalk internal to the IC packages, which can also affect the signal quality.

Backward crosstalk is present in both stripline and microstrip geometry's (see Figure 5). A way to remember which geometry is stripline and which is microstrip is that a stripline geometry requires **stripping** a layer away to see the signal lines. The backward coupled amplitude is proportional to the backward crosstalk coefficient, the aggressor's signal amplitude, and the coupled length of the network up to a maximum which is dependent on the rise/fall time of the aggressor's signal. Backward crosstalk reaches a maximum (and remains constant) when the propagation time on the coupled network length exceeds one half of the rise time of the aggressor's signal. Assuming the ideal ramp on the aggressor from 0% to 100% voltage swing, and the fall time on an unloaded coupled network, then:

$$Length \ for \ Max \ Backward \ Crosstalk = \frac{\frac{1}{2} \times Fall \ Time}{Board \ Delay \ Per \ Unit \ Length}$$

An example calculation if the fast corner fall time is 1.5 V/ns and board delay is 175 ps/inch (2.1 ns/foot) follows:

Fall time = 1.5 V/1.5 V/ns = 1 ns

Length for Max Backward Crosstalk

 $= \frac{1}{2}$  \* 1 ns \* 1000 ps/ns /175 ps/in

= 2.86 inches

Agents on the AGTL+ bus drive signals in each direction on the network. This will cause backward crosstalk from segments on two sides of a driver. The pulses from the backward crosstalk travel toward each other and will meet and add at certain moments and positions on the bus. This can cause the voltage (noise) from crosstalk to double.

Table 5 provides example coupling factors for various stripline space to width to dielectric thickness ratios with dielectric constant  $\mathcal{E}_T = 4.5$ ,  $V_{OH\_MAX} = 1.5$  V, and  $Z_0 = 65~\Omega$ . Note that the fast edge rates of falling edges place limits on the maximum coupled length allowable, and this table illustrates the potential consequences of maximum coupled lengths. Also, it should be noted that multiple parallel coupled lines will increase the impact on the noise budget.



| Space:Width:Thickness | Coupling Factor | Maximum Crosstalk (mV) |
|-----------------------|-----------------|------------------------|
| 24:4:8                | 0.65%           | 9.8                    |
| 20:4:8                | 1.3%            | 19. 5                  |
| 16:4:8                | 1.75%           | 26.2                   |
| 14:4:8                | 2.5%            | 37.5                   |
| 12:4:8                | 3.4%            | 51.0                   |
| 8:4:8                 | 6.55%           | 98.2                   |
| 4:4:8                 | 13.5%           | 202.5                  |

Table 5. Example Backward Crosstalk Coupling Factors with  $\mathcal{E}_r$  = 4.5,  $V_{OH~MAX}$  = 1.5  $V_{OH~MAX}$  and  $Z_0$  = 65  $\Omega$ 

Forward crosstalk is absent in stripline topologies, but present in microstrip. (This is for the ideal case with a uniform dielectric constant. In actual boards, forward crosstalk is nearly absent in stripline topologies, but abundant in microstrip.) The forward coupled amplitude is proportional to the forward crosstalk coefficient, the aggressor's signal edge rate (dv/dt), and the coupled network's electrical length. The forward crosstalk coefficient is also a function of the geometry. Unlike backward crosstalk, forward crosstalk can grow with coupled section length, and may transition in a direction similar to or opposite to that of the aggressor's edge. Unlike backward crosstalk, forward crosstalk on the victim signal will continue to grow as it passes through more coupled length before the aggressor's wave front is absorbed by the termination.

#### 4.4.1. CROSSTALK MANAGEMENT

To minimize crosstalk (and the "cost" of crosstalk) in terms of noise margin budget:

- Route adjacent trace layers in different directions (orthogonal preferred) to minimize the forward and backward crosstalk that can occur from parallel traces on adjacent layers. This reduces the source of crosstalk.
- Maximize the spacing between traces. Where traces have to be close and parallel to each other, minimize the distance that they are close together, and maximize the distance between sections that have close spacing. Routing close together could occur where multiple signals have to route between a pair of pins. When this happens the signals should be spread apart where possible. Also note that routing multiple layers in the same direction between reference planes

can result in parallel traces that are close enough to each other to have significant crosstalk.

- Minimize the variation in board impedance (Z<sub>0</sub>). For the example topology, 65 Ω +/- 10% was assumed.
- Minimize the nominal board impedance within the AGTL+ specification while maintaining the same trace width/spacing ratio. For a given dielectric constant, this reduces the trace width/trace height ratio, which reduces the backward and forward crosstalk coefficients. Having reduced crosstalk coefficients reduces the magnitude of the crosstalk.
- Minimize the dielectric constant used in the PCB fabrication. All else being equal, this puts the traces closer to their reference planes and reduces the magnitude of the crosstalk.
- Watch out for voltage doubling at a receiving agent, caused by the adding of the backward crosstalk on either side of a driver. Minimize the total network length of signals that have coupled sections. If there has to be closely spaced/coupled lines, place them near the center of the net. This will cause the point in time that voltage doubling occurs to be before the setup window.
- Route synchronous signals that could be driven by different components in separate groups to minimize crosstalk between these groups. The Pentium® III Xeon™ processor uses a split transaction bus with six independent sub buses (arbitration, request, error, snoop, response, and data). This implies, that in a given clock cycle, each sub bus could be driven by a different agent. If these two agents are at the opposite process corner (one fast and one slow), then separating the bus types will reduce the impact of crosstalk.



Simulation shows that space to line to dielectric ratios of less than 3:1:2 can produce excessive crosstalk between networks on the Pentium III Xeon processor bus. This is due to the lower voltage swing of AGTL+, high frequencies (even with the controlled edge rate buffers) and likely long parallel traces.

# 4.4.2. POTENTIAL TERMINATION CROSSTALK PROBLEMS

The use of commonly used "pull-up" resistor networks for AGTL+ termination may not be suitable. These networks have a common power or ground pin at the extreme end of the package, shared by 13 to 19 resistors (for 14- and 20-pin components). These packages generally have too much inductance to maintain the voltage/current needed at each resistive load. Intel recommends using discrete resistors, resistor networks with separate power/ground pins for each resistor, or working with a resistor network vendor to obtain resistor networks that have acceptable characteristics.

#### 5.0. MORE DETAILS AND INSIGHTS

#### 5.1. Textbook Timing Equations

The "textbook" equations used to calculate the propagation rate of a PCB are the basis for spreadsheet calculations for timing margin based on the component parameters. These equations are:

#### **Equation 7. Intrinsic Impedance**

$$Z_0 = \sqrt{\frac{L_0}{C_0}}^{(\Omega)}$$

#### **Equation 8. Stripline Intrinsic Propagation Speed**

$$S_0$$
  $STRIPLINE = 1.017*\sqrt{\varepsilon_r}$  (ns/ft)

#### Equation 9. Microstrip Intrinsic Propagation Speed

$$S_{0\_MICROSTRIP} = 1.017*\sqrt{0.475*\varepsilon_r + 0.67}$$
 (ns/ft)

#### **Equation 10. Effective Propagation Speed**

$$S_{EFF} = S_0 * \sqrt{1 + \frac{C_D}{C_0}}$$
 (ns/ft)

#### Equation 11. Effective Impedance

$$Z_{EFF} = \frac{Z_0}{\sqrt{1 + \frac{C_D}{C_0}}} (\Omega)$$

#### **Equation 12. Distributed Trace Capacitance**

$$C_0 = \frac{S_0}{Z_0}$$
 (pF/ft)

#### **Equation 13. Distributed Trace Inductance**

$$L_0 = 12*Z_0*S_0 \text{ (nH/ft)}$$

Symbols for Equation 7 through Equation 13 are:

- S<sub>0</sub> is the speed of the signal on an unloaded PCB in ns/ft. This is referred to as the board propagation constant.
- S<sub>0\_MICROSTRIP</sub> and S<sub>0\_STRIPLINE</sub> refer to the speed of the signal on an unloaded microstrip or stripline trace on the PCB in ns/ft.
- Z<sub>0</sub> is the intrinsic impedance of the line in Ω and is a
  function of the dielectric constant (ε<sub>T</sub>), the line width,
  line height and line space from the plane(s). The
  equations for Z<sub>0</sub> are not included in this document.
  See the MECL System Design Handbook by William
  R. Blood, Jr. for these equations.
- C<sub>0</sub> is the distributed trace capacitance of the network in pF/ft.
- L<sub>0</sub> is the distributed trace inductance of the network in nH/ft.
- C<sub>D</sub> is the sum of the capacitance of all devices and stubs divided by the length of the network's trunk, not including the portion connecting the end agents to the termination resistors in pF/ft.



 S<sub>EFF</sub> and Z<sub>EFF</sub> are the effective propagation constant and impedance of the PCB when the board is "loaded" with the components.

# 5.2. Effective Impedance and Tolerance/Variation

The impedance of the PCB needs to be controlled when the PCB is fabricated. The method of specifying control of the impedance needs to be determined to best suit each situation. Using stripline transmission lines (where the trace is between two reference planes) is likely to give better results than microstrip (where the trace is on an external layer using an adjacent plane for reference with solder mask and air on the other side of the trace). This is in part due to the difficulty of precise control of the dielectric constant of the solder mask, and the difficulty in limiting the plated thickness of microstrip conductors, which can substantially increase crosstalk.

The effective line impedance ( $Z_{EFF}$ ) is recommended to be 65  $\Omega$  +/-10%, where  $Z_{EFF}$  is defined by Equation 11.

# 5.3. Power/Reference Planes, PCB Stackup, and High Frequency Decoupling

#### 5.3.1. POWER DISTRIBUTION

Designs using the Pentium III Xeon processor require several different voltages. The following paragraphs describe some of the impact of three common methods used to distribute the required voltages. Refer to the Pentium® III Xeon™ Processor Power Distribution Guidelines for more information on power distribution.

The most conservative method of distributing these voltages is for each of them to have a dedicated plane. If any of these planes are used as an "AC ground" reference for traces to control trace impedance on the board, then the plane needs to be AC coupled to the system ground plane. This method may require more total layers in the PCB than other methods. 1 ounce/ft² thick copper is recommended for all power and reference planes.

A second method of power distribution is to use partial planes in the immediate area needing the power, and to place these planes on a routing layer on an as-needed basis. These planes still need to be decoupled to ground to ensure stable voltages for the components being supplied. This method has the disadvantage of reducing

area that can be used to route traces. These partial planes may also change the impedance of adjacent trace layers. (For instance, the impedance calculations may have been done for a microstrip geometry, and adding a partial plane on the other side of the trace layer may turn the microstrip into a stripline.)

## 5.3.2. REFERENCE PLANES AND PCB STACKUP

The type and number of layers for the PCB need to be chosen to balance many requirements. Many of these requirements include:

- The maximum trace resistance for AGTL+ signal paths should not exceed 2 ohms. Depending on the trace width chosen and PCB vendor's process tolerance, this may require 1 ounce/ft<sup>2</sup> thick copper instead of 1/2 ounce/ft<sup>2</sup> thickness. A higher trace resistivity increases the voltage drop along the trace, which reduces the falling edge noise margin.
- Providing enough routing channels to support the minimum and maximum timing requirements of the components.
- Providing stable voltage distribution for each of the components.
- Providing uniform impedance for the Pentium<sup>®</sup> III Xeon<sup>TM</sup> processor bus and other signals as needed.
- Provide a ground plane under the principal component side of the baseboard. Preferably under both sides if active components are mounted on both sides.
- Minimizing coupling/crosstalk between the networks.
- Minimizing RF emissions.
- Maximizing PCB yield.
- Minimizing PCB cost.
- Minimizing cost to assemble PCB.

The following baseboard layout recommendations should help reduce the amount of Simultaneous Switching Output (SSO) effects experienced.

It is recommended that baseboard stackup be arranged such that AGTL+ signal routes do not traverse multiple signal layers, as this can create discontinuities in the signal's return path. It is also recommended that each AGTL+ signal have a single reference plane for the entire route. Figure 6 shows the ideal case where a particular signal is routed entirely within the same signal layer, with a ground layer as the single reference plane.





Figure 6. One Signal Layer and One Reference Plane



Figure 7. Layer Switch with One Reference Plane

When it is not possible to route the entire AGTL+ signal on a single layer, there are methods to reduce the effects of layer switches whereby the signal still references the same plane (see Figure 7). Figure 8 shows another method of minimizing layer switch discontinuities, but may be less effective than Figure 7. In this case, the signal still references the same type of reference plane (ground). In such a case, it is good practice to stitch (i.e., connect) the two ground planes together with vias in the vicinity of the signal transition via.



Figure 8. Layer Switch with Multiple Reference Planes (same type)

When routing and stackup constraints require that an AGTL+ signal reference multiple planes, one method of minimizing adverse effects is to add high-frequency decoupling wherever the transitions occur, as shown in Figure 9 and Figure 10. Such decoupling should, again, be in the vicinity of the signal transition via and use capacitors with minimal effective series resistance (ESR) and effective series inductance (ESL). Dual vias for these caps are recommended since via inductance may sometimes be higher than the actual capacitor inductance.





Figure 9. Layer Switch with Multiple Reference Planes



Figure 10. One Layer with Multiple Reference
Planes

#### 5.3.3. HIGH FREQUENCY DECOUPLING

This section contains several high frequency decoupling recommendations that will improve the return path for an AGTL+ signal. These design recommendations will very likely reduce the amount of SSO effects.

Just as layer switching and multiple reference planes can create discontinuities in an AGTL+ signal return path, discontinuities may also occur when a signal transitions between the baseboard and cartridge. Therefore,

providing adequate high-frequency decoupling across VCC<sub>CORE</sub> and ground at the SC330 connector interface on the baseboard will minimize the discontinuity in the signal's reference plane at this junction. Please note that these additional high-frequency decoupling capacitors are in addition to the high-frequency decoupling already on the processor.

Transmission line geometry also influences the return path of the reference plane. The following are decoupling recommendations that take this into consideration:

- A signal that transitions from a stripline to another stripline should have close proximity decoupling between all four reference planes.
- A signal that transitions from a stripline to a microstrip (or vice versa) should have close proximity decoupling between the three reference planes.
- A signal that transitions from a stripline or microstrip through vias or pins to a component (82443GX, etc.) should have close proximity decoupling across all involved reference planes to ground for the device.

#### 5.3.4. SC330 CONNECTOR

Internal studies indicate that the use of thermal reliefs on the connector pin layout pattern (especially ground pins) should be minimized. Such reliefs (cartwheels or wagon-wheels) increase the net ground inductance and reduce the integrity of the ground plane to which many signals are referenced. Increased ground inductance has been shown to aggravate SSO effects. Also, the anti-pad diameter (clearance holes in the planes) for the signal pins should be minimized since large anti-pads also reduce the integrity of the ground plane and increase inductance.

Some additional layout and EMI-reduction guidelines regarding the SC330 connector follow:

- Extend power/ground planes up to the SC330 connector pins.
- Extend the reference planes for AGTL+ and other controlled-impedance signals up to the SC330 connector pins.
- Minimize or remove thermal reliefs on power/ground pins.
- Route VTT power with the widest signal trace or miniplane as possible. Place decoupling caps across VTT and ground in the vicinity of the connector pins.



- Use a ground plane under the principal component side of the baseboard (and secondary side if it contains active components).
- Distribute decoupling capacitors across power and ground pins evenly around the connector (less than 0.5 inch spacing) on the primary and secondary sides.
- Minimize serpentine traces on outer layers.

#### 5.4. Clock Routing

Analog simulations are required to ensure clock net signal quality and skew is acceptable. The clock skew in Pentium III Xeon processor based systems must be kept to a minimum (the calculations and simulations for the example topology given in this document have a total clock skew of 150 ps and 150 ps of clock jitter). For a given design, the clock distribution system, including the clock components, must be evaluated to ensure these same values are valid assumptions. The *Pentium*® *III* Xeon<sup>TM</sup> Processor at 500 and 550 MHz datasheet specifies clock signal quality requirements for Pentium III Xeon processor systems. To help meet these specifications, follow these general guidelines:

- Tie clock driver outputs if clock buffer supports this mode of operation.
- Match the electrical length and type of traces on the PCB (microstrip and stripline may have different propagation velocities).
- Maintain consistent impedance for the clock traces.
  - Minimize the number of vias in each trace.
  - Minimize the number of different trace layers used to route the clocks.

- Keep other traces away from clock traces.
- Lump the loads at the end of the trace if multiple components are to be supported by a single clock output.
- Have equal loads at the end of each network.

The ideal way to route each clock trace is on the same single inner layer, next to a ground plane, isolated from other traces, with the same total trace length, to the same type of single load, with an equal length ground trace parallel to it, and driven by a zero skew clock driver. When deviations from ideal are required, going from a single layer to a pair of layers adjacent to power/ground planes would be a good compromise. The fewer number of layers the clocks are routed on, the smaller the impedance difference between each trace is likely to be. Maintaining an equal length and parallel ground trace for the total length of each clock ensures a low inductance ground return and produces the minimum current path loop area. (The parallel ground trace will have lower inductance than the ground plane because of the mutual inductance of the current in the clock trace.)

#### 5.5. Conclusion

AGTL+ routing requires a significant amount of effort. Planning ahead and leaving the necessary time available for correctly designing a board layout will provide the designer with the best chance of avoiding the more difficult task of debugging inconsistent failures caused by poor signal integrity. Intel recommends planning a layout schedule that allows time for each of the tasks outlined in this document.



# APPENDIX A DEFINITIONS OF FLIGHT TIME MEASUREMENTS/CORRECTIONS AND SIGNAL QUALITY

Acceptable signal quality must be maintained over all operating conditions to ensure reliable operation. Signal Quality is defined by four parameters: Overshoot, Undershoot, Settling Limit, and Ringback. The Pentium® III Xeon™ Processor at 500 and 550 MHz datasheet describes and specifies values for these parameters. Timings are measured at the pins of the driver and receiver, while signal integrity is observed at the receiver chip pad. When signal integrity at the pad violates the following guidelines and adjustments need to be made to flight time, the adjusted flight time obtained at the chip pad can be assumed to have been observed at the package pin, usually with a small timing error penalty.

#### 6.0. VREF GUARDBAND

To account for noise sources that may affect the way an AGTL+ signal becomes valid at a receiver,  $V_{REF}$  is shifted by  $\Delta V_{REF}$  for measuring minimum and maximum flight times. The  $V_{REF}$  Guardband region is bounded by  $V_{REF}$ - $\Delta V_{REF}$  and  $V_{REF}$ + $\Delta V_{REF}$ .  $\Delta V_{REF}$  has a value of 100 mV, which accounts for the following noise sources:

- 50 mV for motherboard coupling
- 35 mV for V<sub>TT</sub> noise
- 15 mV for V<sub>REF</sub> noise

The example topology covered in this guideline assumes ringback tolerance within 20 mV of  $V_{REF}$ . Since  $V_{REF}$  is

guardbanded by 100 mV, this places the absolute ringback limits at:

- 1.12 V for rising edge ringback
- 0.88 V for falling edge ringback

A violation of these ringback limits requires flight time correction as documented in the *Pentium® II Processor Developer's Manual*.

#### 7.0. OVERDRIVE REGION

The overdrive region is the voltage range, at a receiver, from V<sub>REF</sub> to V<sub>REF</sub> + 200 mV for a low-to-high going signal and V<sub>REF</sub> to V<sub>REF</sub> - 200 mV for a high-to-low going signal. The overdrive regions encompass the V<sub>REF</sub> Guardband. So, when  $V_{REF}$  is shifted by  $\Delta V_{REF}$  for timing measurements, the overdrive region does not shift by  $\Delta V_{REF}$ . Figure 11 below depicts this relationship. Corrections for edge rate and ringback are documented in the Pentium® II Processor Developer's Manual. However, there is an exception to the documented correction method. The Pentium® II Processor Developer's Manual states that extrapolations should be made from the last crossing of the overdrive region back to V<sub>REF</sub>. Simulations performed on this topology should extrapolate back to the appropriate V<sub>REF</sub> Guardband boundary, and not V<sub>REF</sub>. So, for maximum rising edge correction, extrapolate back to V<sub>REF</sub> +  $\Delta$ V<sub>REF</sub>. For maximum falling edge corrections, extrapolate back to  $V_{REF}$  -  $\Delta V_{REF}$ .





Figure 11. Overdrive Region and VREF Guardband



Figure 12. Rising Edge Flight Time Measurement



# 8.0. FLIGHT TIME DEFINITION AND MEASUREMENT

Timing measurements consist of minimum and maximum flight times to take into account that devices can turn on or off anywhere in a  $V_{REF}$  Guardband region. This region is bounded by  $V_{REF}\text{-}\Delta V_{REF}$  and  $V_{REF}\text{-}\Delta V_{REF}$ . The minimum flight time for a rising edge is measured from the time the driver crosses

 $V_{REF}$  when terminated to a test load, to the time when the signal first crosses  $V_{REF}\text{-}\Delta V_{REF}$  at the receiver (see Figure 12). Maximum flight time is measured to the point where the signal first crosses  $V_{REF}\text{+}\Delta V_{REF}$ , assuming that ringback, edge rate, and monotonicity criteria are met. Similarly, minimum flight time measurements for a falling edge are taken at the  $V_{REF}\text{+}\Delta V_{REF}$  crossing and maximum flight time is taken at the  $V_{REF}\text{-}\Delta V_{REF}$  crossing.