Electromigration and IR Voltage Drop- EMIR.pdf

AhmedAbdelazeem28 1,241 views 52 slides Oct 02, 2024
Slide 1
Slide 1 of 52
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52

About This Presentation

A system's signal integrity (SI) is a measure of how much an electrical signal changes between entering and exiting a circuit. For digital electronics, that signal is an electric current in which voltage varies between a high and a low value over time.

Signal integrity is fundamental to any mo...


Slide Content

IC Reliability
EMIR
Ahmed Abdelazeem

Index
1.Main Concepts of Reliability
2.Electromigration
3.IR drop
1
Infantile
Mortalities
Failure
Rate
Operational Life (Random Failures)
Wear-out
Period
Time
Accelerated Life Test
(Increased Stress)
|HALT|
|HASS|
Full Life Test (Hours, Miles, Cycles, etc.)
Reliability Tests

•Physics
•Mathematics
•General knowledge on IC and its design
•General knowledge on IC fabrication technologies
•Basic knowledge of MOS and FinFETtransistors
2
Prerequisites

Quality vs Reliability
•Quality
▪A complete number of features and characteristics of a product that deliver specified needs.
•Reliability
▪The capacity of a product to perform its required functions under specified conditions for a particular period. In
other words, reliability is the quality over time.
3
quality reliability
Source: https://www.techly.it/sedia-per-ufficio-easy-colore-blu.html

Reasons of Growing Importance of IC
Reliability
4
Evolution of IC applications
Increase of IC component
number
Increase of IC power
consumption
Increase of IC performance

Evolution of IC Applications
•Most important: automotiveand IoT
5
Everywhere

Automotive Integrated Circuits
6
•Factory-installed electronics
•Integrated Circuits as % of total car cost
Voice/Data
Communications
Cabin
Environment
Controls
DSRC
Entertainment
S y s t e m
Hill-
Hold Regenerative
Control
Bra k ing
Antilock
T h e P r e s s u r e
Braking
Monitoring
Parking
system
S e c urit yS y s t e m
ActiveExhaust
Nois eS u p p r e s s i o n
ActiveSuspension
Ba t t e ry
Management
Repair
LaneCorrection
ElectronicToll
Collection
Digital
Turn
S ig na
ls
Navigat
ion
S y s t e
m
Differen
tial
EV/H EV
Active
CabinNoise
Suppression
Interior
Lightering
Auto-
Dimm ing
Mirror
Event
D a t a
Recorder
Accident
Recorder
Instrument
Cluster
Driver
Alertness
Monitoring
Windshield
WiperControl
Parental
Controls
Airbag
D e ploy m e nt
Remote
Keyless
Ent ry
Blindspot
Direction
L a n e
Departure
Warning
Tra ns m is s ion
Control
ActiveYaw
Control
Electronic
stability
control
S e a t
Position
control
AdaptiveFront
Lighting
AdaptiveCruise
Control
Automatic
Bra k ing
ElectricP ow e r
Steering
ElectronicThrottle
Control
ElectronicValve
Timing
Cylinder
De-activation
Active
Vibration
Control
Drive
Shaft
IdleStop/
S t a rt OBDII
Eng ine
Control
Night
Vision
Head-Up
Display
1980
Electronic fuel
injection
10%
Airbag ABS/ESP
22%
2000
Advanced driver assist Active-passive
safety
Powertrain Radar/ vision infotainment
35%
2010
2030
50%
Source: https://mfjenterprises.com

IoT Integrated Circuits
7
•Essence of IoT •Billions of devices
0.5
2003
PCs
8.7
22.9
50.1
2012
Smart watch
11.27
2015
Consumer electronics
2016
Smart traffic
2016
Healthcare
2020
Smart home
35
Source: https://internetofthingsagenda.techtarget.com/definition/Internet-of-Things-IoT

Exact Prediction of IC Evolution -Moore’s
Law
8
1965. The number of
transistors in ICs will
double every 18
months
Source: https://www.intel.com/content/www/us/en/homepage.html

Increase of IC Component Number
9
3 transistors 54 billion transistors27 million
transistors
First IC
Intel Pentium 3
Google TPUv4
… …

Increase of IC Absolute Power
Consumption
10
2 W 2 kW150 W
=2x
First IC
Intel Pentium 3
Huawei Kirin 9000e
… …
Electricalpanel

Increase of IC Specific Power Consumption
11
∼1W/cm
2
∼300W/cm
2
∼8W/cm
2

First IC
Intel Pentium 3
Google TPUv4
……
Nuclear reactor

Increase of IC Performance
12
10 kHz 8x8x5,2 GHz2,3 GHz

First IC
Intel Pentium 3
Qualcomm snapdragon 865
……

The Importance of IC Reliability
13
Radiation induced circuit glitches
Power rail glitch
Meta-stability due to design flows
(failure to synchronize the circuit)
IC design complexity increases Importance of IC reliability grows
New techniques and tools needed
Source: www.FutureTimeline.net

Causing Decrease of IC Reliability
14
Aging
Self-Heating
Process Variability
Cross Talk
ESD and Latchup
EMI
IR Drop
Electromigration
Radiation
Metastability
Signal Integrity
Power Integrity

Trends of Parameter Evolution With
Regards to IC Thermal Mode
15
•IC complexity increases, consideration and control of thermal mode during IC design becomes more important.
Year 2009 2012 2015 2018 2021 2024
Gate length(nm) 54 35 22 15,7 11,1 7,9
Average specific power consumption (W /
mm
2
)
0,45 0,6 0,75 0,9 1,05 1,2
In case of portable devices, maximum
temperatureof p-n junction(
0
C)
105 105 105 105 105 105
Maximum operational temperature range (
0
C)-40...150-40...150-40...150-40...150-40...150-40...150
In case of optimal flow of air, environment
temperature of IC body (
0
C)
45 45 45 45 45 45

IC Technology, Power Consumption and
Temperature Connection
16
•IC complexity increases and connections between different parameters also increases.
•For effective thermal management, it is necessary to take it into account during the whole design
flow.
Dynamic
Static
Technology, nm Temperature
Power consumption (W)

IC Design Flowand Thermal Analysis
17
IC
Design
Flow
Timing
IR Drop
Rail EM
Thermal
Extraction
Power
•IC design success is largely determined by the availability of
such design environment that combines the interdependent
tasks of timing, power, leakage, thermal and signal analysis.
•Such analysis should be applied throughout the entire IC
design implementation process.
•By IC complexity growth (SoC, Multichip module, 3D IC)
and their use in portable devices, thermal analysis gets
particular importance

Relationship of Power and IC Thermal
Mode
18
•By shrinking of the technology, temperature effect on power consumption increases. It is more vividly
expressed for leakage power.
•Multi-core architecture design trends have taken the
direction of increasing the power density by integrating
more processing units on the chip (with a fixed chip
area).
•If the IC power density keeps increasing, it will eventually
reach the same magnitude of nuclear power plants.
•With such increased power densities, ICs face a
tremendous increase in heat generation that has a direct
impact on the lifetime of ICs.
Dynamic PowerLeakage Power
TotalPower
Temperature

Thermal Reliability of ICs
19
•Temperature difference in different parts of semiconductor crystal can switch from 10 to 20
o
C.
•MTTF is 50-75 years in case of 60
o
C.
•MTTF is 1000-1500 hours in case of 125
o
C for ICs of average complexity and 85-90
o
C for complex processor ICs.
Black’s equation
�??????????????????(??????)=??????�
−??????
??????
(
??????
??????
????????????
)
MTF-Mean Time to Failure at T; A-Empirical constant that depends on technology;J-current density;
n=1,0 ÷2,0 ; J=(0,2÷2,0)10
6
A/cm
2
;Ea-Activation energy=0,5÷0,7eV; k-Boltzmann’s constant;T-
absolute temperature of IC element.
1000
100
10
1
100806040200
T(K)
MTTF(T)=AJ
-2
exp(E
a/kT)
E
a=0.68eV
EM MTTF (300K) / MTTF(300K+)

T

Effect of IC Complexity on Thermal Mode
20
•IC complexity growth caused application of SoC structures in which
thermal issues become more acute.
•This is mainly related to the localized heat generation of several processing
units of SoCs.
•Localized heat generation creates several localized high temperatures,
known as thermal hot spots.
•The existence of several thermal hot spots would imply that there are other
localized cold spots, which leads to the creation of the undesirable spatial
thermal gradients.
Layoutand thermal response of the
UltraSPARCT1 MPSoC

Impact of Temperature: Summary
21
Temperature
Performance
Power
Reliability
Cost
Higher T →lower performance
Higher T →higher (cooling) costs
Higher T →higher leakage
Higher T →higher error rate
Higher power (density) →higher T
Fault redundancy
techniques

higher power
Low
-
power
techniques

higher error rate

Gate-level Average Power Calculation
22
Leakage
Power
Internal
Power
Switching
Power
Average
Power
??????(??????�????????????�)=??????(�????????????�??????????????????)+½∗(��????????????��??????�−??????�??????�????????????∗??????�??????�∗????????????)+½∗??????∗??????
2
∗??????�??????�∗????????????

Gate-level Average Power Calculation
23

IR Drop
24
IR drop
•When voltage is applied to PG metals the current starts flowing and some voltage is droppeddue
to finite resistance of a metal wire and current.
-Increased delays
-Degraded performance
•Solutions
-MinimizePG resistance
-More vias
-Widerwires
-Inserting Decaps

How Does a Power Rail IR Drop Occur?
25

IR Drop (2)
•IR drop: Voltage drops caused
by current flowing from the
power source through the
resistive power network to the
on-chip devices is called IR drop.
•Ground bounce: Voltage
spikes caused by current flowing
from on-chip devices though the
resistive ground network to the
ground pins (or bumps)
•IR drop and ground bounce
combine to impact silicon
performance.
26

IR Drop Impacts on Setup and Hold Time
•In the case where the IR drop occurs within the signal path, the signal is slowed, potentially causing setup time
violations for this signal path
•In the case where IR drop occurs on a clock buffer, the clock signal beyond this buffer is slowed, potentially
causing hold time violations for all signals clocked by this clock branch.
27

IR Drop Effects
28
1.2V
Minimum
Tolerance
level
Ideal voltage level
Actual voltage level
A
B
Y
A
B
Y
D D
CP
QB
CP
QB
t = 25 psPD t = 40 psPD tsu tSU
Result in unpredictable
performance
Result in irregular or
permanent failures
Dynamic IR Drop can lead to the
following effects:
• Performance degradation
–Excessive path delay
–Excessive clock skew
• Functional failure
–Voltage drop reduces noise margin

IR Drop Impact on Path Delay
29
•Design Statistics
-5 element delay chain of the same buffer
-32/28 nm process
-1.05V ideal VDD
1 2 3 4 5
Actual VDD/VSS 1.05V/0V 0.92V/0V 0.89V/0.05V 0.88V/0.12V 0.85V/0.1V
Measured delay
Results Delay1 Delay2 Delay3 Delay4 Delay5 Total
Ideal VDD/VSS
(1.05/0V)
40ps 51ps 53ps 54ps 56ps 254ps
Actual VDD/VSS
44ps 53ps 49ps 59ps 54ps 259ps
Ideal and Actual
difference (%)
8.6 1.6 3.2 15.8 14.9 7.3

IR Drop Occur
•The power supply (VDD and VSS) in a chip is uniformly distributed through the metal rails and stripes which is called
a Power Delivery Network (PDN) or power grid.
•Each metal layer used in PDN has finite resistivity.
•When current flows through the power delivery network, a part of the applied voltage will be dropped in PDN as per
Ohm’s law
•The amount of voltage drop will be V = I.R, called the IR drop.
•If the resistivity of metal wire is high or the amount of current following through the power net is high, A significant
amount of voltage may be dropped in the power delivery network which will cause a lesser amount of voltage
available to the standard cells than the actual amount of voltage applied.
•If V1 voltage is applied at the power port and current I is following in a particular net which has total resistance R,
then the voltage available (V2) to the other end for the standard cell will be
•Standard cells or macros sometimes do not get the minimum operating voltage which is required to operate them due
to IR drop in the power delivery network even the application of sufficient voltage in the power port.
•Voltage drop in the power delivery network before reaching the standard cells is called IR drop
•This drop may cause the poor performance of the chip due to the increase of delay of standard cells and may cause
thefunctional failure of the chip due to setup/hold timing violation.
30

Types of IR drop
•There are two types of IR drops in the ASIC design:
▪Static IR drop
▪Dynamic IR drop
•Static IR drop is the voltage drop in the power delivery network (PDN) when there are no inputs switching,
which means the circuit is in the static stage.
•dynamic IR drop is the voltage drop in the power delivery network when the inputs are continuously switching,
which means the circuit is in a functional state. Dynamic IR drop will depend on the switching rate of the
instance.
•When the inputs are switching continuously, more current would flow in the instances and also in PDN. So
there will be more IR drop in the PDN. Therefore, dynamic IR drop is more than the static IR drop.
31

Static Voltage Drop Background
32
•On-chip power/ground network ➔mesh of resistors
•Average current (I
avg) of instances is estimated from Average power
•Instances ➔DC current sources

Static Voltage Drop on P/G Network
•Average current is calculated for each instance
•V
staticis computed at every node (Ohm's law ...)
•Wire / via electromigration (EM) is post-processed from static current density
33

Dynamic Voltage Drop Problem Definition
34
•On-chip power/ground network ➔R,L,C mesh
•Switching instances ➔i(t,V) sources
•Non-switching instances ➔Effective decaps, ESR and leakage

Dynamic Voltage Drop on P/G Network
•PWL current for each instance
•V
dynamicwaveform is computed at every node by transient simulation
35

Static VS. Dynamic IR drop Analysis
36

Difference Between Static and Dynamic
37
Static Dynamic
All instances will draw an average current
(DC)
Switching instances will draw transient
current (AC)
Non-switching instances will draw only
leakage
Total Average demand will be much less than
real peak demand current for the chip
Dynamic will see the real peak demand
current
Demand current is completely supplied by
battery
Portion of the demand current is supplied by
decaps
(Intrinsic / Intentional / PG caps)
Doesn’t matter when an instance switches
Instances will draw the current all the time
Instances will draw transient current only
when
it switches
Simultaneous switching causes huge peak
current demand
No drop across package due to Ldi/dt effects
(Current is constant)
Ldi/dt drop across package and die
inductance

IR Drop Example for Chip
38

Reasons for IR drop
IR drop could occur due to various reasons, but some main reasons are as follows.
•Poor design of the power delivery network (lesser metal width and more separation in the power stripes)
•inadequate via in-power delivery network
•Inadequate number of decapcells availability
•High cell density and high switching in a particular region
•High impedance of the power delivery network
•Rush current
•Insufficient number of voltage sources
•High RC value of the metal layer used to create the power delivery network
39

Method of Reducing Dynamic IR Drop
40

Method of Reducing Static IR Drop
1.The red area means a voltage drop of more than 10% of the
nominal supply voltage. The solution is to use wider
power stripes or use more metal on higher levels.
2.Additional power stripes are added to the design and
are marked in cyan and magenta.
3.This IR drop plot is made after an increase in the number of
power stripes. This plot shows a very low voltage drop,
which is required for a functional chip.
41

Methods to improve IR drop
Methods to improve static IR drop
•We can go for higher layers if available
•Increase the width of the straps.
•Increase the number of wires.
•Check if any via is missing then add more via.
Methods to improve dynamic IR drop
•Use de-cap cells.
•Increase the number of straps
42
Metal N+1 Metal N+1
Metal N
Metal N
Via Via

Electromigration
43

Electromigration
44
Electromigration
•The movement of atoms caused by the current through a metal.
-High-density current and material
-Temperature
-Size
•Current types for checks
-Absolute
-Average
-Peak
-Root-mean-square (RMS)
Alyo

Electromigration effect(2)
45
•EM effect sources:
-High current density causes current-driven migration of metal atoms. This effect is called
Electromigration (EM).
-Each metal layer with fixed width and length has limited current currying ability.
M1
M1
M1
M1 M1
Current rise
approximately
by1AMP
Temperatureof
wireincreases
Resistanceof
wire goeslow
Electronsflow
faster
--
-
-
-
--
-
-
-
-
-
-
-
-
-
metal
current
This is correlating process. As a result, the metal will
become thinner or will be damaged

Electromigration (3)
46
Electromigration
•The associated accumulation or loss of material results in damage.
-Deposition of atoms (Hillock->short)
-Depletion of atoms (Void->open)
•Solutions
-Widerwires
-More vias
-Shorterlengths
-Switch to higher layers
Open
Short

Reasons of Electromigration
Reasons of EM Violation:
•High Fanout Net(Multiple fanout cells switch simultaneously, draws larger current from the driver)
•Higher Driver strength Cells (delivers large current unnecessarily, heating the wire)
•Higher Frequency (quick transitions)
•Narrow Metal Width.
•Metal slotting (resulting in narrower widths)
•Long Nets (because of larger resistance, higher localized temperature)
47

Prevention techniques for EM
During the physical design, the following techniques could be used to prevent the EM issue.
•Decrease Drive’s drive Strength.
•Insert Buffer on long nets.
•Increase the width of the wire.
•Adding more vias (Multi-Cut Vias)
•Break the fanout (have a smaller fanout)
•Switch the net to higher metal layers.
48

Analysis Output from Power Grid Analysis
49

References
1.MohsenRaji,BehnamGhavami.LifetimeReliability-awareDesignofIntegratedCircuits.Springer;1sted.
2023edition;2022
2.S.Jayanthy,M.C.Bhuvaneswari.TestGenerationofCrosstalkDelayFaultsinVLSICircuits.Springer;1sted.
2019edition;2018
3.YanfengChen,BoZhang:Measuring,Equivalent-Small-ParameterAnalysisofDC/DCSwitched-Mode
Converter.Springer;1sted;2018
4.ZuberPatel,ShilpiGupta.AdvancesinVLSIandEmbeddedSystems:SelectProceedingsofAVES2019.
Springer;1sted.2021;2020
5.S.Kundu,S.Chattopadhyay.Network-on-Chip:TheNextGenerationofSystem-on-ChipIntegration.CRC
Press;1stedition;2018
6.GuihermeArroz,JoseMonteiro,ArlindoOliveira.ComputerArchitecture:DigitalCircuitsTo
Microprocessors.WorldScientific;2018
7.RalphMorrison.FastCircuitBoards:EnergyManagement.Wiley;1stedition;2018
50

Thank You ☺