Lec11Computer Architecture_Pipeline_Introduction.ppt

AhmedJaha 12 views 26 slides Jul 27, 2024

Slide 1 of 26

About This Presentation

Computer Architecture

Size: 706.51 KB

Language: en

Added: Jul 27, 2024

Slides: 26 pages

Slide Content

PipelineCSCE430/830
Pipeline: Introduction
CSCE430/830 Computer Architecture
Lecturer: Prof. Hong Jiang
Courtesy of Prof. Yifeng Zhu, U of Maine
Fall, 2006
Portions of these slides are derived from:
Dave Patterson © UCB

PipelineCSCE430/830
Pipelining Outline
•Introduction \
–Defining Pipelining
–Pipelining Instructions
•Hazards
–Structural hazards
–Data Hazards
–Control Hazards
•Performance
•Controller implementation

PipelineCSCE430/830
What is Pipelining?
•A way of speeding up execution of instructions
•Key idea:
overlapexecution of multipleinstructions

PipelineCSCE430/830
The Laundry Analogy
•Ann, Brian, Cathy, Dave
each have one load of clothes
to wash, dry, and fold
•Washer takes 30 minutes
•Dryer takes 30 minutes
•“Folder” takes 30 minutes
•“Stasher” takes 30 minutes
to put clothes into drawers
ABCD

PipelineCSCE430/830
If we do laundry sequentially...
30
T
a
s
k
O
r
d
e
r
Time
A
3030 3030
B
30 3030
C
3030 3030
D
3030 3030
6 PM7 8 9 10 1112 12 AM
•Time Required: 8 hours for 4 loads

PipelineCSCE430/830
12 2 AM6 PM7 8 9 1011 1
Time
30
A
C
D
B
3030 3030 3030T
a
s
k
O
r
d
e
r
To Pipeline, We Overlap Tasks
•Time Required: 3.5 Hours for 4 Loads

PipelineCSCE430/830
12 2 AM6 PM7 8 9 1011 1
Time
30
A
C
D
B
3030 3030 3030T
a
s
k
O
r
d
e
r
To Pipeline, We Overlap Tasks
•Pipelining doesn’t help latencyof
single task, it helps throughputof
entire workload
•Pipeline rate limited by slowest
pipeline stage
•Multipletasks operating
simultaneously
•Potential speedup = Numberpipe
stages
•Unbalanced lengths of pipe
stages reduces speedup
•Time to “fill” pipeline and time to
“drain” it reduces speedup

PipelineCSCE430/830
Pipelining a Digital System
•Key idea: break big computation up into pieces
•Separate each piece with a pipeline register
1ns
200ps 200ps 200ps 200ps 200ps
Pipeline
Register
1 nanosecond = 10^-9 second
1 picosecond = 10^-12 second

PipelineCSCE430/830
Pipelining a Digital System
•Why do this? Because it's fasterfor repeated
computations
1ns
Non-pipelined:
1 operation finishes
every 1ns
200ps 200ps 200ps 200ps 200ps
Pipelined:
1 operation finishes
every 200ps

PipelineCSCE430/830
Comments about pipelining
•Pipelining increases throughput, but not
latency
–Answer available every 200ps, BUT
–A single computation still takes 1ns
•Limitations:
–Computations must be divisible into stage size
–Pipeline registers add overhead

PipelineCSCE430/830
Pipelining a Processor
•Recall the 5 steps in instruction execution:
1.Instruction Fetch (IF)
2.Instruction Decode and Register Read (ID)
3.Execution operation or calculate address (EX)
4.Memory access (MEM)
5.Write result into register (WB)
•Review: Single-Cycle Processor
–All 5 steps done in a single clock cycle
–Dedicated hardware required for each step

PipelineCSCE430/830
Review -Single-Cycle Processor
•What do we need to add to actually split the datapath into stages?

PipelineCSCE430/830
The Basic Pipeline For MIPS
Reg
ALU
DMemIfetch Reg
Reg
ALU
DMemIfetch Reg
Reg
ALU
DMemIfetch Reg
Reg
ALU
DMemIfetch Reg
Cycle 1Cycle 2Cycle 3Cycle 4 Cycle 6Cycle 7Cycle 5
I
n
s
t
r.
O
r
d
e
r
What do we need to add to actually split the datapath into stages?

PipelineCSCE430/830
Basic Pipelined Processor

PipelineCSCE430/830
Pipeline example: lw
IF

PipelineCSCE430/830
Pipeline example: lw
ID

PipelineCSCE430/830
Pipeline example: lw
EX

PipelineCSCE430/830
Pipeline example: lw
MEM

PipelineCSCE430/830
Pipeline example: lw
WB
Can you find a problem?

PipelineCSCE430/830
Basic Pipelined Processor (Corrected)

PipelineCSCE430/830
Single-Cycle vs. Pipelined Execution
Non-Pipelined0 200 400 600 800 10001200140016001800
lw $1, 100($0)
Instruction
Fetch
REG
RD
ALU
REG
WR
MEM
lw $2, 200($0)
Instruction
Fetch
REG
RD
ALU
REG
WR
MEM
lw $3, 300($0)
Instruction
Fetch
Time
Instruction
Order
800ps
800ps
800ps
Pipelined0 200 400 600 800 1000120014001600
lw $1, 100($0)
Instruction
Fetch
REG
RD
ALU
REG
WR
MEM
lw $2, 200($0)
lw $3, 300($0)
Time
Instruction
Order
200ps
Instruction
Fetch
REG
RD
ALU
REG
WR
MEM
Instruction
Fetch
REG
RD
ALU
REG
WR
MEM
200ps
200ps200ps200ps200ps200ps

PipelineCSCE430/830
Speedup
•Consider the unpipelined processor introduced previously. Assume that
it has a 1 ns clock cycle and it uses 4 cycles for ALU operations and
branches, and 5 cycles for memory operations, assume that the relative
frequencies of these operations are 40%, 20%, and 40%, respectively.
Suppose that due to clock skew and setup, pipelining the processor adds
0.2ns of overhead to the clock. Ignoring any latency impact, how much
speedup in the instruction execution rate will we gain from a pipeline?
Average instruction execution time
= 1 ns * ((40% + 20%)*4 + 40%*5)
= 4.4ns
Speedup from pipeline
= Average instruction time unpiplined/Average instruction time pipelined
= 4.4ns/1.2ns = 3.7

PipelineCSCE430/830
Comments about Pipelining
•The good news
–Multiple instructions are being processed at same time
–This works because stages are isolatedby registers
–Best case speedup of N
•The bad news
–Instructions interfere with each other -hazards
»Example: different instructions may need the same
piece of hardware (e.g., memory) in same clock cycle
»Example: instruction may require a result produced
by an earlier instruction that is not yet complete

PipelineCSCE430/830
Pipeline Hazards
•Limits to pipelining:Hazards prevent next instruction
from executing during its designated clock cycle
–Structural hazards:two different instructions use same h/w
in same cycle
–Data hazards:Instruction depends on result of prior
instruction still in the pipeline
–Control hazards:Pipelining of branches & other instructions
that change the PC

PipelineCSCE430/830
Summary -Pipelining Overview
•Pipelining increase throughput(but not
latency)
•Hazards limit performance
–Structural hazards
–Control hazards
–Data hazards

PipelineCSCE430/830
Pipelining Outline
•Introduction
–Defining Pipelining
–Pipelining Instructions
•Hazards
–Structural hazards \
–Data Hazards
–Control Hazards
•Performance
•Controller implementation

Lec11Computer Architecture_Pipeline_Introduction.ppt

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Lec11Computer Architecture_Pipeline_Introduction.ppt

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Slide 22

Slide 23

Slide 24

Slide 25

Slide 26

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Pray For The Peace Of Jerusalem and You Will Prosper

Don_t_Waste_Your_Life_God.....powerpoint

VILLASUR_FACTORS_TO_CONSIDER_IN_PLATING_SALAD_10-13.pdf

Fertility awareness methods for women in the society

Chapter 5 Arithmetic Functions Computer Organisation and Architecture

syakira bhasa inggris (1) (1).pptx.......