SlidePub
Home
Categories
Login
Register
Home
Design
Fundamentals of Quantitative Design and Analysis
Fundamentals of Quantitative Design and Analysis
ssusered4546
31 views
37 slides
Sep 14, 2025
Slide
1
of 37
Previous
Next
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
About This Presentation
Chapter 1: Fundamentals of Quantitative Design and Analysis
Size:
1.83 MB
Language:
en
Added:
Sep 14, 2025
Slides:
37 pages
Slide Content
Slide 1
Copyright © 2019, Elsevier Inc. All rights reserved. Chapter 1 Fundamentals of Quantitative Design and Analysis Computer Architecture A Quantitative Approach , Sixth Edition
Slide 2
Instructor: Ramalingam Sridhar 338K Davis Hall Ph : 645-3186 Class: http://www.cse.buffalo.edu/~rsridhar/cse490-590 Office Hours: : Tuesday 1:30-3:30pm and/or by appointment Teaching assistants: Aryan Pandey
[email protected]
Tyler DeAngelo
[email protected]
Ronan Kasmier
[email protected]
Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 3
Prerequisites : CSE341 Class website : http://www.cse.buffalo.edu/~rsridhar/cse490-590/ Text Book : Computer Architecture, Sixth Edition: A Quantitative Approach (The Morgan Kaufmann Series in Computer Architecture and Design) (6th Edition), 2017. John L. Hennessy and David A. Patterson Reference Book : David A. Patterson and John L. Hennessy, Computer Organization and Design: The Hardware/Software Interface, 5th edition, Elsevier, 2014 (Book used for CSE341) Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 4
Course Description Computer architecture is the science and art of selecting and interconnecting hardware components to create a computer that meets functional, performance and cost goals. In this course, students will learn how to completely design a correct single processor computer, including processor datapath , processor control, pipelining optimization, instruction-level parallelism and multi-core, memory/cache systems, and I/O. This course is targeted at senior-level undergraduates and first-year graduate students who already took CSE 341: Computer Organization and CSE: 241 Digital Systems or CSE220: Systems Programming. Students should have a good working understanding of digital logic, basic processor design and organization, pipelining, and simple cache design. computer organization, logic design, and the equivalent knowledge are necessary to succeed in computer architecture. Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 5
Topics covered Computer Architecture Overview Benchmarking and Evaluation Review of Computer Organization & Digital Systems Pipelining; Complex Pipelining Memory Review; Cache Design; Virtual Memory Instruction Level Parallelism; Dataflow Very Long Instruction Word Processor Data Level Parallelism Graphic Processing Units Loop Level Parallelism Domain specific architecture Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 6
Exams/Quizzes/Projects/Grading Grading 3 Quizzes - 10 % (4 4 2) 1 Midterm - 25 % 1 Final - 35 % 2 Projects - 30 % Grade Assignment : (Letter grades carry normal numerical values) (90‐100 = A, 88‐89.9 = A‐, 86‐87.9 = B+, 80‐85.9 = B, 78‐79.9 = B‐, 75‐77.9 = C+, 70‐74.9 = C, 65‐69.9 = C‐, 60‐64.9 = D, 1‐ 59 = F). Curving may be applied if deemed appropriate by the instructor. Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 7
Academic Integrity and Accessibility resources https://www.buffalo.edu/academic-integrity.html Academic Integrity issues: Take it serious https://www.buffalo.edu/studentlife/who-we-are/departments/accessibility.html Accessibility resources Class discussions – Piazza UBLearns – resources and grades Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 8
Computer Technology Performance improvements: Improvements in semiconductor technology Feature size, clock speed Improvements in computer architectures Enabled by HLL compilers, UNIX Lead to RISC architectures Together have enabled: Lightweight computers Productivity-based managed/interpreted programming languages Copyright © 2019, Elsevier Inc. All rights reserved. Introduction
Slide 9
Single Processor Performance Copyright © 2019, Elsevier Inc. All rights reserved. Introduction
Slide 10
Copyright © 2019, Elsevier Inc. All rights reserved. Current Trends in Architecture Cannot continue to leverage Instruction-Level parallelism (ILP) Single processor performance improvement ended in 2003 New models for performance: Data-level parallelism (DLP) Thread-level parallelism (TLP) Request-level parallelism (RLP) These require explicit restructuring of the application Introduction
Slide 11
Copyright © 2019, Elsevier Inc. All rights reserved. Classes of Computers Personal Mobile Device (PMD) e.g. start phones, tablet computers Emphasis on energy efficiency and real-time Desktop Computing Emphasis on price-performance Servers Emphasis on availability, scalability, throughput Clusters / Warehouse Scale Computers Used for “Software as a Service ( SaaS )” Emphasis on availability and price-performance Sub-class: Supercomputers, emphasis: floating-point performance and fast internal networks Internet of Things/Embedded Computers Emphasis: price Classes of Computers
Slide 12
Copyright © 2019, Elsevier Inc. All rights reserved. Parallelism Classes of parallelism in applications: Data-Level Parallelism (DLP) Task-Level Parallelism (TLP) Classes of architectural parallelism: Instruction-Level Parallelism (ILP) Vector architectures/Graphic Processor Units (GPUs) Thread-Level Parallelism Request-Level Parallelism Classes of Computers
Slide 13
Copyright © 2019, Elsevier Inc. All rights reserved. Flynn’s Taxonomy Single instruction stream, single data stream (SISD) Single instruction stream, multiple data streams (SIMD) Vector architectures Multimedia extensions Graphics processor units Multiple instruction streams, single data stream (MISD) No commercial implementation Multiple instruction streams, multiple data streams (MIMD) Tightly-coupled MIMD Loosely-coupled MIMD Classes of Computers
Slide 14
Copyright © 2019, Elsevier Inc. All rights reserved. Defining Computer Architecture “Old” view of computer architecture: Instruction Set Architecture (ISA) design i.e. decisions regarding: registers, memory addressing, addressing modes, instruction operands, available operations, control flow instructions, instruction encoding “Real” computer architecture: Specific requirements of the target machine Design to maximize performance within constraints: cost, power, and availability Includes ISA, microarchitecture, hardware Defining Computer Architecture
Slide 15
Instruction Set Architecture Class of ISA General-purpose registers Register-memory vs load-store RISC-V registers 32 g.p., 32 f.p. Copyright © 2019, Elsevier Inc. All rights reserved. Defining Computer Architecture Register Name Use Saver x0 zero constant 0 n/a x1 ra return addr caller x2 sp stack ptr callee x3 gp gbl ptr x4 tp thread ptr x5-x7 t0-t2 temporaries caller x8 s0/fp saved/ frame ptr callee Register Name Use Saver x9 s1 saved callee x10-x17 a0-a7 arguments caller x18-x27 s2-s11 saved callee x28-x31 t3-t6 temporaries caller f0-f7 ft0-ft7 FP temps caller f8-f9 fs0-fs1 FP saved callee f10-f17 fa0-fa7 FP arguments callee f18-f27 fs2-fs21 FP saved callee f28-f31 ft8-ft11 FP temps caller
Slide 16
Instruction Set Architecture Memory addressing RISC-V: byte addressed, aligned accesses faster Addressing modes RISC-V: Register, immediate, displacement (base+offset) Other examples: autoincrement, indexed, PC-relative Types and size of operands RISC-V: 8-bit, 32-bit, 64-bit Copyright © 2019, Elsevier Inc. All rights reserved. Defining Computer Architecture
Slide 17
Instruction Set Architecture Operations RISC-V: data transfer, arithmetic, logical, control, floating point See Fig. 1.5 in text Control flow instructions Use content of registers (RISC-V) vs. status bits (x86, ARMv7, ARMv8) Return address in register (RISC-V, ARMv7, ARMv8) vs. on stack (x86) Encoding Fixed (RISC-V, ARMv7/v8 except compact instruction set) vs. variable length (x86) Copyright © 2019, Elsevier Inc. All rights reserved. Defining Computer Architecture
Slide 18
Copyright © 2019, Elsevier Inc. All rights reserved. Trends in Technology Integrated circuit technology (Moore’s Law) Transistor density : 35%/year Die size: 10-20%/year Integration overall: 40-55 %/year DRAM capacity: 25-40%/year ( slowing) 8 Gb (2014), 16 Gb (2019), possibly no 32 Gb Flash capacity: 50-60%/year 8-10X cheaper/bit than DRAM Magnetic disk capacity: recently slowed to 5%/year Density increases may no longer be possible, maybe increase from 7 to 9 platters 8-10X cheaper/bit then Flash 200-300X cheaper/bit than DRAM Trends in Technology
Slide 19
Copyright © 2019, Elsevier Inc. All rights reserved. Bandwidth and Latency Bandwidth or throughput Total work done in a given time 32,000-40,000X improvement for processors 300-1200X improvement for memory and disks Latency or response time Time between start and completion of an event 50-90X improvement for processors 6-8X improvement for memory and disks Trends in Technology
Slide 20
Copyright © 2019, Elsevier Inc. All rights reserved. Bandwidth and Latency Log-log plot of bandwidth and latency milestones Trends in Technology
Slide 21
Copyright © 2019, Elsevier Inc. All rights reserved. Transistors and Wires Feature size Minimum size of transistor or wire in x or y dimension 10 microns in 1971 to .011 microns in 2017 Transistor performance scales linearly Wire delay does not improve with feature size! Integration density scales quadratically Trends in Technology
Slide 22
Copyright © 2019, Elsevier Inc. All rights reserved. Power and Energy Problem: Get power in, get power out Thermal Design Power (TDP) Characterizes sustained power consumption Used as target for power supply and cooling system Lower than peak power (1.5X higher), higher than average power consumption Clock rate can be reduced dynamically to limit power consumption Energy per task is often a better measurement Trends in Power and Energy
Slide 23
Copyright © 2019, Elsevier Inc. All rights reserved. Dynamic Energy and Power Dynamic energy Transistor switch from 0 -> 1 or 1 -> 0 ½ x Capacitive load x Voltage 2 Dynamic power ½ x Capacitive load x Voltage 2 x Frequency switched Reducing clock rate reduces power, not energy Trends in Power and Energy
Slide 24
Copyright © 2019, Elsevier Inc. All rights reserved. Power Intel 80386 consumed ~ 2 W 3.3 GHz Intel Core i7 consumes 130 W Heat must be dissipated from 1.5 x 1.5 cm chip This is the limit of what can be cooled by air Trends in Power and Energy
Slide 25
Copyright © 2019, Elsevier Inc. All rights reserved. Reducing Power Techniques for reducing power: Do nothing well Dynamic Voltage-Frequency Scaling Low power state for DRAM, disks Overclocking, turning off cores Trends in Power and Energy
Slide 26
Copyright © 2019, Elsevier Inc. All rights reserved. Static Power Static power consumption 25-50% of total power Current static x Voltage Scales with number of transistors To reduce: power gating Trends in Power and Energy
Slide 27
Copyright © 2019, Elsevier Inc. All rights reserved. Trends in Cost Cost driven down by learning curve Yield DRAM: price closely tracks cost Microprocessors: price depends on volume 10% less for each doubling of volume Trends in Cost
Slide 28
Copyright © 2019, Elsevier Inc. All rights reserved. Integrated Circuit Cost Integrated circuit Bose-Einstein formula: Defects per unit area = 0.016-0.057 defects per square cm (2010) N = process-complexity factor = 11.5-15.5 (40 nm, 2010) Trends in Cost
Slide 29
Copyright © 2019, Elsevier Inc. All rights reserved. Dependability Module reliability Mean time to failure (MTTF) Mean time to repair (MTTR) Mean time between failures (MTBF) = MTTF + MTTR Availability = MTTF / MTBF Dependability
Slide 30
Copyright © 2019, Elsevier Inc. All rights reserved. Measuring Performance Typical performance metrics: Response time Throughput Speedup of X relative to Y Execution time Y / Execution time X Execution time Wall clock time: includes all system overheads CPU time: only computation time Benchmarks Kernels (e.g. matrix multiply) Toy programs (e.g. sorting) Synthetic benchmarks (e.g. Dhrystone) Benchmark suites (e.g. SPEC06fp, TPC-C) Measuring Performance
Slide 31
Copyright © 2019, Elsevier Inc. All rights reserved. Principles of Computer Design Take Advantage of Parallelism e.g. multiple processors, disks, memory banks, pipelining, multiple functional units Principle of Locality Reuse of data and instructions Focus on the Common Case Amdahl’s Law Principles
Slide 32
Copyright © 2019, Elsevier Inc. All rights reserved. Principles of Computer Design The Processor Performance Equation Principles
Slide 33
Copyright © 2019, Elsevier Inc. All rights reserved. Principles of Computer Design Principles Different instruction types having different CPIs
Slide 34
Copyright © 2019, Elsevier Inc. All rights reserved. Principles of Computer Design Principles Different instruction types having different CPIs
Slide 35
Fallacies and Pitfalls All exponential laws must come to an end Dennard scaling (constant power density) Stopped by threshold voltage Disk capacity 30-100% per year to 5% per year Moore’s Law Most visible with DRAM capacity ITRS disbanded Only four foundries left producing state-of-the-art logic chips 11 nm, 3 nm might be the limit Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 36
Fallacies and Pitfalls Microprocessors are a silver bullet Performance is now a programmer’s burden Falling prey to Amdahl’s Law A single point of failure Hardware enhancements that increase performance also improve energy efficiency, or are at worst energy neutral Benchmarks remain valid indefinitely Compiler optimizations target benchmarks Copyright © 2019, Elsevier Inc. All rights reserved.
Slide 37
Fallacies and Pitfalls The rated mean time to failure of disks is 1,200,000 hours or almost 140 years, so disks practically never fail MTTF value from manufacturers assume regular replacement Peak performance tracks observed performance Fault detection can lower availability Not all operations are needed for correct execution Copyright © 2019, Elsevier Inc. All rights reserved.
Tags
Categories
Design
Download
Download Slideshow
Get the original presentation file
Quick Actions
Embed
Share
Save
Print
Full
Report
Statistics
Views
31
Slides
37
Age
79 days
Related Slideshows
1
MGV Residential Design projects for different clients, including a New Mexico Adobe project-1-.pdf
mannyvesa
26 views
16
EUNITED_Advocacy and Public Engagement through Visual Media
GeorgeDiamandis11
31 views
31
DESIGN THINKINGGG PPT 2 TOPIC IDEATION.pptx
HibaZaidi2
24 views
36
DESIGN THINKING CHAPTER 1 PPTT PPT 1.pptx
HibaZaidi2
28 views
112
Hinduism and Its History - PowerPoint Slides.pptx
ConorMcCormack10
24 views
20
Service Attributes of Manufactured Parts.pptx
MustafaEnesKrmac
24 views
View More in This Category
Embed Slideshow
Dimensions
Width (px)
Height (px)
Start Page
Which slide to start from (1-37)
Options
Auto-play slides
Show controls
Embed Code
Copy Code
Share Slideshow
Share on Social Media
Share on Facebook
Share on Twitter
Share on LinkedIn
Share via Email
Or copy link
Copy
Report Content
Reason for reporting
*
Select a reason...
Inappropriate content
Copyright violation
Spam or misleading
Offensive or hateful
Privacy violation
Other
Slide number
Leave blank if it applies to the entire slideshow
Additional details
*
Help us understand the problem better