kiến trúc máy tính nguyễn thanh sơn chương 1 abstracts and technology sinhvienzone com

45 48 0
kiến trúc máy tính nguyễn thanh sơn chương 1 abstracts and technology sinhvienzone com

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

Thông tin tài liệu

Computer Architecture Computer Science & Engineering Chapter OVERVIEW: Abstracts and Technology BK TP.HCM CuuDuongThanCong.com https://fb.com/tailieudientucntt The Computer Revolution  Progress in computer technology   Makes novel applications feasible      BK  Based on the Moore’s Law Computers in automobiles Cell phones Human genome project World Wide Web Search Engines Computers used now everywhere TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt History of Computer Development  First generation 1945 - 1955   Second generation 1955 - 1965   ICs and multiprogramming Fourth generation 1980 – present    BK transistors, batch systems Third generation 1965 – 1980   vacuum tubes, plug boards personal computers (Desk, Lap) SuperComp., DataCenter, Clusters, etc TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Classes of Computers    Desktop computers  General purpose, variety of software  Subject to cost/performance tradeoff Server computers  Network based  High capacity, performance, reliability  Range from small servers to building sized Embedded computers  Hidden as components of systems  Stringent power/performance/cost constraints BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt The Processor Market BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt What You Will Learn      How programs are translated into the machine language  And how the hardware executes them The hardware/software interface What determines program performance  And how it can be improved How hardware designers improve performance What is parallel processing BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Understanding Performance  Algorithm   Programming language, compiler, architecture   Determine number of machine instructions executed per operation Processor and memory system   Determines number of operations executed Determine how fast instructions are executed I/O system (including OS)  Determines how fast I/O operations are executed BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Below Your Program  Application software   Written in high-level language System software   Compiler: translates HLL code to machine code Operating System: service code     Handling input/output Managing memory and storage Scheduling tasks & sharing resources Hardware  Processor, memory, I/O controllers BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Levels of Program Code  High-level language    Assembly language   Level of abstraction closer to problem domain Provides for productivity and portability Textual representation of instructions Hardware representation   Binary digits (bits) Encoded instructions and data BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt Components of a Computer  Same components for all kinds of computer   Desktop, server, embedded Input/output includes  User-interface devices   Storage devices   Hard disk, CD/DVD, flash Network adapters  BK Display, keyboard, mouse For communicating with other computers TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 10 CPI Example   Alternative compiled code sequences using instructions in classes A, B, C Sequence 1: IC =  Clock Cycles = 2×1 + 1×2 + 2×3 = 10  Avg CPI = 10/5 = 2.0  Sequence 2: IC =  Clock Cycles = 4×1 + 1×2 + 1×3 =9  Avg CPI = 9/6 = 1.5 BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 31 Performance Summary  Performance depends on     Algorithm: affects IC, possibly CPI Programming language: affects IC, CPI Compiler: affects IC, CPI Instruction set architecture: affects IC, CPI, Tc BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 32 Power Trends  In CMOS IC technology BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 33 Reducing Power  Suppose a new CPU has    The power wall    85% of capacitive load of old CPU 15% voltage and 15% frequency reduction We can’t reduce voltage further We can’t remove more heat How else can we improve performance? BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 34 Uniprocessor Performance BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 35 Multiprocessors  Multicore microprocessors   More than one processor per chip Requires explicitly parallel programming  Compare with instruction level parallelism    Hardware executes multiple instructions at once Hidden from the programmer Hard to    Programming for performance Load balancing Optimizing communication and synchronization BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 36 Manufacturing ICs  Yield: proportion of working dies per wafer BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 37 AMD Opteron X2 Wafer   X2: 300mm wafer, 117 chips, 90nm technology X4: 45nm technology BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 38 Integrated Circuit Cost  Nonlinear relation to area and defect rate    Wafer cost and area are fixed Defect rate determined by manufacturing process Die area determined by architecture and circuit design BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 39 SPEC CPU Benchmark  Programs used to measure performance   Standard Performance Evaluation Corp (SPEC)   Supposedly typical of actual workload Develops benchmarks for CPU, I/O, Web, … SPEC CPU2006    Elapsed time to execute a selection of programs  Negligible I/O, so focuses on CPU performance Normalize relative to reference machine Summarize as geometric mean of performance ratios  CINT2006 (integer) and CFP2006 (floating-point) BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 40 CINT2006 for Opteron X4 2356 BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 41 SPEC Power Benchmark  Power consumption of server at different workload levels   Performance: ssj_ops/sec Power: Watts (Joules/sec) BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 42 SPECpower_ssj2008 for X4 BK TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 43 Pitfall: MIPS as a Performance Metric MIPS: Millions of Instructions Per Second   Doesn’t account for   BK  Differences in ISAs between computers Differences in complexity between instructions CPI varies between programs on a given CPU TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 44 Concluding Remarks  Cost/performance is improving   Hierarchical layers of abstraction    BK In both hardware and software Instruction set architecture   Due to underlying technology development The hardware/software interface Execution time: the best performance measure Power is a limiting factor  Use parallelism to improve performance TP.HCM 07-Sep-13 CuuDuongThanCong.com Faculty of Computer Science & Engineering https://fb.com/tailieudientucntt 45 ... Sequence 1: IC =  Clock Cycles = 2 1 + 1 2 + 2×3 = 10  Avg CPI = 10 /5 = 2.0  Sequence 2: IC =  Clock Cycles = 4 1 + 1 2 + 1 3 =9  Avg CPI = 9/6 = 1. 5 BK TP.HCM 07-Sep -13 CuuDuongThanCong .com. .. a program     10 s on A, 15 s on B Execution TimeB / Execution TimeA = 15 s / 10 s = 1. 5 So A is 1. 5 times faster than B BK TP.HCM 07-Sep -13 CuuDuongThanCong .com Faculty of Computer Science &... TP.HCM 07-Sep -13 CuuDuongThanCong .com Faculty of Computer Science & Engineering https://fb .com/ tailieudientucntt 13 Opening the Box BK TP.HCM 07-Sep -13 CuuDuongThanCong .com Faculty of Computer Science

Ngày đăng: 28/01/2020, 23:05

Từ khóa liên quan

Tài liệu cùng người dùng

  • Đang cập nhật ...

Tài liệu liên quan