l25-instructcyclepipelining

8/6/2019 L25-InstructCyclePipelining

1/27


2/27

Indirect Cycle

May require memory access to fetch operands Indirect addressing requires more memory

accesses

Can be thought of as additional instructionsubcycle


3/27

Instruction Cycle with Indirect cycle


4/27

Instruction Cycle State Diagram


5/27

Data Flow (Instruction Fetch)

Depends on CPU design In general:

Fetch

PC contains address of next instruction Address moved to MAR

Address placed on address bus

Control unit requests memory read Result placed on data bus, copied to MBR, then to

IR

Meanwhile PC incremented by 1


6/27

Data Flow (Fetch Cycle)


7/27

Data Flow (Data Fetch)

IR is examined, to determine if the operanduses indirect addressing.

If so, indirect cycle is performed

Right most N bits of MBR transferred to MAR Control unit requests memory read

Result (address of operand) moved to MBR


8/27

Data Flow (Indirect Cycle)


9/27

Data Flow (Interrupt)

Simple Predictable

Current PC saved to allow resumption after interrupt

Contents of PC copied to MBR

MBR written to memory

PC loaded with address of interrupt handling routine

Next instruction (first of interrupt handler) can be

fetched


10/27

Data Flow (Interrupt Cycle)


11/27

Instruction Pipelining


12/27

Prefetch Fetch involves accessing main memory

Execution usually does not access main

memory

So we can fetch next instruction during

execution of current instruction

Called instruction prefetch


13/27

Two Stage Instruction Pipeline


14/27

Improved Performance But not doubled:

Fetch usually shorter than execution

Prefetch more than one instruction?

Any jump or branch means that prefetched

instructions are not the required instructions

Add more stages to improve performance


15/27

Fetch instruction

Decode instruction

Calculate operands

Fetch operands

Execute instructions

Write result

Overlap these operations

Pipelining


16/27

Timing Diagram for

Instruction Pipeline Operation


17/27

The Effect of a Conditional Branch on

Instruction Pipeline Operation


18/27

Six Stage

Instruction Pipeline


19/27

Dealing with Branches

Multiple Streams Prefetch Branch Target

Loop buffer

Branch prediction Delayed branching


20/27

Multiple Streams

Replicates the initial portions of pipeline & allowpipeline to fetch both the instructions of branchinstruction

Have two streams

Prefetch each branch into a separate streams

Leads to bus & register contention

Multiple branches lead to further pipelines being

needed Despite these drawbacks, this strategy can improve

performance

Used by IBM 3033 and IBM 370/168


21/27

Prefetch Branch Target

Target of branch is prefetched in addition toinstructions following branch

Keep target until branch instruction is

executed Used by IBM 360/91


22/27

Loop Buffer

It is a small, Very high speed memory Maintained by instruction fetch stage of

pipeline

Contains n most recently fetched instruction,in sequence.

If branch is to be taken, h/w first checks for

the branch target in the buffer

Very good for small loops or jumps

Similar to a cache dedicated to instructions

Used by CRAY-1


23/27

Branch Prediction

Predict never taken Assume that jump will not happen

Always fetch next instruction

68020 & VAX 11/780

Predict always taken Assume that jump will happen

Always fetch from the branch target instruction

Predict by Opcode Processor assumes that branch will be taken for

certain branch opcodes and not for others

Can get up to 75% success


24/27

Taken/Not taken switch Based on previous history

One or more bits can be associated with each CBI thatreflect the recent history of the instruction

Good for loops

Branch History Table

A small table is maintained for recently executed branch

instructions with one or more bits in each entry

Good for loops

Delayed Branch Pipeline performance may be improved by rearranging

instructions within program, so that branch instructions

occur later.


25/27

Branch Prediction Flowchart


26/27

Branch Prediction State Diagram


27/27

Dealing With

Branches

l25-instructcyclepipelining

Documents