Pipelining

Learned in ECE222.

“Pipelining is at the core of compiler optimization”

https://www.youtube.com/watch?v=3l10o0DYJXg&ab_channel=PERLI ~ 7:10

Pipelining is a process that allows us to increase instruction Throughput (the number of instructions that can be performed in a given time).

Pipelining improves efficiency by executing multiple instructions simultaneously.

They use the analogy of doing your laundry to illustrate pipelining, which I really like.

It is a form of Parallelism.

With pipelining, the computer architecture allows the next instructions to be fetched while the processor is performing arithmetic operations, holding them in a buffer close to the processor until each instruction operation can be performed. The staging of instruction fetching is continuous. The result is an increase in the number of instructions that can be performed during a given time period.

They have data forwarding which gives greater performance.

Else there is this thing called NOPs which implements stalling.

Important not to get confused on

Pipelining does NOT decrease the execution time of an individual instruction (i.e. Latency). Rather, it improves performance by increasing instruction throughput.

Instruction throughput is the important metric because real programs execute billions of instructions. Also see CPU Performance for discussion about this?

RISC-V pipeline has 5 stages.

IF: Instruction fetch
ID: Instruction decode and register file read
EX: Execution or address calculation
MEM: Data memory access
WB: Write back

Implementation in RISC-V

We first explore the implementation ignoring any potential hazard. Let’s look at our Datapath and split it into the 5 stages:

Instructions are data generally move from left to right in the above diagram, with two exceptions:

The write-back stage, which places the result back into the register file in the middle of the Datapath
The selection of the next value of the PC, choosing between the incremented PC and the branch address from the MEM stage

Data flowing from right to left do not affect the current instruction; these reverse data movements influence only later instructions in the pipeline. Note that the first right-to-left flow of data can lead to data hazards and the second leads to control hazards.

We add registers to hold data so that portions of a single datapath can be shared during instruction execution.

Personal Insight: It’s kind of the same idea for Big-O Notation, there is this tradeoff between Space and Time. We make use of more space to make time much faster.

This is exactly the same diagram as the above.

However, there is a bug with the above pipelined implementation. During the WB stage, we need the register to write into, which is provided by the IF/ID pipeline register. Hence, we need to preserve the destination register number in the load instruction.

At this point, we are ready to talk about pipelining the Control Unit as well. Let’s see which control lines are used for each stage of the pipeline:

IF: Assert read instruction memory and write the PC control signals
ID: Nothing special to control
EX: ALUOp and ALUSrc need to be set
MEM: Branch, MemRead and MemWrite
WB: MemToReg, and RegWrite

So we can have the following:

Now, remember that there are 7 control lines (8 control signals since ALUOp takes 2 signals) for the Control Unit that we saw in the non-pipelined version

This forwarding is the same idea as when we talked about the bug that was introduced, during the WB stage, where we need the value of the RegisterRd.

See below for the actual control signals.

There are going to be hazards, such as when you write to a register, and access it right after. If you implement pipelining, you will get the wrong value. Take a look at Pipeline Hazard to understand how we resolve these issues.

🛠️ Steven Gong

Table of Contents

Pipelining

Important not to get confused on

Implementation in RISC-V

Graph View

Backlinks