memcon: tuningddr4 for power...

30
Tuning DDR4 for Power and Performance Mike Micheletti Product Manager Teledyne LeCroy

Upload: phungkien

Post on 06-Feb-2018

229 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Tuning DDR4 for Power and Performance

Mike MichelettiProduct Manager Teledyne LeCroy

Page 2: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Agenda

Introduction – DDR4 Technology Expanded role of MRS Power Features Examined Reliability Features Examined Performance Features Examined

8/22/2013 2

Page 3: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4 Goals & Motivations

Spec development started in 2005; Officical JEDEC release Aug 2012

• Up to 3.2 Gbps (per pin) 2x Bandwidth

• Single Ended Signaling• Similar clockingEvolutionary Path

• 8 Bit prefetch, same core frequency Lower Cost

• 30-40% power saving (vs DDR3L), • tCAL, LP-ASR, etc.. Power Savings

• C/A parity, CRC, MPR readout, etc…Improved Reliability

Analysts: 50% market penetration by 2015/2016

Page 4: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Power

Reliability (RAS)

Performance

Signalling

Test

New DDR4 Features Categorized

4

TCSRTCARCS to CMD Latency (TCAL) VDDQ TermMax Power Saving Mode0.5KB Page sizeDBI3DS

Write CRCCA ParityMultipurpose Register (MPR) Readout

2133 to 3200 MT/s signalingBank GroupsFine Granularity RefreshSelf Refresh Abort

Gear Down ModeInternal Vref DQDQ Training with MPRPer DRAM Addressability

Page 5: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4 Compared to DDR3

8/22/2013 5

Spec Items DDR3 DDR4Density    /  Speed 512Mbp~8Gb  

1.6~2.1Gbps2Gb~16Gb1.6~3.2Gbps

Voltage(VDD/VDDQ/VPP)

1.5V/1.5V/NA(1.35V/1.35V/NA)

1.2V/1.2V/2.5V

Vref External Vref (VDD/2) Internal Vref (Req. training)

Interface Data IO CTT (34 ohm) POD (34 ohm)CMD/ADDR IO CTT CTT

Strobe Bi‐dir / differential  Bi‐dir / differential# of banks 8 banks 16 banks (4 BG)

Core Page size(x4/8/16) 1KB / 1KB / 2KB 512B / 1KB / 2KBarchitecture # prefetch 8 bits 8 bits

Added functions RESET/ZQ/Dynamic ODT + CRC/DBI/Multi preamble Physical Package (x4,8/x16) 78 / 96 BGA 78 / 96 BGA

Page 6: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4: Command Encoding

68/22/2013

Page 7: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Testing DDR4 Protocol

Fast, Easy Connection & Setup No Calibration needed

Comprehensive Bus Analyzer for DDR3 & DDR4 Traditional Waveform & State

Listings Real-Time JEDEC Error

Triggering Detects over 65 JEDEC bus &

timing violations

Page 8: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

New Features enabled with MRS: Auto-Self Refresh / Low Power Auto Self Refresh CRC and C/A Parity Error Check Host Tx / Rx Training Pattern Per DRAM addressability (PDA) Internal DQ Vref per DRAM Gear-down mode (for C/C/A) Dynamic ODT CAL mode

8/22/2013 Company Confidential 8

Page 9: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4 Mode Register Set (MRS) Overview

8/22/2013 © 2013; AMD 9

MPR Read Format

DLL always Enabled

CRC Clear & Parity Error Status

Page 10: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Key Enhancement: DQ Training with MPR DDR4 allows custom patterns for DQ training

Host uses MR3 [A2=1] command to initiate DQ Training READ BA[1:0] defines the MPR Location (pattern)

8/22/2013 10

Page 11: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Performance Features: DQ Training Sequence

8/22/2013 11

Page 12: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

READ MPR0 (default pattern) Location 0 Back-to-Back Read from MPR is allowed with tCCD=4 nCK for seamless operation

8/22/2013 12

Page 13: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4: Power Features

Reduced Voltages (1.2V) VDDQ Termination (POD) External Vpp Dynamic Bus Inversion (DBI) 0.5KB Page size Temperature controlled Refresh (SR) Low Power Auto-self Refresh (LP ASR) CS to CMD Latency (tCAL) Max Power Saving Mode (MPSM)

8/22/2013 13

Page 14: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Power Features: VDDQ Termination

8/22/2013 © 2013; Inphi Corporation 14

Pseudo Open Drain (POD)Signaling

DDR3 utilizes center tap termination DDR4 utilizes VDDQ termination

• “Pseudo open drain” signaling• Reduces IO current draw

DBI: minimize number of zeroes Increase % of bits stored as “1” Improves Performance & Signal

integrity Lower “Synchronously switching

output” noise

Page 15: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Power Features: External Vpp• External Vpp for Word-line Voltage

DDR3 utilizes on-die voltage pump to generate higher word line voltage DDR4 utilizes Separate Vpp voltage rail

Externally supplied Vpp @ 2.5V enables more energy efficient memory system

Reduces voltage draw & die space

8/22/2013 © 2013; Inphi Corporation 15

Page 16: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Command Address Latency (CAL) Command and Address receivers disabled (MR4) CS# used to wakeup the receivers CMD and ADDR sent after a delay of tCAL (latency 3 clocks at 2.1GT/s)

8/22/2013 16

Power savings:

• 23% for Idd2n,

• 10% for Idd0

• 13% TDP (dual rank DIMM’s)

Page 17: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Command Address Latency (CAL) Switching Ranks adds CAL Latency CAL mode introduces more latency in multi-threaded IO

8/22/201317

Rank 0

Rank 1

Page 18: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Command Address Latency (CAL) CAL mode is better for sequential IO operations

Only impacts DRAM when exiting from IDLE

8/22/201318

Rank 0

Rank 1

Page 19: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Power Savings: Server DDR4 vs. DDR3 (Heavy Utilization)

© 2013; Intel Corporation 19

DDR4 results based on Intel projected values for IDD.DDR3x results based on supplier provided Idd values.

Page 20: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4: Features

Reliability CRC on Writes MPR Error Log Command / Address Parity check

8/22/2013 20

Page 21: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

CMD / ADDR Parity Checking When enabled – SDRAM verifies parity before executing the command

Command and the address lines only Additional delay (parity latency) for tMRD & tMOD (4 to 6 CLKs)

PL ranges from 4nCK to 6nCK, depending on clock rate

8/22/2013 © 2013, Samsung Electronics 21

PL+6ns48 to 96 nCKs @ 2133

Page 22: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

CMD / ADDR Parity Error Detection

8/22/2013 22

Controller sees ALERT = “LOW”

for >“48” nCK

4 CLKs

Page 23: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4: Features

Performance Signaling 1066MHz to 1.6GHz (2133 to 3200 MTs) Training - Preamble training; Internal DQ Vref Gear down mode - For speeds above 2666 MT/s CMD/CTR/ADDR sent at 2t Timing

Bank Groups8/22/2013 23

Page 24: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Bank Groups: DDR4 Similar latency….but higher data rates

So more requests must be kept in-flight to realize higher bandwidth DDR4 supports16 banks divided into 4 bank groups

4 Bank Groups at x4 & x8 2 Bank Groups at x16

8/22/2013 © 2013; Inphi Corporation 24

Separate IO gating structuresallow faster Write-to-Read turnaround between BG

Page 25: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Bank Group RRD_L, CCD_L, WTR_L Violations

Bank Groups require higher latency between ACTIVATE to same BG1600 1866 2133 2400

© 2013; Inphi Corporation

Page 26: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

tRRD-L Violation Check

8/22/2013 26

Page 27: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Verification Issues: “Row Hammer”

8/22/2013 27

A bank of memory is loaded with valid data (green bits) If one row is repeatedly activated (aggressor) in a single

refresh cycle, as the charge decays, crosstalk flips a bit in the neighboring rows (victim).

REFRESH cannot recover the data.

Aggressive row activations can corrupt adjacent rows

Page 28: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Row Usage Report

8/22/2013 28

Page 29: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

DDR4 Features: ScorecardFeature Server Workstation Mobile

0.5KB Page sizeTemperature controlled Refresh (SR)Low Power Auto‐self Refresh (LP ASR)CS to CMD Latency (tCAL)Data Bus Inversion (DBI)Training Bank Groups Gear down mode CRC on Writes Command / Address Parity check

8/22/2013 29

Page 30: Memcon: TuningDDR4 for Power Performancecdn.teledynelecroy.com/.../tuningddr4_for_power_performance.pdf · DQ Training with MPR ... If one row is repeatedly activated ... Memcon:

Questions >?

8/22/2013 30