power-temperature stability and safety analysis for

1
Power-Temperature Stability and Safety Analysis for Multiprocessor Systems Conclusions Motivation Personalized Social Computing § Modern mobile platforms integrate multiple CPUs, GPUs and accelerators § Power consumption and resulting heat dissipation is a major problem for mobile platforms [1] - High temperature affects user experience and reliability - Power and temperature form a positive feedback system § Uncontrolled temperature can also cause safety risks Fixed Point Illustration Thermal and Power models Our Contributions Temperature to Power Constraint Experimental Results § Time granularity of predictions Fixed Point Analysis Ganapati Bhat, Umit Y. Ogras Arizona State University Suat Gumussoy IEEE Member Higher temperature Positive feedback Leakage Dynamic Power consumption § A complete heat-up/cool-down cycle on Exynos 5422 SoC 1 2 3 4 § This work*: 1.Derives the necessary and sufficient conditions for the existence of the fixed point 2.Proves the stability of the fixed point(s) and region of convergence 3.Finds the maximum dynamic power consumption to guarantee a thermally safe temperature 4.Is validated empirically on 8-core big.LITTLE platform Unstable fixed point Stable fixed point Thermal limit Illustration § Theoretical stability analysis for power-temperature dynamics Existence: Derived sufficient and necessary conditions Stability: Found region of stability Safety: Derived maximum allowed dynamic power § Our technique takes 75.2 μs when integrated into Android OS § We validated our approach empirically on the Odroid-XU3 board § On the right side, the prediction easily adapts to change in power due to throttling § Average prediction error is only 2.3 °C § Worst case error is 5.8 °C – When the temperature fixed point is ~100 °C, we did not run the experiment long enough to reach it 1. Calculate the auxiliary temperature as " =− 2. Given " , find as = 1 " 1− +, " " 3. Finally, using we find / as / = −1 Maximum Allowed Dynamic Power (W) Temperature Constraint Input () § Maximum allowed power consumption for various temperature constraints – Power constraint increases linearly with temperature constraint § Lemma 2: Value of " in the temperature iteration increases when " <0, and decreases when " >0 § Theorem 2: Stability of fixed points is as follows – When " has no solution, the temperature iteration diverges, i.e, " → 0 ( → ∞) – Where there are two fixed points, " ; 0, " > is unstable and " ? " > , 1/ is stable. § Lemma 1: " satisfies the following properties " is concave in the interval " ∈ (0, A B ) " has a unique maxima at " > given by " > = 1 2 −1+ 1 4 +1 " is increasing in (0, " > ) and decreasing in ( " > , A B ) § Lemma 2: Value of " in the temperature iteration increases when " <0, and decreases when " >0 § Theorem 2: Stability of fixed points is as follows – When " has no solution, the temperature iteration diverges, i.e, " → 0 ( → ∞) – Where there are two fixed points, " ; 0, " > is unstable and " ? " > , 1/ is stable. § Temperature at a future time step can be written as A +1 +1 I +1 I×A = I×I A I I×A + I×M A M M×A models the effect of temperature in time step in time step +1 models the effect of each power source on the temperature where ?O,P is the switching capacitance, P is the voltage, P is the frequency, T,P is the gate leakage current and A,P , ’,P are leakage power parameters for the VW resource /,P is the temperature-independent component of power consumption § At steady state (→∞) we have = + where 0<<1 and >0 are parameters of reduced order SISO system § Using change of variables we can write, " 1 − " = +, " § Taking logarithm on both sides, " = ln+ ln " + ln(1 − " )+ " =0 § Power consumption can be expressed as the sum of dynamic and temperature dependent leakage P = ?O,P P P + T,P P + P A,P P Z [,\ , \ P = /,P + P A,P P Z [,\ , \ Fixed Point Function § Nonlinear MIMO system due to exponential terms in P No closed-form solutions for this nonlinear system § Solution approach Reduce to SISO model to find an initial estimate of fixed points Then, use Newton’s method to solve the MIMO system 0 1 2 3 4 5 6 7 Prediction Error (°C) References [1] Gaurav Singla, et al., “Predictive Dynamic Thermal and Power Management for Heterogeneous Mobile Platforms,” in Proc. of Design Automation and Test in Europe Conference, March 2015. [2] Ganapati Bhat, Suat Gumussoy, and Umit Y. Ogras. “Power-Temperature Stability and Safety Analysis for Multiprocessor Systems,in ACM Tran. on Embedded Comp. Sys. (ESWEEK Special Issue), October 2017. * The details can be found in [2]

Upload: others

Post on 15-Mar-2022

5 views

Category:

Documents


0 download

TRANSCRIPT

Power-Temperature Stability and Safety Analysis for Multiprocessor Systems

Conclusions

Motivation

Personalized Social Computing

§ Modern mobile platforms integrate multiple CPUs, GPUs and accelerators

§ Power consumption and resulting heat dissipation is a major problem for mobile platforms [1]- High temperature affects user

experience and reliability- Power and temperature form a positive

feedback system§ Uncontrolled temperature can also cause

safety risks

Fixed Point Illustration

Thermal and Power models

Our Contributions

Temperature to Power Constraint

Experimental Results§ Time granularity of predictions

Fixed Point Analysis

Ganapati Bhat, Umit Y. OgrasArizona State University

Suat GumussoyIEEE Member

Higher temperature

Positive feedback

Leakage Dynamic

Power consumption

§A complete heat-up/cool-down cycle on Exynos 5422 SoC

1

2

3

4

§This work*:1.Derives the necessary and sufficient conditions for the existence of the

fixed point2.Proves the stability of the fixed point(s) and region of convergence3.Finds the maximum dynamic power consumption to guarantee a

thermally safe temperature4.Is validated empirically on 8-core big.LITTLE platform

Unstable fixed point

Stable fixed point

Thermal limit Illustration

§ Theoretical stability analysis for power-temperature dynamics–Existence: Derived sufficient and necessary conditions–Stability: Found region of stability–Safety: Derived maximum allowed dynamic power

§ Our technique takes 75.2 μs when integrated into Android OS§ We validated our approach empirically on the Odroid-XU3 board

§ On the right side, the prediction easily adapts to change in power due to throttling

§ Average prediction error is only 2.3 °C

§ Worst case error is 5.8 °C– When the temperature fixed

point is ~100 °C, we did not run the experiment long enough to reach it

1. Calculate the auxiliary temperature as

𝑇"∗ = −𝜅'𝑇∗

2. Given 𝑇"∗, find 𝛼∗ as

𝛼∗ =1𝑇"∗

1 −𝑒+,"∗

𝛽𝑇"∗

3. Finally, using 𝛼∗ we find 𝑃/∗ as

𝑃/∗ = 𝑎 − 1𝑏 𝛼∗𝜅'

Max

imum

Allo

wed

Dyn

amic

Po

wer

(W)

Temperature Constraint Input (℃)

§ Maximum allowed power consumption for various temperature constraints– Power constraint increases linearly with

temperature constraint

§ Lemma 2: Value of 𝑇" in the temperature iteration increases whenℱ 𝑇" < 0, and decreases when ℱ 𝑇" > 0

§ Theorem 2: Stability of fixed points is as follows– When ℱ 𝑇" has no solution, the temperature

iteration diverges, i.e, 𝑇" → 0(𝑇 → ∞)– Where there are two fixed points, 𝑇"; ∈0, 𝑇"> is unstable and 𝑇"? ∈ 𝑇">, 1/𝛼 is

stable.

§ Lemma 1: ℱ 𝑇" satisfies the following properties–ℱ 𝑇" is concave in the interval 𝑇" ∈ (0, AB)–ℱ 𝑇" has a unique maxima at 𝑇"> given by

𝑇"> =12𝛼 − 1 +

14𝛼' + 1

–ℱ 𝑇" is increasing in (0, 𝑇">) and decreasing in (𝑇">,

AB)

§ Lemma 2: Value of 𝑇" in the temperature iteration increases whenℱ 𝑇" < 0, and decreases when ℱ 𝑇" > 0

§ Theorem 2: Stability of fixed points is as follows– When ℱ 𝑇" has no solution, the temperature

iteration diverges, i.e, 𝑇" → 0(𝑇 → ∞)– Where there are two fixed points, 𝑇"; ∈ 0, 𝑇">

is unstable and 𝑇"? ∈ 𝑇">, 1/𝛼 is stable.

§ Temperature at a future time step can be written as𝑇A 𝑘 + 1𝑇' 𝑘 + 1

⋮𝑇I 𝑘 + 1 I×A

= 𝐴I×I

𝑇A 𝑘𝑇' 𝑘⋮

𝑇I 𝑘 I×A

+ 𝐵I×M

𝑃A 𝑘𝑃' 𝑘⋮

𝑃M 𝑘 M×A

– 𝐴 models the effect of temperature in time step 𝑘 in time step 𝑘 + 1

– 𝐵 models the effect of each power source on the temperature

where𝐶?O,P is the switching capacitance, 𝑉P is the voltage, 𝑓P is the frequency, 𝐼T,P is the gate leakage current and 𝜅A,P , 𝜅',P are leakage power parameters for the 𝑖VWresource 𝑃/,P is the temperature-independent component of power consumption

§ At steady state (𝑘 → ∞) we have𝑇 = 𝑎𝑇 + 𝑏𝑃

where 0 < 𝑎 < 1 and 𝑏 > 0 are parameters of reduced order SISO system

§ Using change of variables we can write,𝛽𝑇" 1 − 𝛼𝑇" = 𝑒+,"

§ Taking logarithm on both sides,ℱ 𝑇" = ln 𝛽 + ln 𝑇" + ln(1 − 𝛼𝑇") + 𝑇" = 0

§ Power consumption can be expressed as the sum of dynamic and temperature dependent leakage

𝑃P = 𝐶?O,P𝑉P'𝑓P + 𝐼T,P𝑉P + 𝑉P𝜅A,P𝑇P'𝑒Z[,\,\

𝑃P = 𝑃/,P + 𝑉P𝜅A,P𝑇P'𝑒Z[,\,\

Fixed Point Function§ Nonlinear MIMO system due to exponential

terms in 𝑃P 𝑘– No closed-form solutions for this nonlinear system

§ Solution approach– Reduce to SISO model to find an initial estimate of

fixed points– Then, use Newton’s method to solve the MIMO system

01234567

Pred

ictio

n Er

ror (

°C)

References[1] Gaurav Singla, et al., “Predictive Dynamic Thermal and Power Management for Heterogeneous Mobile Platforms,” in Proc. of Design Automation and Test in Europe Conference, March 2015.[2] Ganapati Bhat, Suat Gumussoy, and Umit Y. Ogras. “Power-Temperature Stability and Safety Analysis for Multiprocessor Systems,” in ACM Tran. on Embedded Comp. Sys. (ESWEEK Special Issue), October 2017.

*Thedetailscanbefoundin[2]