climate prediction coprocessor

Upload: max-power

Post on 03-Jun-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/12/2019 Climate Prediction Coprocessor

    1/26

    Grand Challenge Problems : Computational Climate Prediction

    A possible solution :

    A Low Complexity Dynamic Grid Coprocessorfor Distributed Computing

    Do you see any design logic or any other !ind of error"Do you see any design logic or any other !ind of error"

    ContactContact

    Climate Prediction CP#Climate Prediction CP#

    dist$% at &uno dot comdist$% at &uno dot com

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    2/26

  • 8/12/2019 Climate Prediction Coprocessor

    3/26

    5ow is modern numerical forecasting done"

    *umerical atmosphere models are initiali)ed using data collected from radiosondes

    weather satellites and surface weather obser,ations(

  • 8/12/2019 Climate Prediction Coprocessor

    4/26

    Contrasts (goals) *;Ps .predict weather/ GC0s .predict climate/

    4patial co,erage 8egional or Global Global

    9>> !m$/ #sually H $>> !m$

    8ele,ance of initial conditions 5igh 0edium to Low

    8ele,ance of clouds radiation 5igh 0edium

    8ele,ance of surface .land iceocean(((/

    0edium to Low 5igh

    8ele,ance of ocean dynamics Low 5igh

    8ele,ance of model stability 0edium to Low 5igh

    minutes to I> minutes

    Physics 3uations of motion .plus radiati,e transfer e3uations waterconser,ation e3uations ((/

    0ethods 7inite difference expression of continuous e3uations or spectralrepresentationJ run prognostically

    0aximum time step Controlled by spatial resolution .C7L condition/

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    5/26

  • 8/12/2019 Climate Prediction Coprocessor

    6/26

    non local

    ,orxel readsper

    computation

    but 9$ local

    reads

    .M>*N $ML/ R K9I

    edge ,orxel mapping

    cost K=

    .in the form of a $

    x M array/

    *>9 *>$ *>% *>M *>= *>I

    *>Q 9 $ % M *>

    *> = I Q *9>

    *99 *9$ *9% *9M *9= *I

    S.%/ neighbors',orxelT x MO

    S.=/ neighbors',orxelT x M

    .$> O 9$/ R %$

    non local

    reads per

    computationbut .9$ O $>/

    R %$ local

    reads

    .%$* %$L/ R K> edge

    ,orxel mapping cost

    KI

  • 8/12/2019 Climate Prediction Coprocessor

    7/26

    Current 0odels and 0odeling

    4o simulate the entire system on your computer N that will wor! right"

    >( *ot one of these apps uses the distributed groups of ,orxelsapproach in spite of its possible ad,antages(

  • 8/12/2019 Climate Prediction Coprocessor

    8/26

    5adC0% Coupling

  • 8/12/2019 Climate Prediction Coprocessor

    9/26

    A problem in search of a CP#

    A comparison between global and regional resolution in climate modeling

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    10/26

    xample of C#DA processing flow

    9( Copy data from main memory to GP# memory

    $( CP# instructs the process to GP#

    %( GP# execute parallel in each core

    M( Copy the result from GP# memory to main memory

  • 8/12/2019 Climate Prediction Coprocessor

    11/26

    7lynn-s taxonomy

    0ichael ?( 7lynn created one of the earliest classification systems for parallel .and

    se3uential/ computers and programs now !nown as 7lynn-s taxonomy( 7lynn classified

    programs and computers by whether they were operating using a single set or multiple

    sets of instructions whether or not those instructions were using a single or multiple setsof data(

  • 8/12/2019 Climate Prediction Coprocessor

    12/26

    +nstructionle,el parallelism

    A canonical fi,estage pipeline in a 8+4C machine

    +7 R +nstruction 7etch

    +D R +nstruction Decode

    E R xecute

    00 R 0emory access

    ;6 R 8egister write bac!/

    A fi,estage pipelined superscalar processor capable of issuing two instructions per

    cycle( +t can ha,e two instructions in each stage of the pipeline for a total of up to 9>instructions .shown in green/ being simultaneously executed(

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    13/26

    *oteworthy application specific coprocessors

    D;a,e was spun out of the #ni,ersity of 6ritish Columbia .#6C/ in 9 to

    commerciali)e superconductorbased 3uantum computer processors(

    Xuantum computers .XCs/ use 3uantum mechanics .X0/ the rules that underlie the

    beha,ior of all matter and energy to accelerate computation( +t has been !nown for

    some time that once some simple features of X0 are harnessed machines will be built

    capable of outperforming any concei,able con,entional supercomputer( XCs are not &ust

    inherently faster than con,entional computers( >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>

    times more difficult to sol,e than a typical %>electron system(

    As of the late$>>>s most %> electron 3uantum systems ma!e most highend

    supercomputers cho!e or outright fail(

  • 8/12/2019 Climate Prediction Coprocessor

    14/26

    *oteworthy generali)ed coprocessors

    Asynchronous Array of Simple Processors (AsAP) project

    >=( Current data suggests that this is the highest cloc! rate fabricated

    processor designed in any uni,ersity(

    A 9% mm x 9% mm chip utili)ing the exact same design in > nm C0B4would contain more than 9>>> processors and be capable of more than 9

  • 8/12/2019 Climate Prediction Coprocessor

    15/26

    Another possible CP# solution originating from the Game Console trade

    The GM107 GPU contains one GPC, five Maxwell Streaming Multiprocessors SMM!,

    an" two #$%&it memor' controllers 1()%&it total!*

    This is the full implementation of the chip, an" is the same configuration we ship with

    the Ge+orce GT 7-0 Ti*

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    16/26

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    17/26

  • 8/12/2019 Climate Prediction Coprocessor

    18/26

    Climate Prediction Coprocessor: Design Goals Z Philosophy .o,er,iew/

    Create a 9>> 05) coprocessor that is capable of running a global climate simulation at aresolution of 9W x 9W .or less K%I>$R 9$I>> ,orxels/ with a complete model update rate of 9

    5)(

    A computable ,orxel here is considered to be something li!e

    8ecord S real data[current19$2J real data[pre,ious19$2J T '' 8eal R %$ bit signed real number

    #8 with a fabrication cost for the chip itself of $> #8(

  • 8/12/2019 Climate Prediction Coprocessor

    19/26

    Climate Prediction Coprocessor: Design Goals Z Philosophy

    in future( be controlled by a Power PC I>9 Core or whate,er is also used by

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    20/26

    O>(>/ x x 9 .super,isory/

    *BP *A *A $

    CALL4#6 *A *A 9

    8 or >(>/ *B< >T x x %

    CALL 4#6 +7 8G 0AE or 0+* x x M

    A*D .9/ *A x 9

    *A*D .9/ *A x $

    B8 .9/ *A x %

    *B8 .9/ *A x M

    EB8 .9/ *A x =

    *B< .9/ *A x I

    8otate Left .9/ *A x 9

    8otate 8ight .9/ *A x $

    4hift Left .9/ *A x %

    4hift 8ight .9/ *A x M

    ADD x x 9

    4#6

  • 8/12/2019 Climate Prediction Coprocessor

    21/26

    Instruction 40 bit float 40 bit signed int Total of Instructions

    sin.x/ 1degrees2 x *A 9

    cos.x/ 1degrees2 x *A $

    Log[9>.x/ x *A %

    xp.x/ x *A M

    abs.x/ x x =

    +n,erse x *A I

    DC

  • 8/12/2019 Climate Prediction Coprocessor

    22/26

    Climate Prediction Coprocessor: +nstruction 4et Architecture .o,er,iew/

    6+< ] >9 >$ >% >M >= >I >Q > > 9> 99 9$ 9% 9M 9= 9I 9Q 9 9 $>

    +nstruction

    *BP 9 9 9 > > > > 9 9 9 9 9 9 9 9 9 9 9 9 9*BP 9 9 9 9 > > > > > > > > > > C C C C C C

    84< > > > > > > > 4 4 4 4 4 4 > > > > > > >

    84< > > > 9 > > > 4 4 4 4 4 4 > > > C C C C

    CALL > > 9 > ] ] ] 0 0 0 0 0 0 0 0 0 C C C C

    8 > 9 > ] ] ] 0 0 0 0 0 0 0 0 0 C C C C

    0A 9 > E ] ] ] 4 4 4 4 4 4 D D D D D D C

    0A 9 > E ] ] ] 4 4 4 4 4 4 > > > > > > C

    LBG+C > 9 > E ] ] ] 4 4 4 4 4 4 D D D D D D C

    LBG+C .\/ > 9 > E ] ] ] 4 4 4 4 4 > > > > > > > C

    D4P./ > 9 9 E ] ] ] 4 4 4 4 4 4 0 0 0 0 0 0 C

    D4P./ > 9 9 E ] ] ] 0 0 0 0 0 0 D D D D D D C

    Climate Prediction Coprocessor: +nstruction 4et Architecture .o,er,iew/

    \ ;here source and destination registers are the

    same(

    ] +nstructions from the instruction set(

    pro,isional instruction format

    0 Absolute Address 6it .local memory only/(

    S4 4ource 8egister D Destination 8egisterT

    C or ConditionJ ,aries depending on instruction asto its meaning

    Pri,ileged instruction mode E R *o pri,ilegedinstructions in this mode

    > instructions to be fit into %$! $I$9> instructions to be held in IM!

    =$M>> instructions to be held in 9$!

    CALL: ?ump to memory address either

    subroutine or blind(

    0A>>>>> then 9

    *BP state else increment by CCCCCCJ this

    allows up to IM wait sates to be issued in oneinstruction(

    84 or O>(>/ CCC R>>> if applicable to register indicated else

    repeat operation for register O9 .CCC R >>9/ to OQ

    .999/(

    *E bit call should be supported( *E stands for

    *o eEecute( *E technology is used in GP#s to

    segregate areas of memory for use by eitherstorage of processor instructions .or code/ or for

    storage of data( +t is an important CP# security

    feature to pre,ent execution of data(

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    23/26

    Climate Prediction Coprocessor: Architecture .register si)es/

    #egister type #egister $idth #egister format %otes

    7loating point M> 7> to 7%9 7>(9 9stbyteJ 7>($ $ndbyte(((

    4igned +nt M> 4> to 4%9 4>(9 9stbyteJ 4>($ $ndbyte(((

    Address registers $> A> to AQ 7or local address math

    Code ' Data Demarcation $> AA Allow 9! to I%! for program or data

    xception register 9$ 8G . bits/ O 88 .M bits/

    5ow should the Power PC ,iew the indi,idual coprocessors .,ia the Coprocessor +nterface #nit/"

    General Principles

  • 8/12/2019 Climate Prediction Coprocessor

    24/26

    G6 5ard Dis! that only powers up e,ery M hours for data bac!up synchroni)ation thernet Port

    ideo Card for setup and monitoring

    M #46 Ports

    http:''hireme(gee!(n)'Climate Prediction Coprocessor

    http://hireme.geek.nz/http://hireme.geek.nz/http://hireme.geek.nz/
  • 8/12/2019 Climate Prediction Coprocessor

    25/26

    5elp needed5elp needed>>>> 4D8s4D8sto become ,iable(to become ,iable(

    Contact me on ban! transfer information if you wish to donate money to thisContact me on ban! transfer information if you wish to donate money to this

    research pro&ect(research pro&ect(

    Climate Prediction .as well as fluid dynamics of the outer planets and the earth-sClimate Prediction .as well as fluid dynamics of the outer planets and the earth-s

    magnetic field/ will remain an insurmountable computational problem for manymagnetic field/ will remain an insurmountable computational problem for many

    more decades unless a CP# is designed that can cope with the tas! with as littlemore decades unless a CP# is designed that can cope with the tas! with as little added complexity to the host system as possible(added complexity to the host system as possible(

  • 8/12/2019 Climate Prediction Coprocessor

    26/26

    Created by Briginal Concept Last 0odified Last 8e,ision 8e,ision 4tate

    0ax Power $$ *o,ember $>> $= 7ebruary $>9M 4ee (==rQ

    8e,ision 9M Add $ GP# types delete 8andom instruction fix readability