ramp-white hari angepat derek chiou university of texas at austin

21
RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Post on 21-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

RAMP-White

Hari AngepatDerek Chiou

University of Texas at Austin

Page 2: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Motivation

Coherent shared memory multiprocessor simulator

Support for existing programming models• Operating system support• Single image

• Programming libraries• Legacy applications

2 RAMP-White

Page 3: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Outline

Leon3 Integration• Default architecture• RAMP-White architecture• Baseline Design features• Status

PPC405 MP OS Support• Plan9 Operating System• Porting Status

RAMP-White3

Page 4: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Support for Leon3

Previous design utilized PPC405 hard-cores• Dual-node, multi-image Linux kernels• Segmented global address space

Added support to use Leon3/Grlib components

RAMP-White4

Page 5: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Leon3 Default Architecture

RAMP-White5

Leon 3Mst Slv DbgInt

AHB Bus

MPIntCntrl DSU Eth DDR

Leon 3Mst Slv DbgInt

Original Leon3MP design is shared bus model• Processor cores share AHB bus• Periperals connected to AHB• Interrupt and DSU have direct processor links

Page 6: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

RAMP-White System Architecture

RAMP-White6

Leon 3Mst Slv DbgInt

Leon3 shim

MPIntCntrl DSU Eth DDR2

Leon 3Mst Slv DbgInt

AHB bus

Leon3 shim

Intersection Unit NIU Intersection

UnitNIURouter Router

Adapt bus interfaces to a point-to-point connection scheme

DDR2

AHB busAHB shim AHB shim

Page 7: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Processor Support

IBM PowerPC 405• Hard-core, 300Mhz• Non-coherent I/D caches• PLB Bus interface• Uni-Processor Linux

Gaisler Leon3• Soft-core, ~65Mhz• Write-through snoopy-cache-coherent• AMBA AHB bus interface• SMP Linux

7 RAMP-White

Page 8: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Default Leon3 Core Interfaces

4 bi-directional channels• Master bus interface• Services icache/dcache fills

• Slave bus interface• Invalidates dcache entries via snooping address bus

• Interrupt channel• Driven from multiprocessor interrupt controller

• Debug channel• Grmon DSU interface

8 RAMP-White

Page 9: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Integrating Off-the-Shelf Processor

Generally there is a tight coupling between pipeline/cache/bus-interface• Thus, prefer to keep existing port interface• Also provides forward compatibility with soft-cores

Therefore, add bus shim to convert from processor-specific bus to White connections

9 RAMP-White

Leon 3Mst Slv DbgInt

AHB shim Int/Dbg shim

Page 10: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Processor Adaptation Issues

Increased FPGA resources• Support for RAMP-White infrastructure

Performance impact• Request/reply interaction adds latency

Platform Detection• Static mapping of bus connected devices• Linux/Grmon use configuration registers to detect

platform configuration at run-time• If devices on are connected indirectly, must populate

mapping correctly10 RAMP-White

Page 11: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Baseline Design

Dual Leon3 cores with bus shims Intersection Unit acts as message handler• Will be used to support pluggable coherency

engines Network interface Ring topology

11 RAMP-White

Page 12: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

System Software

Bootstrap:• JTAG for initial configuration• Ethernet for system init, kernel loading, debug

Linux 2.6.21 SMP Kernel SnapGear Linux root file-system Pthreads programming libraries

12 RAMP-White

Page 13: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Current Prototype

Single FPGA, dual core, RAMP-White infrastructure

ICache enabled, DCache disabled Boots Linux in SMP mode via debug memory

initialization Ethernet and NFS mount support Compact initramfs root file system complied

from SnapGear sources

13 RAMP-White

Page 14: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Current Prototype

14 RAMP-White

Page 15: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Future Work

Near-term• Cleanup/bug-fixing/stabilize dual-node platform• Integrate simple microcoded coherency engine for

Spring 2008 Parallel Comp Arch class• Lab to be given out by last week of Feb

• Expand cache hierarchy with soft-core cache models

• Expand design to support multi-FPGA support

15 RAMP-White

Page 16: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Multiprocessor PPC405 OS Support

Previous PPC RAMP-White design used multiple independent operating system images• Pseudo-SMP support in Linux kernel was non-

trivial to implement Alternative strategy opened up by recent

work over the summer by IBM in porting Plan9 to BlueGene PPC

16 RAMP-White

Page 17: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Plan9 Background

Research OS from Bell Labs open-sourced in 2000

Unix-style operating system Resources exposed as file trees Per-process namespaces Standard protocol for sharing resources

17 RAMP-White

Page 18: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Plan9 for HPC Applications

IBM port of Plan9 on a BlueGene grid• Part of DoE FastOS initiative• Allows distributed resource management and

sharing across a large grid• Lighter weight kernel that has less intrusive effect

on HPC apps• Ported to PPC440 with support for JTAG-based

debug and bootstrapping

18 RAMP-White

Page 19: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Plan9 for RAMP-White

Smaller, lower complexity operating system Allows flexible sharing of physical resources• memory, ethernet, disk

Can expose multiple cores as CPU servers• Allows easy task execution/debugging on remote

cores Leverage resurgent interest in using Plan9 for

HPC• HPC applications ported to Plan9

19 RAMP-White

Page 20: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Porting Plan9

• Worked in collaboration with Eric Van Hensbergen

• Resurrected the CerfCube 405 platform– Removed assumptions regarding PPC405 SoC– Adding support for Xilinx peripherals

• Serial port, interrupt controller, network

• Currently boots Plan9 with limited console– Bootstrap, virtual memory, console initialization– Still completing work on interrupt controller, network

20 RAMP-White

Page 21: RAMP-White Hari Angepat Derek Chiou University of Texas at Austin

Questions?

21 RAMP-White