embedded system lab. 최 길 모최 길 모 kilmo choi [email protected] active flash: towards...
TRANSCRIPT
![Page 1: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/1.jpg)
Embedded System Lab.
Embedded System Lab.최 길 모
Kilmo [email protected]
Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines
Devesh Tiwari, Sudharshan S. Vazhkudai, Youngjae Kim, Xiaosong Ma, Simona Boboila, and Peter J. Desnoyers
![Page 2: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/2.jpg)
Embedded System Lab.최 길 모
Contents
Background
Problems and Challenges
Active Flash Approach for In-situ
Active Computation Feasibility
Evaluation
ActiveFlash Prototype based on OpenSSD Platform
Conclusion
![Page 3: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/3.jpg)
Embedded System Lab.최 길 모
Background
![Page 4: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/4.jpg)
Embedded System Lab.최 길 모
Background
Scientific Discovery : Two-Step
Scientific Simulation
Scientific Discovery
Data Analysis and Visualization
![Page 5: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/5.jpg)
Embedded System Lab.최 길 모
Background Large-scale leadership computing applications produce big data
GTC produces ~30TB output data per hour at-scale.
![Page 6: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/6.jpg)
Embedded System Lab.최 길 모
Problems and Challenges
Offline approach suffers from both performance and energy inefficien-
cies
Redundant I/O(simulations write, analyses read)
Excessive data movement
Extra energy cost
Energy efficiency will become the primary metric for system design,
as compute power is expected to increase by x1000 in the next
decade with only a x10 increase in power envelope
Using simulation nodes for data analysis not acceptable
![Page 7: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/7.jpg)
Embedded System Lab.최 길 모
Active Flash Approach for In-situ
SSDs now being adopted in Supercomputers(e.g. Tsbame, Gordon)
higher I/O throughput and storage capability
SSD controllers becoming increasingly powerful
multi-core low-power processors
Idle cycles at SSD controllers
In-situ analysis
analysis on in-transit output data, before it is written to the PFS
eliminates redundant I/O, but it use expensive compute nodes
![Page 8: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/8.jpg)
Embedded System Lab.최 길 모
Active Flash Approach for In-situ
Active flash In-situ analysis on SSDs Exploit the computation at idle cycles of the SSD controller Reduce transfer costs high performance and energy saving
![Page 9: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/9.jpg)
Embedded System Lab.최 길 모
Active Flash Approach for In-situ Three approach to data analysis
offline active flash analysis node
![Page 10: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/10.jpg)
Embedded System Lab.최 길 모
Active Computation Feasibility
Modeling SSD Deployment
Multiple constraints
Capacity
Enough SSDs to sustain output burst
Performance
High I/O bandwidth to SSD space
Fast restart from application checkpoints
Write durability
SSD write endurance limits
![Page 11: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/11.jpg)
Embedded System Lab.최 길 모
Active Computation Feasibility
Staging Ratio
How many simulation nodes share one common SSD?
![Page 12: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/12.jpg)
Embedded System Lab.최 길 모
Active Computation Feasibility
Modeling active computation feasibility
Relatively less compute intensive kernels better suited for active computa-
tion(e.g. regex matching)
Dependent on multiple factors : simulation data production rate, staging
ratio, I/O bandwidth, etc.
![Page 13: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/13.jpg)
Embedded System Lab.최 길 모
Evaluation
Cray XT5 Jaguar supercomputer
Samsung PM830 SSD
Intel Core i7 processors
![Page 14: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/14.jpg)
Embedded System Lab.최 길 모
Evaluation
Feasibility of the analysis node approach
Most data analysis kernels can be placed on SSD controllers without degrading
simulation performance
Additional SSDs are not required for supporting in-situ data analysis on SSDs
Analysis node approach is feasible at higher staging ratios, but at additional infra-
structure cost
![Page 15: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/15.jpg)
Embedded System Lab.최 길 모
Evaluation
Energy and cost saving analysis
Staging ratio = 10
Active Flash and offline approach : y1
analysis node : y2
Offline model consumes more energy
due to the I/O wait time
![Page 16: Embedded System Lab. 최 길 모최 길 모 Kilmo Choi rlfah926@naver.com Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines](https://reader035.vdocuments.site/reader035/viewer/2022062314/56649edb5503460f94beaa18/html5/thumbnails/16.jpg)
Embedded System Lab.최 길 모
Conclusion
Extant approaches to scientific data analysis(e.g. offline and analysis
nodes) are stymied by several inefficiencies in data movement and
energy consumption that results in sub-optimal performance
Active flash is better than either approaches for all of the aforemen-
tioned metrics