board on mathematical sciences & analytics · • autonomous experiments and analysis,...

37
View webinar videos and learn more about BMSA at www.nas.edu /MathFrontiers Board on Mathematical Sciences & Analytics

Upload: others

Post on 01-Jun-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Board on

Mathematical Sciences & Analytics

Page 2: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Feb 12: Machine Learning for Materials Science

Mar 12: Mathematics of Privacy

Apr 9: Mathematics of Gravitational Waves

May 14: Algebraic Geometry

June 11: Mathematics of Transportation

July 9: Cryptography and Cybersecurity

Aug 13: Machine Learning in Medicine

Sept 10: Logic and Foundations

Oct 8: Mathematics of Quantum Physics

Nov 12: Quantum Encryption

Dec 10: Machine Learning for Text

MATHEMATICAL FRONTIERS2019 Monthly Webinar Series, 2-3pm ET

2

This webinar series is made possible by support for BMSA from the

National Science Foundation Division of Mathematical Sciences

and the

Department of Energy Advanced Scientific Computing Research

Page 3: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Rampi Ramprasad, Georgia Institute of Technology

Mark Green, UCLA (moderator)

MATHEMATICAL FRONTIERSMachine Learning for Materials Science

3

Elizabeth Holm, Carnegie Mellon University

Page 4: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

MATHEMATICAL FRONTIERSMachine Learning for Materials Science

4

Elizabeth Holm, Carnegie Mellon University

Professor of Materials Science and Engineering

Incorporating machine intelligence in materials science

and engineering

Page 5: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Demystifying machine intelligence (AI)

• Machine intelligence is comprised of three basic building blocks:

– Data science = statistics of multi-dimensional data

– Machine learning = optimization via feedback loops

– Convolutional neural networks = signal processing

• These three elements combine to give

Deep Learninghttps://xkcd.com/1838/

Page 6: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Opportunities for AI in materials science and engineering

• Discovery and design of materials

• Optimization of materials structure and properties

• Material selection

• Autonomous experiments and analysis, including adaptive experiments

• Managing experimental data streams, including forward modeling

• Extracting information from microstructural images

• Property prediction

• Failure analysis

• Process and quality control

• …

Microstructure

Properties

CompositionProcessing

Page 7: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Search for new materials using data science and machine learning

• Big data: Discovery of new thermoelectric material

M. W. Gaultois, A. O. Oliynyk, A. Mar, T. D. Sparks, G. J. Mulholland, B. Meredig, APL Mater., 4 053213 (2016)

Page 8: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

What is holding physical scientists back?

• Informal survey of 14 physical scientists / engineers who attended a CMU workshop on Machine Learning in Science:

What are the biggest barriers to implementing data science in your research?

• Big data (28%)

• Small data (43%)

• Rare events (43%)

• Rich/multimodal data (57%)

• Data representation (71%)

• Interpretability (79%)

Page 9: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Why interpretability matters: The machine doesn’t always learn the right things

• A CNN-based deep learning system was trained to identify classes of objects in photographs.

• Masking was used to evaluate critical features that the computer associates with an object.

• Some masks made sense:

• Some did not:

Fong, et. al arXiv:1704.03296v1

Page 10: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Black box for tedious tasks:Autonomous microstructural segmentation

using deep learning • Segmenting complex, multi-component microstructures

B. DeCost et al., arXiv:1805.08693

• Accurate (93 ± 3%)

• Objective

• Repeatable

• Indefatigable

• Permanent

• Exactly as interpretable as a graduate student

Original Image Grad student Pixel-Net

Page 11: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Black box for repetitive workflow:Quality control via computer vision

• Autonomous evaluation: identify “out of spec” microstructures

specifiedmicrostructure:

Meet specifications:

Out of spec:

B. L. DeCost, E. A. Holm, Comp. Mater. Sci. 110 126-133 (2015).

Page 12: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Where the black box fails: Extracting knowledge from information

• Extracting information from microstructural images:– Inconel 718 Charpy impact specimens

built using additive manufacturing.

– Two build orientations, horizontal and vertical.

– Charpy impact energy depends on build orientation.

Page 13: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Can you see the difference in the fracture surfaces?

Page 14: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Can you see the difference in the fracture surfaces?

Page 15: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

What is the computer learning?

• Using unsupervised ML with k-means clustering, the computer can identify horizontal and vertical build fractures with 88 ± 3 % accuracy.

• What does the computer see that we cannot?

• Does the distinguishing visual information provide physical insight?

• Has the computer learned fracture mechanics?

A. Kitahara, E. Holm , IMMI 7[3] 148 (2018)

Page 16: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Conclusion: Philosophical musings

• When is a black box OK?

– The overall cost of wrong answers is low.

– The method is better than all alternatives within its domain.

• When does the black box fail?

– The goal is not what, but why.

"All right," said Deep Thought. "The

Answer to the Great Question..."

"Yes..!"

"Of Life, the Universe and Everything..."

said Deep Thought.

"Yes...!"

"Is..." said Deep Thought, and paused.

"Yes...!"

"Is..."

"Yes...!!!...?"

"Forty-two," said Deep Thought, with

infinite majesty and calm.

―Douglas Adams, The Hitchhiker’s

Guide to the Galaxy

Page 17: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Acknowlegements

• Current and former students:– Andrew Kitahara

– Nan Gao

– Bo Lei

– Srujana Yarasi

– Keith Kozlosky

– Ankita Mangal (P&G)

– Toby Francis (UCSB)

– Anna Smith (Merck)

– Brian DeCost (NIST)

17

National Science Foundation

DMR-1507830 and DMR-

1826218

Page 18: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

MATHEMATICAL FRONTIERSMachine Learning for Materials Science

18

Michael E. Tennenbaum Family Chair and GRA Eminent Scholar,

School of Materials Science and Engineering

Rampi Ramprasad, Georgia Institute of Technology

Polymer Genome: An Informatics Platform for Prediction & Design

Page 19: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

POLYMERS ARE UBIQUITOUS

19

High Energy

Density Capacitors

Different applications have different property requirements

(Optimal materials selection is non-trivial)

Solid-State

Battery

Electrolytes

Organic / Flexible

Electronics

Need: high band gap,

high dielectric

constant

Need: low Tg, high

mechanical strength

Need: low band gap,

low carrier

recombination

Page 20: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

POLYMER CHEMICAL UNIVERSE

Poly selenophene

O O

Se n

O

S

O

OO

Sulfone prophane

NH

O

NH

NH

O

NH

Polyurea

Polyimide

n

Polyethylene

n

Polyacetylene Poly oxymethylene Poly vinylidene fluoride

n

Poly naphthalene

S

S

n

Polythiophene

S

O O

n

Poly furan thiophene

H N

n

N H

Polypyrrole Si-aliphatic polyester

Organotin-ester [p(DMT 50/50 DL-Tar/Glu)]

S

Ge NH NH

O n

Ge-containing polyamide

Fe-containing pytpy = 4′-(4-pyridyl)-2,2′:6′,2″-terpyridine

Ru-containing organometallic polymer

bis(ethynyl-benzene)platinum(II)

Organic

Org

anom

eta

llic

Linear

Aromatic

Homocyclic

Heterocyclic

Group – IV

Othe

r groups

Mixed

Organo-Sn

Organo-Si

Organo-Ge

Page 21: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

POLYMER CHEMICAL UNIVERSE

Poly selenophene

O O

Se n

O

S

O

OO

Sulfone prophane

NH

O

NH

NH

O

NH

Polyurea

Polyimide

n

Polyethylene

n

Polyacetylene Poly oxymethylene Poly vinylidene fluoride

n

Poly naphthalene

S

S

n

Polythiophene

S

O O

n

Poly furan thiophene

H N

n

N H

Polypyrrole Si-aliphatic polyester

Organotin-ester [p(DMT 50/50 DL-Tar/Glu)]

S

Ge NH NH

O n

Ge-containing polyamide

Fe-containing pytpy = 4′-(4-pyridyl)-2,2′:6′,2″-terpyridine

Ru-containing organometallic polymer

bis(ethynyl-benzene)platinum(II)

Organic

Org

anom

eta

llic

Linear

Aromatic

Homocyclic

Heterocyclic

Group – IV

Othe

r groups

Mixed

Organo-Sn

Organo-Si

Organo-Ge

Staggering! Where do we start?!

And how do we search efficiently?

Page 22: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

BENCHMARK DATASET

Computational data

via high-throughput DFT

Experimental data

from collaborators,

literature

& data collections

Band gap

Dielectric constant

Glass transition temperature

Atomization energy,

etc.

Data sources

Property space

Chemical space (~900 organic polymers)

+

Page 23: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

MACHINE LEARNING IN MSE

T. Mueller, A. G. Kusne, R. Ramprasad “Machine Learning in Materials Science: Recent Progress and

Emerging Applications”, Reviews in Computational Chemistry, John Wiley & Sons, Inc., Volume 29, (2016).

R. Ramprasad, R. Batra, G. Pilania, A. Mannodi-Kanakkithodi, C. Kim, “Machine Learning and Materials

Informatics: Recent Applications and Prospects”, npj Computational Materials 3, 54 (2017).

Material Property Value

Material 1 P1

Material 2 P2

.

.

.

.

.

.

Material N PN

a Example dataset

b The learning problem

Material Property Value

Material X ?

Material

Material 1

Material 2

.

.

.

Material N

Fingerprint

F11, F12, … F1M

F21, F22, … F2M

.

.

.

FN1, FN2, … FNM

Property Value

P1

P2

.

.

.

PN

Fingerprinting Learning

c Fingerprinting, learning and prediction

f(Fi1, Fi2, …, FiN) = Pi

Prediction Model

Page 24: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

POLYMER FINGERPRINTS (GENOME)

“Polymer Genome: A Data-Powered Polymer Informatics Platform for Property Predictions”

C. Kim, A. Chandrasekaran, T. D. Huan, D. Das and R. Ramprasad, J. Phys. Chem. C (2018)

Distance between rings

Length of sidechain

Aromatics in sidechain

C3-S2-C3

H1-N3-C4

O1-C3-C4

C6H4, CH2,

etc., blocks;

Van der Waals

volume/surface

, etc.

We represent polymers numerically at three length-

scalesBlock-levelAtomic-level Chain-level

Page 25: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

LEARNING FROM DATA

“Polymer Genome: A Data-Powered Polymer Informatics Platform for Property Predictions”

C. Kim, A. Chandrasekaran, T. D. Huan, D. Das and R. Ramprasad, J. Phys. Chem. C (2018)

Gaussian process regression (GPR) Observations

True function

GPR predictions

Uncertainty (σ interval)

x

f(x)

0 2 4 6 108

8

6

4

2

10Data – perfectly known

Fingerprint space

d(i, j) = Fi

!"-Fj

!"!

Pj = wi exp -d (i, j )2

2s 2

i=1

N

å

i

4

3

2

1

d(i,3)

d(i,2)

d(i,1)

d(i,4)

Page 26: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

HIERARCHICAL FINGERPRINTING

26

Higher length-scale

+ Block-leveldescriptors

Van der Waals volume

Types of blocks

Fraction of rotatable bonds

+ Chain-leveldescriptors

Distance between rings

Length of sidechain

Length of main chain

Atomic-leveldescriptors

C3-S2-C3H1-N3-C4

Atom-triples

Train set (R2=0.72)

Test set (R2=0.54)

Test set RMSE=51 K

Train set (R2=0.71)

Test set (R2=0.68)

Test set RMSE=39 K

Train set (R2=0.93)

Test set (R2=0.77)

Test set RMSE=34 K

Impact on glass transition

Page 27: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

ML p

redic

ted (

K)

Experimental (K)

Test set RMSE = 34 K

Train set (R2=0.93)

Test set (R2=0.77)

Experimental (K)

ML p

redic

ted (

K)

Test set RMSE = 24 K

Train set (R2=0.92)

Test set (R2=0.90)

FINGERPRINT-DIMENSION REDUCTION

27

All the fingerprint without RFE

Glass transition temperature

With recursive feature elimination (RFE)

Page 28: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

PROPERTY PREDICTION MODELS

RMSE = 0.6 MPa1/2 RMSE = 0.05 g/cm3RMSE = 18 K

Glass transition temperature Solubility parameter Density

RMSE = 0.3 eV RMSE = 0.5 RMSE = 0.1 RMSE = 0.01 eV/atom

Band gap Dielectric constant Refractive index Atomization energy

Page 29: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

OUTCOME: INSIGHTS

“Mining materials design rules from data: The example of polymer dielectrics”

A. Mannodi-Kanakkithodi, et al, Chemistry of Materials (2017).

Possible blocks

CH2, CO, CS, O, NH, C6H4,

C4H2Sn

Page 30: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

OUTCOME: INSIGHTS

“Mining materials design rules from data: The example of polymer dielectrics”

A. Mannodi-Kanakkithodi, et al, Chemistry of Materials (2017).

Can we determine underlying

factors that govern such

“entangled” behavior?

Certain blocks in a particular

sequence provide optimal

behavior

Possible blocks

CH2, CO, CS, O, NH, C6H4,

C4H2Sn

Page 31: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

OUTCOME: MATERIAL DISCOVERIES

31

Polymer name BOPPPDTC-HDA

(Polythiourea)

BTDA-HDA

(Polyimide)

BTDA-HK511

(Polyimide)

Repeat unit

Synthesized polymer

Energy density (J/cm3) 5 9 10 16

Example: High energy density capacitors

New materials discovered with performance up to 3.5x

of BOPP, the current standard material!

(Metallized)

”Scoping the Polymer Genome: A Roadmap for Rational Polymer Dielectrics Design and

Beyond,”

A. Mannodi-Kanakkithodi, et al, Materials Today 21, 7, 785-796 (2018)

Page 32: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

OUTCOME: ONLINE APPhttps://www.polymergenome.org

Page 33: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

CHALLENGES AND NEXT STEPS

Experimental data capture, and data uncertainty

Other applications / properties

Handling morphological complexity

“Inverse” design (properties to polymers)

33

Targeted

property

Surrogate

models

Inverse

design

Polymers /w

desired property

Page 34: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

ACKNOWLEDGEMENT

Dr. ChihoKim

Dr. AnandChandrasekaran

DeepakKamal

Dr. TranDoan Huan

Dr. ArunMannodi

Kanakkithodi(@ Argonne)

ShrutiVenkatram

AnuragJha

Dr. DeyaDas

(@ Mahindra)

Dr. LihuaChen

Dr. RohitBatra

Dr. AbhirupPatra

PranavShetty

Page 35: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Rampi Ramprasad, Georgia Institute of Technology

Mark Green, UCLA (moderator)

MATHEMATICAL FRONTIERSMachine Learning for Materials Science

35

Elizabeth Holm, Carnegie Mellon University

Page 36: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers

Feb 12: Machine Learning for Materials Science

Mar 12: Mathematics of Privacy

Apr 9: Mathematics of Gravitational Waves

May 14: Algebraic Geometry

June 11: Mathematics of Transportation

July 9: Cryptography and Cybersecurity

Aug 13: Machine Learning in Medicine

Sept 10: Logic and Foundations

Oct 8: Mathematics of Quantum Physics

Nov 12: Quantum Encryption

Dec 10: Machine Learning for Text

MATHEMATICAL FRONTIERS2019 Monthly Webinar Series, 2-3pm ET

36

This webinar series is made possible by support for BMSA from the

National Science Foundation Division of Mathematical Sciences

and the

Department of Energy Advanced Scientific Computing Research

Page 37: Board on Mathematical Sciences & Analytics · • Autonomous experiments and analysis, including adaptive experiments • Managing experimental data streams, including forward modeling

View webinar videos and learn more about BMSA at www.nas.edu/MathFrontiers 37