logical and probabilistic reasoning to support information analysis in uncertain domains marco...

24
Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns [email protected] September 6, 2007 Acknowledgments: This work was funded in part by the Disruptive Technology Office Collaboration and Analyst System Effectiveness (CASE) Program, contract FA8750-06-C-0194 issued by Air Force Research Laboratory (AFRL). The views and conclusions are those of the authors, not of the US Government or its agencies. The contributions of Scott Langevin, Laura Zavala, Jingsong Wang, Jingshan Huang, and Dylan Kane (who prepared several of the slides) are appreciated.

Post on 20-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

Logical and Probabilistic Reasoning to Support

Information Analysis in Uncertain Domains

Marco Valtorta, John Byrnes, and Michael Huhns

[email protected]

September 6, 2007

Acknowledgments: This work was funded in part by the Disruptive Technology Office Collaboration and Analyst System Effectiveness (CASE) Program, contract FA8750-06-C-0194 issued by Air Force Research Laboratory (AFRL). The views and conclusions are those of the authors, not of the US Government or its agencies.

The contributions of Scott Langevin, Laura Zavala, Jingsong Wang, Jingshan Huang, and Dylan Kane (who prepared several of the slides) are appreciated.

Page 2: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

2

USC/HNC BALER Project

Overview

The BALER ProjectMotivation and AimsArchitecture

Conversion of Natural Deduction to Bayesian Network FragmentsNatural DeductionConverter Program

Use of BN Fragments Derived From Proofs Examples (throughout) Proof of Correctness Conclusions

Page 3: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

3

USC/HNC BALER Project

Bayesian and Logical Reasoning: BALER

BALER makes it possible for analysts to confront problems of credibility, relevance, contradictory evidence, and pervasive uncertainty, using

A unique combination of the power of logical and probabilistic reasoning

Numerical analysis of competing hypotheses

Automated linking of relevant evidence Automated propagation of uncertainty

values: good arguments from uncertain data still add strength to a conclusion

Robust reasoning over contradictory information allows analysts to exploit maximal amounts of information

Analysts can enter their own knowledge directly, allowing the system to learn from its users

Probabilities quantify belief in hypotheses to support optimal decision making according to the principle of maximum expected utility

Democracies are stable

Palestine is a

democracy

Palestine is stable

Palestine is an anarchy

Palestinian Parliament

to be dissolved

ContradictoryReports

Information Extraction

Error?

Text Document

Source Reliability?

Logical Inference

HowCertain?

Uncertain rule

How Relevant?

Causal Link

Page 4: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

4

USC/HNC BALER Project

BALER Architectural Concept

Some knowledge is best formalized in logic. This kind of knowledge includesClass-subclass statements, such as “dogs are mammals”Part-whole statements, such as “intake valves are parts of

cylinders”Definitional statements, such as “triangles have three sides”Temporal statements, such as “3:00 p.m. occurs before 4:00

p.m.”Spatial statements, such as “London is located in the UK”

Other knowledge is naturally probabilistic in nature. Examples are“Terrorist cell X planned the bombing”“Suspect Y met with cell leader Z in Syria last March”

BALER reasons both logically and probabilistically, permitting each piece of knowledge to be represented in the most appropriate way

Page 5: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

5

USC/HNC BALER Project

Overview of the BALER Architecture

BALER first generates a logical proof tree to focus the reasoning, then augments it with probabilities, and finally uses Bayesian reasoning to handle uncertainty, credibility, and relevance. The resultant Bayesian network structure is smaller, and thus the computation is tractable

LogicalReasonerFacts & Rules

Causality, Conventions, Hypotheses, and Norms

BayesianNetwork

Generator

Proof Trees

BN FragmentsMatcher &Composer

BN

BNs

BayesianNetwork

Reasoner

BN Situation

Conclusionsand Advice

EvidenceVarious Source

s

Page 6: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

6

USC/HNC BALER Project

BALER Logical Reasoner

Consumes knowledge, which could be provided by: Standard Upper Merged Ontology (SUMO) Magellan ontology IKRIS formalizations http://

nrrc.mitre.org/NRRC/ikris.htm Databases of interest

Provides proofs of the type consumed by the BN Constructor. Initial investigations focus on natural deduction proofs

Features the ability to search in classical, intuitionistic or minimal logic

Features the ability to present high-level outline of proof

Page 7: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

7

USC/HNC BALER Project

Bayesian Network Generator

Consumes a proof Consumes partial conditional

probability information Generates a Bayesian

network Features the ability to

estimate missing probabilities through maximum entropy, and possibly other techniques

We describe a prototype implementation and show its correctness in this talk

LogicalReasonerFacts & Rules

Causality, Conventions, Hypotheses, and Norms

BayesianNetwork

Generator

Proof Trees

BN FragmentsMatcher &Composer

BN

BNs

BayesianNetwork

Reasoner

BN Situation

Conclusionsand Advice

EvidenceVarious Sources

Page 8: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

8

USC/HNC BALER Project

BN Fragment Matcher and Composer

Retrieves BN fragments based on data (evidence) and instantiates the attributes of the nodes

Joins a set of BN fragments Stores instantiated and

composed fragments in a repository

LogicalReasonerFacts & Rules

Causality, Conventions, Hypotheses, and Norms

BayesianNetwork

Generator

Proof Trees

BN FragmentsMatcher &Composer

BN

BNs

BayesianNetwork

Reasoner

BN Situation

Conclusionsand Advice

EvidenceVarious Sources

Page 9: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

9

USC/HNC BALER Project

Bayesian Network Reasoner

Provides updates to a composed Bayesian network given hard, virtual, or soft evidence

Provides value of information computation

Provides analysis of sensitivity to parameters and evidence

LogicalReasonerFacts & Rules

Causality, Conventions, Hypotheses, and Norms

BayesianNetwork

Generator

Proof Trees

BN FragmentsMatcher &Composer

BN

BNs

BayesianNetwork

Reasoner

BN Situation

Conclusionsand Advice

EvidenceVarious Sources

Page 10: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

10

USC/HNC BALER Project

Natural Deduction

Abstract system of first-order logic Designed to mimic the natural reasoning process, as

follows:Make assumptions (“A” is true)

The set of assumptions being relied on at a given step is called the context

Use inference rules to draw conclusionsDischarge assumptions as they become no longer

necessary We use a sound and complete system of rules for classical

first-order logic; variations for intuitionistic and minimal logics require only small modifications

Page 11: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

11

USC/HNC BALER Project

Input Syntax for a Proof Step

<proofStep id="2"><rule>if I</rule><discharge>

<formula>(A)</formula></discharge><premises>

<formula contextID="2">(and (A) (B))</formula>

</premises><conclusion>

<formula>(if (A) (and (A) (B)))</formula></conclusion>

</proofStep>

Page 12: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

12

USC/HNC BALER Project

XMLSchema for Proof Language

Page 13: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

13

USC/HNC BALER Project

Creating Nodes by Formula Create a node for every formula Parse formula into operator and subformulas Create nodes for each subformula and make them

parents of the current node Assign the node a truth table based on the operator Recursively repeat the process for the subformulas until

they cannot be decomposed anymore

Page 14: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

14

USC/HNC BALER Project

Creating Nodes: Contexts

Contexts are provided by the logical reasoner

Their contents are stored as premises

Make each premise a parent of the context

Set the truth table such that the context is only true if all of its subformulas are trueFor a context of two

formulas, this looks like the table for AND

Page 15: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

15

USC/HNC BALER Project

Example: Brown Liquids

We want to show B

Page 16: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

16

USC/HNC BALER Project

Brown Liquids Proof and Corresponding BN

Page 17: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

17

USC/HNC BALER Project

B Logically Follows from the Axioms

Page 18: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

18

USC/HNC BALER Project

Representing Expert Judgment

Page 19: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

19

USC/HNC BALER Project

Composing a Fragment Derived from a Logical Theorywith a Fragment Representing Expert Judgment

Page 20: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

20

USC/HNC BALER Project

Using the Composed Model

Page 21: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

21

USC/HNC BALER Project

Adding One More Fragment

Page 22: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

22

USC/HNC BALER Project

Soft Evidence

Page 23: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

23

USC/HNC BALER Project

Extension of the language meaning that the formula holds for all terms other than a and b

Extending the Proof Converter toFirst-Order Logic

)()( xSxRx

)()( aSaR

baSR ,)()(

)()( bSbR

)(bS)(bR

Explicitly list instantiations occurring in proof

treated like infinite , like . Occurrences from the proof are explicitly represented; a single node represents “all others”

Correctness: show that defined distribution Pr satisfies Pr( A=True ) = Pr*( {M | M ╞ A} )

for Pr* over the class of term models If ├ A then Pr(A=T | =T ) = 1

Page 24: Logical and Probabilistic Reasoning to Support Information Analysis in Uncertain Domains Marco Valtorta, John Byrnes, and Michael Huhns valtorta@sc.edu

24

USC/HNC BALER Project

Conclusions

Our approach enables first-order logic sentences to be combined with Bayesian networks

Our results are based on the assumption that logic proofs and Bayesian reasoning can be handled separately and serially, and that the Bayesian network nodes can attach only to proof nodes without parents

Our converter successfully generates Bayesian networks for any first-order natural deduction proof (that uses the Reeves-Clarke inference rules)

We emphasize that our approach can handle formulas beyond Horn clauses

Additional work underway:Applying more real world examples and probabilistic

knowledge bases