Probabilistic Reasoning

Probabilistic Reasoning & Uncertainty for Artificial Intelligence (CIS 479)

Probabilistic Reasoning

Ad hoc uncertainty factors
Classical probability theory (Bayes Theorem)
Fuzzy set theory
Dempster Schaffer theory evidence

Why use it?

Relevant domain is random
Relevant domain is not random, rarely have access to enough data
Domain is not random, just not described in sufficient detail

Ad Hoc Uncertainty (MYCIN)

Questions?

How are certainties associated with antecedents?
How a rule translates input certainty to output certainty?
How do you combine fact certainty with several rules argue support?

Certainty Factors (ad-hoc)

how certain are you that the rule will fire?

(0.9)(1.0) = .9
(0.25)(1.0) = .25

Probability

P(E) = Number of desired outcomes / Total number of outcomes = | event | / |sample space|

P(not E) = 1 – P(E)

P(A or B) = P(A) + P(B) – P(A Ç B) P(A Ç B) = f ( a and b are mutually exclusive)

P(A Ç B) = P(A) * P(B|A) = P(B) * P(A|B)

For independent events

P(B|A) = P(B)

P(A|B) = P(A)

Bayes Theorem (Non-Independent)

P(H_i |E) º Probability H_iis true given evidence E

P(E | H_i) º Probability evidence E is observed given H_iis true.

P(H_i) = H_i true regardless of evidence

P(H_i |E) = P(E | H_i) * P(H_i) / S P(E | H_k) * P(H_k)

Tanimoto Example on Rain in Seattle

Prior Probability it will rain

P(H) = 0.8

Geese on the lake, given rain tomorrow

P(E|H) = 0.2

Geese on lake, with no rain tomorrow

P(E | ~H) = 0.025

tilde (~) = not

Probability of rain tomorrow with geese on the lake

P(H | E) = (P( E | H) * P(H)) / P(E) = (0.016 / 0.021) = 0.7619

P(E) = P(E | H) * P(H) + P(E | ~H) +P(~H) = (0.02)(0.8) + (0.025)*(0.2) = (0.016) + (0.005) = 0.021

P(~H | E) = ?

P( ~H | E) = (P(E | ~H)P(~H)) / P(E) = (0.005 / 0.021) = 0.2381

Weakness of Bayes Theorem

Difficult to get all apriori and joint probability required
Database of priorities is hard to modify because of interactions
Lots of calculations
Outcomes must be disjoint
Accuracy depends on complete hypothesis

Probabilistic Inference Nets

Criteria for Problem Characteristics

Information available is of varying certainty or completeness
Need nearly optimal solutions
Need to justify decisions in favor of alternate decisions
General rules of inference are known or can be found for the problem

Fuzzy Set Theory

X Î S This is not longer true

X Ï S Either in or out for most set theory

Question?

What constitutes a big pile of chalk?

If we have three pieces of chalk in the room is that considered a big pile of chalk? Some people might say, yes that is a big pile and some would not. Someplace between those three pieces of chalk and a whole room full of chalk the pile of chalk turns from a small pile into a big pile. This could be a different spot for different people.

F:[0,1]ⁿ ® [0,1]

{0,1}

Tanimoto has examples of independent events and dependent events

Fuzzy Inference Rules

	Possiblistic Dependent	Probabilistic Independent
A	a	a
B	b	b
~A	1-a	1-a
A Ù B	min(a,b)	a*b
A n B	max(a,b)	a+b – a*b
A ® B	max(1-a,b)	(1-a)+a*b
A Å B	max(min(a,1-b)),min(1-a,b))	a+b-2ab+a²b+ab²-a²b²

P (X ® (X n Z))

P(X) = 0.5 Possibilistic

P(X) = 0.1 P(Y n Z) = max (P(Y),P(Z))

P(Z) = 0.2 P(Y n Z) = max(0.1, 0.2) = 0.2

P(X ® P(Y n Z)) = max(1-P(X), P(Y n Z))

= max(1-(5), 0.2)

= 5

Probabilistic

P(Y n Z) = P(Y) + P(Z) – P(Y)P(2) = (0.1 + 0.2) – ((0.1)(0.2)) = (0.3 – 0.2) = 0.28

P(X ® X n Z) = 1 – P(X) + P(X)P(Y n Z) = (1 – 0.5) + ((0.5)(0.28)) = 0.5 + 0.14 = 64

Probabilistic Reasoning

Tanimoto pg. 342, pg. 338 car starting

S1: Clanking Sound

S2: Low pickup

S3: Starting problem

S4: Parts are hard to find

C1: Repair Estimate > $250

H1: Thrown connecting rod

H2: Wrist Pin Loose

H3: Car Out of Tune

P(H: | S)

Second Level

H4: Replace or Rebuild Engine

H5: Tune Engine

Steps for Building Inferences

Determine relevant inputs
Determine states of decision alternatives
Determine intermediate assertions
Formulate inference links

Logical Occurrence (+ Correlation)
Negative Occurrence ( - Correlation)
Logical Implication
Conjunction (and)
Disjunction ( or )
Exclusive Disjunctive (XOR)
Fiddle with probabilistic of inferences

Dempster / Schaffer Theorem of Evidence

X,Y,Z

{ X } { Y } { Z }

{ X,Y } { X,Z } { Y,Z } ® [0,1]

{ X,Y,Z } { }

This all needs to add up to 1

as more evidence is acquired views might change

Belief

B(A) + B(~A) ¹ 1

A = total exposed spots is 7

6 of 36 elements from m

m Î P(m ) ® [0,1]

m(f ) = 0

S m (x) = 1

m ³ A

m ³ F

Evidence

m(F) = 1 ® Evidence is certain

F Ç A ¹ 0

Belief (A) = S m(B), A ³ B

Doubt in (A) = Belief (> A)

Plausibility (A) = 1- Doubt(A)

Belief (f ) = 0 Plausibility(f ) = 0

Belief(m ) = 1 Plausibility(m ) = 1

Plausibility (A) ³ Belief (A)

1 ³ Belief(A) + Belief(~A)

Plausibility(A) + Plausibility (~A) ³ 1

If B ³ A then

Belief (B) ³ Belief A

Plausibility (B) ³ Plausibility (A)

Rule of combination for uncertainty evidence – Orthogonal

A ¹ f

if true

[m1 + m2] A = S _{xny = A} M1(x)M2(y)

1 - S _{xny = f}M1(x)M2(y)

if A ¹ f then [M1 + M2] f = 0

Numeric Example

S = Snow

R = Rain

D = Dry

m = {S, R, D}

|P(m )| = 8

p. 275 Tanimoto

Two Pieces of evidence

Temperature is below freezing (temp < 32 degrees F or 0 degrees C
Barometric Pressure is falling (storm likely)

	f	{S}	{R}	{D}	{S,R}	{S,D}	{R,D}	{S,R,D}
Mfreeze	0	0.2	0.1	0.1	0.2	0.1	0.1	0.2
Mstorm	0	0.1	0.2	0.1	0.3	0.1	0.1	0.1
Mboth	0	0.282	0.282	0.128	0.18	0.51	0.51	0.026

All values in the chart should add up to one

The belief ({S,R}) = Mboth({S,R}) + Mboth({S}) + Mboth({R}) = 0.744