Part 5 : Learning bayesian networks

Created by

Merel DJ

Subdecks (3)

Part 5b : Structure learning in Bayesian Networks
Z - Old > Probalistc Models > Part 5 : Learning bayesian networks
26 cards
Math - part 5
Z - Old > Probalistc Models > Part 5 : Learning bayesian networks
10 cards
Part 5a : Parameter learning in bayesian networks
Z - Old > Probalistc Models > Part 5 : Learning bayesian networks
10 cards

Cards (61)

Manual construction of model for a given problem may be impossible :
export not aivable
problem too complex

goal : construct a structured model of the (hidden) distribution most likely underlying the oberserved samples (Automatic Model Learning)
Automatic model learning assumptions :
unkown distribution
training examples are representative of the world
task : learn the model with a distribution that is an approxamtion to the "training set" model and with a graph structure that reflects the true (in)dependencies in the world
Learning as optimisation general approach :
define an objective function F(M,D) : a measure that estimates how "good" a given model M is in relation to the given training examples
develop an algortihm to find the model that maximes F
learning is a search/optimisation problem
Likelihood of a Model M
relative to a dataset D is the probability that the model assigns to the set D : $L(M:D) =$ $P_M (D)$
If the examples D are independent and identically distributed (i.i.d), the likelihood L(M:D) is $L(M:D) =$ $P_M(D) =$ $\Pi_{x_i\in D} P_M (x_i)$
Likelihood : is the product of the probabilities assigned by the model to the individual training examples
Problems with the likelihood function :
probability will be miniscule
arithmetic underflow
solution : log-likelihood
The Log-likelihood l(M,D) of a Model M relative to a dataset D is the logarithm of the likelihood
$l(M:D) =$ $log L(M:D) =$ $log \Pi_{x_i \in D} P_M (x_i)$ = $\sum_{x_i \in D} log P_M(x_i)$
Likelihood and log-likelihood are monotoncially related : l(M: D) has its maximum where L(M:D) is maximal
to compensate for overfitting --> a model that generalises
Generalisation : model must be more general than simple summary of a training set
Overfitting : model that exactly fits the training data, but not usefull for queries about new sitiuations
Bias : error possibility introduced by restricting expressivity of model class
Bias vs Variance
Put contraints on calss of models allowed to the learned
hard constraint : strictly resticts the class of models
soft contraints : an additional regularisation term to the objective function that adds a penalty
Variance : error possiblity introduced by permitting high expressivity of model class
Bias-Variance tradeoff
restriction to simple models make hypothesis space smaller and increases the probaility of bias error
on the other hand, in a smaller hypothesis space, it is less likely to find an overfitting model
vs .
permitting complex models reduces probability of bias error
but introduces variance as a potential source of error

See similar decks

Part 5 : Learning bayesian networks

Subdecks (3)

Part 5b : Structure learning in Bayesian Networks

Math - part 5

Part 5a : Parameter learning in bayesian networks

Cards (61)

4.6 Bayesian Statistics

1.3.3 Networks

4.1.3 Network hardware:

5.2 Networks

4.1.4 Network protocols:

4.1.2 Network topologies:

4.1.5 Network security:

5.2.2 Social Learning Theory

1.4 Willingham’s Learning Theory

1.3.2 Impact on Learning

1.4 Willingham’s Learning Theory

3.5.5 Network security

3.2 Modern Art and Media

6.1.2 The Role of Models in Economics

4.6 Modern Art Movements

Topic 4: Networks

3.5.4 Network protocols

4.1 Computer Networks

5.1 Voting Rights and Models of Voting Behavior

4.1.1 Understanding types of networks:

3.2.1 Fluid Mosaic Model