Parameter learning

In this tutorial we demonstrate the process of parameter learning, which uses data to determine the distribution(s) for one or more nodes in a Bayesian network.

info

You can also specify the distributions manually. In fact, you can learn some nodes from data and elicit some manually if required.

Generate tutorial data

We will generate data from a known network, and then delete the distributions for that network before proceeding with learning.

In the Bayes Server User Interface, open the Waste network from the Start page (or from File/Open).
From the Application main tab, click Data Sources.
Click Generate -> Generate Custom. This will launch the Data Sampling dialog.
Under Options expand the .. more items to display further options.
Change the Sample Count to 10000.
Change the Missing Data Probability to 0.05. This will result in 5% of the data having missing data, which is fully supported during parameter learning.
Click Run.
In the Results you should notice that some of the data values are missing as expected.
Click the + Data Source button.
Close any dialogs until you return to the Network Viewer.

Data Sampling Results

info

Note that parameter learning supports missing data and latent variables.

Delete existing distributions (optional)

By default, parameter learning will overwrite existing distributions, although this is configurable per node.

For this tutorial, we will delete all the existing distributions, to show that we have in fact learned the distributions.

info

You can choose not to learn certain node distributions. You can also choose not to map certain columns of data to nodes.

Repeat the following steps for each node in the network.

Click to select the node.
From the Build main tab, click Distributions -> Edit distribution(s).
From the toolbar, click Delete -> Delete Distribution.

Waste delete links