snorkel.labeling.model.label_model.LabelModel¶

class snorkel.labeling.model.label_model.LabelModel(cardinality=2, **kwargs)[source]¶

Bases: torch.nn.modules.module.Module, snorkel.labeling.model.base_labeler.BaseLabeler

A model for learning the LF accuracies and combining their output labels.

This class learns a model of the labeling functions’ conditional probabilities of outputting the true (unobserved) label Y, P(lf | Y), and uses this learned model to re-weight and combine their output labels.

This class is based on the approach in [Training Complex Models with Multi-Task Weak Supervision](https://arxiv.org/abs/1810.02840), published in AAAI’19. In this approach, we compute the inverse generalized covariance matrix of the junction tree of a given LF dependency graph, and perform a matrix completion-style approach with respect to these empirical statistics. The result is an estimate of the conditional LF probabilities, P(lf | Y), which are then set as the parameters of the label model used to re-weight and combine the labels output by the LFs.

Currently this class uses a conditionally independent label model, in which the LFs are assumed to be conditionally independent given Y.

Examples

>>> label_model = LabelModel()
>>> label_model = LabelModel(cardinality=3)
>>> label_model = LabelModel(cardinality=3, device='cpu')
>>> label_model = LabelModel(cardinality=3)

Parameters

cardinality (int) – Number of classes, by default 2
**kwargs – Arguments for changing config defaults

Raises

ValueError – If config device set to cuda but only cpu is available

cardinality[source]¶: Number of classes, by default 2

config[source]¶: Training configuration

seed[source]¶: Random seed

__init__(cardinality=2, **kwargs)[source]¶

Initializes internal Module state, shared by both nn.Module and ScriptModule.

Return type: None

Methods

`__init__`([cardinality])	Initializes internal Module state, shared by both nn.Module and ScriptModule.
`add_module`(name, module)	Adds a child module to the current module.
`apply`(fn)	Applies `fn` recursively to every submodule (as returned by `.children()`) as well as self.
`bfloat16`()	Casts all floating point parameters and buffers to `bfloat16` datatype.
`buffers`([recurse])	Returns an iterator over module buffers.
`children`()	Returns an iterator over immediate children modules.
`cpu`()	Moves all model parameters and buffers to the CPU.
`cuda`([device])	Moves all model parameters and buffers to the GPU.
`double`()	Casts all floating point parameters and buffers to `double` datatype.
`eval`()	Sets the module in evaluation mode.
`extra_repr`()	Set the extra representation of the module
`fit`(L_train[, Y_dev, class_balance])	Train label model.
`float`()	Casts all floating point parameters and buffers to float datatype.
`forward`(*input)	Defines the computation performed at every call.
`get_conditional_probs`()	Return the estimated conditional probabilities table.
`get_weights`()	Return the vector of learned LF weights for combining LFs.
`half`()	Casts all floating point parameters and buffers to `half` datatype.
`load`(source)	Load existing label model.
`load_state_dict`(state_dict[, strict])	Copies parameters and buffers from `state_dict` into this module and its descendants.
`modules`()	Returns an iterator over all modules in the network.
`named_buffers`([prefix, recurse])	Returns an iterator over module buffers, yielding both the name of the buffer as well as the buffer itself.
`named_children`()	Returns an iterator over immediate children modules, yielding both the name of the module as well as the module itself.
`named_modules`([memo, prefix])	Returns an iterator over all modules in the network, yielding both the name of the module as well as the module itself.
`named_parameters`([prefix, recurse])	Returns an iterator over module parameters, yielding both the name of the parameter as well as the parameter itself.
`parameters`([recurse])	Returns an iterator over module parameters.
`predict`(L[, return_probs, tie_break_policy])	Return predicted labels, with ties broken according to policy.
`predict_proba`(L)	Return label probabilities P(Y \| lambda).
`register_backward_hook`(hook)	Registers a backward hook on the module.
`register_buffer`(name, tensor[, persistent])	Adds a buffer to the module.
`register_forward_hook`(hook)	Registers a forward hook on the module.
`register_forward_pre_hook`(hook)	Registers a forward pre-hook on the module.
`register_full_backward_hook`(hook)	Registers a backward hook on the module.
`register_parameter`(name, param)	Adds a parameter to the module.
`requires_grad_`([requires_grad])	Change if autograd should record operations on parameters in this module.
`save`(destination)	Save label model.
`score`(L, Y[, metrics, tie_break_policy])	Calculate one or more scores from user-specified and/or user-defined metrics.
`share_memory`()	rtype ~T
`state_dict`([destination, prefix, keep_vars])	Returns a dictionary containing a whole state of the module.
`to`(args, *kwargs)	Moves and/or casts the parameters and buffers.
`train`([mode])	Sets the module in training mode.
`type`(dst_type)	Casts all parameters and buffers to `dst_type`.
`xpu`([device])	Moves all model parameters and buffers to the XPU.
`zero_grad`([set_to_none])	Sets gradients of all model parameters to zero.

Attributes

`T_destination`
`dump_patches`

fit(L_train, Y_dev=None, class_balance=None, **kwargs)[source]¶

Train label model.

Train label model to estimate mu, the parameters used to combine LFs.

Parameters

L_train (ndarray) – An [n,m] matrix with values in {-1,0,1,…,k-1}
Y_dev (Optional[ndarray]) – Gold labels for dev set for estimating class_balance, by default None
class_balance (Optional[List[float]]) – Each class’s percentage of the population, by default None
**kwargs –
Arguments for changing train config defaults.

n_epochs
The number of epochs to train (where each epoch is a single optimization step), default is 100

lr
Base learning rate (will also be affected by lr_scheduler choice and settings), default is 0.01

l2
Centered L2 regularization strength, default is 0.0

optimizer
Which optimizer to use (one of [“sgd”, “adam”, “adamax”]), default is “sgd”

optimizer_config
Settings for the optimizer

lr_scheduler
Which lr_scheduler to use (one of [“constant”, “linear”, “exponential”, “step”]), default is “constant”

lr_scheduler_config
Settings for the LRScheduler

prec_init
LF precision initializations / priors, default is 0.7

seed
A random seed to initialize the random number generator with

log_freq
Report loss every this many epochs (steps), default is 10

mu_eps
Restrict the learned conditional probabilities to [mu_eps, 1-mu_eps], default is None

Raises

Exception – If loss in NaN

Examples

>>> L = np.array([[0, 0, -1], [-1, 0, 1], [1, -1, 0]])
>>> Y_dev = [0, 1, 0]
>>> label_model = LabelModel(verbose=False)
>>> label_model.fit(L)
>>> label_model.fit(L, Y_dev=Y_dev, seed=2020, lr=0.05)
>>> label_model.fit(L, class_balance=[0.7, 0.3], n_epochs=200, l2=0.4)

Return type: None

forward(*input)[source]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

Note

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

Return type: None

get_conditional_probs()[source]¶

Return the estimated conditional probabilities table.

Return the estimated conditional probabilites table cprobs, where cprobs is an (m, k+1, k)-dim np.ndarray with:

cprobs[i, j, k] = P(lf_i = j-1 | Y = k)

where m is the number of LFs, k is the cardinality, and cprobs includes the conditional abstain probabilities P(lf_i = -1 | Y = y).

Returns: An [m, k + 1, k] np.ndarray conditional probabilities table.
Return type: np.ndarray

get_weights()[source]¶

Return the vector of learned LF weights for combining LFs.

Returns: [m,1] vector of learned LF weights for combining LFs.
Return type: np.ndarray

Example

>>> L = np.array([[1, 1, 1], [1, 1, -1], [-1, 0, 0], [0, 0, 0]])
>>> label_model = LabelModel(verbose=False)
>>> label_model.fit(L, seed=123)
>>> np.around(label_model.get_weights(), 2)  # doctest: +SKIP
array([0.99, 0.99, 0.99])

load(source)[source]¶

Load existing label model.

Parameters: source (str) – Filename to load model from

Example

Load parameters saved in saved_label_model

>>> label_model.load('./saved_label_model.pkl')  # doctest: +SKIP

Return type: None

predict(L, return_probs=False, tie_break_policy='abstain')[source]¶

Return predicted labels, with ties broken according to policy.

Policies to break ties include:

“abstain”: return an abstain vote (-1)
“true-random”: randomly choose among the tied options
“random”: randomly choose among tied option using deterministic hash

NOTE: if tie_break_policy=”true-random”, repeated runs may have slightly different results due to difference in broken ties

Parameters

L (ndarray) – An [n,m] matrix with values in {-1,0,1,…,k-1}
return_probs (Optional[bool]) – Whether to return probs along with preds
tie_break_policy (str) – Policy to break ties when converting probabilistic labels to predictions

Return type

Union[ndarray, Tuple[ndarray, ndarray]]

Returns

np.ndarray – An [n,1] array of integer labels
(np.ndarray, np.ndarray) – An [n,1] array of integer labels and an [n,k] array of probabilistic labels

Example

>>> L = np.array([[0, 0, -1], [1, 1, -1], [0, 0, -1]])
>>> label_model = LabelModel(verbose=False)
>>> label_model.fit(L)
>>> label_model.predict(L)
array([0, 1, 0])

predict_proba(L)[source]¶

Return label probabilities P(Y | lambda).

Parameters: L (ndarray) – An [n,m] matrix with values in {-1,0,1,…,k-1}f
Returns: An [n,k] array of probabilistic labels
Return type: np.ndarray

Example

>>> L = np.array([[0, 0, 0], [1, 1, 1], [1, 1, 1]])
>>> label_model = LabelModel(verbose=False)
>>> label_model.fit(L, seed=123)
>>> np.around(label_model.predict_proba(L), 1)  # doctest: +SKIP
array([[1., 0.],
       [0., 1.],
       [0., 1.]])

save(destination)[source]¶

Save label model.

Parameters: destination (str) – Filename for saving model

Example

>>> label_model.save('./saved_label_model.pkl')  # doctest: +SKIP

Return type: None

score(L, Y, metrics=['accuracy'], tie_break_policy='abstain')[source]¶

Calculate one or more scores from user-specified and/or user-defined metrics.

Parameters

L (ndarray) – An [n,m] matrix with values in {-1,0,1,…,k-1}
Y (ndarray) – Gold labels associated with data points in L
metrics (Optional[List[str]]) – A list of metric names. Possbile metrics are - accuracy, coverage, precision, recall, f1, f1_micro, f1_macro, fbeta, matthews_corrcoef, roc_auc. See sklearn.metrics for details on the metrics.
tie_break_policy (str) – Policy to break ties when converting probabilistic labels to predictions. Same as predict() method above.

Returns

A dictionary mapping metric names to metric scores

Return type

Dict[str, float]

Example

>>> L = np.array([[1, 1, -1], [0, 0, -1], [1, 1, -1]])
>>> label_model = LabelModel(verbose=False)
>>> label_model.fit(L)
>>> label_model.score(L, Y=np.array([1, 1, 1]))
{'accuracy': 0.6666666666666666}
>>> label_model.score(L, Y=np.array([1, 1, 1]), metrics=["f1"])
{'f1': 0.8}