snorkel.labeling.filter_unlabeled_dataframe

snorkel.labeling.filter_unlabeled_dataframe(X, y, L)[source]

Filter out examples not covered by any labeling function.

Parameters
  • X (DataFrame) – Data points in a Pandas DataFrame.

  • y (ndarray) – Matrix of probabilities output by label model’s predict_proba method.

  • L (ndarray) – Matrix of labels emitted by LFs.

Return type

Tuple[DataFrame, ndarray]

Returns

  • pd.DataFrame – Data points that were labeled by at least one LF in L.

  • np.ndarray – Probabilities matrix for data points labeled by at least one LF in L.