Python Annotations.compute_ambiguity Examples

Programming Language: Python

Namespace/Package Name: medacy.tools

Class/Type: Annotations

Method/Function: compute_ambiguity

Examples at hotexamples.com: 2

Python Annotations.compute_ambiguity - 2 examples found. These are the top rated real world Python examples of medacy.tools.Annotations.compute_ambiguity extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

Annotations(30)

get_entity_annotations(13)

compare_by_index(4)

diff(4)

add_entity(3)

compute_counts(2)

difference(2)

compute_confusion_matrix(2)

compute_ambiguity(2)

compare_by_index_stats(2)

to_html(2)

compare_by_entity(1)

get_entity_count(1)

get_labels(1)

get_spacy_entities(1)

intersection(1)

stats(1)

to_ann(1)

Example #1

Show file

File: dataset.py Project: wolfsohned/medaCy

    def compute_ambiguity(self, dataset):
        """
        Finds occurrences of spans from 'dataset' that intersect with a span from this annotation but do not have this spans label.
        label. If 'dataset' comprises a models predictions, this method provides a strong indicators
        of a model's in-ability to dis-ambiguate between entities. For a full analysis, compute a confusion matrix.

        :param dataset: a Dataset object containing a predicted version of this dataset.
        :param leniency: a floating point value between [0,1] defining the leniency of the character spans to count as different. A value of zero considers only exact character matches while a positive value considers entities that differ by up to :code:`ceil(leniency * len(span)/2)` on either side.
        :return: a dictionary containing the ambiguity computations on each gold, predicted file pair
        """
        if not isinstance(dataset, Dataset):
            raise ValueError("dataset must be instance of Dataset")

        # verify files are consistent
        diff = set([file.ann_path.split(os.sep)[-1] for file in self]).difference(set([file.ann_path.split(os.sep)[-1] for file in dataset]))
        if diff:
            raise ValueError("Dataset of predictions is missing the files: " + str(list(diff)))

        #Dictionary storing ambiguity over dataset
        ambiguity_dict = {}

        for gold_data_file in self:
            prediction_iter = iter(dataset)
            prediction_data_file = next(prediction_iter)
            while str(gold_data_file) != str(prediction_data_file):
                prediction_data_file = next(prediction_iter)

            gold_annotation = Annotations(gold_data_file.ann_path)
            pred_annotation = Annotations(prediction_data_file.ann_path)

            # compute matrix on the Annotation file level
            ambiguity_dict[str(gold_data_file)] = gold_annotation.compute_ambiguity(pred_annotation)


        return ambiguity_dict

Example #2

Show file

File: test_annotation.py Project: yushu-liu/medaCy

 def test_compute_ambiguity(self):
     annotations1 = Annotations(join(self.dataset.get_data_directory(),
                                     self.ann_files[0]),
                                annotation_type='ann')
     annotations2 = Annotations(join(self.dataset.get_data_directory(),
                                     self.ann_files[0]),
                                annotation_type='ann')
     label, start, end, text = annotations2.get_entity_annotations()[0]
     annotations2.add_entity('incorrect_label', start, end, text)
     self.assertEqual(len(annotations1.compute_ambiguity(annotations2)), 1)