Python CasUtil.get_annotations示例

编程语言: Python

命名空间/包名称: pipeline

类/类型: CasUtil

方法/功能: get_annotations

hotexamples.com的示例: 4

Python CasUtil.get_annotations - 已找到4个示例。这些是从开源项目中提取的最受好评的pipeline.CasUtil.get_annotations现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

get_annotations(4)

get_all_annotations(3)

has_annotation(1)

示例#1

显示文件

文件： normalizer.py 项目： Tooa/BTWitter

    def process(self, cas):
        for token_annot in CasUtil.get_annotations(cas, "Token"):
            token = token_annot.get_covered_text()
            normalized = self.normalize_word_token(token)

            if normalized != token:
                norm_annot = Annotation(cas.get_view(), token_annot.begin, token_annot.end, "Error", normalized)
                cas.add_fs_annotation(norm_annot)

示例#2

显示文件

文件： casConsumer.py 项目： Tooa/BTWitter

    def process(self, cas):
        lang = next(CasUtil.get_annotations(cas, "Language"))

        if lang.value != "de":
            return

        filtered_token = []
        for annot in CasUtil.get_all_annotations(cas):
            self.add_to_filtered_token(annot, filtered_token)

        for t in filtered_token:
            self.unique_token[t.lower()] = 1 if CasUtil.has_annotation(cas, t, 'NER') else 0

        self.write_output_files(cas, filtered_token)

示例#3

显示文件

文件： tokenTagger.py 项目： Tooa/BTWitter

 def process(self, cas):
     for token_annot in CasUtil.get_annotations(cas, "Token"):
         token = token_annot.get_covered_text()
         if self.is_token_to_tag(token):
             annot = Annotation(cas.get_view(), token_annot.begin, token_annot.end, self.get_token_type())
             cas.add_fs_annotation(annot)

示例#4

显示文件

文件： casConsumer.py 项目： Tooa/BTWitter

    def write_output_files(self, cas, filtered_token):
        self.sent_writer.writerow([cas.document_id, cas.date, cas.artifact])
        self.token_writer.writerow([cas.document_id, cas.date, " ".join(filtered_token)])

        raw_token = [annot.get_covered_text() for annot in CasUtil.get_annotations(cas, "Token")]
        self.raw_token_writer.writerow([" ".join(raw_token)])