Python list_union 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: invenio.bibindex_engine_utils

메소드/함수: list_union

hotexamples.com에서의 예제들: 10

Python list_union - 10개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 invenio.bibindex_engine_utils.list_union에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

 def _collect_multifield(self, recIDs, termslist):
     """
     Calculates terms from many fields or tags.
     Used together with multifield tokenizer
     """
     tokenizing_function = self.tokenizing_function
     for recID in recIDs:
         new_words = tokenizing_function(recID)
         if not recID in termslist:
             termslist[recID] = []
         termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #2

파일 보기

파일: bibindex_termcollectors.py 프로젝트: BessemAamira/invenio

 def _collect_multifield(self, recIDs, termslist):
     """
     Calculates terms from many fields or tags.
     Used together with multifield tokenizer
     """
     tokenizing_function = self.tokenizing_function
     for recID in recIDs:
         new_words = tokenizing_function(recID)
         if not recID in termslist:
             termslist[recID] = []
         termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #3

파일 보기

 def _collect_recjson(self, recIDs, termslist):
     """
     Collects terms from recjson with use of bibfield.
     Used together with recjson tokenizer.
     """
     tokenizing_function = self.tokenizing_function
     for recID in recIDs:
         record = get_record(recID)
         if record:
             new_words = tokenizing_function(record)
             if not recID in termslist:
                 termslist[recID] = []
             termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #4

파일 보기

파일: bibindex_termcollectors.py 프로젝트: BessemAamira/invenio

 def _collect_recjson(self, recIDs, termslist):
     """
     Collects terms from recjson with use of bibfield.
     Used together with recjson tokenizer.
     """
     tokenizing_function = self.tokenizing_function
     for recID in recIDs:
         record = get_record(recID)
         if record:
             new_words = tokenizing_function(record)
             if not recID in termslist:
                 termslist[recID] = []
             termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #5

파일 보기

파일: bibindex_termcollectors.py 프로젝트: BessemAamira/invenio

 def _collect_string(self, recIDs, termslist):
     """
     Collects terms from specific tags or fields.
     Used together with string tokenizer.
     """
     for tag in self.tags:
         tokenizing_function = self.special_tags.get(tag, self.tokenizing_function)
         phrases = self._get_phrases_for_tokenizing(tag, recIDs)
         for row in phrases:
             recID, phrase = row
             if recID in recIDs:
                 if not recID in termslist:
                     termslist[recID] = []
                 new_words = tokenizing_function(phrase)
                 termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #6

파일 보기

 def _collect_string(self, recIDs, termslist):
     """
     Collects terms from specific tags or fields.
     Used together with string tokenizer.
     """
     for tag in self.tags:
         tokenizing_function = self.special_tags.get(
             tag, self.tokenizing_function)
         phrases = self._get_phrases_for_tokenizing(tag, recIDs)
         for row in phrases:
             recID, phrase = row
             if recID in recIDs:
                 if not recID in termslist:
                     termslist[recID] = []
                 new_words = tokenizing_function(phrase)
                 termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #7

파일 보기

파일: bibindex_termcollectors.py 프로젝트: BessemAamira/invenio

 def _collect_string(self, recIDs, termslist):
     """
     Collects terms from specific tags or fields.
     Used together with string tokenizer.
     """
     tags = self.tags
     for recID in recIDs:
         rec = get_record(recID)
         new_words = []
         extend = new_words.extend
         for tag in tags:
             tokenizing_function = self.special_tags.get(tag, self.tokenizing_function)
             phrases = []
             recjson_field = rec.get(tag)
             get_values_recursively(recjson_field, phrases)
             for phrase in phrases:
                 extend(tokenizing_function(phrase))
         if recID not in termslist and new_words:
             termslist[recID] = []
         if new_words:
             termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #8

파일 보기

 def _collect_string(self, recIDs, termslist):
     """
     Collects terms from specific tags or fields.
     Used together with string tokenizer.
     """
     tags = self.tags
     for recID in recIDs:
         rec = get_record(recID)
         new_words = []
         extend = new_words.extend
         for tag in tags:
             tokenizing_function = self.special_tags.get(
                 tag, self.tokenizing_function)
             phrases = []
             recjson_field = rec.get(tag)
             get_values_recursively(recjson_field, phrases)
             for phrase in phrases:
                 extend(tokenizing_function(phrase))
         if recID not in termslist and new_words:
             termslist[recID] = []
         if new_words:
             termslist[recID] = list_union(new_words, termslist[recID])
     return termslist

예제 #9

파일 보기

파일: bibindex_engine_unit_tests.py 프로젝트: BessemAamira/invenio

 def test_list_union(self):
     """bibindex engine utils - list union"""
     self.assertEqual([1, 2, 3, 4],
                      list_union([1, 2, 3],
                                 [1, 3, 4]))

예제 #10

파일 보기

 def test_list_union(self):
     """bibindex engine utils - list union"""
     self.assertEqual([1, 2, 3, 4], list_union([1, 2, 3], [1, 3, 4]))