Python ACRONYM_REGEX примеры использования

Язык программирования: Python

Пространство имен/Пакет: textacy.constants

Класс/Тип: ACRONYM_REGEX

Примеров на hotexamples.com: 8

Python ACRONYM_REGEX - 8 примеров найдено. Это лучшие примеры Python кода для textacy.constants.ACRONYM_REGEX, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

search(4)

match(1)

Пример #1

Показать файл

Файл: text_utils.py Проект: nigeljyng/textacy

def is_acronym(token, exclude=None):
    """
    Pass single token as a string, return True/False if is/is not valid acronym.

    Args:
        token (str): single word to check for acronym-ness
        exclude (Set[str]): if technically valid but not actually good acronyms
            are known in advance, pass them in as a set of strings; matching
            tokens will return False

    Returns:
        bool
    """
    # exclude certain valid acronyms from consideration
    if exclude and token in exclude:
        return False
    # don't allow empty strings
    if not token:
        return False
    # don't allow spaces
    if ' ' in token:
        return False
    # 2-character acronyms can't have lower-case letters
    if len(token) == 2 and not token.isupper():
        return False
    # acronyms can't be all digits
    if token.isdigit():
        return False
    # acronyms must have at least one upper-case letter or start/end with a digit
    if (not any(char.isupper() for char in token)
            and not (token[0].isdigit() or token[-1].isdigit())):
        return False
    # acronyms must have between 2 and 10 alphanumeric characters
    if not 2 <= sum(1 for char in token if char.isalnum()) <= 10:
        return False
    # only certain combinations of letters, digits, and '&/.-' allowed
    if not ACRONYM_REGEX.match(token):
        return False
    return True

Пример #2

Показать файл

Файл: text_utils.py Проект: chartbeat-labs/textacy

def is_acronym(token, exclude=None):
    """
    Pass single token as a string, return True/False if is/is not valid acronym.

    Args:
        token (str): single word to check for acronym-ness
        exclude (Set[str]): if technically valid but not actually good acronyms
            are known in advance, pass them in as a set of strings; matching
            tokens will return False

    Returns:
        bool
    """
    # exclude certain valid acronyms from consideration
    if exclude and token in exclude:
        return False
    # don't allow empty strings
    if not token:
        return False
    # don't allow spaces
    if ' ' in token:
        return False
    # 2-character acronyms can't have lower-case letters
    if len(token) == 2 and not token.isupper():
        return False
    # acronyms can't be all digits
    if token.isdigit():
        return False
    # acronyms must have at least one upper-case letter or start/end with a digit
    if (not any(char.isupper() for char in token) and
            not (token[0].isdigit() or token[-1].isdigit())):
        return False
    # acronyms must have between 2 and 10 alphanumeric characters
    if not 2 <= sum(1 for char in token if char.isalnum()) <= 10:
        return False
    # only certain combinations of letters, digits, and '&/.-' allowed
    if not ACRONYM_REGEX.match(token):
        return False
    return True

Пример #3

Показать файл

def test_bad_acronym_regex():
    for item in BAD_ACRONYMS:
        assert ACRONYM_REGEX.search(item) is None

Пример #4

Показать файл

def test_good_acronym_regex():
    for item in GOOD_ACRONYMS:
        assert item == ACRONYM_REGEX.search(item).group()

Пример #5

Показать файл

 def test_bad_acronym_regex(self):
     for item in BAD_ACRONYMS:
         self.assertIsNone(ACRONYM_REGEX.search(item))

Пример #6

Показать файл

 def test_good_acronym_regex(self):
     for item in GOOD_ACRONYMS:
         self.assertEqual(item, ACRONYM_REGEX.search(item).group())

Пример #7

Показать файл

Файл: test_constants.py Проект: chartbeat-labs/textacy

 def test_bad_acronym_regex(self):
     for item in BAD_ACRONYMS:
         self.assertIsNone(ACRONYM_REGEX.search(item))

Пример #8

Показать файл

Файл: test_constants.py Проект: chartbeat-labs/textacy

 def test_good_acronym_regex(self):
     for item in GOOD_ACRONYMS:
         self.assertEqual(item, ACRONYM_REGEX.search(item).group())