Python PersistantBalancedCorpusIO Exemples

Langage de programmation: Python

Espace de nommage/Pack: io_wrapper

Exemples au hotexamples.com: 4

Python PersistantBalancedCorpusIO - 4 exemples trouvés. Ce sont les exemples réels les mieux notés de io_wrapper.PersistantBalancedCorpusIO extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

PersistantBalancedCorpusIO(2)

out(2)

Méthodes fréquemment utilisées

PersistantBalancedCorpusIO (2)

out (2)

Associées

obj_map

sort_dict

get_current_block

get

DebugWatcher

get

ConcurrentRun

refresh_token

relativeTime

WordCloud

Related in langs

PremiumPolicyDepartmentListFactory (PHP)

getNotificaciones (PHP)

VictimAttachmentIds (C#)

FgPro (C#)

SWF_assert (C++)

displayNext (C++)

Name (Go)

New (Go)

Event (Java)

TCPReader (Java)

Exemple #1

0

Afficher le fichier

Fichier : normalization.py Projet : Gnork/confusion-words

def pos_tag_corpus(input_path: str, output_path: str): io = PersistantBalancedCorpusIO(input_path, output_path) for line in io: tokens = nltk.word_tokenize(line) pos_tags = nltk.pos_tag(tokens) pos_tags_str = [] for pos_tag in pos_tags: pos_tags_str.append(nltk.tag.tuple2str(pos_tag)) result = ' '.join(pos_tags_str) io.out(result)

Exemple #2

0

Afficher le fichier

def pos_tag_corpus(input_path: str, output_path: str): io = PersistantBalancedCorpusIO(input_path, output_path) for line in io: tokens = nltk.word_tokenize(line) pos_tags = nltk.pos_tag(tokens) pos_tags_str = [] for pos_tag in pos_tags: pos_tags_str.append(nltk.tag.tuple2str(pos_tag)) result = ' '.join(pos_tags_str) io.out(result)

Exemple #3

0

Afficher le fichier

Fichier : normalization.py Projet : Gnork/confusion-words

def normalize_acrotagged_corpus(input_file, output_file): rx = re.compile('III[^I]+III') io = PersistantBalancedCorpusIO(input_file, output_file) for line in io: tokens, pos = split_tokens_and_pos(line.split()) norm_tokens = [] for token in tokens: m = rx.search(token) if m: match = m.group(0) splits = rx.split(token) norm_splits = [] for split in splits: norm_splits.append(normalize(split)) norm_token = match.join(norm_splits) norm_tokens.append(norm_token) else: norm_tokens.append(normalize(token)) result_twp = join_tokens_and_pos(norm_tokens, pos) result = ' '.join(result_twp) io.out(result)

Exemple #4

0

Afficher le fichier

def normalize_acrotagged_corpus(input_file, output_file): rx = re.compile('III[^I]+III') io = PersistantBalancedCorpusIO(input_file, output_file) for line in io: tokens, pos = split_tokens_and_pos(line.split()) norm_tokens = [] for token in tokens: m = rx.search(token) if m: match = m.group(0) splits = rx.split(token) norm_splits = [] for split in splits: norm_splits.append(normalize(split)) norm_token = match.join(norm_splits) norm_tokens.append(norm_token) else: norm_tokens.append(normalize(token)) result_twp = join_tokens_and_pos(norm_tokens, pos) result = ' '.join(result_twp) io.out(result)