Python remove_punctuation示例

编程语言: Python

命名空间/包名称: ops.transform

方法/功能: remove_punctuation

hotexamples.com的示例: 5

Python remove_punctuation - 已找到5个示例。这些是从开源项目中提取的最受好评的ops.transform.remove_punctuation现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： loading.py 项目： Ropes/PDX-Council-Minutes-Data

def token_link_text(text, distance=10):
    '''Combine token frequency and token links together into a single JSON
    format.'''
    text = remove_punctuation(text)
    token_list = stop_word_placeheld(text)

    return link_op(token_list, distance=distance)

示例#2

显示文件

文件： test_transform.py 项目： Ropes/PDX-Council-Minutes-Data

 def test_freq_dist_dict_full(self):
     with open('{}{}'.format(base_resources, '2011-1-19raw.txt'), 'r')\
     as f:
         text = f.read().decode('utf-8')
         text = remove_punctuation(text)
         stopped = stop_words(text)  
         ti = token_index(stopped)
         #print(pformat(ti), file=stderr)
         with open('{}{}'.format(target_out, '2011-1-19token_index'),\
         
         'w') as out_file:
             out_file.write(pformat(ti))

示例#3

显示文件

文件： test_transform.py 项目： Ropes/PDX-Council-Minutes-Data

    def test_freq_dist_dict_full(self):
        with open('{}{}'.format(base_resources, '2011-1-19raw.txt'), 'r')\
        as f:
            text = f.read().decode('utf-8')
            stopped = stop_words(text)  
            freq_dist = freq_dist_dict(stopped.split()) 
            #print(pformat(freq_dist), file=stderr)
            self.assertGreater(freq_dist[u'year'], 8)
            self.assertLess(freq_dist[u'year'], 12)

            text = remove_punctuation(text)
            stopped = stop_words(text)  
            freq_dist = freq_dist_dict(stopped.split())
            #print(pformat(freq_dist), file=stderr)
            self.assertGreater(freq_dist[u'year'], 16)

            with open('{}{}'.format(target_out, '2011-1-19freq_dist_dict'),\
            
            'w') as out_file:
                out_file.write(pformat(freq_dist))

示例#4

显示文件

文件： loading.py 项目： Ropes/PDX-Council-Minutes-Data

def create_tokens(text):
    text = remove_punctuation(text)
    text = stop_word_placeheld(text)
    return freq_dist_dict(text)

示例#5

显示文件

文件： test_transform.py 项目： Ropes/PDX-Council-Minutes-Data

 def test_punctuation_removal_unicode(self):
     x = unicode(self.ick_str)  
     out = remove_punctuation(x, punct=self.punct)
     self.assertEqual(out, unicode(self.good_str))