Python mapper 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: plos_classification.words

메소드/함수: mapper

hotexamples.com에서의 예제들: 4

Python mapper - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 plos_classification.words.mapper에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: test_words_mapper.py 프로젝트: akud/PLoS-Article-Classification

 def setUp(self):
     categories = [
         'red\n',
         'YelLOw',
         'green',
         'blue'
     ]
     self.mapper = mapper(categories)

예제 #2

파일 보기

파일: test_words_mapper.py 프로젝트: alistairwalsh/PLoS-Article-Classification

 def setUp(self):
     categories = ['red\n', 'YelLOw', 'green', 'blue']
     self.mapper = mapper(categories)

예제 #3

파일 보기

파일: process_sample.py 프로젝트: akud/PLoS-Article-Classification

ytrain = csv.writer(open('data/ytrain.csv','w')) 
ytest = csv.writer(open('data/ytest.csv','w')) 

mindocs = round(0.01*len(s['train']))
maxdocs = round(0.99*len(s['train']))

#create the subject mapping
print datetime.now(), 'creating subject mapping'
#get the subjects
subjects = [f['subject2_hierarchy'] for f in s['train']]
#take the top-level element of each subject for each doc
subjects = [[sub.split('/')[0] for sub in f] for f in subjects]
#sort and take the first one
subjects = [ sorted(sub)[0] for sub in subjects]

mapper = words.mapper(subjects, subjectFile='data/subjects.txt')

#setup word counters
wordcounters = {} 
for textfield in text_fields:
    print datetime.now(), 'creating dictionary for %s' % (textfield)

    wordcounters[textfield] = words.counter(
        [f[textfield] for f in s['train']],
        mindocs=mindocs, maxdocs=maxdocs,
        dictionaryFile='data/dictionary-%s.txt' % (textfield))

#process the sample and write vectors
print datetime.now(), 'converting texts to vectors and storing to csv'
for doc in s['train']:
    subject = sorted([sub.split('/')[0] for sub in doc['subject2_hierarchy']])[0]

예제 #4

파일 보기

ytrain = csv.writer(open('data/ytrain.csv', 'w'))
ytest = csv.writer(open('data/ytest.csv', 'w'))

mindocs = round(0.01 * len(s['train']))
maxdocs = round(0.99 * len(s['train']))

#create the subject mapping
print datetime.now(), 'creating subject mapping'
#get the subjects
subjects = [f['subject2_hierarchy'] for f in s['train']]
#take the top-level element of each subject for each doc
subjects = [[sub.split('/')[0] for sub in f] for f in subjects]
#sort and take the first one
subjects = [sorted(sub)[0] for sub in subjects]

mapper = words.mapper(subjects, subjectFile='data/subjects.txt')

#setup word counters
wordcounters = {}
for textfield in text_fields:
    print datetime.now(), 'creating dictionary for %s' % (textfield)

    wordcounters[textfield] = words.counter(
        [f[textfield] for f in s['train']],
        mindocs=mindocs,
        maxdocs=maxdocs,
        dictionaryFile='data/dictionary-%s.txt' % (textfield))

#process the sample and write vectors
print datetime.now(), 'converting texts to vectors and storing to csv'
for doc in s['train']: