Python get_compounds 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: chemtagger

메소드/함수: get_compounds

hotexamples.com에서의 예제들: 2

Python get_compounds - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 chemtagger.get_compounds에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: train.py 프로젝트: jtsui/bionlp

def tag_sentences():
    '''
    input: train_sentences.json
    output: train_tag_sentences.json
    Tags the chemicals in each sentence using ChemicalTagger 
    (http://chemicaltagger.ch.cam.ac.uk/). This method communicates with 
    ChemicalTagger through a custom REST API running on pathway.berkeley.edu
    '''
    sentences = json.load(open('../data/train_sentences.json'))
    bar, i = pbar(len(sentences)), 0
    print 'Tagging chemicals in sentences'
    bar.start()
    chemicals = {}
    for sid, sentence in sentences.iteritems():
        chems = chemtagger.get_compounds(sid, sentence)
        if chems:
            chemicals[sid] = chems
        i += 1
        bar.update(i)
    bar.finish()
    json.dump(chemicals, open('../data/train_tag_sentences.json', 'wb'),
              indent=2, sort_keys=True)
    print 'Result dumped to ../data/train_tag_sentences.json'

예제 #2

파일 보기

파일: patterns.py 프로젝트: jtsui/bionlp

def extract(sid, sentence):
    reactants = []
    chemicals = chemtagger.get_compounds(sid, sentence)
    if chemicals is None:
        return reactants
    chemicals = sanitize_chemicals(chemicals)
    chems = [y for x in chemicals for y in x.split()]
    stemmed_sentence = ' '.join([x if x in chems else STEMMER.stem(x)
                                 for x in sentence.split()])
    tagged_sentence = ' %s ' % stemmed_sentence
    for chem in sorted(chemicals, key=len, reverse=True):
        tagged_sentence = tagged_sentence.replace(
            ' %s ' % chem, ' $%s$chem ' % chem)
    tagged_sentence = tagged_sentence.strip()
    grouped_sentence = group_list(tagged_sentence)
    for pattern_id in expand_patterns():
        for pattern in expand_patterns()[pattern_id]:
            groups = pattern.findall(grouped_sentence, overlapped=True)
            matches = expand_chems(groups)
            for match in matches:
                if match and len(match) > 1:
                    reactants.append((pattern_id, match))
    return reactants