Python QRelFile.key2s 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: JudgeFile

클래스/타입: QRelFile

메소드/함수: key2s

hotexamples.com에서의 예제들: 4

Python QRelFile.key2s - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 JudgeFile.QRelFile.key2s에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

key2s(4)

keys(4)

get(1)

get_value(1)

has_key(1)

자주 사용되는 메소드들

key2s (4)

keys (4)

get (1)

get_value (1)

has_key (1)

예제 #1

파일 보기

파일: WindowExtractor.py 프로젝트: DrDub/window_shopper

def test_extract_text(judge_path, index_path):
    judge_file = QRelFile(judge_path);
    docnos = judge_file.key2s();
    print 'doc number:', len(docnos);
    for docno in filter(is_cluewebB, docnos)[:3]:
        text = extract_text(docno, index_path);
        print text
        print '-' * 20

예제 #2

파일 보기

파일: TextExtractor.py 프로젝트: jinghe/window_shopper

def test_extract_text(judge_path, index_path, collection_type):
    judge_file = QRelFile(judge_path);
    docnos = judge_file.key2s();
    print 'doc number:', len(docnos);
    for docno in docnos[:1]:
        text = extract_text(docno, index_path, collection_type);
        print text
        print '-' * 20

예제 #3

파일 보기

파일: TextExtractor.py 프로젝트: jinghe/window_shopper

def exe_extract_text(judge_path, index_path, out_path, collection_type = 'html'):
    '''
        extract texts of docs in qrel from an index, and store them in out_path in standard trec format
    '''
    import Corpus
    judge_file = QRelFile(judge_path);
    docnos = judge_file.key2s();
    print 'doc number:', len(docnos);
    writer = Corpus.TRECWriter(out_path);
    for docno in docnos:
        text = extract_text(docno, index_path, collection_type)
        writer.write(Corpus.Document(docno, text))

예제 #4

파일 보기

파일: WindowExtractor.py 프로젝트: DrDub/window_shopper

def exe_extract_text(judge_path, index_path, text_db_path):
    judge_file = QRelFile(judge_path);
    docnos = judge_file.key2s();
    docnos = filter(is_cluewebB, docnos);
    #docnos = docnos[:1000];
    print 'doc number:', len(docnos);
    db = bsddb.hashopen(text_db_path, 'w');
    count = 0;
    texts = fastmap.fastmap(lambda docno: extract_text(docno, index_path), 30, docnos);
    assert len(docnos) == len(texts);
    for i in xrange(len(docnos)): 
        db[docnos[i]] = texts[i];
    db.close();