Python parse_parent_line 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: parse

메소드/함수: parse_parent_line

hotexamples.com에서의 예제들: 3

Python parse_parent_line - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 parse.parse_parent_line에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: index_step_1.py 프로젝트: yiransheng/mumbler

def index_processed_file(index, writer):
    datafile = os.path.join(GPFS_STORAGE, "gram2_%s.processed" % str(index))
    with open(datafile, 'r') as f:
        pos = 0
        line = f.readline()
        while True:
            if is_parent_line(line):
                word, skip_lines, _ = parse_parent_line(line)
                starting_pos = pos
                md5 = hashlib.md5()
                md5.update(word)
                word_hash = md5.hexdigest()
                for i in range(0, skip_lines):
                    f.readline()

                chunk_size = f.tell() - starting_pos
                index_entry = IndexEntry(word_hash, index, starting_pos, chunk_size)
                writer.write(index_entry.pack())
                pos = f.tell()
            else:
                if line == '':
                    break # last line is empty in the data file, we are done here
                else:
                    raise ValueError('Improper data file %s' % datafile)

            line = f.readline()

예제 #2

파일 보기

파일: preprocess2.py 프로젝트: yiransheng/mumbler

def extract_parent_word(index, starting, chunk_size):
    datafile = os.path.join(GPFS_STORAGE, "gram2_%s.processed" % str(index))
    with open(datafile, 'r') as df:
        df.seek(starting)
        lines = df.read(chunk_size).split("\n")

    line = lines[0]
    if not is_parent_line(line):
        return None

    parent_word, _, counts = parse_parent_line(lines[0])
    return parent_word

예제 #3

파일 보기

파일: bloomfilter.py 프로젝트: yiransheng/mumbler

def create_filter(datafile, force=False):
    assert os.path.isfile(datafile)
    datadir, datafilename = os.path.split(datafile)
    filter_file = os.path.join(datadir, datafilename + ".filter")
    if force or not os.path.isfile(filter_file):
        bf = BloomFilter(capacity=1e6)
        with open(datafile) as df:
            line = next(df)
            try:
                while True:
                    if is_parent_line(line):
                        word, skips, _ = parse_parent_line(line)
                        bf.add(word)
                        for i in xrange(1, skips):
                            next(df)
                    line = next(df)
            except StopIteration:
                with open(filter_file, 'w') as ff:
                    bf.tofile(ff)
                del bf

        print("%s done." % filter_file)