Python chunk_list 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: articlenizer.util

메소드/함수: chunk_list

hotexamples.com에서의 예제들: 5

Python chunk_list - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 articlenizer.util.chunk_list에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: formatting.py 프로젝트: dave-s477/articlenizer

def bio_to_brat_parallel_wrapper(file_names, n_cores):
    """Parallel wrapper for article_list_bio_to_brat

    Args:
        file_names (list of lists): elements: [PosixPath, PosixPath, PosixPath, PosixPath] paths to text, labels and output text and output annotation
        n_cores (int): number of python processes to use (multiprocessing package)
    """
    list_segments = chunk_list(file_names, n_cores)
    with Pool(n_cores) as p:
        p.map(article_list_bio_to_brat, list_segments)

예제 #2

파일 보기

def parse_article_list_parallel_wrapper(in_list, n_cores=4):
    """Parallel wrapper around parse_article_list

    Args:
        in_list ([in_path, out_path]): path to input JATS, location for output plain txt
        n_cores (int, optional): number parallel python processes to spawn (multiprocessing package). Defaults to 4.
    """
    list_segments = chunk_list(in_list, n_cores)
    with Pool(n_cores) as p:
        error_counts = p.map(parse_article_list, list_segments)
    return sum(error_counts)

예제 #3

파일 보기

파일: html_parser.py 프로젝트: dave-s477/articlenizer

def parse_file_list_parallel_wrapper(in_list, out_path='.', n_cores=4):
    """Parallel wrapper for parse_file_list

    Args:
        in_list (list of PosixPaths): List of input files
        out_path (str, optional): Directory in which to write the outputs. Defaults to '.'.
        n_cores (int, optional): Number of python threads to use. Defaults to 4. 

    Returns:
        int: Number of articles extracted 
    """
    list_segments = chunk_list(in_list, n_cores)
    fct_to_execute = partial(parse_file_list, out_path=out_path)
    with Pool(n_cores) as p:
        n_articles = p.map(fct_to_execute, list_segments)
    return sum(n_articles)

예제 #4

파일 보기

파일: articlenizer.py 프로젝트: dave-s477/articlenizer

def preprocess_articles_parallel_wrapper(file_list,
                                         n_cores,
                                         process_unicode=True,
                                         replace_math=True,
                                         correct=True,
                                         corr_cite=True):
    """Parallel wrapper for preprocess_articles

    Args:
        file_list ([input filename, output filename]): pair of file names to read and to write
        n_cores (int): number of python processes to use (multiprocessing package)
        process_unicode (bool, optional): replace unicodes. Defaults to True.
        replace_math (bool, optional): replace math equations. Defaults to True.
        correct (bool, optional): replace string errors. Defaults to True.
        corr_cite (bool, optional): correct citation errors. Defaults to True.
    """
    list_segments = chunk_list(file_list, n_cores)
    fct_to_execute = partial(preprocess_articles,
                             process_unicode=process_unicode,
                             replace_math=replace_math,
                             correct=correct,
                             corr_cite=corr_cite)
    with Pool(n_cores) as p:
        p.map(fct_to_execute, list_segments)

예제 #5

파일 보기

파일: formatting.py 프로젝트: dave-s477/articlenizer

def brat_to_bio_parallel_wrapper(file_names,
                                 n_cores,
                                 process_unicode=True,
                                 replace_math=True,
                                 correct=True,
                                 corr_cite=True):
    """Parallel wrapper for article_list_brat_to_bio

    Args:
        file_names (list of lists): elements: [PosixPath, PosixPath, PosixPath] paths to text, annotation and output base path
        n_cores (int): number of python processes to use (multiprocessing package)
        process_unicode (bool, optional): replace unicodes. Defaults to True.
        replace_math (bool, optional): replace math equations. Defaults to True.
        correct (bool, optional): replace string errors. Defaults to True.
        corr_cite (bool, optional): correct citation errors. Defaults to True.
    """
    list_segments = chunk_list(file_names, n_cores)
    fct_to_execute = partial(article_list_brat_to_bio,
                             process_unicode=process_unicode,
                             replace_math=replace_math,
                             correct=correct,
                             corr_cite=corr_cite)
    with Pool(n_cores) as p:
        p.map(fct_to_execute, list_segments)