Python Utilities.get_alpha_numeric_count 예제들

프로그래밍 언어: Python

클래스/타입: Utilities

메소드/함수: get_alpha_numeric_count

hotexamples.com에서의 예제들: 3

Python Utilities.get_alpha_numeric_count - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 Utilities.get_alpha_numeric_count 패키지로부터 facebook_page_scraper에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

format_CIK(13)

write_result_to_file(10)

AverageMeter(9)

mkdir(9)

tokenizeFile(7)

init_distribution(7)

get_err_from_predict(7)

sanitize_filing_year(6)

failExecution(6)

setup_db(5)

printFrequencies(5)

connect(5)

replace_zero_label_with_neg_one(5)

insert_entries(5)

pre_compute_threshes(4)

getIndentSize(4)

print_to_file(3)

findNextNonWhiteSpaceCharIndex(3)

getPyQt4ModulesDirectory(3)

get_alpha_numeric_count(3)

pwDecode(3)

get_random_name(3)

get_suffix(3)

printInfo(3)

rmTree(3)

is_CIK_valid(3)

parse_time(3)

isInsideTextLiteral(3)

get_prefix(3)

check_number(3)

check_capital(3)

check_bar(3)

get_f_ranking_from_predictions(2)

from_dungeon_level(2)

rot_center(2)

pwEncode(2)

get_auc_from_predict(2)

BhattacharyaCoeff(2)

checkBlacklistedVersions(2)

maximalElements(2)

collateFrequencies(2)

chat(2)

pre_compute_threshes_uci(2)

pre_compute_threshes_8news(2)

character_counter(2)

isSubList(2)

is_inside_frustum(2)

listMerge(2)

loadAll(2)

normalise_plurk_id(2)

예제 #1

파일 보기

파일: RegressionTest.py 프로젝트: mchrzanowski/SEC10KParser

def _character_count_test(CIK, filing_year, new_data, corpus_file):
    
    parser_alpha_numeric_count =  Utilities.get_alpha_numeric_count(''.join(blob for blob in new_data))

    with open(corpus_file, 'r') as f:
        
        text_from_file = f.read()
        file_alpha_numeric_count = Utilities.get_alpha_numeric_count(text_from_file)
    
    change = (parser_alpha_numeric_count - file_alpha_numeric_count) / file_alpha_numeric_count
    result = abs(change) < Constants.REGRESSION_CHAR_COUNT_CHANGE_THRESHOLD
    
    print "CIK:%r, Year:%r, New Count:%r, " % (CIK, filing_year, parser_alpha_numeric_count),
    print "Corpus Count:%r, Passed:%r" % (file_alpha_numeric_count, result)
    
    if result is False:
        CorpusAccess.write_comparison_to_file(new_data, text_from_file, CIK, filing_year)

예제 #2

파일 보기

파일: parser.py 프로젝트: mchrzanowski/SEC10KParser

def _transform_list_of_hits_into_result(recorder, record_header):
    record = ''.join(recorder)

    #print "original:", record
    record = _cut_text_if_needed(record)
    #print "post:", record
    
    if re.search("SUBSEQUENT", record_header, re.I):
        if not _does_section_mention_litigation(record):
            record = None

    # almost all records are at least X chars. if not, it's 
    # probably something that we don't want.
    if record is not None and Utilities.get_alpha_numeric_count(record) < 200:
        record = None
    
    return record

예제 #3

파일 보기

파일: LegalProceedingParsing.py 프로젝트: mchrzanowski/SEC10KParser

def _get_best_result(results):
    ''' get the result with the smallest number of alphanumeric characters '''
    
    min_count = 0
    return_result = None
    
    for result in results:
    
        count = Utilities.get_alpha_numeric_count(result)
        
        if min_count == 0:
            min_count = count
            return_result = result
        
        elif count < min_count:
            min_count = count
            return_result = result
    
    return return_result