Python Utilities.character_counter 예제들

프로그래밍 언어: Python

클래스/타입: Utilities

메소드/함수: character_counter

hotexamples.com에서의 예제들: 2

Python Utilities.character_counter - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 Utilities.character_counter 패키지로부터 facebook_page_scraper에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

format_CIK(13)

write_result_to_file(10)

AverageMeter(9)

mkdir(9)

tokenizeFile(7)

init_distribution(7)

get_err_from_predict(7)

sanitize_filing_year(6)

failExecution(6)

setup_db(5)

printFrequencies(5)

connect(5)

replace_zero_label_with_neg_one(5)

insert_entries(5)

pre_compute_threshes(4)

getIndentSize(4)

print_to_file(3)

findNextNonWhiteSpaceCharIndex(3)

getPyQt4ModulesDirectory(3)

get_alpha_numeric_count(3)

pwDecode(3)

get_random_name(3)

get_suffix(3)

printInfo(3)

rmTree(3)

is_CIK_valid(3)

parse_time(3)

isInsideTextLiteral(3)

get_prefix(3)

check_number(3)

check_capital(3)

check_bar(3)

get_f_ranking_from_predictions(2)

from_dungeon_level(2)

rot_center(2)

pwEncode(2)

get_auc_from_predict(2)

BhattacharyaCoeff(2)

checkBlacklistedVersions(2)

maximalElements(2)

collateFrequencies(2)

chat(2)

pre_compute_threshes_uci(2)

pre_compute_threshes_8news(2)

character_counter(2)

isSubList(2)

is_inside_frustum(2)

listMerge(2)

loadAll(2)

normalise_plurk_id(2)

예제 #1

파일 보기

파일: tokenvalidity.py 프로젝트: mchrzanowski/SEC10KParser

def are_there_more_left_parentheses_than_right_parentheses(location, hits):
    
    last_sentence_fragment = lfp.wordtokencreation.get_last_sentence_fragment(location, hits, return_as_string=True) 
    char_frequency = Utilities.character_counter(last_sentence_fragment, '(', ')')
        
    if char_frequency['('] > char_frequency[')']:
        return True
    
    return False

예제 #2

파일 보기

파일: tokenvalidity.py 프로젝트: mchrzanowski/SEC10KParser

def was_cut_within_a_table(location, hits):
            
    last_sentence_fragment = lfp.wordtokencreation.get_last_sentence_fragment(location, hits)
    
    if last_sentence_fragment is None:
        return False
    
    compressed_sentence_fragment = lfp.wordtokencreation.get_last_sentence_fragment(location, hits, return_as_string=True)

    #print "FRAGMENT:", compressed_sentence_fragment

    # see whether we picked up a table. 
    # tables normally have units of currency as well as the word follows somewhere.
    # if these hold, then we're probably in a table from a previous section.
    # that means that if we're in a relevant section right now, and the new hit demarcates a new section,
    # then we want to stop recording. if we're not recording, then we probably want to start.
    # if we're in a relevant section and the new hit does *not* have a header that's been whitelisted as being
    # a section, then we can continue recording.
    if re.search("(in)?\s*(millions|thousands|billions)", compressed_sentence_fragment, re.I | re.M | re.S) \
    and re.search("total|follow(s|ing)|balance", compressed_sentence_fragment, re.I | re.M | re.S):
        #print "MATCH ON currency"
        #print 'MATCH ON FOLLOWS|total'
        return True
        
    char_frequency = Utilities.character_counter(compressed_sentence_fragment, '$')
    
    if char_frequency['$'] >= 6:
        #print 'MATCH ON DOLLAR COUNT'
        return True
    
    number_count = 0
    for word in last_sentence_fragment:
        if Utilities.contains_numbers(word):
            number_count += 1
            
    if re.search("total|follow(s|ing)|balance", compressed_sentence_fragment, re.I | re.M | re.S) \
    and number_count >= 6:
        #print "match on number count"
        return True
    
    return False