Python compute_cutoff 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: popularity_cutoff

메소드/함수: compute_cutoff

hotexamples.com에서의 예제들: 2

Python compute_cutoff - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 popularity_cutoff.compute_cutoff에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: create_tfidf.py 프로젝트: TracyMRohlin/RedditCorpus

 def __init__(self, wd=None, save=None):
     if wd:
         os.chdir(wd)
         self.reddit = str(wd).split("/")[-2]
         self.cutoff = compute_cutoff(wd)
     self.popular_texts = []
     self.unpopular_texts = []
     self.corpus = []
     self.all_texts = []
     self.save = False
     self.word_counts = defaultdict(int)
     self.vectorizer = None
     if save:
         if save.lower()[0] == "y":
             self.save = True

예제 #2

파일 보기

파일: Naive_Bayes_model.py 프로젝트: TracyMRohlin/RedditCorpus

    print 'F1 Score:', F1score
    print 'Confusion matrix:'
    print confusion

    return F1score


if __name__ == "__main__":
    parser = argparse.ArgumentParser(description="Builds a Naive Bayes model for classification.")
    parser.add_argument("filepath", help="Argument must be the filepath where the text files are located")
    parser.add_argument("topic_type", help="topic_type is either bow, tfidf or lda")
    parser.add_argument("valid_or_test", help="Either v or t to test against validation or test set")
    parser.add_argument("--num_topics", default=10, help="The amount of topics to be grabbed from the LDA model")
    args = parser.parse_args()

    cutoff = compute_cutoff(args.filepath)

    print "Classifying the initial data."
    classify_initial_data(args.filepath, cutoff, SOURCES)
    training_data = make_data(SOURCES, args.filepath)

    if args.valid_or_test[0].lower() == "v":
        print "Classifying the validation data."
        validation_filepath = args.filepath + "/validation"
        classify_initial_data(validation_filepath, cutoff, VALIDATION)
        validation_data = make_data(VALIDATION, validation_filepath)
        create_Naive_Bayes(training_data, validation_data, args.topic_type, args.num_topics)

    else:
        print "Classifying the testing data."
        testing_filepath = args.filepath + "/testing"