Python extract_valuesの例

プログラミング言語: Python

名前空間/パッケージ名: FeatureExtraction.mainExtractor

メソッド/関数: extract_values

hotexamples.comのコード掲載数: 7

Python extract_values - 7件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのFeatureExtraction.mainExtractor.extract_valuesの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: MostRelevantFeatures.py プロジェクト: DirkBrand/Comment-Classification

    train = count_vect.transform(train_list)
    test = count_vect.transform(test_list)
        
    return train, test, count_vect.get_feature_names()

set = 1
if __name__ == '__main__':
    if set == 1:
        articleList, commentList, parentList, commentCount = read_comments(comment_data_path + 'trainTestDataSet.txt', skip_mtn=False)
    elif set == 2:
        articleList, commentList, parentList, commentCount = read_toy_comments(comment_data_path + 'trainTestDataSet.txt', comment_data_path + 'toyComments.csv')
    elif set == 3:
        articleList, commentList, commentCount = read_slashdot_comments(comment_data_path + 'slashdotDataSet.txt', limit=100000)
    
    # Values
    y = extract_values(articleList, commentList, commentCount, set)
    sss = StratifiedShuffleSplit(y, 1, test_size=0.95, random_state=42)
    y_train = []
    y_test = []
    for train, test in sss:
        np.save('train_vect', train)
        np.save('test_vect', test)
        y_train = y[train]
        y_test = y[test]
    
    processed_comment_list = extract_global_bag_of_words_processed(commentList)  
    train_v, test_v = np.load('train_vect.npy'), np.load('test_vect.npy')
    train_list = []
    test_list = []
    for v in train_v:
        train_list.append(processed_comment_list[v])

コード例 #2

ファイルを表示

ファイル: main.py プロジェクト: DirkBrand/Comment-Classification

def extractSaveValues(df_comments, filename, datatype):
    valueVector = extract_values(df_comments, datatype)
    print "Extracted values"
    save_numpy_matrix(filename, valueVector)

コード例 #3

ファイルを表示

ファイル: main.py プロジェクト: abhi-shek/Comment-Classification

def extractSaveValues(articleList, commentList, commentCount, filename,
                      datatype):
    valueVector = extract_values(articleList, commentList, commentCount,
                                 datatype)
    print "Extracted values"
    save_numpy_matrix(filename, valueVector)

コード例 #4

ファイルを表示

ファイル: MostRelevantFeatures.py プロジェクト: DirkBrand/Comment-Classification

set = 1
if __name__ == '__main__':
    if set == 1:
        articleList, commentList, parentList, commentCount = read_comments(
            comment_data_path + 'trainTestDataSet.txt', skip_mtn=False)
    elif set == 2:
        articleList, commentList, parentList, commentCount = read_toy_comments(
            comment_data_path + 'trainTestDataSet.txt',
            comment_data_path + 'toyComments.csv')
    elif set == 3:
        articleList, commentList, commentCount = read_slashdot_comments(
            comment_data_path + 'slashdotDataSet.txt', limit=100000)

    # Values
    y = extract_values(articleList, commentList, commentCount, set)
    sss = StratifiedShuffleSplit(y, 1, test_size=0.95, random_state=42)
    y_train = []
    y_test = []
    for train, test in sss:
        np.save('train_vect', train)
        np.save('test_vect', test)
        y_train = y[train]
        y_test = y[test]

    processed_comment_list = extract_global_bag_of_words_processed(commentList)
    train_v, test_v = np.load('train_vect.npy'), np.load('test_vect.npy')
    train_list = []
    test_list = []
    for v in train_v:
        train_list.append(processed_comment_list[v])

コード例 #5

ファイルを表示

ファイル: main.py プロジェクト: abhi-shek/Comment-Classification

def extractSaveValues(articleList, commentList, commentCount, filename, datatype):
    valueVector = extract_values(articleList, commentList, commentCount, datatype)
    print "Extracted values"
    save_numpy_matrix(filename, valueVector)

コード例 #6

ファイルを表示

ファイル: MostRelevantFeatures.py プロジェクト: abhi-shek/Comment-Classification

    test = count_vect.transform(test_list)
    
    #print count_vect.get_feature_names()[1000:1010]
    
    #print count_vect.get_feature_names()
    print "Train:", train.shape    
    print "Test:", test.shape  
    print  count_vect.vocabulary_
    
    return train, test, count_vect.get_feature_names()

if __name__ == '__main__':
    articleList, commentList, parentList, commentCount = read_comments(comment_data_path + 'trainTestDataSet.txt')
    
    # Values
    y = extract_values(articleList, commentList, parentList, commentCount)[:, 3]   
    sss = StratifiedShuffleSplit(y, 1, test_size=0.40, random_state=42)
    y_train = []
    y_test = []
    for train, test in sss:
        print train
        np.save('train_vect', train)
        np.save('test_vect', test)
        y_train = y[train]
        y_test = y[test]
    
    processed_comment_list = extract_global_bag_of_words_processed(commentList)  
    train_v, test_v = np.load('train_vect.npy'), np.load('test_vect.npy')
    train_list = []
    test_list = []
    for v in train_v:

コード例 #7

ファイルを表示

    #print count_vect.get_feature_names()[1000:1010]

    #print count_vect.get_feature_names()
    print "Train:", train.shape
    print "Test:", test.shape
    print count_vect.vocabulary_

    return train, test, count_vect.get_feature_names()


if __name__ == '__main__':
    articleList, commentList, parentList, commentCount = read_comments(
        comment_data_path + 'trainTestDataSet.txt')

    # Values
    y = extract_values(articleList, commentList, parentList, commentCount)[:,
                                                                           3]
    sss = StratifiedShuffleSplit(y, 1, test_size=0.40, random_state=42)
    y_train = []
    y_test = []
    for train, test in sss:
        print train
        np.save('train_vect', train)
        np.save('test_vect', test)
        y_train = y[train]
        y_test = y[test]

    processed_comment_list = extract_global_bag_of_words_processed(commentList)
    train_v, test_v = np.load('train_vect.npy'), np.load('test_vect.npy')
    train_list = []
    test_list = []
    for v in train_v: