Python loadfile 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: data_preprocessing

메소드/함수: loadfile

hotexamples.com에서의 예제들: 2

Python loadfile - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 data_preprocessing.loadfile에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: make_embedding.py 프로젝트: bzwartsenberg/arXivData

    returns:
        model: a dictionary mapping words to word-vectors (embeddings).
    """
    if word2vec_format:
        return gensim.models.KeyedVectors.load_word2vec_format(filepath,
                                                               binary=True)
    else:  #own pretrained model
        return gensim.models.Word2Vec.load(filepath)


if __name__ == "__main__":

    ### load data:
    trainpath = 'train_data/train_data.json'
    testpath = 'test_data/test_data.json'
    traindata = dp.loadfile(trainpath)

    inc_categories = [
        'cond-mat.mes-hall', 'cond-mat.mtrl-sci', 'cond-mat.stat-mech',
        'cond-mat.str-el', 'cond-mat.supr-con', 'cond-mat.soft', 'quant-ph',
        'cond-mat.dis-nn', 'cond-mat.quant-gas', 'hep-th'
    ]
    #
    train_X, train_y = dp.generate_Xy_data_categories(traindata,
                                                      inc_categories,
                                                      ignore_others=True,
                                                      shuffle_seed=0,
                                                      ydatatype='onehot',
                                                      clean_x=True,
                                                      keep_latex_tags=True)

예제 #2

파일 보기

파일: cnn_model.py 프로젝트: bzwartsenberg/arXivData

        returns: the predicted labels or probabilities of docs
        """
        probabilities = self.model.predict(X_ints)

        if return_probabilities:
            return probabilities
        else:
            return np.round(probabilities)


if __name__ == "__main__":

    ### load data:
    trainpath = 'train_data/train_data.json'
    testpath = 'test_data/test_data.json'
    traindata, testdata = dp.loadfile(trainpath), dp.loadfile(testpath)

    inc_categories = [
        'cond-mat.mes-hall', 'cond-mat.mtrl-sci', 'cond-mat.stat-mech',
        'cond-mat.str-el', 'cond-mat.supr-con', 'cond-mat.soft', 'quant-ph',
        'cond-mat.dis-nn', 'cond-mat.quant-gas', 'hep-th'
    ]

    train_X, train_y = dp.generate_Xy_data_categories(traindata,
                                                      inc_categories,
                                                      ignore_others=True,
                                                      shuffle_seed=0,
                                                      ydatatype='onehot',
                                                      clean_x=True,
                                                      keep_latex_tags=True)
    test_X, test_y = dp.generate_Xy_data_categories(testdata,