Python Data.pre_processの例

プログラミング言語: Python

名前空間/パッケージ名: load_data

クラス/型: Data

メソッド/関数: pre_process

hotexamples.comのコード掲載数: 2

Python Data.pre_process - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのload_data.Data.pre_processの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Data(30)

get_inputs_and_targets(3)

get_adj_mat(2)

get_data_idxs(2)

load_data(2)

print(2)

pre_process(2)

csv_df(2)

sample(1)

setup(1)

lt_to_int(1)

make_source_data(1)

max_len(1)

max_ngrams(1)

max_tfidf(1)

pad(1)

plot_dataset(1)

plot(1)

sample_gen_evaluate(1)

plot_pics(1)

set_statfile(1)

prepare_next_dataset(1)

save_vec(1)

read_attr(1)

read_graph(1)

load_tracking_info(1)

load(1)

load_test_data(1)

get_cross_validation_train_dev_set(1)

convert2vec(1)

data_augment(1)

data_splitting(1)

drop(1)

generate_sample(1)

get_all_players_info(1)

get_all_plays_info(1)

get_batch(1)

get_data(1)

load_clusters(1)

get_dataset(1)

get_er_vocab(1)

get_feature_dict(1)

get_sparsity_split(1)

get_testset(1)

get_tracking_info(1)

get_train_test(1)

get_train_test_set(1)

build_wordvec(1)

wirte_result(1)

コード例 #1

ファイルを表示

ファイル: model_predictions.py プロジェクト: samshipengs/Twitter-Sentiment-Prediction

    def prepare_data(self, data_fields, wv_size=600):
        test_data = Data(self.file_name, self.file_path)
        test_df = test_data.csv_df(data_fields)
        # make a copy of the original tweets for later use
        original_df = test_df.copy()

        # pre-process data(same as how we trained)
        test_data.pre_process(test_df)

        # then convert using word2vec
        model = test_data.build_wordvec(size=wv_size, verbose=False)
        # take a look of the max_len of testing. although we still have to use max_len from train
        max_len_test = test_data.max_len(test_df)
        data = test_data.convert2vec(test_df,
                                     self.max_len_train,
                                     model,
                                     name='test_' + self.file_name)
        test_data.save_vec(data, name='test_' + self.file_name)

        self.data = data
        self.test_data = test_data
        self.test_df = test_df
        self.original_df = original_df
        print ">>>Done preparing data.<<<\n"

コード例 #2

ファイルを表示

ファイル: lstm.py プロジェクト: samshipengs/Twitter-Topic-Classification

data = np.load(data_file)
label = np.load(label_file)

# load original tweets
# ---------------------------------------------------------------------------------
sports_dic = {
    'basketball': 1,
    'hockey': 2,
    'baseball': 3,
    'tennis': 4,
    'volleyball': 5
}
sp_data = Data(sports_dic, file_path)
sp_df = sp_data.csv_df(['text'])  # load data
rm_hashtags = ['#' + s for s in sports_dic.keys()]
sp_data.pre_process(sp_df, rm_list=rm_hashtags)  # pre-process data
sp_df.drop(['tokenized'], axis=1, inplace=True)
# ---------------------------------------------------------------------------------

# set up lstm structure
n_classes = 5
hm_epochs = 20
batch_size = 50
chunk_size = data.shape[2]
n_chunks = data.shape[1]
rnn_size = 300

# height x width
x = tf.placeholder('float', [None, n_chunks, chunk_size])
y = tf.placeholder('float')