Python bulid_dataset 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils

메소드/함수: bulid_dataset

hotexamples.com에서의 예제들: 2

Python bulid_dataset - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.bulid_dataset에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

if __name__ == '__main__':
    # dataset = 'THUCNews'
    # dataset = 'TOUTIAONews'
    # dataset = 'weibo_senti_100k'
    # dataset = 'simplifyweibo_4_moods'
    # dataset = 'Chinese_conversation_sentiment-master'
    # dataset = 'NLPCC2017'
    dataset = 'testtt'
    model_name = args.model
    x = import_module('models.' + model_name)
    config = x.Config(dataset)
    np.random.seed(2)
    torch.manual_seed(1)
    torch.cuda.manual_seed_all(4)
    torch.backends.cudnn.deterministic = True  #保证每次运行结果一样

    start_time = time.time()
    print('加载数据集')
    train_data, dev_data, test_data = utils.bulid_dataset(config)
    train_iter = utils.bulid_iterator(train_data, config)
    dev_iter = utils.bulid_iterator(dev_data, config)
    test_iter = utils.bulid_iterator(test_data, config)

    time_dif = utils.get_time_dif(start_time)
    print("模型开始之前，准备数据时间：", time_dif)

    # 模型训练，评估与测试
    model = x.Model(config).to(config.device)
    train.train(config, model, train_iter, dev_iter, test_iter)
    # train.test(config, model, test_iter)

예제 #2

파일 보기

파일: 04_bidirectional_lstm.py 프로젝트: zxyscz/ner-english

import numpy as np
import pandas as pd
from utils import bulid_dataset
import matplotlib.pyplot as plt
from keras.models import Model, Input, load_model
from keras.callbacks import ModelCheckpoint
from keras.layers import LSTM, Embedding, Dense, TimeDistributed, Dropout, Bidirectional
plt.style.use("ggplot")

# 1 加载数据
ner_dataset_dir = '../data/ner_dataset.csv'
dataset_dir = '../data/dataset.pkl'

# 2 构建数据集
n_words, n_tags, max_len, words,tags,\
X_train, X_test, y_train, y_test=bulid_dataset(ner_dataset_dir,dataset_dir,max_len=50)


# 3 构建和训练模型
def train():
    input = Input(shape=(max_len, ))
    model = Embedding(input_dim=n_words, output_dim=50,
                      input_length=max_len)(input)
    model = Dropout(0.1)(model)
    model = Bidirectional(
        LSTM(units=100, return_sequences=True, recurrent_dropout=0.1))(model)
    out = TimeDistributed(Dense(n_tags, activation='softmax'))(
        model)  # softmax output layer

    model = Model(input, out)
    model.compile(optimizer='rmsprop',