Python Sample.get_sample_data 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: sample

클래스/타입: Sample

메소드/함수: get_sample_data

hotexamples.com에서의 예제들: 1

Python Sample.get_sample_data - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 sample.Sample.get_sample_data에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Sample(30)

evaluate(6)

collect_sample(5)

addallhistory(5)

get_flywheel_change(3)

get_sample_flywheel_updates(3)

calc(2)

getAllLocOnPlate(2)

getAllOnPlate(2)

create_sample(2)

create_empty_database(2)

copy_from(2)

clearSystematics(2)

generate_x_y(2)

calc_speed(2)

from_wavfile(2)

bestCMu(2)

get_image(2)

afMeasurements(2)

__init__(2)

get_alphanum_percentage(1)

get(1)

hasVariant(1)

hasGene(1)

getClassLabel(1)

getData(1)

getGenes(1)

getSampleName(1)

getVariantFrequency(1)

getVariants(1)

get_alphanum_count(1)

get_sample_data(1)

get_avg_word_len(1)

get_id_and_sample(1)

get_sample_and_text(1)

get_char_appearances(1)

get_sample(1)

get_char_count(1)

get_pv_sample(1)

get_load_sample_rtp(1)

get_ev_sample(1)

get_fragment(1)

get_load_sample_nrtp(1)

get_fragment_with_interval(1)

A(1)

fromRaw(1)

from_values(1)

call(1)

Mu(1)

Type(1)

예제 #1

파일 보기

파일: train_word_vector.py 프로젝트: shenhao-stu/nlp_ai_2020


#In[6]
#读取数据集
data_path = 'ptb.train.txt'  #文件路径
with open(data_path) as f:
    lines = f.readlines()
    #读取数据 raw_data_set就是字符形式的样本
    raw_data_set = [scentence.split() for scentence in lines]

#In[7]
#进行数据集的处理，构建模型
batch_size = 512
data_set = Sample(raw_data_set, 5, 5)  #对raw_data_set进行采样
vocab_size = len(data_set.idx2word)
centers, all_contexts, noises = data_set.get_sample_data(
)  #对raw_data_set进行采样，得到中心词，背景词，噪声词
data_set = MyDataSet(centers, all_contexts, noises)  #把得到的词放到MyDataSet中
data_iter = data.DataLoader(data_set,
                            batch_size,
                            shuffle=True,
                            collate_fn=collate_func,
                            num_workers=0)
#构建词词嵌入层，训练其中的词向量
embed_size = 200
net = nn.Sequential(nn.Embedding(vocab_size, embed_size),
                    nn.Embedding(vocab_size, embed_size))
loss = SigmoidBinCELoss()

#In[8]

#训练