GitHub

QueryPairModels

Usage

Command: python main.py --modeltype (modeltype)
Parameter Priorty: --(modeltype)_cfg k:v > config/(modeltype)_config.json > --other
Data Reader:
- Preprocessor: Set in header as "query:(preprocessor_name)" or "query:(prerpocessor_name):(preprocessor_index):(preprocessor_columnIdx)", will convert using corresponding prerpocessor, inference/score mode will keep origin fields
- prefetch_buffer best set to GPU count?
Adding Model:
- model/config class inherit model/base_model
- three level param
- add index in main.py

Support list

Model type: cdssm, bert2seq, bertpair2seq, bert_qk, xletter2seq
Mode: train (include data reader mode: eval_auc,eval_bleu), infer, score
Preprocessor name: xletter, bertseq, bertpair

Features

--local N (Use N GPU by setting os environ CUDA_VISIBLE_DEVICES)
--timeline_enable & --timeline_desc will enable profiling, the profile log will saved in log folder
--init_status to print trainable parameter init source
multi GPU on single machine
- default setting use parameter sharing strategy
- Grad sharing between GPUS
  - --grad_mode:1 enable this mode
  - --grad_float16:1 cast grad to float16 before sharing

Models

1 .CDSSM(modeltype=CDSSM): Based on https://www.microsoft.com/en-us/research/publication/a-convolutional-latent-semantic-model-for-web-search/

Options
- input_mode: mstf(mstf ops), pyfunc(extract xletter in data reader), pyfunc_batch(customized ops)
- maxpooling_mode: mstf(mstf ops), emb(sparse embedding)
Training Speed: (bs=128, neg=4, 288->64)

Config	#GPU	Trainer Setting	Speed(e/s)
input=mstf, maxp=mstf	1	GM=0,G16=0	13800
input=mstf, maxp=mstf	1	GM=1,G16=0	13000
input=mstf, maxp=mstf	1	GM=1,G16=1	12300
input=mstf, maxp=mstf	2	GM=0,G16=0	6000
input=mstf, maxp=mstf	2	GM=1,G16=0	27600
input=mstf, maxp=mstf	2	GM=1,G16=1	28000
input=mstf, maxp=emb	1	GM=0,G16=0	7060
input=mstf, maxp=emb	2	GM=0,G16=0	14200
input=mstf, maxp=emb	2	GM=0,G16=0	14400
input=mstf, maxp=emb	2	GM=1,G16=1	13800
input=pyfunc, maxp=emb	2	GM=1,G16=0	1050
input=pyfunc_batch, maxp=emb	2	GM=1,G16=0	3000

Bert2Seq, BertPair2Seq
Seq2Seq Encoder: xletter, Decoder: term
QDocTreeRetrieve -- Based on the idea of Learning Tree-based Deep Model for Recommender Systems https://arxiv.org/pdf/1801.02294.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
config		config
models		models
module		module
utils		utils
README.md		README.md
command.txt		command.txt
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

models

models

module

module

utils

utils

README.md

README.md

command.txt

command.txt

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

QueryPairModels

Usage

Support list

Features

Models

About

Releases

Packages

Languages

liuqiangict/BERT_FineTune

Folders and files

Latest commit

History

Repository files navigation

QueryPairModels

Usage

Support list

Features

Models

About

Resources

Stars

Watchers

Forks

Languages