Python get_tokenizerの例

プログラミング言語: Python

名前空間/パッケージ名: programmingalpha.tokenizers

メソッド/関数: get_tokenizer

hotexamples.comのコード掲載数: 5

Python get_tokenizer - 5件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのprogrammingalpha.tokenizers.get_tokenizerの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: testTokenizer.py プロジェクト: AnonymousAuthor2013/KnowAlpha

def testVocab():
        
        tokenizer=get_tokenizer(name="bert", model_path=AlphaPathLookUp.BertBaseUnCased)
        ids=[0,1,2,3,4,5,6,7,8,9,10]
        print(tokenizer.decode(ids))
        tokenizer=get_tokenizer(name="xlnet", model_path=AlphaPathLookUp.XLNetBaseCased)
        print(tokenizer.decode(ids) )

コード例 #2

ファイルを表示

    def __init__(self, config_file):

        self.config = AlphaConfig.loadConfig(
            os.path.join(AlphaPathLookUp.ConfigPath, config_file))

        self.tokenizer = get_tokenizer(name=self.config.tokenizer,
                                       model_path=self.config.model_path)

        self.textExtractor = InformationAbstrator(maxClip=100, tokenizer=None)
        self.textExtractor.initParagraphFilter(
            self.textExtractor.lexrankSummary)

コード例 #3

ファイルを表示

ファイル: testTokenizer.py プロジェクト: AnonymousAuthor2013/KnowAlpha

def testBasicFunctions():
    tokenizer=get_tokenizer(name="gpt2", model_path=AlphaPathLookUp.GPT2Base)
    #tokenizer=get_tokenizer(name="roberta", model_path=programmingalpha.RoBertaBase)
    #tokenizer=get_tokenizer(name="bert", model_path=AlphaPathLookUp.BertBaseUnCased)
    
    print(tokenizer.tokenizer.additional_special_tokens)
    print(tokenizer.tokenizer.added_tokens_encoder)
    #exit(10)

    s="I am fantastic [CODE] supreme [MATH] !"
    print(tokenizer.tokenize(s))
    s_ids=tokenizer.tokenizeLine(s)
    print(s)
    print(s_ids)

    for id in s_ids.split():
        print(tokenizer.decode([id]))

コード例 #4

ファイルを表示

def init():
    global tokenizer
    name= args.tokenizer
    tokenizer=get_tokenizer(path_map_tokenizers[name], name)

コード例 #5

ファイルを表示

 def __init__(self, config_file):
     AlphaHTTPProxy.__init__(self,config_file)
     args=self.args
     self.tokenizer=get_tokenizer(model_path=programmingalpha.BertBaseUnCased,name=args.tokenizer)