Esempi in Python per QueryUtils.static_simple_remove_punct

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: query_util

Classe/tipologia: QueryUtils

Metodo/funzione: static_simple_remove_punct

Esempi su hotexamples.com: 2

QueryUtils.static_simple_remove_punct in Python: 2 esempi trovati. Questi sono i migliori esempi reali in Python per query_util.QueryUtils.static_simple_remove_punct, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

static_jieba_cut(8)

QueryUtils(7)

static_remove_cn_punct(4)

static_remove_pu(3)

corenlp_cut(2)

remove_cn_punct(2)

static_simple_remove_punct(2)

static_corenlp_cut(1)

Esempio n. 1

Mostra file

    def _prepare_data(self, files):
        print('prepare data...')

        embeddings = list()
        queries = list()
        labels = list()
        # mlb = MultiLabelBinarizer()

        for index in xrange(len(files)):
            path = files[index]
            with open(path, 'r') as f:
                for line in f:
                    # line = json.loads(line.strip().decode('utf-8'))
                    # question = line['question']
                    line = line.replace('\t', '').replace(
                        ' ', '').strip('\n').decode('utf-8').split('#')
                    question = QueryUtils.static_simple_remove_punct(
                        str(line[0]))
                    label = self.named_labels.index(
                        str(line[1].encode('utf-8')))
                    queries.append(question)
                    labels.append(label)
                    tokens = [self.cut(question)]
                    embedding = self.feature_extractor.transform(
                        tokens).toarray()
                    embeddings.append(embedding)

        embeddings = np.array(embeddings)
        embeddings = np.squeeze(embeddings)
        # self.kernel.fit()
        # self.mlb = mlb.fit(labels)
        # labels = self.mlb.transform(labels)

        # print (embeddings.shape, len(queries))
        # print_cn(labels.shape)

        return embeddings, labels, queries

Esempio n. 2

Mostra file

 def cut(self, input_):
     input_ = QueryUtils.static_simple_remove_punct(input_)
     seg = " ".join(jieba.cut(input_, cut_all=False))
     tokens = _uniout.unescape(str(seg), 'utf8')
     return tokens