Python cal_doc_distance Examples

Programming Language: Python

Namespace/Package Name: familia

Method/Function: cal_doc_distance

Examples at hotexamples.com: 3

Python cal_doc_distance - 3 examples found. These are the top rated real world Python examples of familia.cal_doc_distance extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: familia_wrapper.py Project: yuqin/Familia

    def cal_doc_distance(self, doc1, doc2):
        """计算长文本与长文本之间的距离

        计算两个长文本的主题分布之间的距离，包括jensen_shannon_divergence和hellinger_distance

        Args:
            doc1: 输入文档1，无需分词
            doc2: 输入文档2，无需分词

        Returns:
            返回一个list对象，其中有两个float元素，第一个表示jensen_shannon_divergence距离，
            第二个表示hellinger_distance距离。例如：
            [0.187232, 0.23431]
        """
        return familia.cal_doc_distance(self._inference_engine,
                                        self._tokenizer, doc1, doc2)

Example #2

Show file

    def cal_doc_distance(self, doc1, doc2):
        """计算长文本与长文本之间的距离

        计算两个长文本的主题分布之间的距离，包括jensen_shannon_divergence和hellinger_distance

        Args:
            doc1: 输入文档1分词后的list结果
            doc2: 输入文档2分词后的list结果

        Returns:
            返回一个list对象，其中有两个float元素，第一个表示jensen_shannon_divergence距离，
            第二个表示hellinger_distance距离。例如：
            [0.187232, 0.23431]
        """
        doc1 = ' '.join(doc1)
        doc2 = ' '.join(doc2)
        return familia.cal_doc_distance(self._inference_engine, doc1, doc2)

Example #3

Show file

File: familia_wrapper.py Project: novellll/Familia

    def cal_doc_distance(self, doc1, doc2):
        """计算长文本与长文本之间的距离

        计算两个长文本的主题分布之间的距离，包括jensen_shannon_divergence和hellinger_distance

        Args:
            doc1: 输入文档1，无需分词
            doc2: 输入文档2，无需分词

        Returns:
            返回一个list对象，其中有两个float元素，第一个表示jensen_shannon_divergence距离，
            第二个表示hellinger_distance距离。例如：
            [0.187232, 0.23431]
        """
        return familia.cal_doc_distance(self._inference_engine,
                                                 self._tokenizer,
                                                 doc1,
                                                 doc2)