Python DocumentParser.parse_document示例

编程语言: Python

命名空间/包名称: parser

类/类型: DocumentParser

方法/功能: parse_document

hotexamples.com的示例: 1

Python DocumentParser.parse_document - 已找到1个示例。这些是从开源项目中提取的最受好评的parser.DocumentParser.parse_document现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

__init__(5)

parse_document(1)

示例#1

显示文件

文件： test.py 项目： praveen97uma/docsearch

from parser import DocumentParser
from store import DocumentStoreFactory, TermStoreFactory
from index import IndexFactory
from parser import TextParser

url1 = "https://stackoverflow.com/questions/9626535/get-domain-name-from-url"

text1 = "Extracting domain from URL in python"
text2 = "How to Get Domain Name from URL String domain in Python"
text3 = "How to automatically extract domain from URL through conf files at search-time"
url3 = "https://answers.splunk.com/answers/188774/how-to-automatically-extract-domain-from-url-throu.html"
url2 = "https://ashiknesin.com"

doc1 = DocumentParser.parse_document(url1, text1)
doc2 = DocumentParser.parse_document(url2, text2)
doc3 = DocumentParser.parse_document(url3, text3)

doc_store = DocumentStoreFactory.get_store()
print(doc_store._data)

index = IndexFactory.get_or_create_index("default")

index.add_document(doc1)
index.add_document(doc2)
index.add_document(doc3)

index.display()

from query import QueryEvaluator

qeval = QueryEvaluator(IndexFactory, TermStoreFactory)