Python IndexModule.IndexModule示例

编程语言: Python

命名空间/包名称: index_module

类/类型: IndexModule

方法/功能: IndexModule

hotexamples.com的示例: 1

Python IndexModule.IndexModule - 已找到1个示例。这些是从开源项目中提取的最受好评的index_module.IndexModule.IndexModule现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

IndexModule(1)

construct_postings_lists(1)

process_json(1)

示例#1

显示文件

文件： setup.py 项目： ericazf/IREngine

	# print(max_page)
	return (max_page)

def crawling():
	print('-----start crawling time: %s-----'%(datetime.today()))
	config = configparser.ConfigParser()
	config.read('../config.ini', 'utf-8')
	root = 'http://news.sohu.com/1/0903/61/subject212846158'
	max_page = get_max_page(root + '.shtml')
	news_pool = get_news_pool(root, max_page, max_page - 5)
	crawl_news(news_pool, 140, config['DEFAULT']['doc_dir_path'], config['DEFAULT']['doc_encoding'])

if __name__ == "__main__":
	print('-----start time:%s-----'%(datetime.today()))

	# 抓取新闻数据
	# crawling()

	# 构建索引
	print('-----start indexing time: %s-----'%(datetime.today()))
	im = IndexModule('../config.ini', 'utf-8')
	im.construct_postings_lists()

	# 推荐阅读
	print('-----start recommending time: %s-----'%(datetime.today()))
	rm = RecommendationModule('../config.ini', 'utf-8')
	rm.find_k_nearest(5, 25)
	print('-----finish time: %s-----'%(datetime.today()))