Simple Search Engine Written by Python

Description

Base on MASASHI Shibata's project
Web Crawler
Use pyltp to segment Chinese word
MongoDB as storage
Flask as web framework

Requirements

Python 2.7
pip

Setup

Clone repository

$ git clone git@github.com:scorpio147wbh/information-retrieval-experiment.git

Download LTP Chinese word segment model from here

Install python packages

$ cd information-retrieval-experiment
$ pip install -r requirements.txt

MongoDB settings

Please rewrite MONGO_URL in config.py
LTP settings

Please rewrite CWS_MODEL_PATH in config.py

Run

$ python run-crawler.py http://nlp.stanford.edu/courses/NAACL2013/ # build a index
$ python run-webapp.py # access to http://127.0.0.1:5000

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
search_engine		search_engine
tests		tests
web_crawler		web_crawler
.gitignore		.gitignore
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt
run_crawler.py		run_crawler.py
run_webapp.py		run_webapp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

search_engine

search_engine

tests

tests

web_crawler

web_crawler

.gitignore

.gitignore

README.md

README.md

config.py

config.py

requirements.txt

requirements.txt

run_crawler.py

run_crawler.py

run_webapp.py

run_webapp.py

Repository files navigation

Simple Search Engine Written by Python

Description

Requirements

Setup

About

Releases

Packages

Contributors 2

Languages

binghaobhw/information-retrieval-experiment

Folders and files

Latest commit

History

Repository files navigation

Simple Search Engine Written by Python

Description

Requirements

Setup

About

Resources

Stars

Watchers

Forks

Languages