SearchEngine Demo with solr
Trec requires the following to run:
In root directory of Trec:
python webapp/app.py
Then you can open a browser and type in xxx.xxx.xxx.xx:8285, you will see this demo.
Deal with trec09 and trec12 web dataset
####Step1: Warc to Mongo#### Parser warc.gz format and insert into Mongodb
####Step2: Mongodb to Solr#### Retrieve doc from mongodb and indexed with solr
####Step3: Solr to WebUI#### Design Web UI to see search results
To contribute to Trec, clone this repo locally and commit your code on a separate branch. Please write unit tests for your code, and run the linter before opening a pull-request:
Trec major versions are just for course project. This means that patch-level changes will be added and bugs will be fixed over a long period. The table below outlines the end-of-support dates for major versions, and the last minor release for that version.
❔ | Major Version | Last Minor Release | Support End Date |
---|---|---|---|
::hourglass:: | 1 | 1 | N/A |
If you're opening issues related to these, please mention the version that the issue relates to.
Trec is licensed under the MIT license. Copyright © 2016, RominYue