Skip to content

amityadav9314/webbed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation



#Project : A web crawler cum search engine #URI: http://www.webbed.in/ #Version: 1.0 #Author: Amit Yadav #Author URI: http://www.amityadav.in #Github URI: https://github.com/am1ty9d9v/webbed


Webbed is a very very simple search engine which I was able to build after going through a course named CS 101 on http://www.udacity.com website in which they have taught us to build a web crawler. In real they never implemented the search engine running on the web browser like what google.com and bing.com run like, but somehow using Beautiful Soup, I managed a to get working a bit of it.

To build this search enigne, I have used Openshift as a platform to run the application on, Python as a Language, MySQL as the backend, Beautiful Soup as the HTML parser.

The web crawler script final_crawler.py uses BeautifulSoup, an HTML parsing library by Leonard Richardson, and the Robot Exclusion Rules parser by Philip Semanchuk.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Please see the included licensing information for further details.

A working version of this search engine is available at http://www.webbed.in

About

This is a graduation final year project where I found about Django. This does not contain may django features, just the logic to hold a search engine

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published