Author: Peter Boraros (based on Niklas Baumstark's webkit-server)
A standalone version of the Webkit server included in capybara-webkit. It is rewritten by pborky with PySide QT bindings and the following improvements over the original version from thoughtbot:
Wait
command to wait for the current page to loadSetAttribute
command to configure certainQWebkit
settingsSetHtml
command to load custom HTML into the browser (e.g. to execute scripts on web pages scraped by a static scraper)SetViewportSize
command to set the viewport size of the in-memory browser
If you are interested in web scraping using this server, have a look at dryscrape.
It is recomended to used distribution version of PySide, e.g. for Ubuntu you should install
at least python-pyside.qtwebkit
package. Then
invoke pip install --process-dependency-links git+https://github.com/pborky/webkit-scraper.git
to install the server.
If you have any problems with this software, don't hesitate to open an issue on Github or open a pull request.
This software is based on capybara-webkit. capybara-webkit is Copyright (c) 2011 thoughtbot, inc. It is free software, and may be redistributed under the terms specified in the LICENSE file.