A web crawler for collecting online academic publication metadata.
Based on scrapy framework, use pip install scrapy
to build up running environment.
Spiders for collecting academic publication metadata.
Collecting bibliography data from some selected USENIX conferences.
Collecting paper list from all USENIX proceedings since 1993.
Translates crawled data into pub-owl ontology as instances.
Work with collected data, convert to W3C Resource Description Framework (RDF) model by using rdflib and FOAF specification.
...