def medlineParser(xmlDict, saveDict, numOfProcess): ''' Parse medline raw data files ''' path = xml.listXmlPath(xmlDict) pool = cocurrency.processPool(numOfProcess) try: pool.map(medlineEntry, itertools.izip(path, itertools.repeat(saveDict))) except Exception: raise
def test_listXmlPath(self): xmlPath = xml.listXmlPath('../data/pubmedMedline/') for index, url in enumerate(xmlPath): suffix = url.split('.')[1] self.assertEqual(suffix, 'xml')
def test_xmlConvDict(self): xmlPath = xml.listXmlPath('../data/') for index, url in enumerate(xmlPath): conv = xml.xmlConvDict(url) self.assertEqual(type(conv), dict)