Python HarParser.scanHarfile Exemples

Langage de programmation: Python

Espace de nommage/Pack: harParser

Class/Type: HarParser

Méthode/Fonction: scanHarfile

Exemples au hotexamples.com: 2

Python HarParser.scanHarfile - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de harParser.HarParser.scanHarfile extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

HarParser(3)

getSingleHarFile(1)

get_single_har_file(1)

parseMultipleHars(1)

parse_multiple_hars(1)

scanHarfile(1)

scan_har_file(1)

Méthodes fréquemment utilisées

HarParser (3)

getSingleHarFile (1)

get_single_har_file (1)

parseMultipleHars (1)

parse_multiple_hars (1)

scanHarfile (1)

scan_har_file (1)

Exemple #1

0

Afficher le fichier

def crawlingScan(self, url, apiCalls = [], allFoundURLs = []): self.count = self.count - 1 if self.count < 0: return harParser = HarParser(self.harDirectory, searchString=self.searchString, removeParams=self.removeParams) #If uncommented, will return as soon as a matching call is found #if self.searchString is not None and len(apiCalls) > 0: # return apiCalls try: print("Scanning URL: "+url) html = self.openURL(url) if html is not None: bsObj = BeautifulSoup(html, "lxml") harObj = harParser.getSingleHarFile() apiCalls = harParser.scanHarfile(harObj, apiCalls=apiCalls) allFoundURLs, newUrls = self.findInternalURLs(bsObj, url, allFoundURLs) shuffle(newUrls) for newUrl in newUrls: self.crawlingScan(newUrl, apiCalls, allFoundURLs) except (KeyboardInterrupt, SystemExit): print("Stopping crawl") self.browser.close() apiWriter = APIWriter(apiCalls) apiWriter.outputAPIs() exit(1) return apiCalls

Exemple #2

0

Afficher le fichier

Fichier : apiFinder.py Projet : Shaw622/Scraping

def crawlingScan(self, url, apiCalls=[], allFoundURLs=[]): self.count = self.count - 1 if self.count < 0: return harParser = HarParser(self.harDirectory, searchString=self.searchString, removeParams=self.removeParams) #If uncommented, will return as soon as a matching call is found #if self.searchString is not None and len(apiCalls) > 0: # return apiCalls try: print("Scanning URL: " + url) html = self.openURL(url) if html is not None: bsObj = BeautifulSoup(html, "lxml") harObj = harParser.getSingleHarFile() apiCalls = harParser.scanHarfile(harObj, apiCalls=apiCalls) allFoundURLs, newUrls = self.findInternalURLs( bsObj, url, allFoundURLs) shuffle(newUrls) for newUrl in newUrls: self.crawlingScan(newUrl, apiCalls, allFoundURLs) except (KeyboardInterrupt, SystemExit): print("Stopping crawl") self.browser.close() apiWriter = APIWriter(apiCalls) apiWriter.outputAPIs() exit(1) return apiCalls