Esempi in Python per HTMLImageLinkExtractor.HTMLImageLinkExtractor

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: scrapy.contrib.linkextractors.image

Classe/tipologia: HTMLImageLinkExtractor

Metodo/funzione: HTMLImageLinkExtractor

Esempi su hotexamples.com: 2

HTMLImageLinkExtractor.HTMLImageLinkExtractor in Python: 2 esempi trovati. Questi sono i migliori esempi reali in Python per scrapy.contrib.linkextractors.image.HTMLImageLinkExtractor.HTMLImageLinkExtractor, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

HTMLImageLinkExtractor(2)

extract_links(2)

Esempio n. 1

Mostra file

File: test_contrib_linkextractors.py Progetto: richard-ma/CodeReading

    def test_extraction(self):
        '''Test the extractor's behaviour among different situations'''

        lx = HTMLImageLinkExtractor(locations=('//img', ))
        links_1 = lx.extract_links(self.response)
        self.assertEqual(links_1, [
            Link(url='http://example.com/sample1.jpg', text=u'sample 1'),
            Link(url='http://example.com/sample2.jpg', text=u'sample 2'),
            Link(url='http://example.com/sample4.jpg', text=u'sample 4')
        ])

        lx = HTMLImageLinkExtractor(locations=('//img', ), unique=False)
        links_2 = lx.extract_links(self.response)
        self.assertEqual(links_2, [
            Link(url='http://example.com/sample1.jpg', text=u'sample 1'),
            Link(url='http://example.com/sample2.jpg', text=u'sample 2'),
            Link(url='http://example.com/sample4.jpg', text=u'sample 4'),
            Link(url='http://example.com/sample4.jpg',
                 text=u'sample 4 repetition')
        ])

        lx = HTMLImageLinkExtractor(locations=('//div[@id="wrapper"]', ))
        links_3 = lx.extract_links(self.response)
        self.assertEqual(links_3, [
            Link(url='http://example.com/sample1.jpg', text=u'sample 1'),
            Link(url='http://example.com/sample2.jpg', text=u'sample 2'),
            Link(url='http://example.com/sample4.jpg', text=u'sample 4')
        ])

        lx = HTMLImageLinkExtractor(locations=('//a', ))
        links_4 = lx.extract_links(self.response)
        self.assertEqual(links_4, [
            Link(url='http://example.com/sample2.jpg', text=u'sample 2'),
            Link(url='http://example.com/sample3.html', text=u'sample 3')
        ])

Esempio n. 2

Mostra file

File: test_contrib_linkextractors.py Progetto: richard-ma/CodeReading

 def test_urls_type(self):
     '''Test that the resulting urls are regular strings and not a unicode objects'''
     lx = HTMLImageLinkExtractor()
     links = lx.extract_links(self.response)
     self.assertTrue(all(isinstance(link.url, str) for link in links))