Python extract_urls Examples

Programming Language: Python

Namespace/Package Name: archivebot.bot

Method/Function: extract_urls

Examples at hotexamples.com: 7

Python extract_urls - 7 examples found. These are the top rated real world Python examples of archivebot.bot.extract_urls extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenCraigslistTermsPage_ReturnsEmptyList(self):
     url = 'https://www.craigslist.org/about/terms.of.use'
     text = self.body.replace('%url%', url)
     self.assertEqual(len(bot.extract_urls(text)), 0)

Example #2

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenNonCraigslistPageEndingWithHtml_ReturnsEmptyList(
         self):
     url = 'https://www.google.com/about.html'
     text = self.body.replace('%url%', url)
     self.assertEqual(len(bot.extract_urls(text)), 0)

Example #3

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenCraigslistSearchPage_ReturnsEmptyList(self):
     url = 'https://tampa.craigslist.org/d/for-sale/search/sss'
     text = self.body.replace('%url%', url)
     self.assertEqual(len(bot.extract_urls(text)), 0)

Example #4

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenCraigslistScamsPageWithRegularHTTP_ReturnsEmptyList(
         self):
     url = 'http://www.craigslist.org/about/scams'
     text = self.body.replace('%url%', url)
     self.assertEqual(len(bot.extract_urls(text)), 0)

Example #5

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenForumUrl_ReturnsEmptyList(self):
     url = 'https://forums.craigslist.org/?forumID=3'
     text = self.body.replace('%url%', url)
     self.assertEqual(len(bot.extract_urls(text)), 0)

Example #6

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenMultipleUrls_ReturnsMultipleUrls(self):
     url = 'http://indianapolis.craigslist.org/bar/d/bears/6451661128.html'
     url2 = 'https://dallas.craigslist.org/ftw/zip/d/20000-pounds-free-remotes/6426178725.html'
     text = self.body.replace('%url%', url)
     text = text.replace('%url2%', url)
     self.assertEqual(len(bot.extract_urls(text)), 2)

Example #7

0

Show file

File: test-bot.py Project: darricktheprogrammer/reddit-craigslist-archive-bot

 def test_ExtractUrls_GivenFullUrlWithOnlyHTTP_ReturnsUrl(self):
     url = 'http://indianapolis.craigslist.org/bar/d/bears/6451661128.html'
     text = self.body.replace('%url%', url)
     self.assertEqual(len(bot.extract_urls(text)), 1)