Python gen_urlsの例

プログラミング言語: Python

名前空間/パッケージ名: pycheetah

メソッド/関数: gen_urls

hotexamples.comのコード掲載数: 7

Python gen_urls - 7件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpycheetah.gen_urlsの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

 def test_gen_urls_for_ptt(self):
     l = [
         i for i in pycheetah.gen_urls(
             'https://www.ptt.cc/bbs/movie/index%d.html',
             product=[list(range(6188, 6189))])
     ]
     self.assertEqual(1, len(l))
     self.assertEqual('https://www.ptt.cc/bbs/movie/index6188.html', l[0])

コード例 #2

ファイルを表示

 def test_get_urls_exception(self):
     Classification = ['world']
     with self.assertRaises(ValueError):
         [
             i for i in pycheetah.gen_urls(
                 'https://www.theguardian.com/%s/%s/all',
                 '2017/1/0',
                 '2017/1/5',
                 product=[Classification, 'date'])
         ]

コード例 #3

ファイルを表示

 def test_gen_urls_for_nyt(self):
     l = [
         i for i in pycheetah.gen_urls(
             'http://www.nytimes.com/indexes/%s/todayspaper/index.html',
             '2017/1/1',
             '2017/1/5',
             date_format='%Y/%m/%d',
             product=['date'])
     ]
     self.assertEqual(5, len(l))

コード例 #4

ファイルを表示

def main():
    pycheetah.init_logger()
    urls = list(
        pycheetah.gen_urls('https://www.ptt.cc/bbs/movie/index%d.html',
                           product=[list(range(6180, 6189))]))

    result = Board.start(urls)
    urls = result.reduce_by('links')
    yield urls
    reseult = Article.start(urls)
    yield reseult.reduce_by('article')

コード例 #5

ファイルを表示

    def test_gen_urls_for_guardian(self):
        Classification = [
            'world', 'politics', 'sport', 'football', 'culture', 'business',
            'lifeandstyle', 'fashion', 'environment', 'technology', 'travel'
        ]

        l = [
            i for i in pycheetah.gen_urls(
                'https://www.theguardian.com/%s/%s/all',
                '2017/1/1',
                '2017/1/5',
                product=[Classification, 'date'])
        ]
        self.assertEqual(55, len(l))

コード例 #6

ファイルを表示

ファイル: nytimes.py プロジェクト: brian41005/pycheetah

def main():
    pycheetah.init_logger()
    urls = list(
        pycheetah.gen_urls(
            'http://www.nytimes.com/indexes/%s/todayspaper/index.html',
            '2017/1/1',
            '2017/1/1',
            date_format='%Y/%m/%d',
            product=['date']))

    result = DailyPage.start(urls)
    urls = result.reduce_by('urls')
    yield urls
    result = NewsPage.start(urls)
    yield result.reduce_by('title')

コード例 #7

ファイルを表示

def main():
    category = [
        'world', 'politics', 'sport', 'football', 'culture', 'business',
        'lifeandstyle', 'fashion', 'environment', 'technology', 'travel'
    ]
    all_daily_urls = list(
        pycheetah.gen_urls('https://www.theguardian.com/%s/%s/all',
                           '2017/1/1',
                           '2017/1/1',
                           product=[category, 'date']))
    pycheetah.init_logger()
    result = DailyPage.start(all_daily_urls)
    urls = result.reduce_by('urls')
    yield urls
    result = NewsPage.start(urls)
    yield result.reduce_by('name')