Python UncheckedExternalURLの例

プログラミング言語: Python

名前空間/パッケージ名: edx.analytics.tasks.url

hotexamples.comのコード掲載数: 6

Python UncheckedExternalURL - 6件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのedx.analytics.tasks.url.UncheckedExternalURLの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

UncheckedExternalURL(6)

よく使われるメソッド

UncheckedExternalURL (6)

コード例 #1

ファイルを表示

    def _get_requirements(self):
        """
        Gather the set of requirements needed to run the task.

        This can be a rather expensive operation that requires usage of the S3 API to list all files in the source
        bucket and select the ones that are applicable to the given date range.
        """
        url_gens = []
        for source in self.source:
            if source.startswith('s3'):
                url_gens.append(self._get_s3_urls(source))
            elif source.startswith('hdfs'):
                url_gens.append(self._get_hdfs_urls(source))
            else:
                url_gens.append(self._get_local_urls(source))

        log.debug('Matching urls using pattern(s)="%s"', self.pattern)
        log.debug('Date interval: %s <= date < %s',
                  self.interval.date_a.isoformat(),
                  self.interval.date_b.isoformat())

        return [
            UncheckedExternalURL(url) for url_gen in url_gens
            for url in url_gen if self.should_include_url(url)
        ]

コード例 #2

ファイルを表示

    def test_requires(self, connect_s3_mock):
        s3_conn_mock = connect_s3_mock.return_value
        bucket_mock = s3_conn_mock.get_bucket.return_value

        class FakeKey(object):
            """A test double of the structure returned by boto when listing keys in an S3 bucket."""
            def __init__(self, path):
                self.key = path
                self.size = 10

        bucket_mock.list.return_value = [
            FakeKey(path) for path in self.SAMPLE_KEY_PATHS
        ]

        task = PathSelectionByDateIntervalTask(
            source=self.SOURCE,
            interval=Month.parse('2014-03'),
            pattern=[r'.*?FakeServerGroup/tracking.log-(?P<date>\d{8}).*\.gz'],
            expand_interval=datetime.timedelta(0),
        )

        expected_paths = [
            'FakeServerGroup/tracking.log-20140318.gz',
            'FakeServerGroup/tracking.log-20140319-1395256622.gz',
        ]

        self.assertItemsEqual(task.requires(), [
            UncheckedExternalURL(source + path) for path in expected_paths
            for source in self.SOURCE
        ])

コード例 #3

ファイルを表示

ファイル: enrollments.py プロジェクト: linearregression/edx-analytics-pipeline

    def downstream_input_tasks(self):
        """
        MultiOutputMapReduceJobTask returns marker as output.
        This method returns the external tasks which can then be used as input in other jobs.
        Note that this method does not verify the existence of the underlying urls. It assumes that
        there is an output file for every date within the interval. Any MapReduce job
        which uses this as input would fail if there is missing data for any date within the interval.
        """

        tasks = []
        for date in self.interval:
            url = self.output_path_for_key(date.isoformat())
            tasks.append(UncheckedExternalURL(url))

        return tasks

コード例 #4

ファイルを表示

ファイル: location_per_course.py プロジェクト: npoed/edx-analytics-pipeline

    def downstream_input_tasks(self):
        """
        Provide a list of tasks that a downstream task would use as input.

        This is necessary because a MultiOutputMapReduceJobTask returns a marker as output.
        Note that this method does not verify the existence of the underlying urls. It assumes that
        there is an output file for every date within the interval. Any MapReduce job
        which uses this as input would fail if there is missing data for any date within the interval,
        so this task will create empty output files for dates with no data.
        """
        tasks = []
        for date in self.interval:
            url = self.output_path_for_key(date.isoformat())
            tasks.append(UncheckedExternalURL(url))

        return tasks

コード例 #5

ファイルを表示

ファイル: test_manifest.py プロジェクト: rogeriofalcone/edx-analytics-pipeline

 def test_requirements(self):
     self.assertItemsEqual(self.task.requires(),
                           [UncheckedExternalURL(self.SOURCE_URL)])

コード例 #6

ファイルを表示

ファイル: manifest.py プロジェクト: rogeriofalcone/edx-analytics-pipeline

 def requires(self):
     return [UncheckedExternalURL(url) for url in self.urls]