Python TextExtractor.get_file_contents_as_array 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: text_extractor

클래스/타입: TextExtractor

메소드/함수: get_file_contents_as_array

hotexamples.com에서의 예제들: 2

Python TextExtractor.get_file_contents_as_array - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 text_extractor.TextExtractor.get_file_contents_as_array에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

TextExtractor(6)

get_extension(3)

extract(2)

get_file_contents_as_array(2)

export_csv(1)

extract_text(1)

extract_text_from_version(1)

get_item(1)

get_lecturer_info(1)

예제 #1

파일 보기

파일: test_extractor_test.py 프로젝트: stubevan/DTPO-Autoload

    def test_non_ocr_pdf(self) :
        """
            Access an valid pdf which hasn't been OCR'd
        """
        file_name = 'non_ocr_file.pdf'

        text_extractor = TextExtractor(
            source_file = file_name,
            source_directory= TextExtractorTest.test_directory,
            working_directory = '/tmp',
            testing = True)

        actual_results = text_extractor.get_file_contents_as_array()

        self.assertEquals(len(actual_results), 0)

예제 #2

파일 보기

파일: test_extractor_test.py 프로젝트: stubevan/DTPO-Autoload

    def test_valid_pdf(self) :
        """
            Access an empty file with .pdf suffix
            This could well break when we test the file type properly
        """

        expected_results = [
            'Test 1\n',
            'Test 2\n',
            '\n'
        ]
        file_name = 'test_file1.pdf'

        text_extractor = TextExtractor(
            source_file = file_name,
            source_directory= TextExtractorTest.test_directory,
            working_directory = '/tmp',
            testing = True)

        actual_results = text_extractor.get_file_contents_as_array()

        self.assertEquals(expected_results, actual_results)