Python count_pages 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: util

메소드/함수: count_pages

hotexamples.com에서의 예제들: 8

Python count_pages - 8개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 util.count_pages에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: tasks.py 프로젝트: ddohler/webocr

def document_analysis(docid):
    #TODO: Check for multiple objects?
    doc = util.is_valid_doc(docid)
    
    doc.file_format = util.determine_format(doc)
    ### Counting pages and repairing damaged documents ###
    num_pages = util.count_pages(doc)
    #TODO: The repair command doesn't quite work; need to make a copy first
    # or update the object's field.
    #if num_pages == -1 and doc.file_format == 'pdf':
        # Try to repair damaged PDF
    #    cmd = ['pdftk', MEDIA_ROOT+doc.doc_file, 'output', MEDIA_ROOT+doc.doc_file]
    #    try:
    #        subprocess.check_call(cmd)
    #    except subprocess.CalledProcessError as e:
    #        print(e)
            #TODO: More error handling if necessary

        #Try again
    #    num_pages = util.count_pages(doc)
        #If it's still undetectable there's not much more we can do
        #TODO: Report error, image cannot be processed.

    if doc.file_format == 'pdf':
        #Counting the number of pages may fail; PyPdf doesn't handle corrupt
        #PDFs well.
        num_imgs = util.count_images(doc)
        has_text = util.detect_text(doc)
    else:
        num_imgs = num_pages #For TIFFS num_pages might be >1
        has_text = False

    # Decide what to do
    if has_text == False and num_imgs == num_pages: #Simple case
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_images.delay(docid)
    elif has_text == True and num_imgs == 0: #Nothing to OCR
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_rasterize.delay(docid) #Rasterize and output page images
    elif has_text == True and num_imgs > 0: #Mixed image / text
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_rasterize.delay(docid) #For now, rasterize pages, then OCR
    else: #Fallback to rasterization
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_rasterize.delay(docid) #rasterize and OCR

    doc.num_pages = num_pages
    doc.save()

예제 #2

파일 보기

def document_analysis(docid):
    #TODO: Check for multiple objects?
    doc = util.is_valid_doc(docid)

    doc.file_format = util.determine_format(doc)
    ### Counting pages and repairing damaged documents ###
    num_pages = util.count_pages(doc)
    #TODO: The repair command doesn't quite work; need to make a copy first
    # or update the object's field.
    #if num_pages == -1 and doc.file_format == 'pdf':
    # Try to repair damaged PDF
    #    cmd = ['pdftk', MEDIA_ROOT+doc.doc_file, 'output', MEDIA_ROOT+doc.doc_file]
    #    try:
    #        subprocess.check_call(cmd)
    #    except subprocess.CalledProcessError as e:
    #        print(e)
    #TODO: More error handling if necessary

    #Try again
    #    num_pages = util.count_pages(doc)
    #If it's still undetectable there's not much more we can do
    #TODO: Report error, image cannot be processed.

    if doc.file_format == 'pdf':
        #Counting the number of pages may fail; PyPdf doesn't handle corrupt
        #PDFs well.
        num_imgs = util.count_images(doc)
        has_text = util.detect_text(doc)
    else:
        num_imgs = num_pages  #For TIFFS num_pages might be >1
        has_text = False

    # Decide what to do
    if has_text == False and num_imgs == num_pages:  #Simple case
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_images.delay(docid)
    elif has_text == True and num_imgs == 0:  #Nothing to OCR
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_rasterize.delay(docid)  #Rasterize and output page images
    elif has_text == True and num_imgs > 0:  #Mixed image / text
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_rasterize.delay(docid)  #For now, rasterize pages, then OCR
    else:  #Fallback to rasterization
        #print "Pages: %d, Images: %d, Text: %d" %(num_pages,num_imgs,has_text)
        pages_from_rasterize.delay(docid)  #rasterize and OCR

    doc.num_pages = num_pages
    doc.save()

예제 #3

파일 보기

파일: comment.py 프로젝트: synee/abillist

def count_pages():
    return util.count_pages(db.Query(Comment).count())

예제 #4

파일 보기

파일: comment.py 프로젝트: neuront/nijipress

def count_pages():
    return util.count_pages(db.Query(Comment).count())

예제 #5

파일 보기

파일: post.py 프로젝트: neuront/nijipress

def count_pages_by_tag(t):
    return util.count_pages(db.Query(tag.TagPostR).filter('tag =', t).count())

예제 #6

파일 보기

파일: post.py 프로젝트: neuront/nijipress

def count_pages():
    return util.count_pages(db.Query(Post).count())

예제 #7

파일 보기

파일: post.py 프로젝트: neuront/nijinote

def count_pages_by_tag(t):
    return util.count_pages(db.Query(tag.TagPostR).filter('tag =', t).count())

예제 #8

파일 보기

파일: post.py 프로젝트: neuront/nijinote

def count_pages():
    return util.count_pages(db.Query(Post).count())