Python URL.get_doi 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: URL

클래스/타입: URL

메소드/함수: get_doi

hotexamples.com에서의 예제들: 2

Python URL.get_doi - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 URL.URL.get_doi에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

URL(13)

open(2)

get_url(2)

get_doi(2)

get_redirect_url(1)

set_attr(1)

redirect_occured(1)

make(1)

get_host(1)

get_querystring(1)

fetch(1)

get_citations(1)

get_attr(1)

getTuple(1)

getParamMap(1)

getBaseUrl(1)

status_ok(1)

예제 #1

파일 보기

import sys
from DB import DB
from URL import URL

db = DB('citeseerx.db')
db.create_tables()
# db.del_all()

# http://citeseerx.ist.psu.edu/viewdoc/summary?cid=16057
if len(sys.argv) == 2:
    url = URL(sys.argv[1])
    url.open()
    db.insert('link', {'doi': url.get_doi(), 'url': url.get_url()})
else:
    print 'Please supply proper URL.'

예제 #2

파일 보기

from URL import URL
from DB import DB
from bs4 import BeautifulSoup

db = DB('citeseerx.db')

count = 0
while db.count_unpr():
    # url = URL('http://citeseerx.ist.psu.edu/viewdoc/summary?cid=4320')
    count = count + 1
    url = db.get_unpr()
    print url
    url = URL(url)
    url.open()
    db.update_link(url.get_doi(), 2)

    if (not db.exists('link', url.get_doi()) and url.redirect_occured()):
        db.insert('link', {
            'doi': url.get_doi(),
            'url': url.get_redirect_url()
        })

    if (not db.exists('metadata', url.get_doi())):
        html = url.fetch()
        # extract abstract
        soup = BeautifulSoup(html, "html.parser")
        title = soup.find('h2').findAll(text=True)[0]
        abstract_div = soup.find("div", {"id": "abstract"})
        for tag in abstract_div:
            if tag.name == 'p':
                abstract = tag.findAll(text=True)