Python get_topic_pageの例

プログラミング言語: Python

名前空間/パッケージ名: wiki_requests

メソッド/関数: get_topic_page

hotexamples.comのコード掲載数: 12

Python get_topic_page - 12件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのwiki_requests.get_topic_pageの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: main.py プロジェクト: GrishinAnton/python-start

def get_topic_words(topic):
    html_content = get_topic_page(topic)

    more_link_word = get_some_more_links_word(html_content)

    if len(more_link_word):
        #Ограничение тремя ссылками, чтобы быстрее было
        for word in more_link_word[0:3]:
            html_content += get_topic_page(word)

    words = re.findall("[а-яА-я\-\']{3,}", html_content)
    return words

コード例 #2

ファイルを表示

def get_wiki_links(link):
    html_content = get_topic_page(link)
    soup = BS(html_content, 'html.parser')
    links = soup.find_all("a")
    links = [link.get('href', '') for link in links]
    links = [
        link for link in links
        if re.search('/wiki/', link) and not re.search('./wiki/', link)
    ]
    return links

コード例 #3

ファイルを表示

ファイル: links.py プロジェクト: ivadimn/py-input

def get_topic_tables(topic):
    html_content = get_topic_page(topic)
    soup = BS(html_content, "html.parser")
    tables = soup.find_all("table")
    tbs = soup.select("table.standard")
    for t in tbs:
        trs = t.select("tr")
        print(len(trs))
    hrs = [t.get("class", "") for t in tables]
    print(hrs)
    return hrs

コード例 #4

ファイルを表示

def get_topic_text(topic):
    html_content = get_topic_page(topic)
    words = re.findall("[а-яА-Я\-\']+", html_content)
    text = " ".join(words)
    return text

コード例 #5

ファイルを表示

def get_topic_words(topic):
    html_content = get_topic_page(topic)
    words = re.findall("[а-яА-Я\-\']+", html_content)
    return words

コード例 #6

ファイルを表示

def get_topic_words(link):
    html_content = get_topic_page(link)
    words = re.findall("[а-яА-Я\-']{3,}", html_content)
    return words

コード例 #7

ファイルを表示

def get_topic_words(topic):
    html_content = get_topic_page(topic)
    words = re.findall("[а-яёА-Я\-\']{3,}", html_content)
    #text = " ".join(words)
    return words

コード例 #8

ファイルを表示

ファイル: links.py プロジェクト: ivadimn/py-input

def get_topic_links(topic):
    html_content = get_topic_page(topic)
    soup = BS(html_content, "html.parser")
    links = soup.find_all("tr")
    print(links)

コード例 #9

ファイルを表示

ファイル: links.py プロジェクト: ivadimn/py-input

def get_neighbo_pages(topic):
    nlinks = get_neighbo_links(topic)
    html_pages = [get_topic_page(n) for n in nlinks]
    return html_pages

コード例 #10

ファイルを表示

ファイル: index.py プロジェクト: podoynitsyn-va/GEEKBRAINS-Learning

def get_topic_words(topic):
    html_content = get_topic_page(topic)
    words = re.findall(r'[а-яА-Я][а-яА-Я\-\']+[а-яА-Я]', html_content)
    return [
        w.capitalize() for w in words
    ]  # Добавил капитализацию, потому что Дерево и дерево считались разными

コード例 #11

ファイルを表示

def get_topic_links(topic):
    html_content = get_topic_page(topic)
    soup = BS(html_content, "html.parser")
    links = soup.find_all("a")
    hrefs = [n.get("href", "") for n in links]
    return hrefs

コード例 #12

ファイルを表示

def get_topic_words(topic):
    html_content = get_topic_page(topic)
    words = re.findall("[а-яА-Я\-\']+", html_content)
    # слова, в которых более 3-х букв
    words = re.findall("[а-яА-Я\-\']{3,}", html_content)
    return words