Python Page.out_links Examples

Programming Language: Python

Namespace/Package Name: model.page

Class/Type: Page

Method/Function: out_links

Examples at hotexamples.com: 1

Python Page.out_links - 1 examples found. These are the top rated real world Python examples of model.page.Page.out_links extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

title(7)

Page(6)

curnav(5)

content(4)

parent_id(2)

fetchPhotos(2)

start(1)

save_to_db(1)

save(1)

put(1)

get_product(1)

out_links(1)

index(1)

arrange_buttons(1)

get_page_links(1)

get_main_crawling_pages(1)

get(1)

count(1)

url(1)

Example #1

Show file

File: html.py Project: pascalweiss/SearchEngine

    def parse(self, text, url):
        dom = self.parseDocument(text)
        page = Page()
        page.title = self.get_text_from_element('title')
        page.content = self.remove_a_tags(self.get_text_from_element('body'))
        page.url = url

        def read_link(link):
            return URLUtils.join_relurl_to_absurl(url, link['href'])

        page.out_links = [read_link(link) for link in dom.select('a[href]')]
        page.out_links = ListUtil.to_list_without_duplicated_entries(page.out_links)

        return page