Python Session.bulk_save_objects 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: db

클래스/타입: Session

메소드/함수: bulk_save_objects

hotexamples.com에서의 예제들: 2

Python Session.bulk_save_objects - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 db.Session.bulk_save_objects에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Session(30)

delete(30)

rollback(30)

remove(30)

query(30)

add(30)

close(30)

commit(30)

execute(22)

flush(11)

add_all(11)

get_by_key_name(8)

merge(6)

refresh(6)

object_session(5)

select(3)

connection(3)

bulk_save_objects(2)

get(2)

token(2)

close_all(2)

query_property(2)

create(1)

expunge(1)

owner(1)

bulk_insert_mappings(1)

insert(1)

find(1)

all(1)

delete_all(1)

save(1)

configure(1)

get_all(1)

update(1)

user_id(1)

예제 #1

파일 보기

    def scrape(self):
        """Iterates through a single results page and extracts bids.

        This is implemented as follows:
          1. Download the results page.
          2. Extract the bid identifiers from this page.
          3. Check which of those identifiers are not yet in our database.
          4. For each of the identifiers not yet in our database:
            4.1. Download the detail page for each identifier.
            4.2. Extract the fields we are interested in.
            4.3. Create a Bid object and store it in the database.
        """
        session = Session()
        page = self.scraper.get(self.results_url)
        bid_ids = self.scrape_results_page(page.content)
        log.info("Found bid ids: {}".format(bid_ids))
        new_ids = get_new_identifiers(session, bid_ids, self.get_site())
        arg_tuples = [(self.scrape_bid_page, bid_id) for bid_id in new_ids]
        bids = execute_parallel(arg_tuples)
        session.bulk_save_objects(bids)
        session.commit()

예제 #2

파일 보기

파일: commbuys_scraper.py 프로젝트: KC2004/bidwire

    def scrape(self):
        """Iterates through all of Commbuys and extracts bids.

        This is implemented as follows, starting on the first results page:
          1. Download the results page.
          2. Extract the bid identifiers from this page.
          3. Check which of those identifiers are not yet in our database.
          4. For each of the identifiers not yet in our database:
            4.1. Download the detail page for each identifier.
            4.2. Extract the fields we are interested in.
            4.3. Create a Bid object and store it in the database.
          5. Go to the next page. Repeat from step #1.
        """
        current_page = 1
        session = Session()
        while True:
            page = self.scraper.post(self.results_url,
                                     data={
                                         'mode': 'navigation',
                                         'currentPage': current_page
                                     })
            bid_ids = self.scrape_results_page(page.content)
            log.info("Results page {} found bid ids: {}".format(
                current_page, bid_ids))
            if not bid_ids:
                log.info("Page {} has no results. Done scraping.".format(
                    current_page))
                break
            new_ids = get_new_identifiers(session, bid_ids, self.get_site())
            # Scrape in parallel the new bid ids found.
            # Any underlying exceptions are allowed to propagate to the caller, and
            # will abort the entire scraping process.
            arg_tuples = [(self.scrape_bid_page, bid_id) for bid_id in new_ids]
            bids = execute_parallel(arg_tuples)
            session.bulk_save_objects(bids)
            # Save all the new bids from this results page in one db call.
            session.commit()
            current_page += 1