Skip to content

yngcan/patents-public-data

 
 

Repository files navigation

Patent analysis using the Google Patents Public Datasets on BigQuery

The contents of this repository are not an official Google product.

Google Patents Public Datasets is a collection of compatible BigQuery database tables from government, research and private companies for conducting statistical analysis of patent data. The data is available to be queried with SQL through BigQuery, joined with private datasets you upload, and exported and processed using many other compatible analysis tools. This repository is a centralized source for examples which use the data.

Currently the repo contains three examples:

  1. Patent Landscaping: A demo of an automated process of finding patents related to a particular topic given an initial seed set of patents. Based on the paper by Dave Feltenberger and Aaron Abood, Automated Patent Landscaping.

  2. Claim Text Extraction: A demo of interacting with patent claim text data using BigQuery and python.

  3. Claim Breadth Model: A machine learning method for estimating patent claim breadth using data from BigQuery.

About

Patent analysis using the Google Patents Public Datasets on BigQuery

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 87.9%
  • Python 12.0%
  • Dockerfile 0.1%