Skip to content

openFDA is a research project to provide open APIs, raw data downloads, documentation and examples, and a developer community for an important collection of FDA public datasets.

License

westurner/openfda

 
 

Repository files navigation

openFDA

Build Status

openFDA is a research project to provide open APIs, raw data downloads, documentation and examples, and a developer community for an important collection of FDA public datasets.

Please note that openFDA is a beta research project and not for clinical use. While we make every effort to ensure that data and logic are accurate, you should assume all results are unvalidated.

Contents

This repository contains the code which powers the api.fda.gov/drug/event.json, api.fda.gov/drug/enforcement.json, api.fda.gov/device/enforcement.json, and api.fda.gov/food/enforcement.json end points:

  • A python pipeline written with Luigi for processing data for each endpoint

  • The event.json pipeline processes the public AERS SGML and FAERS XML Quarterly Data Extracts into an Adverse Event JSON format that can be loaded into Elasticsearch. This process includes de-duping follow-up reports and adding an openfda section to the drug section of the JSON containing additional drug information from other data sources (including FDA, NLM and VA).

  • The enforcement.json pipeline process the public RES HTML Enforcement Reports and RES XML Enforcement Reports into an Enforcement Report JSON format that can be loaded into Elasticsearch. The process includes extracting drug identifiers such as NDC and UPC from free text fields and adding an openfda section to the JSON containing additional drug information from other data sources (including FDA, NLM and VA).

  • Elasticsearch schemas for the Adverse Event and Enforcement Report JSON formats.

  • A Node.js API Server written with Express, Elasticsearch.js and Elastic.js that communicates with Elasticsearch and serves the public end points (documented in detail at http://open.fda.gov).

Prerequisites

  • Elasticsearch 1.2.0 or later
  • Python 2.7.*
  • Node 0.10.*

Packaging

Run bootstrap.sh to download and setup a virtualenv for the openfda python package and to download and setup the openfda-api node package.

About

openFDA is a research project to provide open APIs, raw data downloads, documentation and examples, and a developer community for an important collection of FDA public datasets.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 82.2%
  • JavaScript 15.6%
  • Shell 1.6%
  • Ruby 0.6%