Skip to content

Anonymise relational databases, operate a natural language processing (NLP) pipeline, provide research views on a database, manage consent-for-contact

License

Notifications You must be signed in to change notification settings

Phdmani/crate

 
 

Repository files navigation

Clinical Records Anonymisation and Text Extraction (CRATE)

Purpose

  • Anonymises relational databases.
  • Performs some specific preprocessing tasks; e.g.
    • preprocesses some specific databases (e.g. Servelec RiO EMR).
    • fetches some word lists, e.g. forenames/surnames/eponyms.
  • Provides a natural language processing (NLP) pipeline.
  • Web app for
    • querying the anonymised database
    • managing a consent-to-contact process

Documentation

See https://crateanon.readthedocs.io

Sources

Licence

  • Copyright (C) 2015-2019 Rudolf Cardinal (rudolf@pobox.com).
  • Licensed under the GNU GPL v3+: see LICENSE file.
  • Some third-party libraries have slightly different licences:
    • aspects of CamAnonGatePipeline.java are based on demonstration GATE code, copyright (C); University of Sheffield, and licensed under the GNU LGPL; see https://gate.ac.uk/.

About

Anonymise relational databases, operate a natural language processing (NLP) pipeline, provide research views on a database, manage consent-for-contact

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 66.9%
  • Python 28.4%
  • HTML 2.2%
  • Rich Text Format 1.0%
  • Mako 0.7%
  • Java 0.6%
  • Other 0.2%