Standardize_Metadata

A collection of tools to download, parse, and standardize sequence metadata from NCBI databases.
Written by Remi Marchand between May 13, 2016 and August 26, 2016.

Scope

This collection of tools, by default, manipulates data from the Sequence Read Archive (SRA) database.
The database can be found here: http://www.ncbi.nlm.nih.gov/sra

Usage

metadata.py

Main program that queries and downloads xml files based on organism name and date.
Usage: metadata.py options (run python metadata.py -h to see options)
Download in Bulk: bash download.sh organism start_date end_date

Standard_Tools/standardize.py

Main program that standardizes relevant columns from input csv files.
Usage: standardize.py csv_files

Installation

You may need to install the following modules

lxml (install via pip as: python -m pip install lxml)
Levenshtein (install from: https://pypi.python.org/pypi/python-Levenshtein/0.12.0)

Add to the python path

If on a Mac: export PYTHONPATH="${PYTHONPATH}:Path_to_Standardize_Metadata"

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
Standard_Tools		Standard_Tools
Xml_Validation		Xml_Validation
src.git		src.git
Collection_Date		Collection_Date
README.md		README.md
__init__.py		__init__.py
download.sh		download.sh
download_metadata.py		download_metadata.py
generate_metadata_csv.py		generate_metadata_csv.py
metadata.py		metadata.py
parse_metadata.py		parse_metadata.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standard_Tools

Standard_Tools

Xml_Validation

Xml_Validation

src.git

src.git

Collection_Date

Collection_Date

README.md

README.md

init.py

init.py

download.sh

download.sh

download_metadata.py

download_metadata.py

generate_metadata_csv.py

generate_metadata_csv.py

metadata.py

metadata.py

parse_metadata.py

parse_metadata.py

Repository files navigation

Standardize_Metadata

Scope

Usage

metadata.py

Standard_Tools/standardize.py

Installation

You may need to install the following modules

Add to the python path

About

Releases

Packages

Languages

Remimstr/Standardize_Metadata

Folders and files

Latest commit

History

Repository files navigation

Standardize_Metadata

Scope

Usage

metadata.py

Standard_Tools/standardize.py

Installation

You may need to install the following modules

Add to the python path

About

Resources

Stars

Watchers

Forks

Languages