Globus-Catalog-Ingestor

OBTAIN GLOBUS CREDENTIALS

https://www.globus.org/SignUp
You will input this once when you first run this script

INSTALL REQUIREMENTS

Install Globus Catalog-client

Details at (https://github.com/globusonline/catalog-client)

git clone https://github.com/globusonline/catalog-client
cd catalog-client
python setup.py install --user

Clone this Repo

git clone https://github.com/blaiszik/Globus-Catalog-Ingestor.git
cd Globus-Catalog-Ingestor

Edit config.json
- "catalog_id": ID of the catalog you wish to push data into
- "catalog_aliases": If you prefer to work with a catalog by name, add the numeric aliases here
- "endpoint": Specify the endpoint location for your data
- "path": Specify the path to your data
- "files": Specify the location of the files ot parse (relative to the script)
- "rw_users": Indicate which users should be granted read and write privileges
- "r_users": Indicate which users should be granted read-only privileges

Example config.json

{
    "catalog_id" : 137,
    "catalog_aliases" : { "ingestor suresh":137,
                          "other catalog":15},
    "endpoint" : "globus://s8idiuser#snow",
    "path": "/path/to/data",
    "files": ["B001_Eiger_silica150nm_water_test_Fq1_0001-20000.hdf",
              "B001_Eiger_silica150nm_water_test_Fq1_0001-20001.hdf"],
    "rw_users" : ["s8idiuser", "sureshn", "blaiszik"],
    "r_users" : ["bfrosik", "blaiszik"]
}

Edit metadata_map.json (optional)

Command line options

-c specify the catalog name (string) or ID (int) to ingest data into  (this overrrides config.json catalog_id entry)
-d ingest data into a dataset as members
-f specify a filename to be ingested (this overrrides config.json files entry)
-x suppress script console output

Examples

Using config.json (as above example)

python ingest.py

Overriding config.json

Point the ingestor at a catalog and a file for ingesting

python ingest.py -c 'replace with catalog_name or ID' -f 'filename.hdf'

Ingesting into a dataset as members

This will ingest data into a newly created dataset of the specified dataset name

python ingest.py -c 'replace with catalog_name or ID' -d 'New Dataset Name' -f 'filename.hdf'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

config.json

config.json

ingest.py

ingest.py

metadata_map.json

metadata_map.json

test.sh

test.sh

Repository files navigation

Globus-Catalog-Ingestor

OBTAIN GLOBUS CREDENTIALS

INSTALL REQUIREMENTS

Install Globus Catalog-client

Clone this Repo

Command line options

Examples

Using config.json (as above example)

Overriding config.json

Ingesting into a dataset as members

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
README.md		README.md
config.json		config.json
ingest.py		ingest.py
metadata_map.json		metadata_map.json
test.sh		test.sh

blaiszik/Globus-Catalog-Ingestor

Folders and files

Latest commit

History

Repository files navigation

Globus-Catalog-Ingestor

OBTAIN GLOBUS CREDENTIALS

INSTALL REQUIREMENTS

Install Globus Catalog-client

Clone this Repo

Command line options

Examples

Using config.json (as above example)

Overriding config.json

Ingesting into a dataset as members

About

Resources

Stars

Watchers

Forks

Languages