Skip to content

brian-todd/fx-data-loader

Repository files navigation

Overview

This project serves as an extensible framework for processing and cleaning historical FX data from Dukascopy's data feed.

Usage

The most direct usage is through the load_fx_data.py script. There are several command line options:

python load_fx_data.py --pair=EURUSD --start_date=2019-01-01 --end_date=2019-02-01 --opath=/data/EURUSD/raw --processes=4 --pipeline=tabular

  • pair: Currency pair for historical data.
  • start_date: Starting point for data processing.
  • end_date: (Optional) Ending point for data processing not inclusive of the final date. Default behavior sets end date to current date.
  • opath: (Optional) Output directory to write batches of files.
  • sep: (Optional) Delimiter used in tabular data formats. Default value is '\t'.
  • db: (Optional) Specify path to SQLite database file for sqlite pipeline.
  • table: (Optional) Table to write to for sqlite pipeline.
  • processes: (Optional) Number of processes to run. If too many are run, then the user will start to receive 503 responses from the server. 4-8 processes are normally ideal.
  • pipeline: (Optional) Specify any custom pipelines added to the pipelines/ directory. Default option is tabular.

Layout

  • network: Request handling and parsing.
  • pipelines: Fully connected data processing pipelines.
  • processors: Data processing and cleaning.
  • tests: Unit tests.
  • utils: Utility and tooling functions.

Modifying data pipelines is generally as simple as adding a new class that inherits from the relevant base class.

About

Framework for loading historical FX tick data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published