Skip to content

davidenoma/GPU-GWAS

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GPU-GWAS

Description and goals

This project aims to develop a fast GWAS data analysis pipeline incorporating GPU acceleration and Machine learning.

Our high level goals for the hackathon include -

  • Recreating the Hail GWAS sample using RAPIDS APIs
  • Wrapping RAPIDS GWAS functions into high level APIs within gpu-gwas framework
  • (if time permits) Scaling to larger VCF dataset (e.g. chr22)

Working doc for our team is here

Workflow

Workflow-diagram

Setup Instructions

pip install -e requirements.txt

Test

To test proper setup, please run the following from the root folder of the repo

python gpugwas_test.py

Package components

The gpugwas package is broken up into multiple independent modules that deal with different components of the GWAS pipeline. The modules are all located under the gpugwas folder.

  1. gpugwas.io - This module contains I/O related functions such as loading a VCF/annotation file into a CUDA dataframe.
  2. gpugwas.algorithms - This module contains ML algorithm implementations in CUDA typically used in GWAS (e.g. linear regression, logistic regression, etc).
  3. gpugwas.viz - This module contains functions used in visualizing the GWAS model outputs (manhattan plots, q-q plots, etc)

Example Use Case

An example use case of the pipeline is available in workflow.py.

To execute, please run

python workflow.py

Results

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%