Skip to content

gilga001/HPCandBigDataPipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HPC and BigDataPipeline

Here is all the code we use in a published paper in PEARC18:

Combining HPC and Big Data Infrastructures in Large-Scale Post-Processing of Simulation Data: A Case Study

ACM Digital Library Link: https://dl.acm.org/citation.cfm?id=3229279

Files Introduction

distance.py and hbond.py are from MDTraj, I only modified these two files

mdtraj.py is for running MDTraj

optSeqCode is the optimized sequential code

spark.ipynb is the latest spark code using optimized code

originalSeqCode.py is the very original code

About

A published paper in PEARC18: Combining HPC and Big Data Infrastructures in Large-Scale Post-Processing of SimulaBon Data: A Case Study

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published