This project involved using validated epitope data to assess the population coverage of a potential HIV-1 vaccine targeting the protease cleavage sites (PCS) (all PCS, Gag PCS, or Pol PCS).
The data on population coverage, average epitope hit per individual, and minimum number of epitopes recognized by 90% of the population (PC90) was obtained from IEDB [http://tools.iedb.org/population/] and can be found in the Country
and Regions
directories; the raw data is not labelled by country or region, however, the output is.
This script conducts ANOVA and Fisher's exact testing to examine for significance between the three epitope sets (all PCS, Gag PCS, and Pol PCS) in regards to coverage (Fisher's), average hit (ANOVA), and PC90 (ANOVA).
- Python 3+
- Pandas
- Numpy
- Scipy
After cloning the repository, navigate to the directory via your terminal and execute analysis.py
Pandas DataFrame with labelled population coverage, average hit, and PC90 data displayed with P values.