GC Predictor Algorithm

Requirements

Barebones

Python 3
Python packages, can be installed with

pip install -r requirements.txt

Using Nix?

nix-shell

Configuration

Parse

// example: parse.json

{
  "name": "benchmarks",
  "dir": {
    "data": "./data",
    "output": "./output"
  },
  "data": [
        {
      "name": "1T-16MB-1",
      "file": "./raw_data/randomize/1T-16MB/1/ucare.log",
      "old_format": true
    },
    {
      "name": "3_30006",
      "file": "./raw_data/stringtable/3_30006/ucare.log",
      "old_format": true
    },
    {
      "name": "3_120026",
      "file": "./raw_data/stringtable/3_120026/ucare.log",
      "old_format": true
    },
    {
      "name": "renaissance",
      "file": "./raw_data/benchmarks/renaissance/ucare.log"
    },
    {
      "name": "dacapo",
      "file": "./raw_data/benchmarks/dacapo/ucare.log"
    },
    {
      "name": "specjvm",
      "file": "./raw_data/benchmarks/specjvm/ucare.log"
    }
  ]
}

old_format key is for backward compatibility with old version of ucare.log

Training

// example: training.json

{
  "name": "benchmarks",
  "skip_value": 7,
  "sm_add_constant": false,
  "dir": {
    "data": "./data/benchmarks",
    "output": "./output"
  },
  "models": [
    "ransac",
    "lreg"
  ],
  "subtitle": "",
  "data": {
    "main": [
      "1T-16MB-1",
      "..."
    ],
    "stringtable": [
      "1",
      "..."
    ]
  }
}

models key can be ransac, lreg, and svr
data consists of two key which entries will be prepended by dir/data key :
- main
- stringtable

Inference

// example: inference.json

{
  "name": "benchmarks",
  "skip_value": 0,
  "sm_add_constant": false,
  "dir": {
    "data": "./data/benchmarks",
    "output": "./output"
  },
  "model": {
    "main": {
      "name": "ransac",
      "file": "./output/benchmarks/train/main/model/ransac.joblib"
    },
    "stringtable": {
      "name": "ransac",
      "file": "./output/benchmarks/train/stringtable/model/ransac.joblib"
    }
  },
  "combined_plot": {
    "max": 500,
    "min": -500,
    "subtitle": "Heap Size 4G"
  },
  "data": [
    {
      "name": "renaissance",
      "color": "green",
      "label": "Renaissance",
      "subtitle": ""
    },
    {
      "name": "dacapo",
      "color": "blue",
      "label": "DaCapo",
      "subtitle": ""
    },
    {
      "name": "specjvm",
      "color": "red",
      "label": "Specjvm2008",
      "subtitle": ""
    }
  ]
}

Notes

dir_output will be appended with name key

the output dir will be ${dir_output}/${name}

skip_value is number of value that will be skipped in cdf plot (in case of anomaly)

Running

Parse

python parse.py -c <parse.json>

Train

python train.py \
    -c <train.json> -t [main|stringtable]
    
# equals with

python train.py \
    --config <train.json> --type [main|stringtable]

Inference

python inference.py -c <inference.json>

Authors

Ray Andrew
Cesar Stuardo

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
config		config
notebooks		notebooks
.envrc		.envrc
.gitignore		.gitignore
.nixpkgs-version.json		.nixpkgs-version.json
Benchmark.md		Benchmark.md
README.md		README.md
default.nix		default.nix
inference_v1.py		inference_v1.py
inference_v2.py		inference_v2.py
inference_v3.py		inference_v3.py
inference_v4.py		inference_v4.py
model.py		model.py
parse_v1.py		parse_v1.py
parse_v2.py		parse_v2.py
parse_v3.py		parse_v3.py
requirements.txt		requirements.txt
shell.nix		shell.nix
train_v2.py		train_v2.py
train_v3.py		train_v3.py
utilities.py		utilities.py

rayandrew/gc-predictor-algorithm

Folders and files

Latest commit

History

Repository files navigation

GC Predictor Algorithm

Requirements

Barebones

Using Nix?

Configuration

Parse

Training

Inference

Notes

Running

Parse

Train

Inference

Authors

About

Resources

Stars

Watchers

Forks

Languages