GitHub - AaronTHolt/curc-bench-1: benchmarking and testing utilities for CU-Boulder Research Computing resources

curc-bench is a regression testing benchmark suite developed at and for University of Colorado Boulder Research Computing. It uses linpack, stream, and osu-micro-benchmarks.

Commands

bench create
bench add
bench submit
bench process
bench reserve
bench update-nodes

Tests available

-n, --node-tests: each node in the test runs a stream and linpack benchmark
-b, --bandwidth-tests: pairs of nodes selected from each switch run osu_bw
-p, --alltoall-pair-tests: pairs of nodes selected from each swtich run osu_alltoall
-s, --alltoall-switch-tests: the nodes connected to each switch run osu_alltoall
-r, --alltoall-rack-tests: the nodes in each rack run osu_alltoall

Session directory

The bench create command creates a new "session directory" which is used to track the state of a given benchmark run. The other commands expect this directory to exist ahead of time.

node_list (1)
bench.log (2)
${test_type}/ (3)
- pass_nodes (4)
- fail_nodes (5)
- error_nodes (6)
- tests/ (7)
  - ${test}/ (8)
    - ${test}.job (9)
    - node_list (10)
    - slurm-*.out (11)
    - ${output_files} (12)

bench create creates the base session directory, and generates the root node_list file (1).

bench add consults the root node_list (1) to generate tests of the requested ${test_type} (3). (Valid types are node, bandwidth, alltoall-pair, alltoall-switch, and alltoall-rack.) Each test generates a ${test}.job (9) Slurm job script in an individual ${test}/ (8) directory, as well as a test-specific node_list file (10).

bench submit submits a Slurm job for each ${test}/ (8) directory. This directory is used as the working directory for the job, which runs the contained ${test}.job (9) script. Because it is the working directory for the job, Slurm and the payload tests are expected to write output to the ${test}/ (8) directory. (11, 12)

bench process inspects the ${test}/ (8) directories for payload test output (12) to evaluate pass/fail for each test. Test results are summarized for each ${test_type}/ (3) in pass_nodes (4), fail_nodes (5), and error_nodes (6) files, which share the same format as node_list (1, 10). (The tests/ directory serves to separate valid ${test}/ (8) directories from these summary files.)

bench reserve and bench update-node create Slurm reservations or mark nodes down, respectively, based on the result summaries generated by bench process. (4, 5, 6)

Running curc-bench at CU-Boulder Research Computing

Load modules and prepare the environment

$ module load slurm python/pyslurm benchmarks/bench 
$ bench create

Node tests

$ bench add --node-tests
$ bench submit --node-tests
$ bench process --node-tests # after all jobs done
$ bench reserve --node-tests

Bandwidth tests

$ bench add --bandwidth-tests
$ bench submit --bandwidth-tests
$ bench process --bandwidth-tests # after all jobs done
$ bench reserve --bandwidth-tests

All-to-all tests: pairs of nodes

$ bench add --alltoall-pair-tests
$ bench submit --alltoall-pair-tests
$ bench process --alltoall-pair-tests # after all jobs done
$ bench reserve --alltoall-pair-tests

All-to-all tests: switch groups

$ bench add --alltoall-switch-tests
$ bench submit --alltoall-switch-tests
$ bench process --alltoall-switch-tests # after all jobs done
$ bench reserve --alltoall-switch-tests

All-to-all tests: rack groups

$ bench add --alltoall-rack-tests
$ bench submit --alltoall-rack-tests
$ bench process --alltoall-rack-tests # after all jobs done
$ bench reserve --alltoall-rack-tests

Running code tests

$ python setup.py test

Non-Python dependencies

IOR-2.10.3
hpl-2.0
hpcc-1.4.1
osu-micro-benchmarks-3.8
- latest version: http://mvapich.cse.ohio-state.edu/benchmarks/
stream.c v5.9
Intel-provided linpack 10.3.9
openmpi

Some of the version numbers were changed in this (come back to this later)

stream.c has been tuned. If you download a fresh copy it needs to be re-tuned.

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
curc		curc
man		man
src		src
.gitignore		.gitignore
Non_Python_Dependencies.sh		Non_Python_Dependencies.sh
README.mdwn		README.mdwn
ez_setup.py		ez_setup.py
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

curc

curc

man

man

src

src

.gitignore

.gitignore

Non_Python_Dependencies.sh

Non_Python_Dependencies.sh

README.mdwn

README.mdwn

ez_setup.py

ez_setup.py

setup.cfg

setup.cfg

setup.py

setup.py

Repository files navigation

Commands

Tests available

Session directory

Running curc-bench at CU-Boulder Research Computing

Running code tests

Non-Python dependencies

About

Releases

Packages

Languages

AaronTHolt/curc-bench-1

Folders and files

Latest commit

History

Repository files navigation

Commands

Tests available

Session directory

Running curc-bench at CU-Boulder Research Computing

Running code tests

Non-Python dependencies

About

Resources

Stars

Watchers

Forks

Languages