#treeherder-client
A set of client libraries to support data submission to https://github.com/mozilla/treeherder-service. There are two types of data structures supported: job and resultset collections. Both classes have support methods for building a data structure that https://github.com/mozilla/treeherder-service accepts. Data structures can be extended with new properties as needed, there is a minimal validation protocol applied that confirms the bare minimum parts of the structures are defined.
##Resultset Collection
Resultset collections contain meta data associated with a github pull request or a push to mercurial or any event that requires tests to be run on a repository.
The most critical part of each resultset is the revision_hash
. This is used as an identifier to associate test job data with. It can be any unique string of 50 characters at most. A resultset collection has the following data structure
[
{
# unique identifier for a result set, can be any unique character string no longer than 50 characters
'revision_hash': '45f8637cb9f78f19cb8463ff174e81756805d8cf',
'author': 'somebody@somewhere.com',
'push_timestamp': 1384353511,
'type': 'push',
# any type of artifact data associated with the resultset
'artifact': {
'type': 'extra_info',
'name': 'push_data',
# blob can be any kind of structured data
'blob': { 'stuff': [1, 2, 3, 4, 5] }
},
# a list of revisions associated with the resultset. There should be at least
# one.
'revisions': [
{
'comment': 'Bug 936711 - Fix crash which happened at disabling Bluetooth...',
'files': [
'dom/bluetooth/BluetoothA2dpManager.cpp',
'dom/bluetooth/BluetoothHidManager.cpp',
'dom/bluetooth/linux/BluetoothDBusService.cpp'
],
'revision': 'cdfe03e77e66',
'repository': 'test_treeherder',
'author': 'Some Person <sperson@someplace.com>'
},
...
]
}
}
]
##Job Collection
Job collections can contain test results from any kind of test. The revision_hash
provided should match the associated revision_hash
in the resultset structure. The job_guid
provided can be any unique string of 50 characters at most. A job collection has the following data structure.
[
{
'project': 'mozilla-inbound',
'revision_hash': '4317d9e5759d58852485a7a808095a44bc806e19',
'job': {
'job_guid': 'd22c74d4aa6d2a1dcba96d95dccbd5fdca70cf33',
'product_name': 'spidermonkey',
'reason': 'scheduler',
'who': 'spidermonkey_info__mozilla-inbound-warnaserr',
'desc': 'Linux x86-64 mozilla-inbound spidermonkey_info-warnaserr build',
'name': 'SpiderMonkey --enable-sm-fail-on-warnings Build',
# The symbol representing the job displayed in
# treeherder.allizom.org
'job_symbol': 'e',
# The symbol representing the job group in
# treeherder.allizom.org
'group_symbol': 'SM',
'group_name': 'SpiderMonkey',
'submit_timestamp': 1387221298,
'start_timestamp': 1387221345,
'end_timestamp': 1387222817,
'state': 'completed',
'result': 'success',
'machine': 'bld-linux64-ec2-104',
'build_platform': {
'platform':'linux64', 'os_name': 'linux', 'architecture': 'x86_64'
},
'machine_platform': {
'platform': 'linux64', 'os_name': 'linux', 'architecture': 'x86_64'
},
'build_url': 'https://get.your.builds.here/12345',
'option_collection': {'opt': True},
'log_references': [
{
'url': 'http://ftp.mozilla.org/pub/mozilla.org/spidermonkey/...',
'name': 'builds-4h'
}
],
# The artifact can contain any kind of structured data associated with a test.
'artifacts': [{
'type': 'json',
'name': '',
'blob': { my json content here}
}],
# List of job guids that were coalesced to this job
'coalesced': []
},
...
]
##Artifact Collection
Artifact collections contain arbitrary data associated with a job. This is usually a json blob of structured data produced by the build system during the job execution.
[
{
'type': 'json',
'name': 'my-artifact-name',
# blob can be any kind of structured data
'blob': { 'stuff': [1, 2, 3, 4, 5] },
'job_guid': 'd22c74d4aa6d2a1dcba96d95dccbd5fdca70cf33'
}
]
###Usage
If you want to use TreeherderResultSetCollection
to build up the resultset data structures to send, do something like this.
from thclient import TreeherderRequest, TreeherderResultSetCollection, TreeherderClientError
trsc = TreeherderResultSetCollection()
for data in dataset:
trs = trsc.get_resultset()
trs.add_push_timestamp( data['push_timestamp'] )
trs.add_revision_hash( data['revision_hash'] )
trs.add_type( data['type'] )
trs.add_artifact( 'push_data', 'push', { 'stuff':[1,2,3,4,5] } )
for revision in data['revisions']:
tr = trs.get_revision()
tr.add_revision( revision['revision'] )
tr.add_author( revision['author'] )
tr.add_comment( revision['comment'] )
tr.add_files( revision['files'] )
tr.add_repository( revision['repository'] )
trs.add_revision(tr)
trsc.add(trs)
# Send the collection to treeherder
# The OAuth key and secret for your project should be supplied to you by the
# treeherder-service administrator.
req = TreeherderRequest(
protocol='https',
host='treeherder.mozilla.org',
project='project',
oauth_key='oauth-key',
oauth_secret='oauth-secret',
)
# Post the result collection
#
# data structure validation is automatically performed here, if validation
# fails a TreeherderClientError is raised
req.post(trc)
At any time in building a data structure, you can examine what has been created by looking at the data
property.
You can also call the validate
method at any time before sending a collection.
All treeherder data classes have validate
methods that can be used for testing.
The validate
method is called on every structure in a collection when a send
is called on a TreeherderRequest
.
If validation fails a TreeherderClientError
is raised.
If you want to use TreeherderJobCollection
to build up the job data structures to send, do something like this.
from thclient import TreeherderRequest, TreeherderJobCollection, TreeherderClientError
#####
# TreeherderJobCollection() takes a 'type' parameter that can be set to 'update'
# if the job objects are being used for updating status (status = 'running' | 'pending') and
# don't contain a full data payload. If type is not set, the job object go to the
# objectstore (status = 'completed'). If the collection is passed a type like so,
# TreeherderJobCollection(type='update') the status of the object will be updated in
# the RDBS schema
#####
tjc = TreeherderJobCollection()
for data in dataset:
tj = tjc.get_job()
tj.add_revision_hash( data['revision_hash'] )
tj.add_project( data['project'] )
tj.add_coalesced_guid( data['coalesced'] )
tj.add_job_guid( data['job_guid'] )
tj.add_job_name( data['name'] )
tj.add_job_symbol( data['job_symbol'] )
tj.add_group_name( data['group_name'] )
tj.add_group_symbol( data['group_symbol'] )
tj.add_description( data['desc'] )
tj.add_product_name( data['product_name'] )
tj.add_state( data['state'] )
tj.add_result( data['result'] )
tj.add_reason( data['reason'] )
tj.add_who( data['who'] )
tj.add_submit_timestamp( data['submit_timestamp'] )
tj.add_start_timestamp( data['start_timestamp'] )
tj.add_end_timestamp( data['end_timestamp'] )
tj.add_machine( data['machine'] )
tj.add_build_url( data['build_url'] )
tj.add_build_info(
data['build']['os_name'], data['build']['platform'], data['build']['architecture']
)
tj.add_machine_info(
data['machine']['os_name'], data['machine']['platform'], data['machine']['architecture']
)
tj.add_option_collection( data['option_collection'] )
tj.add_log_reference( 'builds-4h', data['log_reference'] )
# data['artifact'] is a list of artifacts
for artifact_data in data['artifact']:
tj.add_artifact(
artifact_data['name'], artifact_data['type'], artifact_data['blob']
)
tjc.add(tj)
# Send the collection to treeherder
req = TreeherderRequest(
protocol='https',
host='treeherder.mozilla.org',
project='project',
oauth_key='oauth-key',
oauth_secret='oauth-secret',
)
# Post the job collection
#
# data structure validation is automatically performed here, if validation
# fails a TreeherderClientError is raised
req.post(tjc)
If you want to use TreeherderArtifactCollection
to build up the job artifacts data structures to send, do something like this.
from thclient import TreeherderRequest, TreeherderArtifactCollection, TreeherderClientError
tac = TreeherderArtifactCollection()
for data in dataset:
ta = tac.get_artifact()
ta.add_blob( data['blob'] )
ta.add_name( data['name'] )
ta.add_type( data['type'] )
ta.add_job_guid( data['job_guid'] )
tac.add(ta)
# Send the collection to treeherder
req = TreeherderRequest(
protocol='https',
host='treeherder.mozilla.org',
project='project',
oauth_key='oauth-key',
oauth_secret='oauth-secret',
)
# Post the artifact collection
#
# data structure validation is automatically performed here, if validation
# fails a TreeherderClientError is raised
req.post(tac)
If you don't want to use TreeherderResultCollection
or TreeherderJobCollection
to build up the data structure
to send, build the data structures directly and add them to the collection.
from thclient import TreeherderRequest, TreeherderResultSetCollection
trc = TreeherderResultSetCollection()
for resultset in resultset_data:
trs = trc.get_resultset(resultset)
# Add any additional data to trs.data here
# add resultset to collection
trc.add(trs)
req = TreeherderRequest(
protocol='https',
host='treeherder.mozilla.org',
project='project',
oauth_key='oauth-key',
oauth_secret='oauth-secret',
)
# Post the request to treeherder
req.post(trc)
from thclient import TreeherderRequest, TreeherderJobCollection
tjc = TreeherderJobCollection()
for job in job_data:
tj = tjc.get_job(job)
# Add any additional data to tj.data here
# add job to collection
tjc.add(tj)
req = TreeherderRequest(
protocol='https',
host='treeherder.mozilla.org',
project='project',
oauth_key='oauth-key',
oauth_secret='oauth-secret',
)
# Post the request to treeherder
#
# data structure validation is automatically performed here, if validation
# fails a TreeherderClientError is raised
req.post(tjc)
In the same way, if you don't want to use TreeherderArtifactCollection
to build up the data structure to send, build the data structures directly and add them to the collection.
from thclient import TreeherderRequest, TreeherderArtifactCollection
tac = TreeherderArtifactCollection()
for artifact in artifact_data:
ta = tac.get_artifact(artifact)
# Add any additional data to ta.data here
# add artifact to collection
tac.add(ta)
req = TreeherderRequest(
protocol='https',
host='treeherder.mozilla.org',
project='project',
oauth_key='oauth-key',
oauth_secret='oauth-secret',
)
# Post the request to treeherder
req.post(tac)
##Job artifacts format
Artifacts can have name, type and blob. The blob property can contain any valid data structure accordingly to type attribute. For example if you use the json type, your blob must be json-serializable to be valid. The name attribute can be any arbitrary string identifying the artifact. Here is an example of what a job artifact looks like in the context of a job object:
[
{
'project': 'mozilla-inbound',
'revision_hash': '4317d9e5759d58852485a7a808095a44bc806e19',
'job': {
'job_guid': 'd22c74d4aa6d2a1dcba96d95dccbd5fdca70cf33',
# ...
# other job properties here
# ...
'artifacts': [
{
"type": "json",
"name": "my first artifact",
'blob': {
k1:v1,
k2: v2,
...
}
},
{
'type': 'json',
'name': 'my second artifact',
'blob': {
k1:v1,
k2: v2,
...
}
}
]
}
},
...
]
A special case of job artifact is a "Job Info" artifact. This kind of artifact will be retrieved by the UI (see https://github.com/mozilla/treeherder-ui) and rendered in the job detail panel. This is what a Job Info artifact looks like:
{
"blob": {
"job_details": [
{
"url": "https://www.mozilla.org",
"value": "website",
"content_type": "link",
"title": "Mozilla home page"
},
{
"value": "bar",
"content_type": "text",
"title": "Foo"
},
{
"value": "This is <strong>cool</strong>",
"content_type": "raw_html",
"title": "Cool title"
}
],
},
"type": "json",
"name": "Job Info"
}
All the elements in the job_details attribute of this artifact have a mandatory title attribute and a set of optional attributes depending on content_type
.
The content_type
drives the way this kind of artifact will be rendered. Here are the possible values:
-
Text - This is the simplest content type you can render and is the one used by default if the content type specified is not recognised or is missing.
This content type renders as:
<label>{{title}}</label><span>{{value}}</span>
-
Link - This content type renders as an anchor html tag with the following format:
{{title}}: <a title="{{value}}" href="{{url}}" target="_blank">{{value}}</a>
-
Raw Html - The last resource for when you need to show some formatted content.
##Development
To run the treeherder-client
test suite, run python setup.py test
.
If you have python2.5
, python2.6
, or python2.7
available on your system
under those names, you can also pip install tox
and then run tox
to test
treeherder-client
under all of those Python versions.