TESS exoplanet detection filter, crossmatch, classifier, and ranker.
Your transiting planet detection pipeline results in a huge pile of detections. Now What? Do I really have to look at all these manually!?
No! Algorithmic filtering and classifying is available with TESS-ExoClass (TEC).
Christopher J Burke (MIT), Jeffrey Coughlin (SETI/NASA Ames), Susan Mullally (STScI), Aylin Garcia Soto (MIT), Fergal Mullally (SETI), Jessie Christiansen (NExSci)
Sector Number | Detection Source | Result Tables | Summary PDFs |
---|---|---|---|
3 | NASA Ames SPOC TCE | Tables | PDFs |
4 | NASA Ames SPOC TCE | Tables | PDFs |
1-3 | NASA Ames SPOC TCE | Tables | PDFs |
5 | NASA Ames SPOC TCE | Tables | PDFs |
6 | NASA Ames SPOC TCE | Tables | PDFs |
7 | NASA Ames SPOC TCE | Tables | PDFs |
8 | NASA Ames SPOC TCE | Tables | PDFs |
1-6 | NASA Ames SPOC TCE | Tables | PDFs |
9 | NASA Ames SPOC TCE | Tables | PDFs |
1-9 | NASA Ames SPOC TCE | Tables | PDFs |
10 | NASA Ames SPOC TCE | Tables | PDFs |
11 | NASA Ames SPOC TCE | Tables | PDFs |
12 | NASA Ames SPOC TCE | Tables | PDFs |
13 | NASA Ames SPOC TCE | Tables | PDFs |
1-13 | NASA Ames SPOC TCE | Tables | PDFs |
14 | NASA Ames SPOC TCE | Tables | PDFs |
15 | NASA Ames SPOC TCE | Tables | PDFs |
16 | NASA Ames SPOC TCE | Tables | PDFs |
14-16 | NASA Ames SPOC TCE | Tables | PDFs |
17 | NASA Ames SPOC TCE | Tables | PDFs |
18 | NASA Ames SPOC TCE | Tables | PDFs |
19 | NASA Ames SPOC TCE | Tables | PDFs |
14-19 | NASA Ames SPOC TCE | Tables | PDFs |
20 | NASA Ames SPOC TCE | Tables | PDFs |
21 | NASA Ames SPOC TCE | Tables | PDFs |
22 | NASA Ames SPOC TCE | Tables | PDFs |
23 | NASA Ames SPOC TCE | Tables | PDFs |
14-23 | NASA Ames SPOC TCE | Tables | PDFs |
24 | NASA Ames SPOC TCE | Tables | PDFs |
25 | NASA Ames SPOC TCE | Tables | PDFs |
26 | NASA Ames SPOC TCE | Tables | PDFs |
14-26 | NASA Ames SPOC TCE | Tables | PDFs |
27 | NASA Ames SPOC TCE | Tables | PDFs |
28 | NASA Ames SPOC TCE | Tables | PDFs |
29 | NASA Ames SPOC TCE | Tables | PDFs |
30 | NASA Ames SPOC TCE | Tables | PDFs |
31 | NASA Ames SPOC TCE | Tables | PDFs |
32 | NASA Ames SPOC TCE | Tables | PDFs |
33 | NASA Ames SPOC TCE | Tables | PDFs |
34 | NASA Ames SPOC TCE | Tables | PDFs |
35 | NASA Ames SPOC TCE | Tables | PDFs |
36 | NASA Ames SPOC TCE | Tables | PDFs |
37 | NASA Ames SPOC TCE | Tables | PDFs |
1-36 | NASA Ames SPOC TCE | Tables | PDFs |
38 | NASA Ames SPOC TCE | Tables | PDFs |
39 | NASA Ames SPOC TCE | Tables | PDFs |
40 | NASA Ames SPOC TCE | Tables | PDFs |
1-39 | NASA Ames SPOC TCE | Tables | PDFs |
41 | NASA Ames SPOC TCE | Tables | PDFs |
TEC gets you from your pile of signal detections to your best transiting planet candidate science faster. TEC builds off the highly successful Kepler Robovetter classification process (Coughlin et al. 2016, Thompson et al. 2018) as well as the DAVE K2 vetting tools (Kostov et al. 2019). TEC is actually neither of those more mature efforts (yet). TEC is actually the foundation underlying those classifiers. In other words TEC is the database of attributes/metrics that generically feed any kind of machine learning or classification process. For instance, the Kepler Robovetter examined a database of ~50 metrics in order for it to carry out its decision tree based classifications. TEC is currently focused on developing the database of attributes/metrics that can provide more sophisticated classifications using machine learning techniques. TEC currently uses a crude decision tree like classifier that is tuned manually in order to match the efficacy of the current manual vetting effort to build the MIT TOI alerts.
- Observing the best candidates: TEC provides a filtered and ranked list of the most viable detections to get you to the 'gem' planet candidates quickly. In practice, TEC has demonstrated that the number of viable detections can be reduced by a factor of six from the original detections.
- Machine learning attributes and training sets: All machine learning algorithms begin with a database of attributes/metrics. You can use TEC to enhance the current set of attributes that you have developed or you can use the currently crude classifications from TEC to enhance your training set.
- Statistical Validation: Do you want to statistically validate your lowest signal to noise ratio (SNR) detections? Then you need to quantify the reliability of your detections and classification. Mullally et al. (2018) and Burke et al. (2019) recently have shown that important Kepler statistical validations (such as Kepler-452b and Kepler-186f) are not as statistically significant as previously thought without taking into account the possibility that they could be false alarm/systematic detections. TEC enables automating the classification thus one can provide TEC examples of systematics in order to demonstrate whether reliability is an issue for the lowest SNR detections.
- Planet Occurrence Rates: Are you one of the brave souls that wants to calculate unbiased planet occurrence rates based on TESS data? You need automated classifications in order to quantify how this step influences the recovered planets.
- Your Pipeline Here: I have my own planet search pipeline, and my own large pile of detections. Help! Without too much work providing a flux-time series and detection ephemeris you can run TEC for your own pipeline.
TEC is a very, very much work in progress from a coding and workflow standpoint. However, TEC is producing filtered and ranked list of candidates from the TESS produced NASA Ames SPOC Threshold Crossing Events (TCEs) along with summary documents for each detection that TEC deems worthy of attention. So, even if you ignore the codes, TEC currently can provide a high fidelity list of the most promising planet candidates for observing follow-up, machine learning training sets, or benchmark to your own efforts. The TOI alerts (focused on providing the most reliable candidates for the TESS prime mission) are a heavily filtered set of candidates. TEC provides a more thorough search to lower SNR levels and classifications for all TCEs from the NASA Ames pipeline.
From an attribute/metric standpoint most of the flux-level metrics from the Kepler Robovetter are available with TEC. The notable exceptions are LPP (Thompson et al. 2015) and Marshall (Mullally et al. 2016). Those missing metrics do have publicly available algorithms under the DAVE project and will potentially be implemented in future TEC updates. Pixel-level centroid analysis does not exist in TEC yet, and won't for sometime. There is an effort to independently calculate in-transit difference images, but that is under developement. TEC relies on the centroid analysis from the NASA Ames pipeline to warn about possible centroid shifts during transit. The DAVE project is potentially gearing up for more detailed pixel level analysis that I encourage those interested to follow the DAVE project.
TEC currently consists of 5 primary outputs that are of most interest to the community. The columns and format of the files are described in doc/TESS-ExoClass_File_Guide.txt
- Triage list - Based upon a limited set of metrics that are simple, efficient, and robustly deal with 'junky' candidates, TEC does a triage filter on the NASA Ames TCEs. This is a hard filter in that TCEs that do not pass the TEC triage filter do not get a TEC summary report and do not have the more detailed metrics calculated. From a classification standpoint, the TCEs that do not pass this filter should be considered instrumental and/or stellar variability in nature. The TCEs that pass the triage are considered viable transiting planet candidates and/or detached stellar eclipsing binaries. Contact binaries that are not detached (detached EBs have a flat stable normalized flux time series in between the eclipses) tend to not pass the triage filter stage.
- Tier 1 list - List of most promising planet candidates that pass the triage filter and all the additional metrics indicate that the signals are consistent with a planet candidate on the target. In addition, the Tier 1 list is ranked by a heuristic value that gives higher rank to planet candidates that have brighter hosts, smaller radii, lower insolation flux, and etc...(quantities most suitable for follow up). In addition, the Tier 1 list provides a flag as to whether the planet candidate is previously known from the confirmed planet list hosted by the NASA Exoplanet Archive or a previously known TOI. End users (especially observers making plans for a telescope) are still encouraged to perform due diligence and manually examine the summary report pdf in order to confirm that the planet candidate is viable.
- Tier 2 list - List of planet candidates that pass the triage filter, but one or more metrics indicate an issue with the signal that raises concern the signal is not a planet candidate or the signal is not on the primary target of interest (evidence for a centroid offset). The Tier 2 list is a similarly ranked list as Tier 1, but contains a flag string to indicate which metric has raised concern. End users are encouraged to examine the summary report pdf in order to provide a judgement as to whether the metric of concern is significant and calculated in a valid manner. Some of the metrics can have false negatives and be good clean planet candidates.
- Tier 3 list - List of probable stellar eclipsing binaries. In particular, this list contains signals with eclipses and a significant secondary that is not due to a planet.
- Summary report - Users of the Kepler TCERT Vetting Reports from the Kepler Science Office will feel right at home with the TEC summary reports. The TEC summary report repeats the SPOC DV summary and includes the additional information calculated in TEC such as the very helpful model shift test, SPOC centroid information, and a prototype for an independent difference image calculation.
The doc folder has documentation that describes the columns for the TEC outputs, a semi-thorough step-by-step guide for running TEC on the NASA Ames SPOC TCEs along with hints for implementing TEC codes for your own pipeline results.
TEC code is in the code folder. It is primarily python. See doc/tess-exoclass_opsnotes_s04.txt for the guide for running the code for the NASA Ames SPOC TCEs with data products available at MAST.