The Cannon 2: The Compressed Sensing Edition

If we take The Cannon to large numbers of labels (say chemical abundances), the model complexity grows very fast. At the same time, we know that most chemicals affect very few wavelengths in the spectrum; that is, we know that the problem is sparse. Here we try to use standard methods to discover and enforce sparsity.

Authors

Andy Casey (Cambridge)
David W. Hogg (NYU) (MPIA) (SCDA)
Melissa K. Ness (MPIA)
Hans-Walter Rix (MPIA)
Anna Y. Q. Ho (Caltech)
Gerry Gilmore (Cambridge)

License

Installation

To install:

pip install https://github.com/andycasey/AnniesLasso/archive/master.zip

Getting Started

Let us assume that you have rest-frame continuum-normalized spectra for a set of stars for which the stellar parameters and chemical abundances (which we will collectively call labels) are known with high fidelity. The labels for those stars (and the locations of the spectrum fluxes and inverse variances) are assumed to be stored in a table. In this example all stars are assumed to be sampled on the same wavelength (dispersion) scale.

Here we will create and train a 3-label (effective temperature, surface gravity, metallicity) quadratic (e.g., Teff^2) model:

import numpy as np
from astropy.table import Table

import AnniesLasso as tc

# Load the table containing the training set labels, and the spectra.
training_set = Table.read("training_set_labels.fits")

# Here we will assume that the flux and inverse variance arrays are stored in
# different ASCII files. The end goal is just to produce flux and inverse
# variance arrays of shape (N_stars, N_pixels).
normalized_flux = np.array([np.loadtxt(star["flux_filename"]) for star in training_set])
normalized_ivar = np.array([np.loadtxt(star["ivar_filename"]) for star in training_set])

# Providing the dispersion to the model is optional, but handy later on.
dispersion = np.loadtxt("common_wavelengths.txt")

# Create the model that will run in parallel using all available cores.
model = tc.CannonModel(training_set, normalized_flux, normalized_ivar,
    dispersion=dispersion, threads=-1)

# Specify the complexity of the model:
model.vectorizer = tc.vectorizer.NormalizedPolynomialVectorizer(labelled_set,
    tc.vectorizer.polynomial.terminator(("TEFF", "LOGG", "FEH"), 2))

# Train the model!
model.train()

You can follow this example further in the complete Getting Started tutorial.

Name		Name	Last commit message	Last commit date
Latest commit History 324 Commits
AnniesLasso		AnniesLasso
docs		docs
papers		papers
sandbox-scripts		sandbox-scripts
.coveragerc		.coveragerc
.gitignore		.gitignore
.scrutinizer.yml		.scrutinizer.yml
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
getting_started.py		getting_started.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AnniesLasso

AnniesLasso

docs

docs

papers

papers

sandbox-scripts

sandbox-scripts

.coveragerc

.coveragerc

.gitignore

.gitignore

.scrutinizer.yml

.scrutinizer.yml

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

getting_started.py

getting_started.py

setup.py

setup.py

Repository files navigation

The Cannon 2: The Compressed Sensing Edition

Authors

License

Installation

Getting Started

About

Releases

Packages

Languages

License

peraktong/AnniesLasso

Folders and files

Latest commit

History

Repository files navigation

The Cannon 2: The Compressed Sensing Edition

Authors

License

Installation

Getting Started

About

Resources

License

Stars

Watchers

Forks

Languages