Pipeline¶

This class provides methods to load and interact with results from the VAST Pipeline. There are ready made functions to:

Load the pipeline outputs into dataframes ready for analysis.
Explore individual sources.
Run provided transient and variability searches.
Check for the Sun, Moon and planets in the pipeline run.
Recalculate the sources data after filters have been applied.
Create a MOC of the pipeline run.

Warning: Data Access

It is assumed that the machine that is running VAST Tools has access to the pipeline output and the images that were used in the pipeline. Refer to the Configuration & Data Access page for more information.

Using the Pipeline Component¶

Info: VAST Pipeline Example Notebook

An example notebook of using the Pipeline component can be found in the example notebooks section here. Using the pipeline results to crossmatch to external catalogues is also demonstrated in the catalogue crossmatch example notebook.

The first step is to initialise a Pipeline instance from vasttools.pipeline:

Example

from vasttools.pipeline import Pipeline

pipe = Pipeline()

If configured correctly, VAST Tools should automatically detected where the Pipeline directory is located on the system. If this is not the case then the variable project_dir can be passed when initialising the instance to define where the pipeline outputs are located. Refer to the Configuration & Data Access page for more information.

Example: Defining the Project Directory

pipe = Pipeline(project_dir='/path/to/the/pipeline-runs/')

Available Pipeline Methods¶

The following methods are available with the Pipeline instance.

Info: Code Reference

Each method below has a link to the Code Reference section which provides full details of the method, including the arguments.

id	wavg_ra	wavg_dec	avg_compactness	min_snr	max_snr	wavg_uncertainty_ew	wavg_uncertainty_ns	avg_flux_int	avg_flux_peak	max_flux_peak	max_flux_int	min_flux_peak	min_flux_int	min_flux_peak_isl_ratio	min_flux_int_isl_ratio	v_int	v_peak	eta_int	eta_peak	new	new_high_sigma	n_neighbour_dist	vs_abs_significant_max_peak	m_abs_significant_max_peak	vs_abs_significant_max_int	m_abs_significant_max_int	n_measurements	n_selavy	n_forced	n_siblings	n_relations
1	321.973	0.699851	1.19165	50.0035	50.0035	0.000282565	0.000282565	17.161	14.401	14.401	17.161	14.401	17.161	1	1	0	0	0	0	False	0	0.0797685	0	0	0	0	1	1	0	0	0
2	323.714	-2.60374	0.984136	35.6238	50.4427	0.000115396	0.000115396	14.8353	15.056	16.293	18.188	14.278	10.097	1	1	0.220432	0.0483071	34.0201	4.77011	False	0	0.0671826	4.37252	0.131824	9.7928	0.572105	6	6	0	0	0
3	322.062	-3.65218	1.10679	28.9505	51.573	0.00011714	0.00011714	11.2827	10.2657	14.492	17.725	7.602	8.53	1	1	0.318601	0.320678	44.727	144.446	False	0	0.0483475	18.271	0.623699	12.2216	0.700438	6	6	0	0	0
4	316.332	-2.60898	1.11456	52.4078	64.4311	0.000114767	0.000114767	15.6558	14.0617	14.497	19.35	13.364	12.002	1	1	0.166578	0.0334684	35.3403	3.70799	False	0	0.0724847	0	0	10.9505	0.468742	6	6	0	0	0
5	323.796	1.83018	1.0673	40.236	40.236	0.000284353	0.000284353	15.288	14.324	14.324	15.288	14.324	15.288	1	1	0	0	0	0	False	0	0.0485506	0	0	0	0	1	1	0	0	0
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...

Attribute Name	Description
`associations`	The pipeline output `associations.parquet` loaded into a pandas dataframe.
`images`	The pipeline output `images.parqeut` loaded into a pandas dataframe.
`measurements`	The pipeline output `measurements.parquet` or `measurements.arrow` loaded into a pandas dataframe or vaex dataframe, respectively.
`relations`	The pipeline output `relations.parqeut` loaded into a pandas dataframe.
`skyregions`	The pipeline output `skyregions.parqeut` loaded into a pandas dataframe.
`sources`	The pipeline output `sources.parqeut` loaded into a pandas dataframe.
`sources_skycoord`	A `astropy.coords.SkyCoordinate` instance of the all the `sources` positions for convenience.

Pipeline¶

Using the Pipeline Component¶

Available Pipeline Methods¶

list_images¶

list_piperuns¶

load_run¶

load_runs¶

PipeAnalysis Instances¶

Accessing PipeAnalysis Run Data¶

Available PipeAnalysis Methods¶

check_for_planets¶

combine_with_run¶

create_moc¶

filter_by_moc¶

get_source¶

get_sources_skycoord¶

load_two_epoch_metrics¶

recalc_measurement_pairs_df¶

recalc_sources_df¶

Available PipeAnalysis Transient Analysis Methods¶

eta_v_diagnostic_plot¶

plot_two_epoch_pairs¶

run_eta_v_analysis¶

run_two_epoch_analysis¶