SWAS Data Pipeline

1.0 Overview of SWAS Data

While in orbit, SWAS makes continuous spectroscopic measurements. The AOS is read out every 10 milliseconds, regardless of the actual operation mode and pointing of the telescope. These 10 millisecond scans are co-averaged on-board, generating one full 1,450-channel scan every 2 seconds. All the 2-second co-adds are stored on-board, time-tagged, and downlinked to the ground stations (downlinks require approximately 1 hour per day) twice per day. The downlinked data are played back to the NASA data facilities twice-daily, and are subsequently passed on to the SWAS Science Operation Center (SOC) at SAO in Cambridge, Mass. In addition to the spectroscopic output which make up most of the SWAS science data, there are also the "housekeeping" data which include a large number of instrument and spacecraft status parameters.

The data stream is pre-processed using a modified NASA software program ("tcw") upon arrival at SOC. This process decompresses, decodes, re-formats and partitions the science data packets into a "segmental" form, where data file boundaries are established based on the observational segments, as implemented by the SWAS Planning and Scheduling System (each segment usually observes a single target, and lasts up to 45 minutes). At this stage, SWAS science data become individual files in FITS (binary table and image extensions) format, and are referred to as "level-0.5" (or LH) data in our discussions. Essentially, the LH data are identical to the SWAS instrument output in terms of the science content, with the only exception that the 1,450 AOS channels are broken into two halves (corresponding to the 490 and 550GHz bands, respectively). In other words, the pre-processing mostly takes out effects related to the packaging the transmission of the data from the spacecraft to the SOC.

The SWAS Pipeline consists of a series of operations, each of which can be considered as a distinctive step in the data processing. Results of each processing step are recorded into intermediate data files. There is also a data reduction log file automatically generated in the process. The products of the pipeline are written into a new output data file in the final step of the processing. In its entire operation the Pipeline accesses the LH data file on a "read-only" basis, i.e. no information in the raw data is altered as a result of the pipeline processing. This ensures the consistency of SWAS data when re-processing becomes necessary.

The output of the pipeline are co-added spectra with a new set of header parameters generated during the processing. Currently, there are two parallel "flavors" of the output data : an IRAF-flavored FITS data file format, to suit those users who are familiar with tools within the IRAF environment; and a CLASS-flavored FITS format, which is intended for the general user community who are presumably more accustomed to the popular radio astronomy data analysis tools package CLASS from Observatory de Grenoble, Spain. These data products are referred to as "level-1" data.

2.0 Summary of Processing Steps

There are three basic SWAS science observing modes: nodding, chopping and mapping. The SWAS Pipeline reads the header parameters in the LH data file that identifies the observing mode, and takes actions accordingly. The following descriptions summarize each of the main Pipeline reduction steps. Special reductions for data of other modes (e.g. planetary cal and matrix tests) are discussed elsewhere.

2.1 Step one (checking):

This task performs data validation and integrity checks on the raw (LH) data file. The validations are based on the "header" parameters that come with the spectral data and specify the instrument setup and conditions of the data taking. Potential problems such as internal inconsistencies of the instrumental parameters and observing time gaps are reported and, flagged. Input data files that are not of the three basic observing modes are noted and excluded from the following normal processing steps. In addition, necessary conversions of parameters into more useful units (such as times, frequencies, temperatures and celestial coordinates) are performed. However, there is no direct access or manipulation of the spectral (pixel) data at this stage.

2.2 Step two (ON/OFF pairing and subtraction):

This step mainly performs the standard (ON-OFF)/(OFF-ZERO) calculations of the spectral data. This involves pairing and evaluating ON, OFF and ZERO or CAL scans. The scans are pairied on a one-to-one basis for Nod and Chop modes, usually with the ON and OFF scans taken within 30 seconds in time. For mapping data where unequal numbers of ON and OFF scans are sometimes taken, a procedure similar to the so-called "on-the-fly" mapping processing is adopted, where each ON is paired with an average of a small number of OFF scans. Because of the Doppler effects of the orbiting spacecraft, stringent limits are put on the time intervals over which the pairing can be accepted.

Similar operations are performed on the spectral data for internal CAL sequences, where the ONs correspond to scans of the "hot load", while OFFs are those of the blank sky. Results of the CAL data are used in latter steps for passband and flux calibrations.

SWAS ZERO scans correspond to data taken while the blanking switch is engaged. However, since it has been discovered that engaging the blanking switch causes a time-dependent hysteresis effect on the AOS continuum levels, SWAS has been using ZERO scans taken at irregular time intervals. Tests have shown that the ZERO scans are statistically stable over a long (months) time scale.

In addition to the spectral AOS data, there are also the individual data points from the two SWAS continuum detectors, which are processed in the same way in this step.

Frequency calibration is performed via an on-board COMB generator. The pipeline selects unsaturated COMB lines, performs a line center fitting for each, and calculates a dispersion relation based on the results. As a rule of thumb, both CAL and COMB scans are taken during the same observing segment as the science data to which the calibrations are applied.

2.3 Step three (flagging):

This task allows an adjustable control over the criteria by which individual ON-OFF scans are allowed to enter the final coadds. It is kept flexible so that flagging criteria can be added or changed depending on the actual requirements and types of observation. Basically, scans that exhibit unusual levels of noise, data spikes, and baseline variations are flagged. The statistics of these flagging operations are kept in log files.

This step differs from the flagging in step one (above) in that the criteria imposed here are based on "derived" quantities from after the ON-OFF pairing and subtraction step.

2.4 Step four (frequency calibration and co-adding):

This step performs frequency shifts and coadds based on the nature of the data and requirements. Nominal co-adds consist of data within a 3-minute (or less) on-target time interval. For each 2-second scan that enters the co-add, a velocity shift is evaluated and added to compensate for the Doppler effect of the spacecraft's motion around the Earth. This effectively normalizes the velocity scale for all scans taken at different positions of the orbit to the line-of-sight velocity of the center of the Earth. Also, this shift and co-add operation is performed on each spectral scan in two opposite directions, because the Doppler effect is in reversed directions depending on whether one is interested in the upper or lower sidebands of the SWAS receivers. It is at this step that the SWAS data are split into four different frequency ranges.

The absolute frequency calibration is based on the fits to the COMB scans taken during the same segment as the science data. If more than one set of the COMB scans are taken, then their results are averaged. Finally, the VLSR velocity scales for each line are added based on the time and target coordinates of the observations.

For each coadd, several header parameters such as observation time are also summed or averaged as appropriate. For mapping mode data, the coadds are grouped according to their actual pointing offsets.

2.5 Step five (Flux Calibration and System Temperature Evaluation):

In standard calibration sequences, SWAS takes CAL data with its hot load at a known temperature, and compares that with signal from the empty sky. This calibration step extracts this information, evaluates the gain profile of the SWAS passband and the flux conversion factor, and apply these to the data for flux calibration. The continuum detector data are also used to provide the "Y-factor" measurement of the receiver and, thus, the corresponding system temperature. Note that there are only two double-sidband system temperatures derived for the four SWAS bands, i.e the upper and lower sidebands of the same frequency range (490 or 550 GHz) share the same system temperature. The cosmic background with a blackbody temperature of a constant 2.97K in the SWAS continuum bands is assumed in these calculations.

The Pipeline generates both IRAF and CLASS flavored data output at this point which are identical except for some minor differences in parameter settings. However, in the CLASS flavored data files, a total co-add spectrum per band, per pointing position (in case of mapping data), per segment is also provided in addition to the nominal (3-minute) averaged spectra. Note that each output spectrum has its own set of header parameters that specify, among other things, the integration time, antenna temperature and frequency scales, target coordinates, time of observation and data reduction, etc.

At this stage (known as Level-1), the data products are still organized according to observing segments, which is also chronologically ordered. One of the last operations of the SWAS Pipeline is to sort the data files (one per segment) according to the target being observed. For example, most of Orion data files are copied to a data directory "omc-1" (but see notes about different LO settings below).

3.0 Additional Notes

The "integration time" recorded in the SWAS data product is only the sum of ON scan times. The actual observing time, which include the OFF scans, is of course usually twice as long.

The amount of data flagged and excluded from co-adding in the Pipeline processing has been monitored throughout the SWAS mission. There are a variety of reasons for flagging the data, but the ultimate goal is to obtain the highest possible signal-to-noise ratios from the existing data without introducing any artifacts. The total amount of excluded data has been shown to be only a small fraction of the available observing time. The overall stability of the SWAS instrument has been found to be very good, as evidenced by spectra of the same target taken over 6-month intervals that are essentially indistinguishable.

By a convention established in the SWAS mission, each unique "target name" has a single (0,0) sky coordinate position. Mapping positions of the same target are expressed as angular offsets from that central position.

In some cases where a target has been observed with several different local oscillator (LO) frequency settings, the data are separated using different target names. For example, target names "sgra" and "sgra-150" are of the same target, but of different LO settings (i.e. sgra-150 had the LO shifted by -150 km/s relative to the LO setting used for sgra). One could, if necessary, combine these data in further data reductions, for example, within the CLASS package.

The co-added spectra are each of 725 channels, with pixel-to-pixel velocity scales of approximately 0.6 km/sec. However, because of the shift-and-coadd operation in the calibration process as well as the "edge-effect" of the AOS bands, the approximately 25 pixels at each end of a co-added spectrum suffer from much higher uncertainties and cannot be taken at face value for scientific analysis purposes. Also, since the shift-and-coadd operation was performed on integer number of pixel basis, (i.e. no sub-pixel interpolation is used), the effective velocity resolution of the co-added SWAS spectra is about 1.5 pixels, or approximately 0.9 km/sec.

In the header information of the SWAS Pipeline data product, one of the items that are unique to SWAS (as opposed to ground-based radio telescopes) is the "roll angle" of the spacecraft. This parameter can be important in some cases, especially because the SWAS beam is not perfectly circular. Since this is not a standard FITS spectral header item and thus could be missed by the analysis tools a user might have, we have decided to use the parameter "azimuth angle" (which is meaningless in the context of SWAS) to register the value of roll angle. Similarly, in the ASCII string of FITS header parameter "telescope" we record information regarding the observing segment. For example, "s506006 NOD" means that the data were obtained on Day of Mission number 506, of observing segment 6, and was taken in nodding mode.

Data Segments with Illegal System Temperatures

For ground support events, SWAS engages a blanking switch. Because no data are recorded from the AOS when the blanking switch is engaged, ground supports appears as time gaps (a jump in the time index) in the science data. The use of the blanking switch has been found to induce a hysteresis effect in the AOS after it is disengaged, with the response of the AOS returning to normal levels in about 100 seconds. If a ground support event occurs entirely within a science segment, the pipeline software detects the event and automatically disregards the next 50 scans (100 seconds of data) after the event for processing.

Under certain (rare) circumstances, the engagement of the blanking switch will occur such that the calibration scans for a science segment are subsequently disregarded by the pipeline after the blanking switch is disengaged. A bug in the pipeline software allows these segments to be processed to level-one in an uncalibrated state and assigns system temperature values of "9999" to channels one and two.

Segments in the 1st and 2nd public SWAS data releases that are affected by the illegal system temperature problem are listed below. These segments have been deleted from the post-pipeline data products at the SWAS Science Operations Center and are not used for scientific analysis. In the Class-format archive, the affected science segments are identified in the "all.dat" file by "mission day_segment number" for the targets listed below. For the third public data release (August 2001), both the Class-format archives and the FITS-format archives have had the affected segments removed.

segment ID    target

0241_032      ON2S
0052_036      S140
0075_030      S140
0265_064      HELIX
0099_062      OMC-1
0117_012      OMC-1
0320_021      OMC-1
0320_056      OMC-1
0320_061      OMC-1
0396_004      TMC-1
0310_016      SATURN
0239_042      CRL2591
0306_027      JUPITER
0112_053      NGC1333
0113_048      NGC1333
0059_043      NGC7538
0067_046      NGC7538
0070_046      NGC7538
0384_010      IC443B-60
0385_013      IC443B-60
0386_011      IC443B-60

Reprocessing of Data Taken in Chop_Nod Observing Mode

Early in the SWAS mission, it was discovered that observations made using the chopping secondary (chop mode) have a standing wave in the spectral baseline. This standing wave is likely due to the presence of a reflection in the chopped off-beam somewhere in the telescope. For this reason, the vast majority of SWAS observations are made by nodding the spacecraft itself to the off position (nod mode).

A hybrid observing mode called "chop_nod" (dual beam switching or balanced beam switching) was introduced on mission day 348. For this mode to be effective, the pipeline software was modified to apply the Doppler correction to the AOS data after the on-minus-off pairs are averaged. This observing mode, processed in this way, removes the standing wave from the spectral baseline. Data taken in the chop_nod mode that were previously distributed in the second SWAS public data release have been reprocessed with the modified pipeline software and have been updated for the third SWAS public data release (August 2001). Observing segments affected by this reprocessing, listed as "mission day_segment number" and target are listed below.

segment ID    target

0348_041      VY_CMA
0306_027      JUPITER
0369_006      JUPITER
0369_010      JUPITER
0369_014      JUPITER
0369_018      JUPITER
0369_022      JUPITER
0369_026      JUPITER