Normalized to: Pence, W.
[1]
oai:arXiv.org:1201.1345 [pdf] - 460571
FITS Checksum Proposal
Submitted: 2012-01-05
The checksum keywords described here provide an integrity check on the
information contained in FITS HDUs. (Header and Data Units are the basic
components of FITS files, consisting of header keyword records followed by
optional associated data records). The CHECKSUM keyword is defined to have a
value that forces the 32-bit 1's complement checksum accumulated over all the
2880-byte FITS logical records in the HDU to equal negative 0. (Note that 1's
complement arithmetic has both positive and negative zero elements). Verifying
that the accumulated checksum is still equal to -0 provides a fast and fairly
reliable way to determine that the HDU has not been modified by subsequent data
processing operations or corrupted while copying or storing the file on
physical media.
[2]
oai:arXiv.org:1201.1340 [pdf] - 460570
A Tiled-Table Convention for Compressing FITS Binary Tables
Submitted: 2012-01-05
This document describes a convention for compressing FITS binary tables that
is modeled after the FITS tiled-image compression method (White et al. 2009)
that has been in use for about a decade. The input table is first optionally
subdivided into tiles, each containing an equal number of rows, then every
column of data within each tile is compressed and stored as a variable-length
array of bytes in the output FITS binary table. All the header keywords from
the input table are copied to the header of the output table and remain
uncompressed for efficient access. The output compressed table contains the
same number and order of columns as in the input uncompressed binary table.
There is one row in the output table corresponding to each tile of rows in the
input table. In principle, each column of data can be compressed using a
different algorithm that is optimized for the type of data within that column,
however in the prototype implementation described here, the gzip algorithm is
used to compress every column.
[3]
oai:arXiv.org:1201.1336 [pdf] - 460569
Tiled Image Convention for Storing Compressed Images in FITS Binary
Tables
Submitted: 2012-01-05
This document describes a convention for compressing n-dimensional images and
storing the resulting byte stream in a variable-length column in a FITS binary
table. The FITS file structure outlined here is independent of the specific
data compression algorithm that is used. The implementation details for 4
widely used compression algorithms are described here, but any other
compression technique could also be supported by this convention. The general
principle used in this convention is to first divide the n-dimensional image
into a rectangular grid of subimages or 'tiles'. Each tile is then compressed
as a block of data, and the resulting compressed byte stream is stored in a row
of a variable length column in a FITS binary table. By dividing the image into
tiles it is generally possible to extract and uncompress subsections of the
image without having to uncompress the whole image.
[4]
oai:arXiv.org:1112.2671 [pdf] - 450245
Fpack and Funpack User's Guide: FITS Image Compression Utilities
Submitted: 2011-12-12
Fpack is a utility program for optimally compressing images in the FITS
(Flexible Image Transport System) data format (see http://fits.gsfc.nasa.gov).
The associated funpack program restores the compressed image file back to its
original state (if a lossless compression algorithm is used). (An experimental
method for compressing FITS binary tables is also available; see section 7).
These programs may be run from the host operating system command line and are
analogous to the gzip and gunzip utility programs except that they are
optimized for FITS format images and offer a wider choice of compression
options.
[5]
oai:arXiv.org:1007.1179 [pdf] - 225958
Optimal Compression of Floating-point Astronomical Images Without
Significant Loss of Information
Submitted: 2010-07-07
We describe a compression method for floating-point astronomical images that
gives compression ratios of 6 -- 10 while still preserving the scientifically
important information in the image. The pixel values are first preprocessed by
quantizing them into scaled integer intensity levels, which removes some of the
uncompressible noise in the image. The integers are then losslessly compressed
using the fast and efficient Rice algorithm and stored in a portable FITS
format file. Quantizing an image more coarsely gives greater image compression,
but it also increases the noise and degrades the precision of the photometric
and astrometric measurements in the quantized image. Dithering the pixel values
during the quantization process can greatly improve the precision of
measurements in the images. This is especially important if the analysis
algorithm relies on the mode or the median which would be similarly quantized
if the pixel values are not dithered. We perform a series of experiments on
both synthetic and real astronomical CCD images to quantitatively demonstrate
that the magnitudes and positions of stars in the quantized images can be
measured with the predicted amount of precision. In order to encourage wider
use of these image compression methods, we have made available a pair of
general-purpose image compression programs, called fpack and funpack, which can
be used to compress any FITS format image.
[6]
oai:arXiv.org:0910.3733 [pdf] - 29700
Optimal DN encoding for CCD detectors
Submitted: 2009-10-19
Image compression has been a frequent topic of presentations at ADASS.
Compression is often viewed as just a technique to fit more data into a smaller
space. Rather, the packing of data - its "density" - affects every facet of
local data handling, long distance data transport, and the end-to-end
throughput of workflows. In short, compression is one aspect of proper data
structuring. For example, with FITS tile compression the efficient
representation of data is combined with an expressive logistical paradigm for
its manipulation.
A deeper question remains. Not just how best to represent the data, but which
data to represent. CCDs are linear devices. What does this mean? One thing it
does not mean is that the analog-to-digital conversion of pixels must be stored
using linear data numbers (DN). An alternative strategy of using non- linear
representations is presented, with one motivation being to magnify the
efficiency of numerical compression algorithms such as Rice.
[7]
oai:arXiv.org:0903.2140 [pdf] - 1001622
Lossless Astronomical Image Compression and the Effects of Noise
Submitted: 2009-03-12
We compare a variety of lossless image compression methods on a large sample
of astronomical images and show how the compression ratios and speeds of the
algorithms are affected by the amount of noise in the images. In the ideal case
where the image pixel values have a random Gaussian distribution, the
equivalent number of uncompressible noise bits per pixel is given by Nbits
=log2(sigma * sqrt(12)) and the lossless compression ratio is given by R =
BITPIX / Nbits + K where BITPIX is the bit length of the pixel values and K is
a measure of the efficiency of the compression algorithm.
We perform image compression tests on a large sample of integer astronomical
CCD images using the GZIP compression program and using a newer FITS
tiled-image compression method that currently supports 4 compression
algorithms: Rice, Hcompress, PLIO, and GZIP. Overall, the Rice compression
algorithm strikes the best balance of compression and computational efficiency;
it is 2--3 times faster and produces about 1.4 times greater compression than
GZIP. The Rice algorithm produces 75%--90% (depending on the amount of noise in
the image) as much compression as an ideal algorithm with K = 0.
The image compression and uncompression utility programs used in this study
(called fpack and funpack) are publicly available from the HEASARC web site. A
simple command-line interface may be used to compress or uncompress any FITS
image file.
[8]
oai:arXiv.org:astro-ph/0210407 [pdf] - 52445
Automated object classification with ClassX
Suchkov, A. A.;
McGlynn, T. A.;
Angelini, L.;
Corcoran, M. F.;
Drake, S. A.;
Pence, W. D.;
White, N.;
Winter, E. L.;
Hanisch, R. J.;
White, R. L.;
Postman, M.;
Donahue, M. E.;
Genova, F.;
Ochsenbein, F.;
Fernique, P.;
Derriere, S.
Submitted: 2002-10-17
ClassX is a project aimed at creating an automated system to classify X-ray
sources and is envisaged as a prototype of the Virtual Observatory. As a
system, ClassX integrates into a pipeline a network of classifiers and an
engine that searches and retrieves for a given target multi-wavelength
counterparts from the worldwide data storage media. It applies machine learning
methods to `train' different classifiers using different `training' data sets.
In ClassX, each classifier can make its own class (object type) assignment and
is optimized for handling different tasks and/or different object types. A user
would generally select a certain classifier to make, for instance, a most
complete list of candidate QSOs, but a different classifier would be used to
make a most reliable list of candidate QSOs. Still different classifiers would
be selected to make similar lists for other object types. Along with the class
name assignment, a network classifier outputs the probability for a source to
belong to the assigned class as well as probabilities that the source belongs
in fact to other classes. We illustrate the current capabilities of ClassX and
the concept of a classifiers network with the results obtained with classifiers
trained using ROSAT data. ~
[9]
oai:arXiv.org:astro-ph/0209166 [pdf] - 51572
Chandra Observation of Luminous and Ultraluminous X-ray Binaries in M101
Submitted: 2002-09-09
X-ray binaries in the Milky Way are among the brightest objects on the X-ray
sky. With the increasing sensitivity of recent missions, it is now possible to
study X-ray binaries in nearby galaxies. We present data on six luminous
sources in the nearby spiral galaxy, M101, obtained with the Chandra ACIS-S. Of
these, five appear to be similar to ultraluminous sources in other galaxies,
while the brightest source, P098, shows some unique characteristics. We present
our interpretation of the data in terms of an optically thick outflow, and
discuss implications.
[10]
oai:arXiv.org:astro-ph/0107133 [pdf] - 43504
Chandra X-ray Sources in M101
Submitted: 2001-07-06
A deep (98.2 ks) Chandra Cycle-1 observation has revealed a wealth of
discrete X-ray sources as well as diffuse emission in the nearby face-on spiral
galaxy M101. From this rich dataset we have created a catalog of the 110
sources from the S3 chip detected with a significance of >3 sigma,
corresponding to a flux of ~1.0E-16 ergs/cm/cm/s and a luminosity of 1.0E36
ergs/s for a distance to M101 of 7.2 Mpc. The sources display a distinct
correlation with the spiral arms and include a variety of X-ray binaries,
supersoft sources, supernova remnants, and other objects of which only ~27 are
likely to be background sources. There are only a few sources in the interarm
regions, and most of these have X-ray colors consistent with that of background
AGNs. The derived log N-log S relation for the sources in M101 (background
subtracted) has a slope of -0.80+/-0.05 over the range of 1.0E36 - 1.0E38
ergs/s. The nucleus is resolved into 2 nearly identical X-ray sources, each
with a 0.5-2.0 keV flux of 4.0E37 ergs/s. One of these sources coincides with
the optical nucleus, and the other coincides with a cluster of stars 110 pc to
the south.
[11]
oai:arXiv.org:astro-ph/0102408 [pdf] - 41119
Reconsidering the Identification of M101 Hypernova Remnant Candidates
Submitted: 2001-02-23
Using a deep Chandra AO-1 observation of the face-on spiral galaxy M101, we
examine three of five previously optically-identified X-ray sources which are
spatially correlated with optical supernova remnants (MF54, MF57, and MF83).
The X-ray fluxes from these objects, if due to diffuse emission from the
remnants, are bright enough to require a new class of objects, with the
possible attribution by Wang to diffuse emission from hypernova remnants. Of
the three, MF83 was considered the most likely candidate for such an object due
to its size, nature, and close positional coincidence. However, we find that
MF83 is clearly ruled out as a hypernova remnant by both its temporal
variability and spectrum. The bright X-ray sources previously associated with
MF54 and MF57 are seen by Chandra to be clearly offset from the optical
positions of the supernova remnants by several arc seconds, confirming a result
suggested by the previous work. MF54 does have a faint X-ray counterpart,
however, with a luminosity and temperature consistent with a normal supernova
remnant of its size. The most likely classifications of the sources are as
X-ray binaries. Although counting statistics are limited, over the 0.3--5.0 keV
spectral band the data are well fit by simple absorbed power laws with
luminosities in the 10^38 to 10^39 ergs/s range. The power law indices are
softer than those of Milky Way LMXB of similar luminosities, and are more
consistent with those of the Large Magellanic Cloud. Both the high luminosity
and the soft spectral shape favor these being accreting black hole binaries in
high soft states.