Normalized to: Laurino, O.
[1]
oai:arXiv.org:1502.00996 [pdf] - 934374
Learning from FITS: Limitations in use in modern astronomical research
Thomas, Brian;
Jenness, Tim;
Economou, Frossie;
Greenfield, Perry;
Hirst, Paul;
Berry, David S.;
Bray, Erik;
Gray, Norman;
Muna, Demitri;
Turner, James;
de Val-Borro, Miguel;
Santander-Vela, Juande;
Shupe, David;
Good, John;
Berriman, G. Bruce;
Kitaeff, Slava;
Fay, Jonathan;
Laurino, Omar;
Alexov, Anastasia;
Landry, Walter;
Masters, Joe;
Brazier, Adam;
Schaaf, Reinhold;
Edwards, Kevin;
Redman, Russell O.;
Marsh, Thomas R.;
Streicher, Ole;
Norris, Pat;
Pascual, Sergio;
Davie, Matthew;
Droettboom, Michael;
Robitaille, Thomas;
Campana, Riccardo;
Hagen, Alex;
Hartogh, Paul;
Klaes, Dominik;
Craig, Matthew W.;
Homeier, Derek
Submitted: 2015-02-03, last modified: 2015-02-10
The Flexible Image Transport System (FITS) standard has been a great boon to
astronomy, allowing observatories, scientists and the public to exchange
astronomical information easily. The FITS standard, however, is showing its
age. Developed in the late 1970s, the FITS authors made a number of
implementation choices that, while common at the time, are now seen to limit
its utility with modern data. The authors of the FITS standard could not
anticipate the challenges which we are facing today in astronomical computing.
Difficulties we now face include, but are not limited to, addressing the need
to handle an expanded range of specialized data product types (data models),
being more conducive to the networked exchange and storage of data, handling
very large datasets, and capturing significantly more complex metadata and data
relationships.
There are members of the community today who find some or all of these
limitations unworkable, and have decided to move ahead with storing data in
other formats. If this fragmentation continues, we risk abandoning the
advantages of broad interoperability, and ready archivability, that the FITS
format provides for astronomy. In this paper we detail some selected important
problems which exist within the FITS standard today. These problems may provide
insight into deeper underlying issues which reside in the format and we provide
a discussion of some lessons learned. It is not our intention here to prescribe
specific remedies to these issues; rather, it is to call attention of the FITS
and greater astronomical computing communities to these problems in the hope
that it will spur action to address them.
[2]
oai:arXiv.org:1407.6916 [pdf] - 856852
Iris: an Extensible Application for Building and Analyzing Spectral
Energy Distributions
Laurino, Omar;
Budynkiewicz, Jamie;
D'Abrusco, Raffaele;
Bonaventura, Nina;
Busko, Ivo;
Cresitello-Dittmar, Mark;
Doe, Stephen M.;
Ebert, Rick;
Evans, Janet D.;
Norris, Patrick;
Pevunova, Olga;
Refsdal, Brian;
Thomas, Brian;
Thompson, Randy
Submitted: 2014-07-25
Iris is an extensible application that provides astronomers with a
user-friendly interface capable of ingesting broad-band data from many
different sources in order to build, explore, and model spectral energy
distributions (SEDs). Iris takes advantage of the standards defined by the
International Virtual Observatory Alliance, but hides the technicalities of
such standards by implementing different layers of abstraction on top of them.
Such intermediate layers provide hooks that users and developers can exploit in
order to extend the capabilities provided by Iris. For instance, custom Python
models can be combined in arbitrary ways with the Iris built-in models or with
other custom functions. As such, Iris offers a platform for the development and
integration of SED data, services, and applications, either from the user's
system or from the web. In this paper we describe the built-in features
provided by Iris for building and analyzing SEDs. We also explore in some
detail the Iris framework and software development kit, showing how astronomers
and software developers can plug their code into an integrated SED analysis
environment.
[3]
oai:arXiv.org:1206.6161 [pdf] - 1124420
Managing Distributed Software Development in the Virtual Astronomical
Observatory
Evans, Janet D.;
Plante, Raymond L.;
Bonaventura, Nina;
Busko, Ivo;
Cresitello-Dittmar, Mark;
D'Abrusco, Raffaele;
Doe, Stephen;
Ebert, Rick;
Laurino, Omar;
Pevunova, Olga;
Refsdal, Brian;
Thomas, Brian
Submitted: 2012-06-26
The U.S. Virtual Astronomical Observatory (VAO) is a product-driven
organization that provides new scientific research capabilities to the
astronomical community. Software development for the VAO follows a lightweight
framework that guides development of science applications and infrastructure.
Challenges to be overcome include distributed development teams, part-time
efforts, and highly constrained schedules. We describe the process we followed
to conquer these challenges while developing Iris, the VAO application for
analysis of 1-D astronomical spectral energy distributions (SEDs). Iris was
successfully built and released in less than a year with a team distributed
across four institutions. The project followed existing International Virtual
Observatory Alliance inter-operability standards for spectral data and
contributed a SED library as a by-product of the project. We emphasize lessons
learned that will be folded into future development efforts. In our experience,
a well-defined process that provides guidelines to ensure the project is
cohesive and stays on track is key to success. Internal product deliveries with
a planned test and feedback loop are critical. Release candidates are measured
against use cases established early in the process, and provide the opportunity
to assess priorities and make course corrections during development. Also key
is the participation of a stakeholder such as a lead scientist who manages the
technical questions, advises on priorities, and is actively involved as a lead
tester. Finally, frequent scheduled communications (for example a bi-weekly
tele-conference) assure issues are resolved quickly and the team is working
toward a common vision
[4]
oai:arXiv.org:1206.2919 [pdf] - 1124099
CLaSPS: a new methodology for Knowledge extraction from complex
astronomical dataset
Submitted: 2012-06-13
In this paper we present the Clustering-Labels-Score Patterns Spotter
(CLaSPS), a new methodology for the determination of correlations among
astronomical observables in complex datasets, based on the application of
distinct unsupervised clustering techniques. The novelty in CLaSPS is the
criterion used for the selection of the optimal clusterings, based on a
quantitative measure of the degree of correlation between the cluster
memberships and the distribution of a set of observables, the labels, not
employed for the clustering. In this paper we discuss the applications of
CLaSPS to two simple astronomical datasets, both composed of extragalactic
sources with photometric observations at different wavelengths from large area
surveys. The first dataset, CSC+, is composed of optical quasars
spectroscopically selected in the SDSS data, observed in the X-rays by Chandra
and with multi-wavelength observations in the near-infrared, optical and
ultraviolet spectral intervals. One of the results of the application of CLaSPS
to the CSC+ is the re-identification of a well-known correlation between the
alphaOX parameter and the near ultraviolet color, in a subset of CSC+ sources
with relatively small values of the near-ultraviolet colors. The other dataset
consists of a sample of blazars for which photometric observations in the
optical, mid and near infrared are available, complemented for a subset of the
sources, by Fermi gamma-ray data. The main results of the application of CLaSPS
to such datasets have been the discovery of a strong correlation between the
multi-wavelength color distribution of blazars and their optical spectral
classification in BL Lacs and Flat Spectrum Radio Quasars and a peculiar
pattern followed by blazars in the WISE mid-infrared colors space. This pattern
and its physical interpretation have been discussed in details in other papers
by one of the authors.
[5]
oai:arXiv.org:1205.2419 [pdf] - 515115
Iris: The VAO SED Application
Doe, Stephen;
Bonaventura, Nina;
Busko, Ivo;
D'Abrusco, Raffaele;
Cresitello-Dittmar, Mark;
Ebert, Rick;
Evans, Janet;
Laurino, Omar;
McDowell, Jonathan;
Pevunova, Olga;
Refsdal, Brian
Submitted: 2012-05-10, last modified: 2012-05-22
We present Iris, the VAO (Virtual Astronomical Observatory) application for
analyzing SEDs (spectral energy distributions). Iris is the result of one of
the major science initiatives of the VAO, and the first version was released in
September 2011. Iris combines key features of several existing software
applications to streamline and enhance SED analysis. With Iris, users may read
and display SEDs, select data ranges for analysis, fit models to SEDs, and
calculate confidence limits on best-fit parameters. SED data may be uploaded
into the application from IVOA-compliant VOTable and FITS format files, or
retrieved directly from NED. Data written in unsupported formats may be
converted using SedImporter, a new application provided with Iris. The
components of Iris have been contributed by members of the VAO. Specview,
contributed by STScI, provides a GUI for reading, editing, and displaying SEDs,
as well as defining models and parameter values. Sherpa, contributed by the
Chandra project at SAO, provides a library of models, fit statistics, and
optimization methods; the underlying I/O library, SEDLib, is a VAO product
written by SAO to current IVOA (International Virtual Observatory Alliance)
data model standards. NED is a service provided by IPAC for easy location of
data for a given extragalactic source, including SEDs. SedImporter is a new
tool for converting non-standard SED data files into a format supported by
Iris. We demonstrate the use of SedImporter to retrieve SEDs from a variety of
sources--from the NED SED service, from the user's own data, and from other VO
applications using SAMP (Simple Application Messaging Protocol). We also
demonstrate the use of Iris to read, display, select ranges from, and fit
models to SEDs. Finally, we discuss the architecture of Iris, and the use of
IVOA standards so that Specview, Sherpa, SEDLib and SedImporter work together
seamlessly.
[6]
oai:arXiv.org:1204.3055 [pdf] - 1886350
IVOA Recommendation: Spectrum Data Model 1.1
McDowell, Jonathan;
Tody, Doug;
Budavari, Tamas;
Dolensky, Markus;
Kamp, Inga;
McCusker, Kelly;
Protopapas, Pavlos;
Rots, Arnold;
Thompson, Randy;
Valdes, Frank;
Skoda, Petr;
Rino, Bruno;
Derriere, Sebastien;
Salgado, Jesus;
Laurino, Omar;
Layer, the IVOA Data Access;
Groups, Data Model Working
Submitted: 2012-04-13
We present a data model describing the structure of spectrophotometric
datasets with spectral and temporal coordinates and associated metadata. This
data model may be used to represent spectra, time series data, segments of SED
(Spectral Energy Distributions) and other spectral or temporal associations.
[7]
oai:arXiv.org:1112.0742 [pdf] - 447224
The DAME/VO-Neural Infrastructure: an Integrated Data Mining System
Support for the Science Community
Brescia, M.;
Corazza, A.;
Cavuoti, S.;
d'Angelo, G.;
D'Abrusco, R.;
Donalek, C.;
Djorgovski, S. G.;
Deniskina, N.;
Fiore, M.;
Garofalo, M.;
Laurino, O.;
Mahabal, G. Longo A.;
Manna, F.;
Nocella, A.;
Skordovski, B.
Submitted: 2011-12-04
Astronomical data are gathered through a very large number of heterogeneous
techniques and stored in very diversified and often incompatible data
repositories. Moreover in the e-science environment, it is needed to integrate
services across distributed, heterogeneous, dynamic "virtual organizations"
formed by different resources within a single enterprise and/or external
resource sharing and service provider relationships. The DAME/VONeural project,
run jointly by the University Federico II, INAF (National Institute of
Astrophysics) Astronomical Observatories of Napoli and the California Institute
of Technology, aims at creating a single, sustainable, distributed
e-infrastructure for data mining and exploration in massive data sets, to be
offered to the astronomical (but not only) community as a web application. The
framework makes use of distributed computing environments (e.g. S.Co.P.E.) and
matches the international IVOA standards and requirements. The integration
process is technically challenging due to the need of achieving a specific
quality of service when running on top of different native platforms. In these
terms, the result of the DAME/VO-Neural project effort will be a
service-oriented architecture, obtained by using appropriate standards and
incorporating Grid paradigms and restful Web services frameworks where needed,
that will have as main target the integration of interdisciplinary distributed
systems within and across organizational domains.
[8]
oai:arXiv.org:1112.0750 [pdf] - 447227
DAME: A Distributed Data Mining & Exploration Framework within the
Virtual Observatory
Submitted: 2011-12-04
Nowadays, many scientific areas share the same broad requirements of being
able to deal with massive and distributed datasets while, when possible, being
integrated with services and applications. In order to solve the growing gap
between the incremental generation of data and our understanding of it, it is
required to know how to access, retrieve, analyze, mine and integrate data from
disparate sources. One of the fundamental aspects of any new generation of data
mining software tool or package which really wants to become a service for the
community is the possibility to use it within complex workflows which each user
can fine tune in order to match the specific demands of his scientific goal.
These workflows need often to access different resources (data, providers,
computing facilities and packages) and require a strict interoperability on (at
least) the client side. The project DAME (DAta Mining & Exploration) arises
from these requirements by providing a distributed WEB-based data mining
infrastructure specialized on Massive Data Sets exploration with Soft Computing
methods. Originally designed to deal with astrophysical use cases, where first
scientific application examples have demonstrated its effectiveness, the DAME
Suite results as a multi-disciplinary platform-independent tool perfectly
compliant with modern KDD (Knowledge Discovery in Databases) requirements and
Information & Communication Technology trends.
[9]
oai:arXiv.org:1107.3160 [pdf] - 1078014
Astroinformatics of galaxies and quasars: a new general method for
photometric redshifts estimation
Submitted: 2011-07-15
With the availability of the huge amounts of data produced by current and
future large multi-band photometric surveys, photometric redshifts have become
a crucial tool for extragalactic astronomy and cosmology. In this paper we
present a novel method, called Weak Gated Experts (WGE), which allows to derive
photometric redshifts through a combination of data mining techniques.
\noindent The WGE, like many other machine learning techniques, is based on the
exploitation of a spectroscopic knowledge base composed by sources for which a
spectroscopic value of the redshift is available. This method achieves a
variance \sigma^2(\Delta z)=2.3x10^{-4} (\sigma^2(\Delta z) =0.08), where
\Delta z = z_{phot} - z_{spec}) for the reconstruction of the photometric
redshifts for the optical galaxies from the SDSS and for the optical quasars
respectively, while the Root Mean Square (RMS) of the \Delta z variable
distributions for the two experiments is respectively equal to 0.021 and 0.35.
The WGE provides also a mechanism for the estimation of the accuracy of each
photometric redshift. We also present and discuss the catalogs obtained for the
optical SDSS galaxies, for the optical candidate quasars extracted from the DR7
SDSS photometric dataset {The sample of SDSS sources on which the accuracy of
the reconstruction has been assessed is composed of bright sources, for a
subset of which spectroscopic redshifts have been measured.}, and for optical
SDSS candidate quasars observed by GALEX in the UV range. The WGE method
exploits the new technological paradigm provided by the Virtual Observatory and
the emerging field of Astroinformatics.
[10]
oai:arXiv.org:1010.4843 [pdf] - 275635
DAME: A Web Oriented Infrastructure for Scientific Data Mining &
Exploration
Brescia, Massimo;
Longo, Giuseppe;
Djorgovski, George S.;
Cavuoti, Stefano;
D'Abrusco, Raffaele;
Donalek, Ciro;
Di Guido, Alessandro;
Fiore, Michelangelo;
Garofalo, Mauro;
Laurino, Omar;
Mahabal, Ashish;
Manna, Francesco;
Nocella, Alfonso;
d'Angelo, Giovanni;
Paolillo, Maurizio
Submitted: 2010-10-23, last modified: 2010-12-07
Nowadays, many scientific areas share the same need of being able to deal
with massive and distributed datasets and to perform on them complex knowledge
extraction tasks. This simple consideration is behind the international efforts
to build virtual organizations such as, for instance, the Virtual Observatory
(VObs). DAME (DAta Mining & Exploration) is an innovative, general purpose,
Web-based, VObs compliant, distributed data mining infrastructure specialized
in Massive Data Sets exploration with machine learning methods. Initially fine
tuned to deal with astronomical data only, DAME has evolved in a general
purpose platform which has found applications also in other domains of human
endeavor. We present the products and a short outline of a science case,
together with a detailed description of main features available in the beta
release of the web application now released.
[11]
oai:arXiv.org:0807.0967 [pdf] - 14254
Astrophysics in S.Co.P.E
Submitted: 2008-07-07
S.Co.P.E. is one of the four projects funded by the Italian Government in
order to provide Southern Italy with a distributed computing infrastructure for
fundamental science. Beside being aimed at building the infrastructure,
S.Co.P.E. is also actively pursuing research in several areas among which
astrophysics and observational cosmology. We shortly summarize the most
significant results obtained in the first two years of the project and related
to the development of middleware and Data Mining tools for the Virtual
Observatory.
[12]
oai:arXiv.org:0806.1144 [pdf] - 13320
GRID-Launcher v.1.0
Submitted: 2008-06-06
GRID-launcher-1.0 was built within the VO-Tech framework, as a software
interface between the UK-ASTROGRID and a generic GRID infrastructures in order
to allow any ASTROGRID user to launch on the GRID computing intensive tasks
from the ASTROGRID Workbench or Desktop. Even though of general application, so
far the Grid-Launcher has been tested on a few selected softwares
(VONeural-MLP, VONeural-SVM, Sextractor and SWARP) and on the SCOPE-GRID.
[13]
oai:arXiv.org:0806.1006 [pdf] - 13294
The VO-Neural project: recent developments and some applications
Brescia, M.;
Cavuoti, S.;
d'Angelo, G.;
D'Abrusco, R.;
Deniskina, N.;
Garofalo, M.;
Laurino, O.;
Longo, G.;
Nocella, A.;
Skordovski, B.
Submitted: 2008-06-05
VO-Neural is the natural evolution of the Astroneural project which was
started in 1994 with the aim to implement a suite of neural tools for data
mining in astronomical massive data sets. At a difference with its ancestor,
which was implemented under Matlab, VO-Neural is written in C++, object
oriented, and it is specifically tailored to work in distributed computing
architectures. We discuss the current status of implementation of VO-Neural,
present an application to the classification of Active Galactic Nuclei, and
outline the ongoing work to improve the functionalities of the package.