Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Presentation . 2021
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2021
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Presentation . 2021
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Training an interpretable ML algorithm with only a dab of real data: An extragalactic perspective

Authors: Ghosh, Aritra; Urry, Meg;

Training an interpretable ML algorithm with only a dab of real data: An extragalactic perspective

Abstract

In the last decade, convolutional neural networks (CNNs) have revolutionized the field of image processing and have become increasingly popular among astronomers for morphological analysis of galaxies. This push has been driven by the fact that they are the perfect alternative to the traditional techniques of obtaining morphological classifications --- expert visual classification, citizen science projects, and fitting light profiles, none of which is easily scalable to large data volumes. However, most previous applications of CNNs to morphological analysis have required a large training set of real galaxies with pre-determined classifications. However, if CNNs are to become the method of choice for analyzing unclassified data from future surveys, this necessitates an algorithm that does not require a large pre-classified training set of real galaxies from the same survey. The challenge of training a machine learning algorithm to classify brand new data, which has not been manually/previously looked at, is not unique to astronomy and is applicable to many other scientific fields which use large amounts of data such as the biomedical sciences. In this talk, I will outline how we have successfully trained a Bayesian CNN called Galaxy Morphology Network (GaMorNet) with a very small amount of real data and used it to extract morphological parameters of galaxies at a variety of redshifts from different surveys. We first trained GaMorNet on a large simulation suite of galaxies and then used a small amount of real data to perform transfer-learning/domain adaptation. We have already demonstrated that a preliminary classification-version of GaMorNet (Ghosh et. al. 2020) can be successfully applied to data from different surveys with misclassification rates of < 5%. We have also used GaMorNet to study the morphology and quenching of ~100,000$ (z~0) SDSS and ~20,000 (z~1) CANDELS galaxies using morphology-separated color-mass diagrams. Using the GaMorNet classifications, we find that bulge- and disk-dominated galaxies have distinct color-mass diagrams with separate evolutionary pathways. For both datasets, disk-dominated galaxies peak in the blue cloud, across a broad range of masses, consistent with the slow exhaustion of star-forming gas. In contrast, bulge-dominated galaxies are mostly red, with much smaller numbers down toward the blue cloud, suggesting rapid quenching and fast evolution across the green valley. GaMorNet is one of the very few publicly available CNNs in astronomy, complete with trained models. I will also outline in this talk why GaMorNet is not a black-box and how the representations learned by the network are highly amenable to visual interpretation. We have used a combination of different CNN visualization techniques to investigate and shed light on GaMorNet’s decision-making process, making our results interpretable, reproducible, and robust.

Related Organizations
Keywords

machine learning, convolutional neural networks, galaxies, morphology, gramornet, AGN, CNN

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 7
    download downloads 1
  • 7
    views
    1
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
7
1
Green