SkyShot - Volume 1, Issue 1: Autumn 2020

Recommendations

Info

SkyShot Autumn 202040a binary star system, eclipsing binaries may result, inwhich a star’s brightness would vary periodically as onepasses in front of the other, causing the observed dip.Such a phenomenon would require extended analysisof the target star’s flux lightcurve, which shows changesin brightness. In the case of background object interference,a background eclipsing binary or planetmay blend with the target star, requiring researchersto observe any offset between the target star and thetransit signal. [4]As a result, researchers use a planetary validationprocess in order to provide the statistical probabilitythat a transit arose from a false positive, in whicha planet was not present. [5] A common algorithmused for validating some of the approximately 4,000known exoplanets has been the vespa algorithm andopen source code library. The procedure, detailed in apaper by Morton in 2012, accounts for factors such asfeatures of the signal, target star, follow-up observations,and assumptions regarding field stars. [6] However,as Armstrong, Gamper, and Damoulas explain intheir abstract published in August 2020, a catalogueof known exoplanets should not be dependent onone method. [5] Previous machine learning strategieshave often generated rankings for potential candidatesbased on their relative likelihoods of truly being planets;however, these approaches have not provided exactprobabilities for any given candidate. For example, in2017, Shallue and Vanderburg developed a model thatgenerated rankings for potential candidates based ontheir relative likelihoods of truly being planets. 98.8%of the time, plausible planet signals in the test set wereranked higher than false positive signals. [7]However, a probabilistic framework is a key componentof the planetary validation process. Thus, by employinga Gaussian Process Classifier along with othermodels, the University of Warwick researchers couldfind the exact statistical probability that a specific exoplanetcandidate is a false positive, not merely a relativeranking. In general, a Gaussian Process generatesa probabilistic prediction, which allows researchers toincorporate prior knowledge, potentially find confidenceintervals and uncertainty values, and make decisionsabout refitting. [8] If the probability of a candidatebeing a false positive is less than 1%, it wouldbe considered a validated planet by their approach.Trained using two samples of confirmed planets andpositive samples from Kepler, the model was tested onunconfirmed Kepler candidates and confirmed 50 newplanets with a wide range of sizes and orbital periods.[3]Although the computational complexity for trainingthe model is higher than that of traditional methods,and certain discrepancies with vespa were found, thisapproach demonstrates a clear potential for efficientautomated techniques to be applied for the classificationof future exoplanet candidates, while becomingmore accurate with each dataset due to machine learning.In fact, the researchers aim to apply this techniqueto data from the missions PLATO and TESS, which hasalready identified over 2,000 potential exoplanet candidates.[9]Machine Learning and Deep Learning forGalaxy Identification and ClassificationAnother area of artificial intelligence growing inpopularity is image classification and object detection,with common applications for autonomous vehiclesand medical imaging. A powerful technique in this fieldis a convolutional neural network, a form of deep learningroughly based on the functionalities and structureof the human brain. Each layer of the network servesA depiction of an exoplanettransit lightcurve;the Gaussian ProcessClassifier prioritizesthe ingress and egressregions, indicated bythe 2 dotted lines, whenclassifying exoplanets[5].An example of dataaugmentation for galaxyimages using rotationand flipping [10].
SkyShot Autumn 2020a unique purpose, such as convolutionlayers for generating feature maps fromthe image, pooling layers for extractingkey features such as edges, dense layersfor combining features, and dropout layersthat prevent overfitting to the trainingset. [10]This method was applied to galaxyclassification by researchers at the NationalAstronomical Observatory of Japan(NAOJ). The Subaru Telescope, an8.2-meter optical-infrared telescope atMaunakea, Hawaii, serves as a robustsource of data and images of galaxiesdue to its wide coverage, high resolution,and high sensitivity. [11] In fact, earlierthis year, astronomers used Subaru Telescopedata to train an algorithm to learntheoretical galaxy colors and search forspecific spectroscopic signatures, orlight frequency combinations. The algorithmwas used to identify galaxies in theearly stage of formation from data containingover 40 million objects. Throughthis study, a relatively young galaxy HSCJ1631+4426, breaking the previous recordfor lowest oxygen abundance, was discovered.[12]In addition, NAOJ researchers havebeen able to detect nearly 560,000 galaxiesin the images and have had accessto big data from the Subaru/HyperSuprime-Cam (HSC) Survey, whichcontains deeper band images and hasa higher spatial resolution than imagesfrom the Sloan Digital Sky Survey. Usinga convolutional neural network (CNN)with 14 layers, they could classify galaxiesas either non-spirals, Z-spirals, orS-spirals. [10]This application presents several importanttakeaways for computationalastrophysics. The first is the augmentationof data in the training set. Sincethe number of non-spiral galaxies wassignificantly greater than the number ofspiral galaxies, the researchers neededmore training set images for Z-spiral andS-spiral galaxies. In order to achieve thisresult without actively acquiring newimages from scratch, they flipped, rotated,and rescaled the existing images withZ-spiral and S-spiral galaxies, generatinga training set with roughly similar numbersfor all types of galaxies.Second, it is also important to notethat the accuracy levels of AI models mayreduce when working with celestial bodiesor phenomena that are rare, due to areduction in the size of the training set.The galaxy classification CNN originallyachieved an accuracy of 97.5%, identifyingspirals in over 76,000 galaxies ina testing dataset. However, this valuedecreased to only 90% when the modelwas trained on a set with fewer than 100images per galaxy type, demonstratingthe potential for concerns if more raregalaxy types were to be used.A final important takeaway is regardingthe impact of misclassification anddifferences between the training datasetand the testing dataset. When applyingthe model to the testing set of galaxy imagesto classify, the model found roughlyequal numbers of S-spirals and Z-spirals.This contrasted with the training set, inwhich S-spiral galaxies were more common.Although this may appear concerning,as one would expect the distributionof galaxy types to remain consistent, thetraining set may have not been representative,likely due to human selectionand visual inspection bias. In addition,the authors point out that the criterionof what constitutes a clear spiral is ambiguous,and that the training set imageswere classified by human eye. As a result,while the training set only included imagesthat had unambiguous spirals; thevalidation set may have included moreambiguous cases, causing the model toincorrectly classify them.Several strategies can be used to combatsuch issues in scientific machinelearning research. In terms of datasets,possible options include creating a new,larger training sample or employing numericalsimulations to create mock images.On the other hand, a completelydifferent machine learning approach -unsupervised learning - could be used.Unsupervised learning would not requirehumans to visually classify thetraining dataset, as the learning modelwould identify patterns and create classeson its own. [10]In fact, researchers at the ComputationalAstrophysics Research Group atthe University of Santa Cruz have takena very similar approach to the task ofgalaxy classification, focusing on galaxymorphologies, such as amorphous ellipticalor spheroidal. Their deep learningframework, named Morpheus, takes inimage data by astronomers and uniquelydoes pixel level classification for variousfeatures of the image, allowing it todiscern unique objects within the sameimage rather than merely classifying theimage as a whole (like the models usedby the NAOJ researchers). A notable benefitof this approach is that Morpheuscan discover galaxies by itself and wouldnot require as much visual inspection orhuman involvement, which can be fairlyhigh for traditional deep learning approaches- the NAOJ researchers workedwith a dataset that required nearly100,000 volunteers. [13] This is crucial,given that Morpehus could be used toanalyze very large surveys, such as theLegacy Survey of Space and Time, whichwould capture over 800 panoramic imagesper night. [13]Examples of a Hubble Space TelescopeImage and its classification resultsusing Morpheus [13].41
Page 1 and 2: Autumn 2020Volume I, Issue Iskyshot
Page 3 and 4: SkyShot Autumn 2020Founder’s Note
Page 5 and 6: SkyShot Autumn 20206 Astronomical S
Page 7 and 8: SkyShot Autumn 2020space/everything
Page 9 and 10: SkyShot Autumn 2020Comet NEOWISE (Z
Page 11 and 12: SkyShot Autumn 2020the world began
Page 13 and 14: SkyShot Autumn 2020II. MethodologyA
Page 15 and 16: SkyShot Autumn 2020ing brightness,
Page 17 and 18: SkyShot Autumn 2020s of Discovering
Page 19 and 20: SkyShot Autumn 2020the star’s mot
Page 21 and 22: SkyShot Autumn 2020use as wide an a
Page 23 and 24: SkyShot Autumn 2020Milky Way Galaxy
Page 25 and 26: SkyShot Autumn 2020Milky Way Galaxy
Page 27 and 28: SkyShot Autumn 2020Messier 3 (near
Page 29 and 30: SkyShot Autumn 2020Every function c
Page 31 and 32: SkyShot Autumn 2020MATLAB has its o
Page 33 and 34: SkyShot Autumn 2020Since the determ
Page 35 and 36: SkyShot Autumn 2020causes vignettin
Page 37 and 38: SkyShot Autumn 2020Figure 27: The f
Page 39: SkyShot Autumn 2020Computational As
Page 43 and 44: SkyShot Autumn 2020False Positive T
Page 45 and 46: SkyShot Autumn 2020NASA’s Evoluti
Page 47 and 48: SkyShot Autumn 2020nasa-finds-61-co
Page 49 and 50: SkyShot Autumn 2020The Infant Unive
Page 51 and 52: SkyShot Autumn 20202002 KM6 (99795)
Page 53 and 54: SkyShot Autumn 2020starry dreamsAle
Page 55 and 56: SkyShot Autumn 2020unseen skiesAlex
Page 57 and 58: SkyShot Autumn 2020Cameron Woo - Pl
Page 59 and 60: SkyShot Autumn 2020Educational Oppo
Page 61 and 62: SkyShot Autumn 2020Alexandra Masegi

SkyShot - Volume 1, Issue 1: Autumn 2020

Create successful ePaper yourself

Delete template?

Save as template?