Machine Learning Uncovers the Universe’s Hidden Gems: A Comprehensive Catalogue of C ivAbsorption Lines in SDSS DR12

Monthly Notices of the Royal Astronomical Society(2023)

Cited 0|Views0
No score
Abstract
ABSTRACT We assemble the largest C iv absorption line catalogue to date, leveraging machine learning, specifically Gaussian processes, to remove the need for visual inspection for detecting C iv absorbers. The catalogue contains probabilities classifying the reliability of the absorption system within a quasar spectrum. Our training set was a sub-sample of DR7 spectra that had no detectable C iv absorption in a large visually inspected catalogue. We used Bayesian model selection to decide between our continuum model and our absorption-line models. Using a random hold-out sample of 1301 spectra from all of the 26 030 investigated spectra in DR7 C iv catalogue, we validated our pipeline and obtained an 87 per cent classification performance score. We found good purity and completeness values, both $\sim 80{{\ \rm per\ cent}}$, when a probability of $\sim 95{{\ \rm per\ cent}}$ is used as the threshold. Our pipeline obtained similar C iv redshifts and rest equivalent widths to our training set. Applying our algorithm to 185 425 selected quasar spectra from SDSS DR12, we produce a catalogue of 113 775 C iv doublets with at least 95 per cent confidence. Our catalogue provides maximum a posteriori values and credible intervals for C iv redshift, column density, and Doppler velocity dispersion. We detect C iv absorption systems with a redshift range of 1.37–5.1, including 33 systems with a redshift larger than 5 and 549 absorbers systems with a rest equivalent width greater than 2 Å at more than 95 per cent confidence. Our catalogue can be used to investigate the physical properties of the circumgalactic and intergalactic media.
More
Translated text
Key words
<scp>iv</scp>absorption lines,sdss dr12,machine learning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined