<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Dark Energy Survey Deep Field photometric redshift performance and training incompleteness assessment</title></titleStmt>
			<publicationStmt>
				<publisher>EDP Sciences</publisher>
				<date>06/01/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10559381</idno>
					<idno type="doi">10.1051/0004-6361/202348956</idno>
					<title level='j'>Astronomy &amp; Astrophysics</title>
<idno>0004-6361</idno>
<biblScope unit="volume">686</biblScope>
<biblScope unit="issue"></biblScope>					

					<author>L Toribio_San_Cipriano</author><author>J De_Vicente</author><author>I Sevilla-Noarbe</author><author>W G Hartley</author><author>J Myles</author><author>A Amon</author><author>G M Bernstein</author><author>A Choi</author><author>K Eckert</author><author>R A Gruendl</author><author>I Harrison</author><author>E Sheldon</author><author>B Yanny</author><author>M Aguena</author><author>S S Allam</author><author>O Alves</author><author>D Bacon</author><author>D Brooks</author><author>A Campos</author><author>A Carnero_Rosell</author><author>J Carretero</author><author>F J Castander</author><author>C Conselice</author><author>L N da_Costa</author><author>M_E S Pereira</author><author>T M Davis</author><author>S Desai</author><author>H T Diehl</author><author>P Doel</author><author>I Ferrero</author><author>J Frieman</author><author>J García-Bellido</author><author>E Gaztañaga</author><author>G Giannini</author><author>S R Hinton</author><author>D L Hollowood</author><author>K Honscheid</author><author>D J James</author><author>K Kuehn</author><author>S Lee</author><author>C Lidman</author><author>J L Marshall</author><author>J Mena-Fernández</author><author>F Menanteau</author><author>R Miquel</author><author>A Palmese</author><author>A Pieres</author><author>A A Plazas_Malagón</author><author>A Roodman</author><author>E Sanchez</author><author>M Smith</author><author>M Soares-Santos</author><author>E Suchyta</author><author>M_E C Swanson</author><author>G Tarle</author><author>M Vincenzi</author><author>N Weaverdyck</author><author>P Wiseman</author><author>DES_Collaboration</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<p><italic>Context.</italic>The determination of accurate photometric redshifts (photo-<italic>zs</italic>) in large imaging galaxy surveys is key for cosmological studies. One of the most common approaches is machine learning techniques. These methods require a spectroscopic or reference sample to train the algorithms. Attention has to be paid to the quality and properties of these samples since they are key factors in the estimation of reliable photo-<italic>zs</italic>.</p> <p><italic>Aims.</italic>The goal of this work is to calculate the photo-<italic>zs</italic>for the Year 3 (Y3) Dark Energy Survey (DES) Deep Fields catalogue using the Directional Neighborhood Fitting (DNF) machine learning algorithm. Moreover, we want to develop techniques to assess the incompleteness of the training sample and metrics to study how incompleteness affects the quality of photometric redshifts. Finally, we are interested in comparing the performance obtained by DNF on the Y3 DES Deep Fields catalogue with that of the EAzY template fitting approach.</p> <p><italic>Methods.</italic>We emulated – at a brighter magnitude – the training incompleteness with a spectroscopic sample whose redshifts are known to have a measurable view of the problem. We used a principal component analysis to graphically assess the incompleteness and relate it with the performance parameters provided by DNF. Finally, we applied the results on the incompleteness to the photo-<italic>z</italic>computation on the Y3 DES Deep Fields with DNF and estimated its performance.</p> <p><italic>Results.</italic>The photo-<italic>zs</italic>of the galaxies in the DES deep fields were computed with the DNF algorithm and added to the Y3 DES Deep Fields catalogue. We have developed some techniques to evaluate the performance in the absence of “true” redshift and to assess the completeness. We have studied the tradeoff in the training sample between the highest spectroscopic redshift quality versus completeness. We found some advantages in relaxing the highest-quality spectroscopic redshift requirements at fainter magnitudes in favour of completeness. The results achieved by DNF on the Y3 Deep Fields are competitive with the ones provided by EAzY, showing notable stability at high redshifts. It should be noted that the good results obtained by DNF in the estimation of photo-<italic>zs</italic>in deep field catalogues make DNF suitable for the future Legacy Survey of Space and Time (LSST) and<italic>Euclid</italic>data, which will have similar depths to the Y3 DES Deep Fields.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The arrival of large photometric galaxy surveys such as the Sloan Digital Sky Survey (SDSS, <ref type="bibr">York et al. 2000)</ref>, the Dark Energy Survey (DES, <ref type="bibr">Flaugher et al. 2015)</ref>, Physics of the Accelerating Universe (PAU, <ref type="bibr">Castander et al. 2012)</ref>, or future projects such as the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST, LSST Science Collaboration 2009), and Euclid (Euclid Collaboration 2020), capable of collecting huge amounts of data, are providing invaluable insights about the Universe. One of the crucial elements for cosmological and astrophysical studies is the estimation of accurate redshifts from photometric information, which are essential for many cosmological probes as baryon acoustic oscillation (BAO), weak lensing, or galaxy clustering. Spectroscopic surveys -measuring the dif-ference in the wavelength of some spectral lines with respect to their wavelength at rest frame -provide high-precision redshifts, but obtaining spectroscopic redshifts of large samples of astronomical objects is very expensive in terms of observing time. Currently, the Dark Energy Spectroscopic Instrument (DESI) project (DESI Collaboration 2016) is capable of measuring thousands of galaxy spectra every night, reducing telescope time. Despite this great advantage, long exposure times are still required to obtain good signal-to-noise spectra of faint objects, and photometric data for target selection. An alternative is to measure the fluxes of galaxies with a set of broadband or narrowband filters within an image survey; that is, using photometric techniques. These measurements allow us to compute the photometric redshift (photo-z) of a large number of galaxies per image, reducing the telescope time at the cost of lower precision.</p><p>The two main approaches to determining photo-zs are template fitting and machine learning methods. Template methods compare the spectral energy distribution (SED) of each galaxy with that of a set of redshifted rest-frame templates, looking for the best match (e.g., <ref type="bibr">Arnouts et al. 1999;</ref><ref type="bibr">Ben&#237;tez 2000;</ref><ref type="bibr">Bolzonella et al. 2000;</ref><ref type="bibr">Ilbert et al. 2006)</ref>. Machine learning approaches use reference or training galaxy samples whose spectroscopic redshifts are known in order to learn the relationship between magnitudes, colours, and redshifts. With this information, machine learning methods can predict the photometric redshift of a set of target galaxies (e.g., <ref type="bibr">Collister &amp; Lahav 2004;</ref><ref type="bibr">Sadeh et al. 2016;</ref><ref type="bibr">Carrasco Kind &amp; Brunner 2013;</ref><ref type="bibr">De Vicente et al. 2016)</ref>. Neither method is free of di culties. Template methods depend on synthetic models and the completeness of the template library used in the fitting, while machine learning methods depend on the quality and variety of the training samples. Specifically, the selection of this spectroscopic training sample is one of the most important decisions in obtaining accurate photometric redshift estimations in the machine learning approach. Ideally, the spectroscopic sample should be representative of the whole target galaxy sample, covering the same colourmagnitude space. Unfortunately, the galaxy samples whose photometric redshift is to be determined typically include galaxies with deeper magnitudes that are not included in the spectroscopic sample. <ref type="bibr">Hartley et al. (2020)</ref> studied the impact of using incomplete spectroscopic samples in the redshift distribution using the <ref type="bibr">Lima et al. (2008)</ref> algorithm. They show that an incomplete spectroscopic training sample could bias the galaxy redshifts. Moreover, the studies of <ref type="bibr">Hildebrandt et al. (2010)</ref>, <ref type="bibr">Beck et al. (2017)</ref>, <ref type="bibr">S&#225;nchez et al. (2014)</ref>, <ref type="bibr">Schmidt et al. (2020)</ref>, <ref type="bibr">Bonnett et al. (2016)</ref>, <ref type="bibr">Abdalla et al. (2011)</ref> and <ref type="bibr">Brescia et al. (2021)</ref> compare di&#8629;erent methods of photo-z estimation. These works suggest that machine learning methods provide more accurate values of photo-zs than template methods as long as there is a su ciently adequate sample for training. Outside the magnitude and colour space, template methods seem to perform better than machine learning methods because they can generate synthesised spectra without redshift constraints. Everything seems to indicate that the combination of both template and machine learning is the best option to obtain the best photoz accuracy of a sample (e.g., <ref type="bibr">Tanaka et al. 2018;</ref><ref type="bibr">Salvato et al. 2019)</ref>.</p><p>In this work, we study how the incompleteness in the spectroscopic training sample a&#8629;ects Directional Neighborhood Fitting (DNF) photo-z algorithm <ref type="bibr">(De Vicente et al. 2016</ref>) photo-zs, as estimated in the Dark Energy Survey (DES) Year 3 Deep Field sample. The DNF algorithm is a nearest-neighbour approach to photometric redshift estimation that has become a reference within DES collaboration and included as one of the five methods to be prioritised in the Vera Rubin observatory. To assess the e&#8629;ects of incompleteness, we first derive the relevant parameters to characterise incompleteness, demonstrating how these parameters a&#8629;ect photo-z performance. Then, we show how DNF accounts for incompleteness in the photo-z errors provided. Finally, we study the incompleteness of the training sample in Y3 DES Deep Fields and compare our results with those obtained with the EAzY template method <ref type="bibr">(Brammer et al. 2008)</ref>.</p><p>The rest of the paper is organised as follows. In Sect. 2, we describe the sample selection and in Sect. 3 the metrics used and the description of DNF algorithm. We carry out an analysis of the e&#8629;ects of incomplete training samples on the estimation of photometric redshift in Sect. 4. In Sect. 5, we estimate photometric redshift for Y3 DES Deep Fields with di&#8629;erent training samples. We compare the photo-zs determined by DNF and EAzY in Sect. 6. Finally, we enumerate the conclusions of this work in Sect. 7.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Data</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Spectroscopic sample</head><p>We used the spectroscopic sample defined by <ref type="bibr">Gschwend et al. (2018)</ref>. This sample contains spectroscopic redshifts of galaxies from 34 surveys (see Appendix A) and the photometric information for each of them. The quality of the spectroscopic redshift is flagged by the label FLAG_DES (with FLAG_DES = 4 as certain redshift, FLAG_DES = 3 as probable redshift, FLAG_DES = 2 as possible redshift, and FLAG_DES = 1 as unknown redshift). For this work, we only selected those objects with spectroscopic redshift determination marked in the catalogue with the best redshift determination; that is, those galaxies with flags of levels three and four (FLAG_DES 3). In addition, we excluded those galaxies with mag(i) 28. After these cuts, our spectroscopic sample contains a total of 55 601 galaxies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Year 3 Deep Fields catalogue</head><p>The Y3 DES Deep Fields catalogue<ref type="foot">foot_0</ref> used is part of the DES. The observations were taken using the Dark Energy Camera <ref type="bibr">(DECam, Flaugher et al. 2015)</ref> on the Victor M. Blanco 4 m telescope at the Cerro Tololo Inter-American Observatory (CTIO) in Chile. The DES covered 5000 deg<ref type="foot">foot_1</ref> in grizY bands with approximately ten overlapping dithered exposures in each filter (90 s in griz, 45 s in Y) covering the survey footprint. The Y3 DES Deep Fields catalogue comprises four fields measured with eight bands (ugrizJHK s ), covering an area of &#8672;5.88 deg 2 where the integrated exposure time per pixel is approximately ten times more than in the main DES area (see details in <ref type="bibr">Hartley et al. 2022)</ref>. This catalogue contains around 2.8 million galaxies. We selected those galaxies that have flux measurements in the eight filters and with mag(i) &lt; 28, resulting in a catalogue that contains around 1.5 million galaxies. We selected galaxies with mag(i) &lt; 28 -still suitable for weak lensing applicationsbecause for higher magnitudes the errors in the photometry are large and the data become unreliable.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Metrics and algorithm</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Metrics</head><p>This section describes the metrics used in this work to assess the quality of the photo-z estimates, where z spec , z phot , and N represent the spectroscopic redshift, the photometric redshift, and the number of objects in the sample, respectively. We define the following metrics to quantify the degree of precision of the photo-z and its scatter:</p><p>-Bias: the assessment of the overall photo-z is determined by the mean bias:</p><p>where z = z spec z phot . -Mean absolute deviation:</p><p>-68 ( z): denotes the half-width of the central 68% percentile range of both galaxies' bias values, -Outlier fraction:</p><p>where N is the total number of objects and N out the outlier defined by</p><p>where is the standard deviation of the z distribution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">The DNF algorithm</head><p>Directional neighborhood fitting (DNF, De Vicente et al. <ref type="formula">2016</ref>) is a nearest-neighbour algorithm for estimating the redshift of a sample of galaxies. The DNF algorithm uses the colours and magnitudes or fluxes as a measurement of closeness to a reference sample composed of galaxies whose spectroscopic redshifts are known. The DNF algorithm provides the main photo-z value and its error estimation along with a secondary value intended for photo-z distribution estimation:</p><p>-DNF_Z: the main photo-z estimate determined by the fit of a number of neighbour galaxies to a hyperplane in the magnitude space. The process is iterated to remove outliers. In addition the algorithm can provide individual photo-z probability density functions (PDFs). -DNF_ZSIGMA: an indicator of photo-z quality computed from the quadratic sum of the error due to photometry plus the error due to the fit. DNF_ZSIGMA takes the value -99 when DNF does not estimate the photo-z of a galaxy because there is no neighbour galaxy within a given radius. -DNF_ZN: a secondary photo-z determined by the single nearest neighbour galaxy, which is valuable in redshift distribution estimation.</p><p>The algorithm provides three alternative metrics for the assessment of closeness: Euclidean, angular, and directional. While Euclidean and angular metrics account for magnitude and colour, respectively, the directional metric integrates both in a unique number. The present work takes advantage of the combination of five optical plus three near-infrared filters to define non-degenerated colours within the angular metric.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Effect of training incompleteness on photometric redshift estimation</head><p>We studied the e&#8629;ect of using an incomplete spectroscopic training sample to determine the photo-zs with the DNF algorithm. We refer to an incomplete training sample when it does not cover the same range of magnitudes and/or colours as the target sample for which we want to determine the photo-z. The spectroscopic sample, in addition to being used to train the algorithm, allows us to study the accuracy and precision of the photo-z estimation. For this purpose, the spectroscopic sample is usually split into two samples: one used to train the algorithm and the other one to validate the photo-zs (known as the training and validation sample, respectively). However, we must be careful when extrapolating the results obtained in the validation sample to the galaxies in the scientific target sample. The scientific sample may well contain galaxies at deeper magnitudes or in a di&#8629;erent colour range that are not represented in the training sample and photo-zs may not be correctly estimated.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Incompleteness emulation with the spectroscopic sample</head><p>With the goal of learning how the incompleteness a&#8629;ects photometric redshift performance, we used the spectroscopic sample to emulate, at brighter magnitudes, two di&#8629;erent scenarios: a case in which we have completeness of magnitude and colour coverage from training sample to the target sample, and another in which we do not (the incomplete case).</p><p>We split the spectroscopic catalogue into two sub-samples of equal size (with 27 801 galaxies each) and equal magnitudecolour distribution. We took one of these sub-samples as a validation sample and the other as training sample. We selected the galaxies of the training sample in two di&#8629;erent ways to emulate the scenarios mentioned. On the one hand, we took all of the galaxies in the training sample (the 27 801 galaxies) to emulate a complete training set; that is, a training sample that A38, page 3 of 15 covers the same colour-magnitude space as the validation sample. On the other hand, we used the training sample to construct an incomplete version. In this second case, we wanted to emulate, at brighter magnitudes, the incompleteness observed in the Y3 DES Deep Fields catalogue when the training sample is formed by galaxies of the spectroscopic sample with FLAG_DES = 4. To achieve this, some high-magnitude galaxies can be manually removed from the training sample until incompleteness is reached. In order to automate this process rather than performing it manually, we employed the following method. We first calculated band ; that is, the di&#8629;erence between the mean magnitude of the objects in the spectroscopic sample and in the Y3 DES Deep Fields photometric catalogue for each band. Then, we subtracted band from the magnitudes of every galaxy in the training sample to emulate a similar incompleteness at brighter magnitudes. Applying this leftward magnitude shift, we achieved a magnitude incompleteness at the expense of decoupling galaxies from their own redshift. To solve this issue, we used a nearest-neighbour algorithm to find, within the shifted sample, real galaxies with similar magnitudes. The algorithm assigns to each left-shifted magnitude a real galaxy from the training sample, many of them repeated. After applying this procedure and dropping out the repeated galaxies, the new training sample, hereafter referred to as the incomplete training sample, is reduced to 5336 galaxies out of the original 27 801. In this way, we now have a galaxy sample that simulates incomplete-Fig. <ref type="figure">3</ref>. Density map as a function of the first and second principal component of the galaxies of the validation sample. In the bottom panel, we included the galaxies of the training sample (red dots) and the limit in the principal components of the galaxies for which DNF provides a value of photo-z with an uncertainty, DNF_ZSIGMA &lt; 1.0 (orange line) and DNF_ZSIGMA &lt; 0.1 (bold black line), with this training sample.</p><p>ness in a magnitude range for which we have information about the spectroscopic redshift, enabling us to study the e&#8629;ects of incompleteness.</p><p>Figure <ref type="figure">1</ref> shows the magnitudes and colour distributions (upper and lower plots, respectively) for the incomplete training sample (dashed red lines) and the validation sample (blue lines). We have not included the comparison to the complete training sample since their distributions overlap perfectly with those of the validation sample by construction.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Incompleteness assessment</head><p>We determined the DNF photometric redshifts for the validation sample with both complete and incomplete training sets, defined as in Sect. 4.1. We selected objects meeting the conditions DNF_Z &gt; 0, DNF_ZN &gt; 0, and DNF_ZSIGMA &lt; 1.0 to ensure the quality of the sample. The cut-o&#8629; of DNF_ZSIGMA was defined by taking into account the analysis carried out in Appendix B, which studies the possible biases that DNF_ZSIGMA may have as a quality estimator of DNF photo-z. The number of galaxies after these cuts is 26 882 galaxies (96.7% of the sample) using the complete training sample and 22 617 (81.3% of the sample) using the incomplete training sample. We note that the improvement for z &gt; 1 in the complete case.</p><p>Figure <ref type="figure">2</ref> shows the magnitude and colour distributions of the galaxies in the validation sample (blue lines) versus the distributions of their nearest-neighbour galaxies (dashed orange lines) determined from the incomplete training sample. We note that while nearest-neighbour magnitude distributions do not match the weaker magnitudes in all the filters, the colour distributions are close to being recovered in comparison with those shown in Fig. <ref type="figure">1</ref>. The matching of the colour distributions between the validation sample and their nearest neighbour in the incomplete training sample may be a necessary condition to produce a reliable photometric redshift distribution. However, it may not be su cient due to the possibility of the existence of galaxies with colour combinations not covered by the training sample.</p><p>In order to study the e&#8629;ect of incompleteness and how to detect it, we carried out a principal component analysis (PCA). The PCA was performed with the magnitudes of the bands ugrizJHK s . The first principal component (PC 1 ) of this sample represents 92.8% of the variance of the validation sample in the magnitude space, while the percentage increases to 98.1% with the second component (PC 2 ). We represent the density map of the validation sample for the principal components in the upper panel of Fig. <ref type="figure">3</ref>. We have also stored the first two eigenvectors obtained for the validation sample to represent on the same basis the training sample. In the bottom panel of Fig. <ref type="figure">3</ref>, the red dots show the scatter of the incomplete training sample represented using the same eigenvectors of the validation sample. Comparing both panels of Fig. <ref type="figure">3</ref>, it can be seen that the incomplete training sample does not cover the full validation sample, but this red area is well delimited by the inner bold black line that corresponds to the region for which DNF_ZSIGMA &lt; 0.1. We note that this plot shows the limitations in determining the photo-zs using the DNF algorithm with an incomplete training sample but it also shows how DNF_ZSIGMA informs of this fact. The outer orange line corresponds to galaxies with DNF_ZSIGMA &lt; 1.0 (this is 81.3% of the sample). Therefore, we can identify three groups of galaxies. Those galaxies covered by the red dots will have precise photo-zs, since for these galaxies the training sample covers the full range of principal component. On the other hand, DNF tags those galaxies outside the orange limit as having unreliable photo-zs. Therefore, we must study the quality of the photo-zs of the galaxies that are located inside the orange limit but not covered by the training sample (red area).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Photo-z performance estimation</head><p>We compared the photo-z estimation given by DNF in the validation sample using the incomplete and complete training samples.</p><p>Figure <ref type="figure">4</ref> shows the comparison between the spectroscopic redshift (z spec ) and the photometric redshift (DNF_Z) for the incomplete training sample (left panel) and the complete training sample (right panel). We can see that the complete training sample not only determines photo-z values for a larger number of galaxies compared to the incomplete case (26 883 galaxies vs. 22 651), but also presents a lower bias and Norm 68 (see Table <ref type="table">1</ref>). In addition to the completeness, the number of galaxies in the training sample is a factor that influences the quality of the photo-zs. In Appendix C we have included the results of calculating the photometric redshift using a complete sample with the same number of galaxies as the incomplete sample. The results show that the completeness allows more accurate photo-zs to be calculated than in the incomplete case for comparable training sample sizes.</p><p>We studied the behaviour of the photo-zs estimation through the mean absolute deviation, the Norm 68 , and outliers. training sample a&#8629;ects the metrics. We cannot assume, then, that the metrics (mean absolute deviation, Norm 68 or those chosen in the study) will have the same behaviour in the validation sample and in the target sample if the training sample exhibits incompleteness. We must keep in mind that the z spec value of each galaxy is not available when we are calculating the photo-zs for a catalogue so we will not have these measurements to estimate the precision of the photo-zs. Nevertheless, we note that DNF_ZN (nearest-neighbour photo-z) is able to reproduce the z spec distribution for moderate training incompleteness in the same way that colour distributions are well recovered in the case of incomplete training (Fig. <ref type="figure">2</ref>). In this way, statistical metrics involving z spec are well represented by DNF_ZN. For this reason, we calculated the mean absolute deviation and the Norm 68 , replacing z spec with DNF_ZN (dashed lines). In both plots, the behaviour of the mean absolute deviation and the Norm 68 can be considered a good approximation of the real value, which changes depending on the training sample. We can take these metrics calculated by DNF_ZN as an upper limit of real ones. Figure <ref type="figure">6</ref> shows the outliers as a function of the i band magnitude, mag(i) in the complete training case (blue lines) and the incomplete training case (magenta lines). The number of outliers is less than 4% in both cases up until mag(i) &gt; 24, where it starts to increase in the incomplete training case. Finally, we complete this study with the behaviour of the photo-z estimation as a function of the spectroscopic redshifts in Appendix D.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Photometric redshift Deep Fields catalogue</head><p>We want to study the e&#8629;ects of using di&#8629;erent training samples on the quality of the photo-zs in the Y3 DES Deep Fields catalogue. For that, we applied the same analysis developed in Sect. 4 using two training samples. The first training sample contains only galaxies with the highest quality of spectroscopic redshift determination (i.e., with FLAG_DES = 4). In this case, the training sample does not contain galaxies with magnitudes as deep as in the Y3 DES Deep Fields catalogue. In other words, this training sample is of the highest quality but shows a certain incompleteness with respect to the science sample. The second training sample contains galaxies labelled with the spectroscopic redshift quality FLAG_DES 3. The inclusion of galaxies whose spectroscopic redshift quality is not optimal but still good in this training sample reduces the problem of incompleteness. In this second case, the training sample reaches the deepest magnitudes of the Y3 DES Deep Fields catalogue. Figure <ref type="figure">7</ref> shows the magnitude and colour distributions of both training samples (in the left panels for the incomplete sample and in the right panels for the semi-complete sample). In order to carry out a similar analysis to that performed in Sect. 4, we selected those galaxies with mag(i) &lt; 28.0 and with a positive flux measurement in the eight filters. This sample contains 1 478 705 galaxies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Assessment of high quality but incomplete training</head><p>For this study, the incomplete training sample is limited to 38 123 galaxies for which the spectroscopic redshift has been determined with very high quality. As we can see in the left panels of Fig. <ref type="figure">7</ref>, this spectroscopic sample is shallower than the Y3 DES Deep Fields catalogue (red lines and blue lines, respectively). We want to know how this incompleteness a&#8629;ects the photometric redshift calculation. Using DNF and selecting   <ref type="figure">8</ref>. Density map as a function of the first and second principal components of the galaxies of Y3 DES Deep Fields (density plot in green and yellow), the galaxies of the training sample (red dots), and the limit in the principal components of the galaxies for which DNF provides a value of photo-z with an uncertainty, DNF_ZSIGMA &lt; 1.0 (orange line) and DNF_ZSIGMA &lt; 0.1 (bold black line), with this training sample. The red blob is less extensive than the green-yellow blob (where the highest density of galaxies is located) when we select only the galaxies with FLAG_DES = 4. galaxies with DNF_Z &gt; 0, DNF_ZN &gt; 0 and DNF_ZSIGMA &lt; 1.0, we have determined the photometric redshift of 1 254 981 galaxies (84.9%) in the Deep Fields catalogue using this training sample.</p><p>The left panel of Fig. <ref type="figure">8</ref> shows the density map as a function of the first and second principal components of the Y3 DES Deep Fields catalogue and the galaxies of the training sample (red dots). The orange line is the limit of the photo-zs of this 84.9% of galaxies with the cuts defined above. In addition, we overplot another limit (bold black line) using a more stringent cut, namely DNF_ZSIGMA &lt; 0.1, which contains 441 144 galaxies; that is, 29.8%.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Assessment of medium-quality but semi-complete training</head><p>The second training sample used to determine the photometric redshift of Y3 DES Deep Fields catalogue contains 55 601 galaxies that have magnitudes as deep as in the Y3 DES Deep Fields catalogue but with a di&#8629;erent distribution, as is shown in the right panels of Fig. <ref type="figure">7</ref>. We can see in the right panel of Fig. <ref type="figure">8</ref>, corresponding to the principal components (the first two eigenvectors represent 93.59% of the sample), that the spectroscopic training sample is located in the area where the density of galaxies is A38, page 7 of 15 higher. Although it does not cover the entire principal component area of the field, DNF provides photo-z for almost all galaxies in the sample (1 318 960 galaxies, 91.67%), delimited by the orange line in the figure. We plot another limit with a bold black line that represents galaxies with a more stringent cut of DNF_ZSIGMA &lt; 0.1, as we did before (405 854 galaxies, i.e., 28.2%).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Performance and comparison of science sets with different training samples</head><p>According to the results obtained, based on the cuts defined in Sect. 4, DNF determines photometric redshifts for slightly fewer galaxies when using the incomplete but high-quality training sample than in the semi-complete case. The question is how the quality of these photometric redshift estimates compare. Or, in other words, whether it is more important to have high-quality spectroscopic redshifts or whether we can slightly relax that condition to cover the magnitude-colour space as much as possible.</p><p>The results of Fig. <ref type="figure">9</ref> show the precision of the photo-z estimation by DNF for the Y3 DES Deep Fields catalogue defined by mean absolute deviation (left panel) and Norm 68 (right panel) as a function of the mag(i). It should be noted that the z spec of each galaxy is not available in Y3 DES Deep Fields catalogue, so to estimate the mean absolute deviation and Norm 68 we replaced z spec with DNF_ZN following the analysis done in Sect. 4.3. We can see that the results obtained by the incomplete, high-quality training (dashed purple lines) and the semi-complete training (blue lines) samples follow a similar behaviour for mag(i) &lt; 24, although slightly better for the incomplete, high-quality training. In this case, we obtain a lower error for magnitude-colour areas covered by the spectroscopic sample.</p><p>We have also seen the same behaviour in Sects. 5.1 and 5.2, where the incomplete, high-quality training contains more galaxies with DNF_ZSIGMA &lt; 0.1 even though, globally, the semi-complete training generates more precise photo-zs. For mag(i) 24, the semi-complete training sample, formed by galaxies with slightly lower-quality spectroscopic redshifts, outperforms the photo-zs of the incomplete training sample formed by the highest-quality spectroscopic redshift galaxies. The results indicate that completeness plays an important role in determining higher-quality photometric redshift values, as was expected. But the results also suggest that for specific studies focused on brighter galaxies we may be more interested in using only the redshifts of the highest possible quality in our training.</p><p>Finally, we studied the absolute median deviation and Norm 68 as a function of the redshift for the two training samples; more details can be seen in Appendix D.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Comparison between DNF and EAzY</head><p>We estimated the photo-zs of the whole deep fields and added this information to the Y3 DES Deep Fields data<ref type="foot">foot_2</ref> The training sample used to estimate the photo-zs contains galaxies with spectroscopic redshift information labeled with FLAG_DES 3, corresponding to the semi-complete training sample in Sect. 5.2. It is important to note that, when computing DNF in this case, we ignored the spectroscopic redshift of the target galaxy in the training sample in order to provide a homogeneous comparison of all estimates. In addition to the DNF photo-zs <ref type="bibr">(De Vicente et al. 2016)</ref>, the Y3 DES Deep Fields catalogue contains photo-zs determined with the EAzY algorithm <ref type="bibr">(Hartley et al. 2022;</ref><ref type="bibr">Brammer et al. 2008)</ref>. These two methods approach the photometric redshift problem from di&#8629;erent perspectives: EAzY determined the photo-zs by fitting a linear combination of template components, while DNF is a machine learning code.</p><p>We analysed the photo-zs obtained using both methods. Firstly, we selected from the Y3 Deep Fields catalogue those galaxies with spectroscopic redshift information, mag(i) &lt; 28.0, flux measurements in the eight filters. This sample contains 55 198 galaxies and covers a large portion of the total sample, as we can see in the right panel of Fig. <ref type="figure">8</ref>. Figure <ref type="figure">10</ref>  bottom shows the photo-z values of DNF (x axis) versus EAzY (y axis). We can see the same bias appears in the right panel; that is, EAzY with respect to SPEC_Z. Therefore, this behaviour seems to come from the EAzY estimation. It may be due to the lack of Y band data. The break is poorly constrained from z &#8672; 1 until the 4000 &#197; break starts to enter the J band. The prior tends to favour a lower redshift and so the point estimates are pulled to a lower redshift slightly. This would be partially alleviated with the full EAzY PDFs.</p><p>In Fig. <ref type="figure">11</ref> (left), we compare the photo-z provided by both methods for the Y3 DES Deep Fields catalogue (right panel). We focus on galaxies with flux measurements in all eight filters and mag(i) &lt; 28.0. It corresponds to a sample of 1 473 381 galaxies. For z &gt; 1 we can see a similar behaviour to that observed with spectroscopic redshifts (right panel of Fig. <ref type="figure">10</ref>). Therefore, this behaviour seems to come from EAzY estimation. On the other hand, there is a cloud of points below the diagonal around EAzY_Z &#8672; 0.5 that extends along several values of DNF_Z. We can see in Fig. <ref type="figure">11</ref> (right) that the cloud can be removed by applying the quality-cut DNF_ZSIGMA &lt; 0.5. In general, DNF_ZSIGMA allows us to detect galaxies with large errors due to bad photometry, degeneracies, or incompleteness.</p><p>Determining the best method to be applied to a scientific sample is non-trivial. <ref type="bibr">Salvato et al. (2019)</ref> points out that machine learning methods outperform template approaches when the training survey is su ciently complete. However, template methods are more favourable when spectroscopic samples are limited. In the case of DNF and EAzY, the biggest di&#8629;erences appear for z &gt; 1, when the completeness of the training sample is poorer. Nevertheless, the photo-zs generated by DNF present better metrics than those provided by EAzY. According to <ref type="bibr">Salvato et al. (2019)</ref>, template methods work best for high redshifts because of the lack of photometric information with which to construct training samples for machine leaning methods. In the same sense, the templates are built on physical assumptions that may not be entirely correct or that have incomplete coverage in certain areas.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusions</head><p>This study is an analysis of how the completeness and spectroscopic quality of the training sample a&#8629;ects the photometric redshift determination using the DNF algorithm. The conclusions are the following: A38, page 9 of 15 1. We have emulated the problem of an incomplete training sample for DNF with the goal of measuring its e&#8629;ects and taking them into account with regard to the photo-z performance. The principal component analysis provides a graphical method of assessing completeness and DNF_ZSIGMA turns out to be a reliable parameter to separate the set of galaxies computed with a complete training sample. 2. We analysed the possibility of substituting z spec with DNF_ZN to assess (z) in the scatter metrics of DNF_Z (MAD( z) and Norm 68 ). The results show that DNF_ZN provides an upper limit of the real values. Using this method, the photo-z quality can be estimated when no spectroscopic information is available. 3. We determine the photo-zs of the Y3 DES Deep Fields catalogue using both a semi-complete training sample with highand medium-quality redshift spectroscopy and an incomplete training sample with the highest-quality redshift spectroscopy. The obtained results are globally better for the semi-complete sample in spite of its slight diminution in quality. However, the photo-z improves for that sub-sample in which the high-quality incomplete training covers its principal component analysis space. For faint magnitudes, it seems better to use a training sample with a medium-quality spectroscopic redshift covering deeper magnitudes. This result supports for training completeness at the expense of slightly sacrificing the quality of the spectroscopic redshifts.</p><p>The results also suggest that for specific studies focused on brighter galaxies, we may be more interested in using only redshifts of the highest possible quality in our training. 4. We have compared the photometric redshift of the Y3 DES Deep Fields catalogue determined with DNF and EAzY. Both methods show a similar behaviour up to z &#8672; 1. For z &gt; 1 DNF, outperforms EAzY, which shows some bias towards higher redshifts. In addition, we studied the behaviour of the photo-z estimation as a function of the redshift for the galaxies of the Y3 Deep Field catalogue using the incomplete and semi-incomplete training samples defined in Sect. 5. In this case, as we lack infor- mation on the spectroscopic redshift, we have replaced z spec with DNF_Z. The results of Fig. D.3 and D.4 show that MAD( z) and Norm 68 get worse for higher redshifts. Both training samples have similar results for z &lt; 1.4. After this value, the semi-incomplete training sample works better than incomplete one. A38, page 15 of 15</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>Available at https://des.ncsa.illinois.edu/releases/ y3a2/Y3deepfields A38, page</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1"><p>of 15</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_2"><p>Available at https://des.ncsa.illinois.edu/releases/ y3a2/Y3deepfields</p></note>
		</body>
		</text>
</TEI>
