<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Environmental control on the distribution of metabolic strategies of benthic microbial mats in Lake Fryxell, Antarctica</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>04/13/2020</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10231810</idno>
					<idno type="doi">10.1371/journal.pone.0231053</idno>
					<title level='j'>PLOS ONE</title>
<idno>1932-6203</idno>
<biblScope unit="volume">15</biblScope>
<biblScope unit="issue">4</biblScope>					

					<author>Megan L. Dillon</author><author>Ian Hawes</author><author>Anne D. Jungblut</author><author>Tyler J. Mackey</author><author>Jonathan A. Eisen</author><author>Peter T. Doran</author><author>Dawn Y. Sumner</author><author>Steven Arthur Loiselle</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Ecological theories posit that heterogeneity in environmental conditions greatly affects community structure and function. However, the degree to which ecological theory developed using plant-and animal-dominated systems applies to microbiomes is unclear. Investigating the metabolic strategies found in microbiomes are particularly informative for testing the universality of ecological theories because microorganisms have far wider metabolic capacity than plants and animals. We used metagenomic analyses to explore the relationships between the energy and physicochemical gradients in Lake Fryxell and the metabolic capacity of its benthic microbiome. Statistical analysis of the relative abundance of metabolic marker genes and gene family diversity shows that oxygenic photosynthesis, carbon fixation, and flavin-based electron bifurcation differentiate mats growing in different environmental conditions. The pattern of gene family diversity points to the likely importance of temporal environmental heterogeneity in addition to resource gradients. Overall, we found that the environmental heterogeneity of photosynthetically active radiation (PAR) and oxygen concentration ([O 2 ]) in Lake Fryxell provide the framework by which metabolic diversity and composition of the community is structured, in accordance with its phylogenetic structure. The organization of the resulting microbial ecosystems are consistent with the maximum power principle and the species sorting model.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>The microbial components of ecological communities (the microbiome) provide a large proportion of the genetic novelty and perform a large proportion of the functions of an ecosystem (for example, <ref type="bibr">[1]</ref><ref type="bibr">[2]</ref><ref type="bibr">[3]</ref>). However, many of the methods that are used to explore microbiomes were developed by investigating plant-and animal-dominated ecosystems <ref type="bibr">[4]</ref>. The composition, assembly, and function of microbiomes are considerably more complex than macroscopic processes because there are more individuals, more populations, and thus more possible interactions. Further, the phylogenetic relationships and metabolic capacity of microorganisms are often fundamentally different from plants and animals (e.g., horizontal gene transfer and mixotrophy). If the ecological theories developed from well-studied macroscopic ecosystems are universal, they should apply to microbial ecosystems. Therefore, we can gain insights into the general applicability of ecological theories by studying microbiomes <ref type="bibr">[5]</ref><ref type="bibr">[6]</ref><ref type="bibr">[7]</ref>.</p><p>In all ecosystems, heterogeneity in environmental conditions can greatly affect community membership. The extent to which this holds depends on the extent to which niche selection, drift, speciation or mutation, and dispersal affect taxonomic and functional richness, evenness, and composition <ref type="bibr">[8]</ref>. The explicit effects of these processes have been developed into a metacommunity framework, applicable across spatial scales, and give rise to alternative models (species sorting, neutral theory, patch dynamics, and mass effects), depending on the degree to which each is relevant for a given ecosystem <ref type="bibr">[9,</ref><ref type="bibr">10]</ref>.</p><p>In the species sorting model <ref type="bibr">[10]</ref>, fitness advantages of existing community members limit the survival and growth of immigrants to only those that are most competitive. Therefore, environments with greater habitat heterogeneity have more diverse fitness landscapes and are thus inhabited by a more diverse community than homogenous habitats. Communities that conform to the species sorting model are ones in which species are distributed according to local environmental conditions and community heterogeneity matches environmental heterogeneity. Drift, speciation or mutation, and dispersal effects are damped out by niche selection. Thus, under the species sorting model, we expect to observe community composition changing in response to habitat features, and total habitat heterogeneity in a landscape directly influences community diversity. The species sorting model is the most widely cited as important in shaping microbial community dynamics, especially in aquatic ecosystems <ref type="bibr">[11]</ref><ref type="bibr">[12]</ref><ref type="bibr">[13]</ref>.</p><p>In contrast, the neutral theory model assumes that all organisms in a community are equally suited for their habitat. Therefore drift, speciation or mutation, and dispersal processes dominate over niche selection and community variations do not reflect variations in environment. Thus, neutral theory models produce a wide range of communities that are randomly distributed across heterogeneous habitats. Neutral dynamics have successfully described microbial community assembly in host-associated microbiome studies <ref type="bibr">[14]</ref><ref type="bibr">[15]</ref><ref type="bibr">[16]</ref>.</p><p>The patch dynamics model assumes homogeneous local environments where species coexist due to stochastic extinction and advantageous dispersal. In this model, community composition depends on early colonizers, which induce priority effects <ref type="bibr">[17]</ref>. Differences in niche and fitness may exist among community members, but drift and dispersal dominate, leading to a community that does not depend on environmental heterogeneities. Under the patch dynamics model, populations are randomly distributed across relatively homogenous habitats, with greater diversity than expected from the homogenous landscape. Patch dynamics models are consistent with some studies of the human microbiome, which show significant diversity across individual patients or body sites that are interpreted as due to colonization history <ref type="bibr">[18]</ref>.</p><p>Finally, the mass effects model applies when dispersal overwhelms and masks selection, drift, and speciation or mutation, creating uniform communities composed of the same dominant organisms irrespective of environment. The mass effects model produces ecosystems in which community composition varies across different habitats and geography according to dispersal from parent communities. For example, the microbial composition of arctic streams are similar to the soils from which they originate near headwaters due to mass effects, but change as geographic distance and environmental heterogeneity increase <ref type="bibr">[19]</ref>.</p><p>The metacommunity framework explicitly describes community membership, but necessarily applies to functional aspects of communities as well. The means by which selection, drift, mutation, and dispersal affect community membership are through individuals' traits. The species sorting, neutral theory, patch dynamics, and mass effects models all hinge on the relative fitness of community members, which is determined by how well their phenotypes (functions) allow them to survive and reproduce in a given habitat or under specific environmental conditions.</p><p>These metacommunity models can be used to understand ecological processes in microbial communities that lack macroscopic organisms. Specifically, microbial ecosystems in ice-covered lakes in the McMurdo Dry Valleys (MDVs), Antarctica, serve as natural laboratories to test the extent to which these models can explain community variations as a function of environmental gradients in photosynthetically active radiation (PAR) and oxygen concentration ([O 2 ]). The MDV lake environments are stable on decade-long timescales <ref type="bibr">[20,</ref><ref type="bibr">21]</ref>, containing well characterized PAR and slowly changing [O 2 ] gradients, which lead to predictable habitat heterogeneity <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>. PAR and [O 2 ] gradients are particularly prominent in Lake Fryxell, a perennially ice-covered, density-stratified lake in the Taylor Valley, Antarctica.</p><p>Our prior investigations into the relationships between the phylogenetic structure and taxonomic composition of Lake Fryxell's benthic microbial mats and local environmental conditions demonstrated that PAR and [O 2 ] affect local community membership differently at mmand m-scales <ref type="bibr">[23]</ref>. At the mm-scale, phototrophs dominate top mat layers where they maximize conversion of PAR into chemical energy and suppress &#945;-diversity due to their high population <ref type="bibr">[23]</ref>. The phylogenetic diversity of the underlying non-phototrophic layers increases with depth into the mat, consistent with the maximum power principle, which predicts that communities are structured to optimize energy consumption over time <ref type="bibr">[23,</ref><ref type="bibr">24]</ref>. In mat layers where [O 2 ] was saturating, PAR structured the community. At the m-scale however, [O 2 ] positively correlated with diversity and affected the distribution of dominant populations across the three habitats. This suggests that meter-scale diversity is structured by PAR, as predicted by species-energy theory, which posits that areas with greater net primary productivity have more diverse habitats <ref type="bibr">[4,</ref><ref type="bibr">25]</ref>.</p><p>Because both the maximum power principle and species-energy theory require niche selection, prior results suggest that the species sorting model may be most appropriate for describing the benthic mat structure in Lake Fryxell across large-and small-scale PAR and [O 2 ] gradients. Neutral theory models are not appropriate because the communities systematically vary along environmental gradients. Similarly, the stratification of lake water means that the transport of organisms within Lake Fryxell is likely too low for populations to be controlled by mass effects. Finally, since the landscape features (PAR, [O 2 ]) are heterogeneous, the patch dynamics model does not apply <ref type="bibr">[23]</ref>.</p><p>Because species sorting was found to be an appropriate model for the phylogenetic diversity and taxonomic composition of Lake Fryxell's benthic microbial mats, we tested whether patterns of metabolic capacity reflect the environmental conditions in Lake Fryxell across lake depth and through mat layers, also consistent with the species sorting model. Recent work has found that different ecological processes may influence phylogenetic and metabolic composition and diversity in microbial communities <ref type="bibr">[26]</ref>. Indeed, due to the modular structure of cellular biochemistry <ref type="bibr">[27]</ref>, it may be the case that metabolic structure is more directly affected by environmental conditions than phylogenetic structure, which is additionally influenced by species-species interactions <ref type="bibr">[28]</ref>. Application of the species sorting model to metabolic capacities would mean that the local distributions of PAR and [O 2 ] dictate the local metabolic capacity of the mats, similar to the distribution of species.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Materials and methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Site description</head><p>Lake Fryxell (77&#730;36&#180;S 162&#730;6&#180;E) is a physically stratified, low-nutrient habitat in the McMurdo Dry Valleys (MDV), Antarctica. It is 5 km x 1.5 km in extent and the maximum depth is approximately 20 m <ref type="bibr">[20]</ref>. Water is supplied to Lake Fryxell by 13 glacial melt-water streams primarily sourced from the Canada and Commonwealth glaciers <ref type="bibr">[29]</ref>. Water balance is achieved by evaporation and ablation from the surface; there are no out-flowing streams <ref type="bibr">[30]</ref>.</p><p>Environmental conditions in Lake Fryxell are strongly affected by a 4-5 m thick perennial ice cover <ref type="bibr">[31]</ref>. During the summer, the ice cover transmits approximately 1% of incident irradiance <ref type="bibr">[22]</ref>, which provides the lake's primary energy influx. Light reaching the benthic surface of Lake Fryxell declines with increasing depth in the water column but is adequate to support photosynthesis in surface layers under anoxic water to depths of 10.4 m during the summer months (Fig <ref type="figure">1</ref>). The ice cover inhibits wind mixing and gas equilibration between lake water and the atmosphere. The lack of mixing produces stable density stratification, as demonstrated by conductivity profiles <ref type="bibr">[21,</ref><ref type="bibr">22]</ref> (Fig <ref type="figure">1</ref>). The stratification limits the transport of nutrients and redox pairs to diffusion and creates stable redox and nutrient gradients in the water column <ref type="bibr">[22,</ref><ref type="bibr">32]</ref>. Temperature varies from 2.4 to 2.7&#730;C and pH varies from 7.50 to 7.52 along a lake-bottom transect through the oxycline <ref type="bibr">[22]</ref>. As lake water freezes during winter, oxygen and other gases are excluded from the underside of the ice cover, building to gas supersaturation in shallow waters. Oxygen concentration declines with depth, and oxygen is absent from the water column below approximately 9.8 m. The oxygen limit is therefore partially determined by the ice cover. Lake Fryxell's robust planktonic microbial community thrives near the oxic-anoxic transition (9-10 m), coincident with the deep chlorophyll maximum and the nutricline <ref type="bibr">[33]</ref>. Centimeter-to-decimeter-scale thick microbial mats exhibiting a variety of pigments and morphologies grow on the benthic surface of the lake to depths of at least 10.5 m <ref type="bibr">[22]</ref>, affecting the seasonal redox conditions near the oxycline via oxygenic photosynthesis and respiration. In late spring, a seasonal oxygen oasis forms at approximately 9.8 m and [O 2 ] varies significantly through the microbial mats by lake depth, according to microelectrode profiles: 650-825 &#956;mol O 2 / L to at 9.0 m, and 0-50 &#956;mol O 2 / L at 9.8 m <ref type="bibr">[34]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Sampling</head><p>The benthic microbial mats in Lake Fryxell were sampled in November 2012 as permitted by the New Zealand Minister of Foreign Affairs, described by <ref type="bibr">Jungblut et al. (2016)</ref>. Sampling was performed at 9.0, 9.3, and 9.8 m depths along a transect that was installed in 2006 <ref type="bibr">[37]</ref>. At 9.0 m, top layers were exposed to PAR, and middle and bottom layers were not exposed to PAR; [O 2 ] was saturated in all layers <ref type="bibr">[22,</ref><ref type="bibr">34]</ref>. At 9.3 m, all layers were exposed to PAR due to mat topography; top and middle layers were exposed to oxygenated water, but the bottom layers were anoxic <ref type="bibr">[22]</ref>. At 9.8 m, film and top layers were exposed to PAR; film and top samples were seasonally exposed to O 2 <ref type="bibr">[22,</ref><ref type="bibr">34]</ref>. All sampling and dissection were performed using sterile technique. Divers retrieved samples from the bottom of the lake by cutting samples out of in situ mats using a spatula and lifting them into plastic boxes underwater. Upon delivery to the surface, multiple samples from each depth were dissected according to layer pigmentation and morphology. The samples were preserved in the field immediately after sampling using an Xpedition Soil/Fecal DNA MiniPrep kit (Zymo Research, Irvine, CA), stored on ice for the remainder of the field season, and shipped frozen to University of California, Davis where they were stored at -80&#730;C until DNA was extracted <ref type="bibr">[22]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Metagenomic sequencing</head><p>DNA was extracted using an Xpedition Soil/Fecal DNA MiniPrep kit (Zymo Research, Irvine, CA) as per manufacturer instructions from biological and technical replicates of 10 sample types (S1 Table; <ref type="bibr">[23]</ref>). Metagenomic sequencing was performed at the University of California, Davis Genome Center DNA Technologies Core (<ref type="url">http://dnatech.genomecenter.ucdavis.edu/</ref>) using the Illumina HiSeq 2500, PE 250 platform. Library preparation was performed using Illumina's Nextera DNA Kit (Oligonucleotide sequences &#169; 2007-2013 Illumina, Inc.). Reads were quality filtered to Q20, and forward and reverse reads were joined using PEAR v0.9.6 <ref type="bibr">[38]</ref>. Downstream analyses included only biological replicates with greater than 10,000 reads.  <ref type="bibr">[36]</ref>. C) Oxygen concentration, conductivity, PAR, and oxygen saturation at 0&#730;C along a benthic mat transect in Lake Fryxell in November 2012 <ref type="bibr">[22]</ref>. The linear increase in conductivity indicates stably density-stratified waters, and the oxygen saturation line shows areas of the lake that are oxygen-supersaturated. <ref type="url">https://doi.org/10.1371/journal.pone.0231053.g001</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Bioinformatics</head><p>Humann2 <ref type="bibr">[39]</ref> was used to characterize metabolic genes from all domains, using the Cho-coPhlan and UniRef databases. The comprehensive UniRef50 clusters <ref type="bibr">[40]</ref> were used within Humann2 to identify proteins. Gene families discovered using Humann2 were normalized using copies per million (CPM), which allows a direct comparison across samples <ref type="bibr">[39]</ref>.</p><p>The distribution of specific metabolic pathways was evaluated by comparing the proportion of metabolic marker genes mapping to each community. Microbial metabolism drives the biogeochemical cycles of all major elements on Earth, including the oxygen, carbon, nitrogen, and sulfur cycles <ref type="bibr">[41]</ref><ref type="bibr">[42]</ref><ref type="bibr">[43]</ref><ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref><ref type="bibr">[47]</ref><ref type="bibr">[48]</ref>. We chose genes within these pathways as representative of major metabolic processes (nitrogen fixation, the Calvin Cycle, oxygenic photosynthesis, etc). Genes marking metabolisms of interest (Table <ref type="table">1</ref>) were chosen for their lack of pathway ambiguity, phylogenetic breadth, and importance in major element cycles. Gene families were regrouped and assigned to their Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology (KO) <ref type="bibr">[49]</ref>.</p><p>When calculating CPM, unmapped and ungrouped reads were carried forward. Unmapped reads are those which did not align during either nucleotide or translated searches. Ungrouped reads are those that did not match any features in KEGG <ref type="bibr">[39]</ref>. CPM for reads that both mapped and grouped was then normalized to percent grouped for downstream analyses.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Statistical analyses</head><p>Alpha diversity was calculated using Simpson's index of diversity directly on gene families, as called by Humann2. Significant differences in metabolic marker genes, and gene family alpha diversity between samples were determined using Permutational Multiple Analysis of Variance (PERMANOVA) in R v3.3.2 <ref type="bibr">[50]</ref><ref type="bibr">[51]</ref><ref type="bibr">[52]</ref> using R package vegan v2.5-5 <ref type="bibr">[23,</ref><ref type="bibr">53]</ref>. Samples determined to differ significantly in alpha diversity, as per PERMANOVA implemented via the adonis function, were then subjected to Tukey's Honest Significant Difference (Tukey's HSD) test <ref type="bibr">[54]</ref> in R v3.3.2 <ref type="bibr">[52]</ref> to establish which genes differed between depths and between layers at each depth.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Bioinformatics</head><p>Metagenomic sequencing yielded approximately 5 x 10 9 bp per sample type (S2 Table ). On average, approximately 34% of the metagenomic reads mapped to the RefSeq50 database. Of the reads that mapped, approximately 74% were grouped in KEGG as KOs (Table <ref type="table">2</ref>). Approximately 8% of total reads mapped to the RefSeq50 database and grouped as KOs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Gene family diversity</head><p>Gene family diversity varied with lake depth and mat layer, from approximately 0.6 to 0.95, as measured by Simpson's Index of Diversity (Fig <ref type="figure">2</ref>). ANOVA demonstrated several key differences in the diversity of genes present across depths and layers (Table <ref type="table">3</ref>). At 9.0 m, the top layer is significantly less diverse than all other samples except the top layer at 9.3 m. At 9.8 m, the film and top layers are significantly more diverse than all other samples. Gene family diversity increased with lake depth. At 9.0 and 9.3 m, alpha diversity increased from the top to bottom layers, whereas at 9.8 m, alpha diversity decreased through mat layers (Fig <ref type="figure">2</ref>). Phylogenetic and metabolic diversity are correlated in only three samples (S3 Table ).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Metabolic marker gene presence, absence, and relative abundance</head><p>To explore how gene family diversity correlated with environmental parameters, specific metabolic marker genes (Table <ref type="table">1</ref>) were chosen to represent distinct metabolic strategies. Some samples lacked one or more of the genes representing these strategies, and where metabolic genes were present, their relative abundances varied among depths and mat layers (Table <ref type="table">4</ref>).</p><p>The predicted capacity to perform photosynthesis and fix carbon decreases through mat layers at all depths (Fig <ref type="figure">3</ref>). The potential for oxygenic photosynthesis (psbA) was present in all sample types; however, the relative abundance of this gene was greater in top, illuminated layers than in dark bottom layers at all depths (Fig 3 <ref type="figure"/>and<ref type="figure">Table 4</ref>). Similarly, Calvin cycle carbon fixation (rbcL) was consistently present in all samples and decreased in relative abundance through the layers at all depths (Table <ref type="table">4</ref>). The capacity for oxygenic photosynthesis and carbon fixation were strongly correlated (Fig <ref type="table">3</ref> and<ref type="table">Table 5</ref>). This correlation is unsurprising considering that both genes are often present in organisms capable of oxygenic photosynthesis <ref type="bibr">[55]</ref>, though rbcL is not found exclusively in oxygenic phototrophs. Little evidence of the capacity for anoxygenic photosynthesis (pufL) was found; pufL was only identified in middle layers at 9.0 and 9.3 m where PAR is very low (Table <ref type="table">4</ref>). The capacity for alternative anoxygenic photosynthesis strategies (pscA) were absent from all samples. Additionally, the capacity for polysaccharide hydrolysis (amyA) was present in all samples and had the highest relative abundance where PAR was highest (Table <ref type="table">4</ref>).</p><p>With the exception of aerobic respiration (ccoNO), the relative abundances of genes encoding respiration and major nutrients such as nitrogen, phosphorus, and sulfur assimilation functions correlated with [O 2 ] (Table <ref type="table">5</ref>). The capacity for aerobic respiration was consistently high at all depths and in all mat layers, irrespective of environmental availability of O 2 (Fig 4  and Table <ref type="table">4</ref>). Anaerobic respiration genes increased in relative abundance through layers at all depths (Fig 5 <ref type="figure"/>and<ref type="figure">Table 4</ref>). Dissimilatory nitrate reduction (nrfA) and denitrification (nosZ) genes were the most abundant genes encoding the use of electron acceptors other than oxygen (Fig <ref type="figure">5</ref> and Table <ref type="table">4</ref>). The capacity for sulfate respiration (as indicated by the relative abundance of soxC) increased through mat layers at all lake depths (Fig <ref type="figure">5</ref>). The relative abundance of sulfate reduction via aprB was more variable, but also increased through mat layers (Table <ref type="table">4</ref>); aprB was found in far greater relative abundance in the bottom layer at 9.8 m than in any other sample type (Fig 5 <ref type="figure"/>and<ref type="figure">Table 4</ref>). The gene hdrB is generally associated with methanogenesis but possibly also indicates a capacity for flavin-based electron bifurcation <ref type="bibr">[56]</ref>. It was relatively abundant at 9.8 m and also detectable in 9.3 m samples (Fig 6 <ref type="figure"/>and<ref type="figure">Table 4</ref>) even though methanogens were not identified in these samples <ref type="bibr">[23]</ref>. Tukey's post-hoc test revealed that hdrB relative abundance varied significantly between the 9.8 m film and all layers from 9.0 m and 9.3 m (Table <ref type="table">6</ref>). The relative abundance of hdrB strongly co-varied with genes for oxygenic photosynthesis and carbon fixation: Pearson's correlation coefficients between hdrB and psbA or rbcL are 0.897 and 0.877, respectively. The capacity for methanogenesis (hdrD) was absent from all samples. Methanotrophy genes (mdh2) were only detected in the film at 9.8 m (Table <ref type="table">3</ref>).</p><p>Nutrient assimilation trends were specific to lake depth and mat layer. Nitrogen fixation capacity (nifH) was absent in 9.0 m samples but present in some 9.3 m and 9.8 m mat layers (Table <ref type="table">4</ref> and <ref type="bibr">Fig 7)</ref>. Assimilatory nitrate (nasA) and sulfate reduction (cysI) genes were found in consistent relative abundance throughout all mat layers (Fig 7 <ref type="figure"/>and<ref type="figure">Table 4</ref>). Higher relative abundance of the capacity to substitute nitrogenous groups into membrane lipids (btaA; <ref type="bibr">[57]</ref>), were found in the film, top, and middle layers at 9.8 m (Fig 7 <ref type="figure"/>and<ref type="figure">Table 4</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head><p>Photosynthetically active radiation correlated with key metabolic genes in Lake Fryxell, specifically the capacity for oxygenic photosynthesis and carbon fixation. Oxygenic photosynthesis genes are most abundant in the top layers at each depth, consistent with greater PAR at mat surfaces and prior studies of phylogenetic data <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>. Photosynthesis requires PAR, so the decreasing relative abundances of psbA with layers into the mat and from 9.0 to 9.3 m is consistent with the utility of photosynthesis where there is light (Figs <ref type="figure">1</ref> and<ref type="figure">3</ref>). However, the proportion of psbA in surface mat layers did not correlate directly with PAR across all lake depths. The amount of PAR reaching the mats growing at 9.8 m is just above the threshold for net photosynthetic production <ref type="bibr">[22,</ref><ref type="bibr">34]</ref>, yet samples from the film and top mat layers have the highest relative abundance of psbA of all depths (Fig <ref type="figure">3</ref>). The single Cyanobacterial lineage Phormidium  pseudopristleyi dominates these samples <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>. The high population density of this organism likely explains the disproportionate representation of psbA. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Energy capture and use: Photosynthesis, respiration, and flavin-based electron bifurcation</head><p>The high relative abundance of the capacity for oxygenic photosynthesis overall supports previous studies indicating that oxygenic photosynthesis is the most ecologically important energy  capture mechanism available to the communities in Lake Fryxell at depths where PAR is available <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>. The relative abundances of psbA and rbcL genes have a Pearson's correlation coefficient of 0.998 (Table <ref type="table">5</ref>), which is consistent with them being hosted in the same organisms, likely Cyanobacteria which are fixing the most carbon and generating the most biomass in the lake. The relative abundance of the capacity for polysaccharide hydrolysis (amyA) correlated with those for oxygenic photosynthesis and carbon fixation at only 9.0 m (S4 Table ), where mats are oxygenated to a greater extent than at any other depth, and likely throughout the year <ref type="bibr">[34]</ref>. Psychrophilic organisms that encode amyA are generally aerobes <ref type="bibr">[58]</ref><ref type="bibr">[59]</ref><ref type="bibr">[60]</ref> and may be more efficient at polysaccharide hydrolysis in oxic environments <ref type="bibr">[61]</ref>.</p><p>The relative scarcity of genes encoding anoxygenic photosynthesis (absence of pscA and very low relative abundance of pufL) is interesting in the context of previous work indicating that anoxygenic phototrophs are often abundant in low-light environments (e.g., <ref type="bibr">[62,</ref><ref type="bibr">63]</ref>). Anoxygenic phototrophs that use pufL are part of the planktonic community in Lake Fryxell <ref type="bibr">[64,</ref><ref type="bibr">65]</ref>, and also have been detected in MDV Lake Vanda <ref type="bibr">[66]</ref>, but appear to be absent from MDV Lake Joyce <ref type="bibr">[67]</ref>. The low relative abundances of pufL and absence of pscA may be related to the spectrum of light reaching the benthic surface of Lake Fryxell. The absorption spectrum of bacteriochlorophyll is near 700 nm <ref type="bibr">[68]</ref> and the majority of light reaching the mats in icecovered lakes is shorter wavelength due to increasing attenuation of longer wavelengths with depth <ref type="bibr">[69,</ref><ref type="bibr">70]</ref>. The paucity of light at wavelengths suitable for anoxygenic phototrophs may render anoxygenic phototrophy an ineffective metabolic strategy, consistent with both the paucity of pufL and pscA genes in general, as well as their absence at 9.8 m. In Lake Joyce, the penetration of irradiance through the ice cover is also low, between approximately 0.4% and 4% <ref type="bibr">[67]</ref>. In Vanda, approximately 16% of incident irradiance penetrates the ice cove <ref type="bibr">[71]</ref>. Thus, it appears that PAR wavelength attenuation contributes to habitat suitability for anoxygenic phototrophs in MDV lakes.</p><p>Where sufficient O 2 is available, aerobic respiration is the most efficient means of ATP generation for organisms. In Fryxell's benthic mats, no statistically significant difference in the capacity for aerobic respiration, as measured by ccoNO relative abundance, exists between habitats where oxygen is constantly available, those where it is seasonally available, and those where it is constantly absent (Fig <ref type="figure">4</ref>). The widespread capacity for aerobic respiration across [O 2 ] in Fryxell mats may be attributable to the fact that bacteria can perform aerobic respiration at nanomolar concentrations of O 2 using terminal oxidases with a high-affinity for O 2 (ccoNO) <ref type="bibr">[72]</ref>. Although the heterogeneity of anoxic environments has not been directly characterized in Fryxell mats, it is likely that micro-oxic and anoxic sub-habitats are more common as oxygen declines with depth in the lake and into the mats <ref type="bibr">[34]</ref>. In such habitats, genes for both aerobic and anaerobic respiration are likely maintained because enough oxygen heterogeneity exists both spatially and temporally to make both strategies valuable. Anaerobic respiration using nitrate and sulfate appear to be viable strategies at all depths (Fig <ref type="figure">5</ref>). The greater relative abundance of nitrogen respiration genes over assimilatory nitrate reduction genes in Fryxell (Table <ref type="table">4</ref>) may indicate the importance of nitrogen species as electron acceptors. Testing expression patterns of nitrogen cycling genes in shoulder and winter seasons would allow a better understanding of the effects of strong seasonality, especially availability of PAR and [O 2 ], has on these communities.</p><p>While photosynthesis and aerobic respiration are the dominant energy metabolisms in Lake Fryxell, mats at 9.8 m show an interesting possible alternative metabolic strategy, as represented by the relative abundance of hdrB genes. hdrB encodes a subunit of a cytoplasmic complex that reduces two thiol coenzymes <ref type="bibr">[73]</ref>, which is crucial to methane production in methanogens that have been found in Fryxell's planktonic community <ref type="bibr">[74,</ref><ref type="bibr">75]</ref>. hdrB is strictly inhibited by oxygen <ref type="bibr">[76]</ref>. However, in Fryxell mats, hdrB homologs were found in statistically higher relative abundance in the 9.8 m film sample type (Table <ref type="table">6</ref>) where the mats are anoxic only during the winter months <ref type="bibr">[34]</ref>. Phylogenetic markers of methanogens are absent in samples with high relative abundances of hdrB <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>, suggesting hdrB is hosted in non-methanogens. Interestingly, hrdB is present in some sulfate reducing bacteria <ref type="bibr">[77]</ref><ref type="bibr">[78]</ref><ref type="bibr">[79]</ref> and may be necessary for energy generation among diverse anaerobes <ref type="bibr">[56]</ref>. In these organisms, hdrB is part of an enzyme complex called flavin-based electron bifurcation that acts as an alternative to both substrate level phosphorylation (fermentation) and electron transport <ref type="bibr">[80]</ref>. In Fryxell mats, hdrB appears to mark capacity for flavin-based electron bifurcation in sulfate reducers rather than methane production, the first ecological evidence of this function of hdrB to our knowledge.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nutrient cycling and limitation</head><p>Nitrogen fixation capacity in Lake Fryxell appears to be limited by local [O 2 ] as nifH is absent from mats continuously exposed to oxic water <ref type="bibr">(Fig 4)</ref>. Typically in microbial mats, nitrogen fixation and ammonium and nitrate assimilation are performed by community members living near the surface of a mat that is illuminated and oxygenated <ref type="bibr">[81]</ref>, particularly by Nostoc spp. <ref type="bibr">[82]</ref>. Many Antarctic mat ecosystems have a greater apparent capacity for nitrogen fixation than we found here, especially where Nostoc spp are in high abundance <ref type="bibr">[81]</ref>. However, Nostoc spp. are rare in Fryxell's mats <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>. Nitrogen fixation in non-heterocystous cyanobacteria occurs at night, when oxygen is no longer being generated and depleted from the cells <ref type="bibr">[83,</ref><ref type="bibr">84]</ref>. The absence of dark conditions during the Antarctic summer leads to the continuous production of oxygen by cyanobacteria, which inhibits nitrogen fixation. Thus the polar latitude of Lake Fryxell may significantly limit nitrogen fixation above the oxycline even if the communities contained the capability to do so, consistent with previous metagenomic results <ref type="bibr">[85]</ref>. Further, Fryxell's water column above the oxycline contains less than 1 &#956;g / L nitrate or ammonium <ref type="bibr">[22]</ref>, leading to the hypothesis that the planktonic microbial community is also limited by nitrogen <ref type="bibr">[20]</ref>. Given the low relative abundance of nifH, Lake Fryxell mats above the oxycline are also likely nitrogen limited, whereas water column nitrate and ammonium levels rise below the oxycline <ref type="bibr">[22]</ref>. In contrast to the likely inhibition of nitrogen fixation in the O 2 supersaturated mats at 9.0 m, the absence of nifH in the top layer at 9.8, where mats are only weakly oxic seasonally, may be due to the high population density of the Phormidium, which often lacks the ability to fix nitrogen <ref type="bibr">[86]</ref>. In the bottom layer at 9.8 m, where the capacity for nitrogen fixation could be attributable to heterotrophic bacteria <ref type="bibr">[87]</ref>, the absence of nifH is likely due to low availability of energy for nitrogen fixation, which requires an abundance of ATP <ref type="bibr">[88]</ref>. In contrast, the low-light environment in the bottom layers at 9.3 m may provide enough PAR to support nitrogen fixation, and nifH is detectable in this layer ( <ref type="bibr">Fig 7)</ref>.</p><p>Nitrogen and phosphorus cycling in planktonic communities in Lake Fryxell were recently investigated by <ref type="bibr">[89]</ref>, who found evidence that nitrogen and phosphorus are co-limiting. The relative availability of nitrogen versus phosphorus can affect the substitution of nitrogenous groups for phosphate groups in membrane lipids <ref type="bibr">[57]</ref>, a process that requires the gene btaA. The increased relative abundance of membrane phosphorus substitution genes at 9.8 m relative to samples with lower predicted nitrogen availability may indicate a switch in nutrient limitation from nitrogen to phosphorus at the oxycline. Mats growing below the oxycline in Fryxell have nitrogen available to them both through nitrogen fixation via nifH and water column nitrate and ammonium levels rise faster than dissolved reactive phosphorus below the oxycline <ref type="bibr">[22]</ref>. Thus, variations in water column chemistry and the distribution of btaA indicate that there is likely spatial variability in nutrient availability.</p><p>In contrast to nitrogen cycling, microbial sulfur cycling occurs across a range of oxygen concentrations, and sulfur oxidation and reduction are typically performed throughout microbial mats <ref type="bibr">[83]</ref>. Assimilatory sulfate reduction is required for incorporation of sulfur into amino acids (biomass) in the absence of sulfide, whereas dissimilatory sulfate reduction is a means of anaerobic respiration. In general, dissimilatory sulfate reduction is an important anaerobic metabolism in microbial mats, especially where cyanobacteria generate low molecular-weight organics as substrates <ref type="bibr">[84]</ref>. However, assimilatory sulfate reduction genes are found in greater relative abundance than dissimilatory sulfate reduction genes in Lake Fryxell (Figs 5 and 7 and Table <ref type="table">4</ref>). The difference in relative abundance of sulfate reduction genes in Fryxell mats may indicate that sulfate is primarily used for biomass generation rather than respiration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>The species sorting model applied to metabolic composition and diversity</head><p>Analyses of taxonomic composition and phylogenetic diversity suggested that the species sorting model is the most appropriate for describing benthic mat structure in Lake Fryxell across large-and small-scale PAR and [O 2 ] gradients <ref type="bibr">[23]</ref>. Therefore, we expected the metabolic strategies of the mat communities to also closely match the local heterogeneity of PAR and [O 2 ] at the millimeter-and meter-scales. Understanding the metabolic capacity of the Fryxell's mat communities across the gradients of PAR input and [O 2 ] is crucial to understanding the processes driving community composition because fitness is dictated by individuals' traits.</p><p>Gene family diversity trends support the hypothesis that the species sorting model can be appropriately applied to the communities in Lake Fryxell. We found that gene family diversity increased at the meter-scale across the lake floor and at the millimeter-scale through mat layers at 9.0 and 9.3 m, negatively correlating with PAR. Likely, the genes needed for oxygenic phototrophy, the dominant metabolic strategy in the top layers at 9.0 and 9.3 m (Table <ref type="table">4</ref> and<ref type="table">Fig 3)</ref>, suppress gene family diversity, which is relieved as phototrophy becomes less dominant through mat layers. This is consistent with phylogenetic and taxonomic results of these samples <ref type="bibr">[22,</ref><ref type="bibr">23]</ref>, and supports the interpretation that the communities through the layers at 9.0 and 9.3 m are organized to maximize energy capture <ref type="bibr">[23,</ref><ref type="bibr">24]</ref>. The proportions of metabolic genes change as PAR decreases, indicating that the metabolic capacity of the mats at 9.0 and 9.3 m is structured by the local environmental conditions. In contrast, gene family diversity decreased through mat layers at 9.8 m, where [O 2 ] varies the most seasonally. Gene family diversity is also greatest in the film at 9.8 m. Samples from the top layer at 9.8 m show strong negative correlation between phylogenetic diversity and gene family diversity (Pearson correlation coefficient -0.790). The phylogenetic diversity in this habitat is quite low, likely due to the highly selective environmental conditions <ref type="bibr">[23]</ref>. This implies that in this seasonally illuminated, seasonally oxic, low-energy, sulfidic environment, gene family diversity is important for survival as habitat conditions change throughout the year. Future investigation into how gene family diversity is distributed among community members in the film and top layers at 9.8 m will likely provide further insight into tradeoffs between fitness and diversity in this habitat.</p><p>The metabolic marker genes that varied significantly between different local [O 2 ] and PAR input are those most important for optimization of energy capture. The relative abundances of genes encoding oxygenic photosynthesis (psbA) and carbon fixation (rbcL) at 9.8 m are greatest where high populations of Cyanobacteria capture the energy available at the mat surface. Cyanobacteria produce O 2 , which drives aerobic respiration and supports other, lower energy metabolisms when the mats become anoxic over winter. For example, organic carbon fixed by photoautotrophs likely supplies the substrates required by organisms using flavin-based electron bifurcation (hrdB), which is O 2 -inhibited and would be active only in the winter. The potential metabolic strategies of Fryxell mats across environments with different energy inputs suggest that they have maximized energy capture consistent with the maximum power principle <ref type="bibr">[24,</ref><ref type="bibr">90]</ref> and the species sorting model.</p><p>Alternative models within the metacommunity framework do not explain the patterns of metabolic diversity and composition in Fyrxell's benthic mats. The patch dynamics model is inappropriate to Lake Fryxell because it requires local habitats conditions to be uniform, which does not conform to variability in PAR and [O 2 ] with depth in Lake Fryxell. The mass effects model would suggest that the metabolic composition of communities on the surface of the mats at each depth would be similar to that of the nearby lake water due to the settling of microorganisms. However, the benthic community is strikingly different from the planktonic community; specifically, the planktonic community contains abundant and diverse purple phototrophic bacteria <ref type="bibr">[65]</ref>, which are absent from the benthic microbial mats. The neutral model would be expected to produce communities that might vary in their metabolic diversity but without any relationship to environmental conditions, and therefore fails to explain the patterns of marker gene distribution along the PAR and [O 2 ] gradients in Lake Fryxell.</p><p>Self-organizing systems such as these microbial communities are structured by their environment across both spatial and temporal scales; the relative abundances of species housing specific metabolic strategies adjust in population to achieve maximum power input given average energy availability throughout the year, with depth into the lake and through mat layers. Phototrophic and heterotrophic populations in Lake Fryxell's benthic community likely change differently over the course of the annual PAR cycle because they occupy different niches. Phototrophs require PAR, and so likely increase in activity in the spring and summer. In the winter, phototrophs generally respond by a combination of entering dormant states, enduring reduced population abundances and loss of biomass via cell death, and shifting to heterotrophy or fermentation <ref type="bibr">[91]</ref>; in MDV lakes, phototrophs may also be buried in mat over years rather than seasons <ref type="bibr">[67,</ref><ref type="bibr">92]</ref>. Heterotrophs, and mixotrophs (seasonally), rely on organic carbon reservoirs built up over the years by the autotrophs. Heterotroph and mixotroph populations in the benthic mats likely shift according to organic carbon quality and quantity throughout the summer and winter, as do populations in the pelagic community <ref type="bibr">[33,</ref><ref type="bibr">93]</ref>. Additionally, both phototrophic and heterotrophic populations living at 9.0 m likely change differently than those at 9.8 m. At 9.0 m, the O 2 saturation of the mats makes aerobic respiration available year-round. But at 9.8 m, the mats are predicted to become anoxic during winter, so other electron acceptors then become important. The increased relative abundance of extremely low-energy strategies such as flavin-based electron bifurcation via hdrB at 9.8 m (Fig <ref type="figure">6</ref>) are evidence that annual variation in PAR further affects the metabolic strategies found in the mats according to local environmental heterogeneity, in this case seasonal energy availability. The metabolic patterns uncovered here are consistent with the species sorting model because spatial and temporal heterogeneity of physicochemical characteristics (PAR, [O 2 ], nitrate, phosphorus, etc.) explain patterns of metabolic genes in Fryxell's benthic mats. Independent evidence suggests that OTU abundances optimize energy capture in Fryxell's planktonic community <ref type="bibr">[75]</ref>, and the same is true for Fryxell's benthic community. An even more extreme example of the applicability of the species sorting model to microbial communities may be found in hot springs in Yellowstone National Park. The hot springs are considerably more constrained than Lake Fryxell, both phylogenetically and metabolically, where the dominant phylogenetic lineage may compose between 63 and 100% by SSU amplicon analyses and [O 2 ] limitation favors hydrogen metabolisms <ref type="bibr">[94,</ref><ref type="bibr">95]</ref>. In contrast, the microbial mats growing in Guerrero Negro are phylogenetically stratified, likely according to PAR and geochemical gradients <ref type="bibr">[96,</ref><ref type="bibr">97]</ref>. At Guerrero Negro, the chemical complexity of the habitat allowed the phylogenetic diversity to map onto environmental heterogeneity. The Guerrero Negro mats are therefore more similar to the stratified and stably heterogeneous environment of Lake Fryxell. These habitats differ in environmental conditions, but all demonstrate the applicability of the species sorting model, and metacommunity theory generally, to frame future research in extreme environments and microbial mat ecosystems.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>Assessment of the gene family diversity and metabolic marker genes indicates that PAR and [O 2 ] control the distribution of potential metabolic strategies in Lake Fryxell. A multivariate statistical analysis of the relative abundance of metabolic marker genes shows that oxygenic photosynthesis, carbon fixation, and flavin-based electron bifurcation are the key metabolic strategies that differentiate mats growing in different environmental sub-habitats. Metabolic marker genes for anaerobic respiration likely result from spatial and temporal heterogeneity in [O 2 ] in Lake Fryxell. Further, the high relative abundance of btaA suggests that microbial mats in Fryxell appear to be phosphorus-, not nitrogen-limited in the anoxic portion of the lake, consistent with water column concentrations of nitrite, nitrate, and soluble reactive phosphorus. Attenuation of red light with depth may explain the dearth of anoxygenic photosynthesis genes. Finally, the pattern of gene family diversity through the mat layers and metabolic marker gene relative abundances of psbA, rbcL, and hdrB correlate strongly with PAR and [O 2 ] and point to the importance of their seasonal fluctuation.</p><p>The spatial heterogeneity of PAR and [O 2 ] in Lake Fryxell provide the foundation for the organisms in Lake Fryxell to organize according to metabolic diversity and composition, similar to their phylogenetic structure <ref type="bibr">[23]</ref>, supporting the maximum power principle as applicable in this microbial ecosystem. More broadly, the species sorting model appears to be applicable to the metacommunity in Lake Fryxell as regards both phylogenetic lineages <ref type="bibr">[23]</ref> and metabolic traits because niche selection (via the maximum power principle) governs which lineages and metabolic marker genes are found in which habitats.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>PLOS ONE | https://doi.org/10.1371/journal.pone.0231053April 13, 2020  </p></note>
		</body>
		</text>
</TEI>
