<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>The genetic architecture of gene expression regulation in a Citrus x Poncirus hybrid tolerant to Huanglongbing</title></titleStmt>
			<publicationStmt>
				<publisher>Frontiers Media SA</publisher>
				<date>09/04/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10656451</idno>
					<idno type="doi">10.3389/fpls.2025.1627531</idno>
					<title level='j'>Frontiers in Plant Science</title>
<idno>1664-462X</idno>
<biblScope unit="volume">16</biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Isaac A Diaz</author><author>Omar Zayed</author><author>Emmanuel Ávila_De_Dios</author><author>Claire Jiang</author><author>Kim D Bowman</author><author>Danelle K Seymour</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<p>Interspecific hybridization is a common and effective strategy for producing disease resilient citrus cultivars, including those with tolerance to Huanglongbing (HLB) disease. Several HLB-tolerant cultivars have been developed through hybridization of mandarins (<italic>Citrus reticulata</italic>) with their wild relative<italic>Poncirus trifoliata</italic>. One such cultivar, ‘US-897’, exhibits robust tolerance to the bacteria causing HLB disease,<italic>Candidatus Liberibacter asiaticus</italic>(<italic>C</italic>Las). To explore the genetic architecture of the early transcriptional response to<italic>Candidatus</italic>Liberibacter asiaticus (<italic>C</italic>Las) infection in ‘US-897’, we performed transcriptomic analysis of the hybrid and its parents, ‘Cleopatra’ (<italic>C. reticulata</italic>) and ‘Flying Dragon’ (<italic>P. trifoliata</italic>). A haplotype-resolved genome for ‘US-897’ was generated using PacBio HiFi sequencing reads to support quantification of the expression of both the<italic>Citrus and Poncirus</italic>alleles. By profiling gene expression in this parent-offspring trio, we were able to determine the mode of inheritance for genes differentially expressed between parents (‘Cleopatra’ and ‘Flying Dragon’) and their interspecific hybrid (‘US-897’), with the majority genes exhibiting non-additive patterns of gene expression inheritance. Additionally, analysis of allele-specific expression in the hybrid ‘US-897’ revealed the contribution of cis- versus trans-acting regulatory variants on genes with additive and non-additive modes of inheritance. A strong correlation between differential expression between parents and allele-specific expression in ‘US-897’ suggests that cis-regulatory variation is a significant source of expression divergence between species. Finally, genes responsive to infection with<italic>C</italic>Las were identified to explore how gene regulation associated with tolerance to HLB was rewired between<italic>Citrus</italic>and its relative<italic>Poncirus</italic>.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>Interspecific hybridization can improve disease resilience by introducing beneficial alleles from the relatives of cultivated crop species. In long-lived perennial tree crops, like those in the genus Citrus, interspecific hybridization has led to the development of many commercial cultivars with enhanced disease resistance and improved productivity in disease endemic regions. A current major challenge to citrus production is the bacterial disease Huanglongbing (HLB). The most common cause of HLB in the United States is the phloemrestricted bacterial species Candidatus Liberibacter asiaticus (CLas) which infects all commercial citrus types and has major impacts on fruit production (Bove,2 0 0 6 ). CLas is vectored by the Asian citrus psyllid (Diaphorina citri), which first arrived in Florida in 1998 <ref type="bibr">(Hall, 2008)</ref>. HLB has devastated the citrus industry in Florida <ref type="bibr">(Court and Rahmani, 2017)</ref> and genetic solutions, including the development of disease resilient cultivars, are one of the many tools being used to combat HLB disease <ref type="bibr">(Albrecht and Bowman, 2012a;</ref><ref type="bibr">Bowman and McCollum, 2015;</ref><ref type="bibr">Albrecht et al., 2020;</ref><ref type="bibr">Bowman and Albrecht, 2020;</ref><ref type="bibr">Peng et al., 2020)</ref>.</p><p>Poncirus trifoliata is a relative of species in the genus Citrus with tolerance to HLB <ref type="bibr">(Albrecht and Bowman, 2012a)</ref>. Several Citrus x Poncirus hybrids have been shown to have levels of tolerance to CLas similar to P. trifoliata <ref type="bibr">(Albrecht and</ref><ref type="bibr">Bowman, 2011, 2012a;</ref><ref type="bibr">Bowman and McCollum, 2015)</ref>, including reduced bacterial titers and limited development of disease symptoms. One hybrid, 'US-897', developed from a cross between 'Cleopatra' mandarin (C. reticulata)a n d'Flying Dragon',( P. trifoliata) exhibits robust tolerance to CLas infection as an ungrafted tree <ref type="bibr">(Albrecht and Bowman, 2011)</ref>. Although the hybrid 'US-897' can still be infected by the bacterium, it only exhibits subtle symptoms of infection in both greenhouse studies and field evaluations <ref type="bibr">(Albrecht and Bowman, 2011)</ref>. In contrast, CLas titer rapidly increases in 'Cleopatra', the susceptible maternal parent of 'US-897',a f t e r infection and associated disease symptoms include chlorosis, blotchy mottling of leaves, and reduced overall size. The strong resilience of 'US-897' to CLas provides an opportunity to characterize the basis of HLB tolerance in an interspecific hybrid.</p><p>Improved hybrid performance can result from genetic interactions between parental genomes, including those that may alter patterns of gene expression <ref type="bibr">(Chen, 2013)</ref>. In the analysis of parent-offspring trios, gene expression can be classified into additive effects, where hybrid expression is intermediate between the two parents, and non-additive effects such as expression-level dominance and transgressive expression including over-and underdominance <ref type="bibr">(Rapp et al., 2009)</ref>. There has been a long debate over the genetic architecture underlying hybrid traits <ref type="bibr">(Schwartz and Laughner, 1969;</ref><ref type="bibr">Semel et al., 2006;</ref><ref type="bibr">Krieger et al., 2010;</ref><ref type="bibr">Seymour et al., 2016)</ref>, and the relative contribution of loci with dominant <ref type="bibr">(Hua et al., 2003;</ref><ref type="bibr">Garcia et al., 2008;</ref><ref type="bibr">Xiao et al., 2021)</ref>o r overdominant <ref type="bibr">(Redei, 1962;</ref><ref type="bibr">Vrebalov et al., 2002;</ref><ref type="bibr">Krieger et al., 2010;</ref><ref type="bibr">Guo et al., 2014)</ref> contributions to hybrid vigor <ref type="bibr">(Paril et al., 2024)</ref>. Examples of loci with dominant and overdominant effects on hybrid vigor have been identified across many plant species <ref type="bibr">(Semel et al., 2006;</ref><ref type="bibr">Li et al., 2008;</ref><ref type="bibr">Hashimoto et al., 2021)</ref>. Overall, the contribution of dominant and overdominant loci to hybrid vigor seems to vary by species and mating system <ref type="bibr">(Huang et al., 2016;</ref><ref type="bibr">Tian et al., 2019;</ref><ref type="bibr">Hashimoto et al., 2021;</ref><ref type="bibr">Xiao et al., 2021)</ref>. Ultimately, the genetic architecture and patterns of inheritance are important for understanding the regulatory mechanisms underlying hybrid phenotypes, including gene expression.</p><p>The genetic architecture of gene regulatory variation in hybrids includes whether genes are cis-or trans-regulated and can provide insight into the mechanistic basis of hybrid phenotypes <ref type="bibr">(Rockman and Kruglyak, 2006)</ref>. Cis-regulatory variants alter expression of a gene by disrupting regulatory sequences in close proximity to that gene and may include genetic variation in promoters and enhancers. In contrast, trans-regulatory variants perturb the expression of a gene by altering the activity of regulatory molecules, such as transcription factors and regulatory RNAs <ref type="bibr">(Hill et al., 2021)</ref>. Unlike cis-regulatory variants, trans-regulatory variants may not be located in close physical proximity to their target. Profiling gene expression in parent-offspring trios is commonly used to separate cis-and trans-regulatory effects on expression because both parental alleles, and any associated cisregulatory variants, experience a shared trans-regulatory environment <ref type="bibr">(Wittkopp et al., 2004;</ref><ref type="bibr">Goncalves et al., 2012;</ref><ref type="bibr">Pereira and Kuzmin, 2024)</ref>. Cis-regulated genes will exhibit allele-specific expression (ASE) in the hybrid, while trans-regulatory variants will alter the expression of both alleles concordantly <ref type="bibr">(Hill et al., 2021)</ref>. Dissecting the genetic architecture of gene expression can connect expression divergence to phenotypic divergence and can provide valuable insight into the mechanisms underlying hybrid phenotypes.</p><p>Here, we assembled a high-quality, haplotype-resolved reference genome for the HLB-tolerant hybrid 'US-897' and performed transcriptomic analyses of the hybrid and its parents, 'Cleopatra' (C. reticulata)and'Flying Dragon' (P. trifoliata), to investigate the genetic architecture of gene expression regulation, including in response to CLas infection. Haplotype phasing of the 'Cleopatra' and 'Flying Dragon' chromosomes in the diploid genome of 'US-897' enabled quantification of allele-specific expression (ASE) through accurate assignment of RNA-sequencing reads to either the Citrus or Poncirus haplotype, which is superior to relying on a haploid reference genome <ref type="bibr">(Wu et al., 2023)</ref>. By profiling gene expression in this parent-offspring trio, we were able to determine the mode of inheritance for genes differentially expressed between parents ('Cleopatra' and 'Flying Dragon') and their interspecific hybrid ('US-897'). Additionally, analysis of allele-specific expression in the hybrid 'US-897' revealed the contribution of cis-versus trans-acting regulatory variants on genes with additive, dominant, and transgressive inheritance. Finally, genes responsive to infection with CLas were identified to explore the regulatory architecture of pathogen response.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Assembly of a diploid reference genome of F 1 hybrid 'US-897'</head><p>To quantify the expression of both the Citrus and Poncirus alleles in the interspecific hybrid, a haplotype-resolved chromosome-level genome of 'US-897' was assembled from 65.57 Gb of PacBio HiFi sequencing reads (108.75X coverage; mean read length = 15.68 Kb). Parent-specifick -m e r s( k = 1 9 ) ,o r"hap-mers",w e r ei d e n t i fied from whole-genome sequence data for 'Cleopatra' mandarin (C. reticulata) and 'Flying Dragon' (P. trifoliata) and used to phase the Citrus and Poncirus haplotypes in 'US-897' <ref type="bibr">(Wu et al., 2018;</ref><ref type="bibr">Peng et al., 2020;</ref><ref type="bibr">Rhie et al., 2020;</ref><ref type="bibr">Rautiainen et al., 2023)</ref>. For both haplotypes, the nine largest scaffolds had BUSCO scores of 99.0%, and captured 99.2% and 99.8% of parental hap-mers, an indication of assembly completeness (Supplementary Table <ref type="table">S1</ref>). There was no evidence of haplotype switching within scaffolds, confirming that each haplotype was correctly phased in the diploid assembly of 'US-897' (Figure <ref type="figure">1A</ref>). Haplotype 1 scaffolds were inherited from 'Flying Dragon' (P. trifoliata) (282.4 Mb, N50 = 29.9 Mb) and haplotype 2 scaffolds from 'Cleopatra' (C. reticulata) (321.4 Mb, N50 = 34.1 Mb) (Figure <ref type="figure">1A</ref>; Supplementary Table <ref type="table">S1</ref>). A similar number of genes were annotated in each haplotype (Haplotype 1: 33,431; Haplotype 2: 31,602 genes) and, based on BUSCO scores, the annotations included a majority of single copy orthologous genes used to benchmark genome annotations (Haplotype 1: 98.8%; Haplotype 2: 98.4%) (Supplementary Table <ref type="table">S1</ref>). Orthology between the Citrus and Poncirus gene models was inferred based on protein sequence identity and collinearity using GeneTribe <ref type="bibr">(Chen et al., 2020)</ref>. This identified 22,419 reciprocal-best hits that were used for comparisons of allele-specific expression, along with 13,439 orthologous genes (single-best hits) and 6,756 singletons unique to one of the two parental haplotypes (Figure <ref type="figure">1B</ref>). Analysis of allelespecific expression requires identification of gene orthology and this set of 22,419 genes were used to dissect the regulatory architecture of gene expression divergence between Citrus and Poncirus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Expression level dominance is pervasive in 'US-897'</head><p>To better understand tolerance to HLB disease in 'US-897',we profiled the early transcriptional response of 'Cleopatra', 'Flying Dragon', and 'US-897' to CLas infection. The three genotypes were each infested by Asian citrus psyllids reared either on CLas-infected o rc l e a nc i t r o np l a n t s( C. medica). Infection with CLas was confirmed for five 'Cleopatra', two 'Flying Dragon', and five 'US-897' individuals (Supplementary Table <ref type="table">S2</ref>). To capture an early transcriptional response to CLas infection, leaf samples were collected from newly emerged branches 18 days after bud initiation. Previous work has shown early cellular and biochemical responses to CLas infection, including reactive oxygen species (ROS) production and callose deposition, occur as early as 15 days after inoculation <ref type="bibr">(Ma et al., 2022)</ref>. It is important to note that this sampling scheme captures initial host responses and the relationship between initial response and the robust tolerance of 'US-897' to Huanglongbing disease is unknown. An average of 28.3 million paired-end RNA-sequencing reads were generated per sample (Supplementary Table <ref type="table">S3</ref>). RNA-seq reads were aligned to the haplotype-resolved genome of 'US-897' and the level of expression of the Citrus and Poncirus alleles of each gene were quantified using Salmon <ref type="bibr">(Patro et al., 2017)</ref>. In addition to allelespecific expression, levels of overall gene expression were determined by summing read counts for the Citrus and Poncirus alleles of each orthologous gene. Genes with low levels of expression were removed (CPM less than 2) and the remaining set of 20,981 genes were the basis of differential gene expression analysis. A haplotype-resolved assembly of the interspecifich y b r i d'US-897'. (A) The abundance of parent-specific "hap-mers" (k=19) in each haplotype of the 'US-897' diploid reference genome. Hap-mers were identified from 'Cleopatra' and 'Flying Dragon' whole-genome sequencing reads <ref type="bibr">(Wu et al., 2018;</ref><ref type="bibr">Peng et al., 2020)</ref>. (B) Classification of genes based on their sequence homology and collinearity between Citrus and Poncirus haplotypes of 'US-897'. Reciprocal best hits (RBH) are allelic between haplotypes, single best hits (SBH) are orthologous sequences that are not reciprocal best hits, and singletons are genes that are unique to a haplotype.</p><p>Theanalysisofgeneexpressioninparent-offspring trios is essential for determining patterns of gene expression inheritance. Whether a gene exhibits additive, dominant, or transgressive patterns of inheritance in an F 1 hybrid compared to its two parents can provide mechanistic insights into gene expression divergence between parental alleles. A linear model was constructed to explain overall gene expression by a genotype effect ('Cleopatra', 'Flying Dragon', 'US-897', and mid-parent value), a treatment effect (CLas-inoculated, mock-inoculated), and the interaction between genotype and treatment. A total of 9,522 differentially expressed genes (DEGs) with a significant genotype effect were identified ('Cleopatra' versus 'Flying Dragon', 'Cleopatra' or 'Flying Dragon' versus 'US-897',and'US-897' versus estimated midparent value) (Figure <ref type="figure">2A</ref>, FDR adjusted p-value &lt; 0.05). A majority of these genes <ref type="bibr">(8,</ref><ref type="bibr">807)</ref> were differentially expressed between the parental genotypes (Figure <ref type="figure">2A</ref>). For genes with additive expression the level of expression in the hybrid is similar to the average of parental allelic expression, or the mid-parent expression value. Comparison of gene expression between 'US-897' and the mid-parent value revealed that gene expression patterns in the hybrid deviated from additivity for a set of 784 genes (Figure <ref type="figure">2A</ref>).</p><p>The relative abundance of gene expression across all three genotypes was used to further categorize DEGs with significant genotype effects based on inheritance patterns <ref type="bibr">(Rapp et al., 2009;</ref><ref type="bibr">Almeida-Silva et al., 2023)</ref>. Genes with additive gene expression in the hybrid will be intermediate between the parental levels of expression (similar to the mid-parent value). Genes with nonadditive gene expression in the hybrid will be biased towards expression of one of the two parental alleles (dominant) or is significantly higher or lower than expression of both parental alleles (transgressive). Gene expression patterns in each of these three categories can be further partitioned based on which parental allele is expressed at a higher level. DEGs were grouped into five categories (1 additive, 2 dominant, 2 transgressive) and 12 subcategories based on relative expression differences between the parental alleles (Figures <ref type="figure">2B</ref>, <ref type="figure">C</ref>). For example, subcategories 1 and 12 both include genes with additive inheritance, but category 1 contains genes with reduced expression of the 'Flying Dragon' allele and category 12 includes genes with increased expression of the 'Flying Dragon' allele (Figure <ref type="figure">2C</ref>). The majority of DEGs (80.83%) display expression level dominance, indicating that one of the two parental alleles is commonly favored in the hybrid 'US-897' (Figure <ref type="figure">2C</ref>). Dominance of the maternal 'Cleopatra' allele was more common (43.45% of DEGs) than dominance of the paternal 'Flying-Dragon' allele (37.38% of DEGs) (Figure <ref type="figure">2C</ref>). In Arabidopsis thaliana hybrids, expression level dominance in F 1 hybrids was also common, affecting 90% of DEGs <ref type="bibr">(Yuan et al., 2023)</ref>. The preference for maternal expression bias has also been observed in other studies of parent-offspring trios hybrids <ref type="bibr">(Videvall et al., 2016)</ref>. Examples of DEGs with transgressive patterns of expression were rare (3.18%). Comparison of gene expression patterns in this parent-offspring trio revealed genes with additive, dominant, and transgressive expression patterns, but that expression level dominance was the most pervasive pattern of gene expression in 'US-897'.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Genes with additive and dominant patterns of gene expression are regulated in cis</head><p>To better understand the evolution of gene regulation it is crucial to determine whether gene expression is controlled by cisacting or trans-acting variants. Classification of cis-and transregulatory effects on gene expression can be determined by comparing the expression of parental alleles in an F 1 hybrid. If differential expression between parental species (i.e. C. reticulata and P. trifoliata) is caused by a cis-regulatory variant, allele-specific expression of this gene is expected in the F 1 hybrid. In contrast, a trans-regulatory variant would cause differential gene expression between parental species, but concordant expression of both parental alleles in the hybrid. Cis-regulatory variants are generally considered to be major contributors to expression divergence between different species, while trans-regulatory variants are more commonly associated with expression differences within species <ref type="bibr">(Emerson et al., 2010;</ref><ref type="bibr">Metzger et al., 2017;</ref><ref type="bibr">Signor and Nuzhdin, 2018;</ref><ref type="bibr">Zhang and Emerson, 2019;</ref><ref type="bibr">Coolon et al., 2014)</ref>. To determine if gene expression is controlled in cis or trans we compared DEGs with significant genotype effects to allele-specific expression (ASE) of parental alleles in 'US-897'. Genes with ASE in 'US-897' were identified using the same set of 22,419 orthologous genes between Citrus and Poncirus haplotypes. After removing genes with low expression, monoallelice x p r e s s i o n ,a n di n s u f ficient genetic variation (a limitation in the analysis of ASE), ASE could be evaluated for 15,549 genes. Allele-specifice x p r e s s i o nw a s common in 'US-897' (9,189 genes, FDR-corrected p &lt; 0.05), indicating that cis-regulatory variation has a major role in regulating hybrid gene expression patterns. On average, there was a 97% difference in expression between parental alleles for genes with significant ASE (log 2 (Haplotype 1 (P. trifoliata)/Haplotype 2 (C. reticulata)) = 0.98). Notably, allele-specific expression in 'US-897' explains 41.6% of differential gene expression variation between 'Cleopatra' and 'Flying Dragon' (Figure <ref type="figure">3A</ref>; Spearman's rho = 0.74, R 2 = 0.416). Parental haplotypes have diverged for ~9 million years <ref type="bibr">(Wu et al., 2018;</ref><ref type="bibr">Huang et al., 2023b)</ref>, and our results are consistent with the role of cis-regulatory variation in divergence of gene expression patterns.</p><p>Next, we explored if cis-regulated genes (i.e. ASE genes in 'US-897') corresponded to additive, dominant, or transgressive patterns of expression inheritance. On average, the allele expressed more highly in the hybrid was consistent with parental expression patterns. For example, additively expressed genes with higher expression in 'Cleopatra' (Category 1) exhibit allele expression bias towards the 'Cleopatra' allele in 'US-897' (Figure <ref type="figure">3B</ref>). This trend is consistent for all patterns of additive and dominant gene expression categories (Figure <ref type="figure">3B</ref>). Allele-specific expression was not detected for genes with transgressive expression in the F 1 hybrid consistent with trans-regulation of this gene set (Figure <ref type="figure">3B</ref>). Additionally, genes with the greatest magnitude of allele-specific expression are additively inherited, perhaps with cis-regulatory variants of large effect. The abundance of genes with additive and dominant patterns of expression inheritance and their allelic dynamics in 'US-897' is consistent with the major contribution of cis-regulatory variation to gene expression divergence between Citrus and Poncirus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Genes responsive to CLas infection have reduced signature of cis-regulation</head><p>T h er e l a t i v ei m p o r t a n c eo fc i s -a n dt r a n s -r e g u l a t i o nt o environmental response is still unclear. There is evidence of both cis-and trans-acting regulation of gene expression in response to environmental signals <ref type="bibr">(Snoek et al., 2017;</ref><ref type="bibr">Reynoso et al., 2019;</ref><ref type="bibr">Zeng et al., 2019;</ref><ref type="bibr">Zhou et al., 2021)</ref>, although some suspect that transregulators may be more critical for environmental response <ref type="bibr">(Signor and Nuzhdin, 2018)</ref>. To gain insight into the role of cis-versus trans-regulation of genes conferring HLB-tolerance in 'US-897',we first identified genes with significant treatment effects and genotype x treatment interactions. Only 629 genes were identified with a significant treatment effect (FDR corrected p &lt; 0.05, Supplementary Table <ref type="table">S4</ref>). This set of genes is responsive to CLas infection regardless of genotype. Of these 629 genes, 245 were upregulated, and 384 were downregulated (Supplementary Figure <ref type="figure">S1A</ref>). The few genes responding to CLas infection here may be due to the early sampling only 18 days after bud initiation. Previous gene expression analysis comparing 'Cleopatra' to 'US-897' identified many more differentially expressed genes, but plants were sampled 7 to 8 months after infection <ref type="bibr">(Albrecht and Bowman, 2012b)</ref>. While early transcriptomic changes are important for understanding the initial stages of host-pathogen interactions, they do not explain the full suite of mechanisms underlying tolerance to Huanglongbing disease. Gene ontology enrichment analysis revealed that upregulated genes were enriched for several biological processes related to regulation of the cell cycle, while down-regulated genes were enriched for processes related to cutin biosynthesis, cell wall organization, and lipid metabolism (Supplementary Table <ref type="table">S5</ref>; Fisher's exact test, p &lt; 0.001). A larger group of genes with a significant genotype x treatment interaction were identified. Of the 1,071 genes with significant genotype-specific responses to HLB disease (FDR corrected p &lt; 0.05), a majority (972) exhibited differential response to CLas infection between the two parental genotypes, 'Cleopatra' and 'Flying-Dragon'. A smaller number of genes (490) were differentially responsive to CLas between one parent, 'Flying-Dragon', and 'US-897' (Figure <ref type="figure">4A</ref>; FDR corrected p &lt; 0.05). There was some overlap between genes with significant genotype x treatment interactions between contrasts. Surprisingly, no genes were differentially responsive to CLas between the second parent, 'Cleopatra' and 'US-897'. The two parents, 'Flying Dragon' and 'Cleopatra', are responding differently to infection with CLas, at least in the early stages of infection. Few genes are differentially responsive to CLas in the hybrid compared to its two parents, suggesting that much of the parental response to infection was inherited in the hybrid.</p><p>To assess the contribution of cis-regulation to CLas responsive genes we quantified the response in allele-specific expression in 'US-897' (DASE) compared to expression divergence between the infected parents. Of the 1,071 genes with significant genotype x treatment effects, 975 could be assessed for allele-specific expression in 'US-897'. For these genes there is a moderate correlation between DASE and differential expression between infected parents (Figure <ref type="figure">4C</ref>,S p e a r m a n 's rho = 0.48, R 2 = 0.264) with D ASE explaining 26.4% of variation in divergent expression between 'Flying Dragon' and 'Cleopatra'. We also examined the relationship between DASE and differential expression between infected parents for 536 of the 629 genes with significant treatment effects. For these genes, DASE explained an even smaller proportion of variation in divergent expression between parents (R 2 = 0.134; Supplementary Figure <ref type="figure">S1C</ref>). This suggests that cis-regulatory variants are also a component of divergence in disease response, but that the signature of cis-regulation is reduced.</p><p>To further investigate cis-regulation of genes responding to</p><p>CLas infection, the allelic response to pathogen infection was evaluated in 'US-897'. Genes were partitioned into quadrants that were based on whether they were up-or down-regulated between 'US-897' and its parents (Figure <ref type="figure">4B</ref>). Allele-specific responses to pathogen infection were observed for three of the four quadrants for genes with a significant genotype x treatment interaction (Figure <ref type="figure">4D</ref>; paired t-test, p &lt; 0.01) and two of four quadrants for genes with significant treatment effect (Supplementary Figure <ref type="figure">S1D</ref>; paired t-test, p &lt; 0.01). Overall, the signature of cis-regulation of pathogen-responsive genes is reduced compared to genes with significant genotype effects (Figure <ref type="figure">3B</ref>), but cis-regulation still contributes to the transcriptional response to CLas infection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Cis-regulation of genes involved in plant immunity</head><p>Genes with the largest magnitude of allele-specific expression in 'US-897' were additively inherited, possibly due to large-effect cisregulatory variants. Notably, genes with additive inheritance favoring expression of the 'Cleopatra' allele were significantly enriched for genes involved in systemic acquired resistance (SAR) (two out of seven annotated genes; p &lt; 0.01; Supplementary Table <ref type="table">S6</ref>). These two genes are homologs of NONEXPRESSOR OF PATHOGENESIS- RELATED3 (AtNPR3) and NONEXPRESSOR OF PATHOGENESIS-RELATED4 (AtNPR4) in Arabidopsis thaliana and are negative regulators of immunity and critical components of plant defense against pathogens <ref type="bibr">(Ding et al., 2018;</ref><ref type="bibr">Zhang et al., 2006)</ref>. NPR3 has also been shown to negatively regulate immunity in sweet orange, where silencing of NPR3 expression increased basal callose levels and reduced pathogen-induced callose deposition and ROS accumulation <ref type="bibr">(Sarkar et al., 2024)</ref>. The expression of both NPR3 and NPR4 in 'US-897' was intermediate between 'Cleopatra' and 'Flying Dragon' (Figure <ref type="figure">5A</ref>). We reasoned that reduced expression of NPR3 and NPR4 in 'US-897' could enhance basal immunity in the hybrid compared to its susceptible parent 'Cleopatra'. To understand how novel combinations of cis-regulatory alleles affect expression of these genes, we evaluated allele-specific expression in 'US-897'. Surprisingly, intermediate expression of NPR3 in 'US-897' results from high expression of the 'Flying Dragon' allele and low expression of the maternal 'Cleopatra' allele (Figure <ref type="figure">5B</ref>, FDR corrected p &lt; 0.01). This contrasts with the parental expression patterns, where 'Cleopatra' expression is high and 'Flying Dragon' expression is low (Figure <ref type="figure">5A</ref>). The patterns of allelic expression differ at NPR4 with low expression of the 'Flying Dragon' allele and high expression of the 'Cleopatra' allele resulting in intermediate expression of the gene in 'US-897' compared t oi t st w op a r e n t s( Figure <ref type="figure">5C</ref>). Patterns of allelic expression of NPR3 and NPR4 in 'US-897' are consistent for uninfected and CLas infected treatments. NPR3 and NPR4 negatively regulate NPR1, but because this regulation occurs post-transcriptionally <ref type="bibr">(Fu et al., 2012)</ref>, Only genes with significant genotype x treatment effects that could be tested for ASE are included (n=975). (D) The response of each allele in 'US-897' to pathogen infection for genes in each of the quadrants in 4B. Only genes with significant genotype x treatment effects that could be tested for ASE are included (n=975).</p><p>no relationship between levels of gene expression between NPR1 and NPR3/4 are expected. We did find that 'Flying Dragon' and 'US-897' have significantly lower NPR1 expression than 'Cleopatra' (Supplementary Figure <ref type="figure">S3</ref>). Overall, expression of NPR3 and NPR4 is reduced in 'US-897' compared to its susceptible parent, 'Cleopatra', and both genes have signatures of cis-regulation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head><p>Parent-offspring trios are commonly used to infer patterns of inheritance and, in the case of gene expression, are essential for dissecting cis-and trans-gene regulatory architecture <ref type="bibr">(Hill et al., 2021)</ref>. We selected a parent-offspring trio with relevance to a current major disease impacting worldwide production of citrus <ref type="bibr">(Albrecht and Bowman, 2011;</ref><ref type="bibr">Court and Rahmani, 2017)</ref>. The interspecifich y b r i d ,'US-897' is a commercial cultivar with tolerance to HLB disease <ref type="bibr">(Albrecht and Bowman, 2011)</ref>. Analysis of differential gene expression between parents and hybrid and allele-specific expression in the hybrid was made possible by the production of a phased, diploid assembly of 'US-897'.T h e combination of PacBio HiFi sequencing with kmer-based triobinning resolved both parental haplotypes of the 'US-897' genome, resulting in the first published reference genome containing both Citrus and Poncirus ancestry.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Genes with additive and dominant expression are cis-regulated</head><p>Expression level dominance was the most prevalent pattern of gene expression in 'US-897' compared to its parents. More than 80% of genes with significant differential expression between genotypes, in both untreated (Figure <ref type="figure">2C</ref>) and treated samples (Supplementary Figure <ref type="figure">S2</ref>), were dominantly expressed in the hybrid. This is consistent with other parent-offspring trios, including in Arabidopsis thaliana F 1 hybrids, where expression dominance was also detected at the majority of genes differentially expressed between hybrid and parents <ref type="bibr">(Yuan et al., 2023)</ref>. Interestingly, in untreated samples, dominant gene expression was slightly biased towards expression of the maternal allele ('Cleopatra')( Figure <ref type="figure">2C</ref>), but this trend reversed in CLas treated samples, instead favoring dominant expression of the paternal allele ('Flying Dragon')(Supplementary Figure <ref type="figure">S2</ref>).</p><p>Allele-specific expression of parental alleles in hybrid genomes is an indicator of cis-regulatory control of gene expression. Analysis of allele-specific expression in 'US-897' revealed that a majority of genes differentially regulated between 'Cleopatra' and 'Flying Dragon' are regulated in cis. In addition to the large number of genes expressed in 'US-897' with ASE (59%), there was a strong correlation between ASE in 'US-897' and differential expression between the two parents (Figure <ref type="figure">3A</ref>). Together this suggests that divergence in gene expression between the Citrus reticulata ('Cleopatra') and Poncirus trifoliata ('Flying Dragon') primarily results from cis-regulatory variation. Genes with the greatest magnitude of ASE in 'US-897' were those genes with additive inheritance and dominant inheritance with higher expression of the 'Flying Dragon' allele (Figure <ref type="figure">3B</ref>). Strong allele-specific expression of these genes could be caused by large-effect mutations in cis-regulatory regions, and mining the regulatory regions of these genes could reveal regulatory motifs that have diverged between</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Citrus and Poncirus.</head><p>We focused on additively inherited genes with strong allele specific expression and discovered significant enrichment for genes involved in systemic acquired resistance (SAR), including homologs of Arabidopsis thaliana NONEXPRESSOR OF PATHOGENESIS-RELATED3 (AtNPR3) and NONEXPRESSOR OF PATHOGENESIS-RELATED3 (AtNPR4). These genes have established roles as negative regulators of plant immune responses <ref type="bibr">(Ding et al., 2018;</ref><ref type="bibr">Zhang et al., 2006)</ref>, including recently in citrus where silencing of NPR3 increased tolerance to HLB in sweet orange, suggesting that modulating NPR3 activity could tune the immune response to mitigate disease <ref type="bibr">(Sarkar et al., 2024)</ref>. Reduced expression of NPR3 and NPR4 in 'US-897' compared to susceptible 'Cleopatra' suggests that the hybrid may contain a favorable immune status compared to its susceptible parent. Further dissecting allele-specific expression (ASE) of NPR3 and NPR4 in 'US-897' revealed complex underlying regulatory architectures. For NPR4, we found evidence of a classic example of cis-regulation, with low-expression of the 'Flying Dragon' allele and higher expression of the 'Cleopatra' allele resulting in intermediate expression of NPR4 in the hybrid. In contrast, the regulatory control of NPR3 was more complex with intermediate expression of NPR3 in 'US-897' resulting from unexpectedly high expression of the 'Flying Dragon' and low expression of the 'Cleopatra' allele. This suggests an interplay of cis-and transacting factors, but the expression of each allele would need to be evaluated in both parental backgrounds to further dissect the regulation of NPR3. The precise implications and physiological impact of cis-regulation of NPR3 and NPR4 and their effect on HLB tolerance will require further functional evaluation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Transgressive gene expression was limited in 'US-897'</head><p>Genes with transgressive expression should not exhibit ASE, because trans-regulators will affect the expression of both parental alleles in the hybrid. Indeed, genes with transgressive expression in 'US-897' were not expressed in an allele-specificmanner(Figure <ref type="figure">3B</ref>). Overall, very few genes were transgressively expressed in both untreated and CLas-treated samples (Figure <ref type="figure">2C</ref>; Supplementary Figure <ref type="figure">S2</ref>). Transgressive expression could be a consequence of hemizygosity, with the hybrid inheriting two or zero copies of a gene and resulting in transgressive up or downregulation in 'US-897', respectively. In fact, 10 -15% of genes are hemizygous in the clonally propagated grapevine (Vitis vinifera) and cassava (Manihot esculenta) <ref type="bibr">(Zhou et al., 2019;</ref><ref type="bibr">Peng et al., 2025)</ref>. Similarly, in 'US-897',morethan 6,000 genes were present in only one of the two parental haplotypes, making this a promising avenue for future research. Genes with transgressive expression have long been of interest for their potential contribution to hybrid vigor, but overall we find that transgressive expression is limited in this interspecifichy br i d.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Genes responsive to CLas infection have a reduced signature of cis-regulation</head><p>Cis-regulatory sequences are also important for regulating gene expression in response to environmental signals <ref type="bibr">(Reynoso et al., 2019;</ref><ref type="bibr">Zeng et al., 2019;</ref><ref type="bibr">Zhou et al., 2021)</ref>. For example, in two species of Arabidopsis more than 6,000 drought responsive cisregulatory variants were identified <ref type="bibr">(He et al., 2021)</ref>. To explore the contribution of cis-regulation to the response to CLas infection, genes with a significant treatment effect (n=629) or genotype x treatment interaction (n=1,071) were identified. Compared to previous studies of gene expression in 'US-897' and its maternal parent 7-8 months after infection <ref type="bibr">(Albrecht and Bowman, 2012b)</ref>, relatively few CLas responsive genes were identified at just 18 days after bud initiation.</p><p>Cis-regulation of genes responsive to CLas infection was limited compared to cis-regulation of gene expression divergence between 'Cleopatra' and 'Flying Dragon'. This is supported by the reduced correlation of DASE in 'US-897' (the allelic response to CLas infection) to parental responses to CLas infection (Figure <ref type="figure">4C</ref>; Supplementary Figure <ref type="figure">S1C</ref>). Also, evaluation of allelic response to CLas revealed that there were significant differences between the response of the maternal or paternal allele in 'US-897', although the differences were subtle (Figure <ref type="figure">4D</ref>). This suggests that cisregulatory variants are important for the transcriptional response to pathogen infection, but their contribution may be reduced for pathogen-responsive genes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Relevance of genetic architecture for breeding new citrus cultivars</head><p>Many citrus rootstocks in current use for citriculture throughout the world are hybrids of Citrus species with Poncirus trifoliata <ref type="bibr">(Bowman and Joubert, 2020)</ref>. Poncirus trifoliata has been highly valued as a parent to create new rootstocks because of several important traits that are transmitted to progeny, including nucellar polyembryony <ref type="bibr">(Kepiro and Roose, 2009)</ref>, tolerance of citrus nematodes <ref type="bibr">(Ling et al., 2000;</ref><ref type="bibr">Imran Hamid et al., 2025)</ref>, tolerance of Phytophthora root rot <ref type="bibr">(Graham, 1995)</ref>, resistance to citrus tristeza virus <ref type="bibr">(Fang et al., 1998)</ref>, and cold tolerance <ref type="bibr">(Yelenosky, 2011)</ref>. More recently, P. trifoliata has also been recognized as transmitting CLas-resistance and tolerance to progeny that are useful as rootstocks <ref type="bibr">(Albrecht and</ref><ref type="bibr">Bowman, 2011, 2012b)</ref>, and breeding work has expanded to include P. trifoliata ancestry in advanced-generation rootstock hybrids with tolerance to CLas <ref type="bibr">(Bowman et al., 2021;</ref><ref type="bibr">Bowman, 2023)</ref>.</p><p>Although less widely recognized, P. trifoliata has also been incorporated in citrus scion breeding for decades, to introduce resistance to tristeza virus and better cold hardiness into new scions <ref type="bibr">(Young et al., 1982;</ref><ref type="bibr">Tignor et al., 1998)</ref>. More recently, the aim of such efforts has shifted to include breeding new scion hybrids that contain tolerance to CLas from P. trifoliata ancestry <ref type="bibr">(Huang et al., 2023a)</ref>. In citrus scion breeding, first generation hybrids with trifoliate orange typically have significant undesirable flavor components in fruit juice <ref type="bibr">(Deterre et al., 2021)</ref>. However, backcrossing of these P. trifoliata orange hybrids to citrus for one or more generations, combined with progeny selection, can eliminate these off-flavors, and recover new hybrids that combine the desired CLas-tolerance traits with good-flavored fruit <ref type="bibr">(Fang et al., 1998;</ref><ref type="bibr">Jeffries et al., 2024)</ref>. In breeding of both citrus rootstocks and citrus scions, information about regulation of gene expression for critical traits is severely limited, and lack of knowledge about transmission of key traits from parents to progeny greatly hampers progress in developing new cultivars with those essential traits. A clear understanding about genetic architecture and the regulation of gene expression associated with CLas infection will dramatically improve opportunities to make meaningful gains in the breeding of both rootstocks and scions with tolerance to CLas. While our transcriptomic analyses provide insights into the regulation of gene expression in response to CLas infection, the molecular basis of HLB tolerance is still unknown and will require validation of the roles of specific regulatory variants and genes in conferring HLB tolerance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusion</head><p>Cis-regulatory variants are a common and significant contributor to expression divergence between Citrus and its wild relative Poncirus. These cis-acting variants are associated with additive and dominant gene expression inheritance. But, genes responding to CLas shortly after infection have a reduced signature of cis-regulation, suggesting a dynamic interplay between cis-and trans-regulation of response to CLas infection in the interspecific hybrid, 'US-897'. The high-quality, haplotypephased genome assembly of 'US-897', alongside the insights gained from analysis of allele-specific expression in the 'US-897', provide a critical resource for uncovering the genetic basis of HLB tolerance in Citrus x Poncirus hybrids.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Materials and methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Plant cultivation and CLas inoculation</head><p>Candidatus liberibacter asiaticus (CLas) is a quarantined pathogen in California and all plant cultivation and psyllidmediated inoculations were performed in the California Citrus Research Foundation containment facility located in Riverside, CA. Seeds of 'Cleopatra' mandarin (Citrus reticulata), 'Flying Dragon' trifoliate orange (Poncirus trifoliata), and their F 1 hybrid 'US-897' were sterilized prior to germination. The primary seed coats were removed, and seeds were washed with 70% isopropanol for 10 minutes. Next, seeds were treated with a 10% bleach solution for 30 minutes and rinsed three times with sterile water. Seeds were placed in 15 ml culture tubes containing seed germination media (0.22% Murashige and Skoog basal salt mixture <ref type="bibr">(Murashige and Skoog, 1962)</ref>, 2.5% sucrose, 0.005% myo-inositol, 0.50% ethylenediaminetetraacetic acid ferric salt, and 0.70% agar) and grown at 30&#176;C in the dark for two weeks. After germination, seeds were transplanted to cone-tainers (164 ml) with coconut coir and placed in a greenhouse with 12 hour supplemental light:dark cycles with light intensity maintained at ~500 &#956;mol/m&#178;/s. Both 'Cleopatra' and 'Flying Dragon' produce nucellar seeds. All transplanted seedlings were genotyped with 24 LGC KASP &#8482; markers (Hoddesdon, UK) and zygotic seedlings were culled.</p><p>For CLas inoculation, Asian citrus psyllids (Diaphorina citri) were reared on either confirmed CLas-infected citron (Citrus medica) plants or CLas-free citron plants, thereby providing CLas-positive (CLas+) and mock-inoculation (CLas-) psyllid populations, respectively. Nine month old plants (6 replicates) of each genotype ('Cleopatra', 'Flying Dragon', US-897) were infested with 15-20 CLas+ psyllids per plant for a period of four weeks. Concurrently, three replicate plants of each genotype were similarly infested with 15-20 clean (CLas-) psyllids per plant to serve as mock-inoculated controls. Psyllids were contained on the plants using small mesh cages enclosing a single branch during the infestation period. In some rare cases, smaller plants were completely caged. Plants were nine months-old at the time of infestation with psyllids. Before tissue sampling for RNAsequencing, all experimental plants were pruned to stimulate new flush. At this time, the systemic CLas infection status of each individual plant was confirmed by quantitative PCR of the nrdB gene of CLas as performed in <ref type="bibr">(Zheng et al., 2016)</ref>. The leaf sampled to confirm CLas infection was from the single, caged branch (or small plant). This branch was trimmed and then bud initiation was tracked, with sampling of leaf tissue for RNA-seq occurring 18 days after bud initiation. Based on qPCR results, five 'Cleopatra', two 'Flying Dragon',a n dfive US-897 individuals from the CLas+ treatment group were confirmed as infected and used for subsequent RNA-seq analysis (Supplementary Table <ref type="table">S2</ref>). All plants, mock-and CLas-infected, were randomized in both the plant growth chamber, where psyllid inoculations occurred, and in the greenhouse.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Sampling scheme and RNA-sequencing</head><p>Plants were transferred from the growth chamber used for psyllid-mediated CLas (and mock) inoculations to the greenhouse two months after their initial exposure to psyllids. Plants continued to grow for an additional two months in the greenhouse and then, after adjusting to greenhouse conditions, leaf samples were collected to assay for CLas. This enables sufficient establishment of the bacteria in the host plants. Additionally, CLas detection in new shoots has been shown to occur as early as 15 days post-bud initiation <ref type="bibr">(Ma et al., 2022)</ref>. After sampling, a single branch on each plant was trimmed to induce new flush and plants were monitored daily for bud initiation. Next, leaf tissue from the newly emerging flush was sampled 18 days post bud initiation to capture early responses to pathogen infection as performed in <ref type="bibr">(Ma et al., 2022)</ref>. While the exact newly emerging shoots sampled for RNA-seq were not individually quantified for bacterial titer, CLas is known to be transported to and can be detected in newly developing flush tissues within 15 days of bud-initiation <ref type="bibr">(Ma et al., 2022)</ref>. Therefore, RNA-seq was performed on these newly emerged shoots from systemically infected plants. All sampling was performed from 10-11AM Pacific Standard Time (PST). After sampling, leaf tissue was flash frozen in liquid nitrogen and quarantined at -80&#176;C for 14 days. RNA extraction was then performed using the Qiagen RNeasy Plant kit (Hilden, Germany). Sequencing libraries were prepared using the Illumina Stranded mRNA prep, Ligation kit (Pleasanton, California). Libraries were sequenced on two lanes of a NovaSeq X Plus (PE150) by Novogene (Beijing, China) generating an average of 28.3 million reads per sample (Supplementary Tables <ref type="table">S3</ref>, <ref type="table">S4</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>PacBio sequencing</head><p>Nuclei extraction from ~3 grams of young leaf tissue from a single individual of the F 1 hybrid 'US-897' was performed using PacBio's nuclei extraction protocol (PacBio, 2022b). High molecular weight DNA (HMW DNA) was then extracted from nuclei using PacBio's Nanobind plant kit (PacBio, 2022a). An extended lysis step was included, increasing from 30 minutes to 2 hours. A second lysis step was performed to ensure sufficient extraction of HMW DNA. HMW DNA was then sent to Novogene (Beijing, China) for library preparation and sequencing on one SMRT cell of the PacBio Revio (Menlo Park, California). This generated 65 Gb of HiFi sequencing reads, with an average read length of 15.6 Kb.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Genome-assembly and annotation</head><p>PacBio HiFi sequencing reads were used to assemble the diploid genome of the F 1 hybrid 'US-897'. First, Illumina whole-genome sequencing reads from the parental genotypes, 'Cleopatra' mandarin and 'Flying Dragon' trifoliate orange <ref type="bibr">(Wu et al., 2018;</ref><ref type="bibr">Peng et al., 2020)</ref>, were used to identify parent-specific k-mers (k=19), or hapmers, using Merqury <ref type="bibr">(Rhie et al., 2020)</ref>. PacBio HiFi reads were error-corrected and used to construct an assembly graph using verkko <ref type="bibr">(Rautiainen et al., 2023)</ref>. Next, paths in this assembly graph were haplotype-resolved using parental hap-mer profiles (verkko trio), and a consensus algorithm was applied to construct the final diploid assembly <ref type="bibr">(Rautiainen et al., 2023)</ref>. Before genome annotation, repetitive sequences were soft-masked using a custom library of repeat sequences with RepeatMasker v 4.1.5 <ref type="bibr">(Smit et al., 2013</ref><ref type="bibr">(Smit et al., -2015))</ref>. Genome annotation was then performed with BRAKER v3.0.3, using a custom database of proteins, including those specific to the genus Citrus. Only the 9 largest scaffolds of each haplotype were annotated. BUSCO scores were calculated for the genome assembly (-m geno) and the genome annotation (-m prot) using the eudicots_db10 database and Busco v5.8.0 <ref type="bibr">(Manni et al., 2021)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Differential gene expression analysis</head><p>Genome annotations for each haplotype of 'US-897' were used to construct a diploid transcriptome using Salmon v1.7.0 <ref type="bibr">(Patro et al., 2017)</ref>. Homology of transcripts between haplotypes was inferred based on protein sequence identity and collinearity using GeneTribe <ref type="bibr">(Chen et al., 2020)</ref>. Transcripts were classified based on whether they are reciprocal best hits (RBH) that are allelic between haplotypes, single best hits (SBH) are orthologous sequences, and singletons are genes that are unique to a haplotype. For ortholog identification, the gene IDs from the 'Flying Dragon' haplotype were used, unless the gene was only present in Cleopatra, then the gene IDs from the 'Cleopatra' haplotype were used. Next transcripts were quantified using Salmon v1.7.0 <ref type="bibr">(Patro et al., 2017)</ref> with the following parameters: -validateMappings -numBootstraps 30 -recoverOrphans -writeUnmappedNames -allowDovetail -softclip -minScoreFraction 0.6. Bootstrapping (n=30) was performed to generate inferential replicates for accurate quantification of transcripts from RNA-sequencing reads. For analyses of overall gene-expression (as opposed to allele-specific expression), read counts for transcripts were summed for the set of orthologous genes. Gene expression estimates (Counts-Per-Million or CPM) were log 2 -transformed and weights were assigned to each observation based on the estimated mean-variance trend across genes using voom transformation <ref type="bibr">(Law et al., 2014)</ref>. Next, gene expression estimates for the estimated mid-parent values for each gene were calculated by averaging read counts of parental samples that were randomly selected from replicates of each treatment. This process was repeated to generate four replicates of mid-parent expression per gene. Genes with low levels of expression were removed (CPM less than 2) and the remaining set of 20,981 genes were the basis of differential gene expression analysis. Gene expression estimates were modeled using limma-voom <ref type="bibr">(Law et al., 2014)</ref>, including fixed effects for genotype ('Cleopatra', 'Flying Dragon', 'US-897', mid-parent), treatment (CLas-inoculated, mockinoculated), and the genotype x treatment interaction. This midparent value-based classification, performed using limma'sl i n e a r models and statistical comparisons <ref type="bibr">(Ritchie et al., 2015)</ref>, serves as a standard approach for broadly categorizing gene expression inheritance patterns <ref type="bibr">(Rapp et al., 2009;</ref><ref type="bibr">Huang et al., 2016)</ref>. While the midparent value establishes a null hypothesis for additive inheritance for each gene, the use of limma accounts for intragenotypic variability in its variance estimation, thereby providing robust statistical significance for deviations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Classification of hybrid gene expression inheritance patterns</head><p>Specific contrasts were evaluated to identify differentially expressed genes (DEGs) between genotypes ('Cleopatra' versus 'Flying Dragon', 'Cleopatra' or 'Flying Dragon' versus 'US-897',a n d'US-897' versus estimated mid-parent value). Gene expression inheritance patterns were determined for genes with differential expression in any of these contrasts using the R package "HybridExpress" <ref type="bibr">(Almeida-Silva et al., 2023)</ref>. Significant differences based on Benjamini-Hochberg FDR &lt; 0.05 and the log 2 (fold-change) for each contrast were used to classify geneexpression patterns. Genes were classified as having "additive" inheritance if there was no significant difference in expression (Benjamini-Hochberg FDR &gt; 0.05) between the expected mid-parent value and 'US-897'. Genes were classified as "dominant" if there was a significant difference between the hybrid and one parent, but not the other. Lastly, genes with significantly higher or lower gene expression in the 'US-897' compared to both parents were classified as transgressive. This same classification scheme was used to assess gene expression inheritance patterns for genotype x treatment contrasts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Identification of allele-specific expression in 'US-897'</head><p>Gene expression was quantified as described above using Salmon, with bootstrapping (n=30) to generate inferential replicates for accurate quantification of alleles from RNAsequencing reads. Gene expression was quantified for 22,419 genes, and genes with read counts greater than ten in at least six samples were retained (n=17,487). Genes for which inferential replicates of each allele were almost identical were filtered from downstream analyses (n=2), as this indicates there is no information about allelic expression in the sequencing reads. For each gene, the ratio of allele 1 ('Flying Dragon'): allele 2 ('Cleopatra') expression was averaged across all samples, and genes with a ratio greater than 0.95 (n=371) or less than 0.05 (n=467) were classified as having monoallelic expression. Monoallelic genes were excluded from modeling of allele-specific expression resulting in 15,549 genes that could be evaluated for allele-specific expression.</p><p>that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>Frontiers in Plant Science frontiersin.org</p></note>
		</body>
		</text>
</TEI>
