<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Analysis of essential genes in &lt;i&gt;Clostridioides difficile&lt;/i&gt; by CRISPRi and Tn-seq</title></titleStmt>
			<publicationStmt>
				<publisher>Journal of Bacteriology</publisher>
				<date>10/23/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10665323</idno>
					<idno type="doi">10.1128/jb.00220-25</idno>
					<title level='j'>Journal of Bacteriology</title>
<idno>0021-9193</idno>
<biblScope unit="volume">207</biblScope>
<biblScope unit="issue">10</biblScope>					

					<author>Maia E Alberts</author><author>Micaila P Kurtz</author><author>Ute Müh</author><author>Jonathon P Bernardi</author><author>Kevin W Bollinger</author><author>Horia A Dobrila</author><author>Leonard Duncan</author><author>Hannah M Laster</author><author>Andres J Orea</author><author>Anthony G Pannullo</author><author>Juan G Rivera-Rosado</author><author>Facundo V Torres</author><author>Craig D Ellermeier</author><author>David S Weiss</author><author>Tina M Henkin</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>ABSTRACT</title> <sec><p>Essential genes are interesting in their own right and as potential antibiotic targets. To date, only one report has identified essential genes on a genome-wide scale in<italic toggle='yes'>Clostridioides difficile</italic>, a problematic pathogen for which treatment options are limited. That foundational study used large-scale transposon mutagenesis to identify 404 protein-encoding genes as likely to be essential for vegetative growth of the epidemic strain R20291. Here, we revisit the essential genes of strain R20291 using a combination of CRISPR interference (CRISPRi) and transposon insertion site sequencing (Tn-seq). First, we targeted 181 of the 404 putatively essential genes with CRISPRi. We confirmed essentiality for >90% of the targeted genes and observed morphological defects for >80% of them. Second, we conducted a new Tn-seq analysis, which identified 346 genes as essential, of which 283 are in common with the previous report and might be considered a provisional essential gene set that minimizes false positives. We compare the list of essential genes to those of other bacteria, especially<italic toggle='yes'>Bacillus subtilis</italic>, highlighting some noteworthy differences. Finally, we used fusions to red fluorescent protein (RFP) to identify 18 putative new cell division proteins, 3 of which are conserved in Bacillota but of largely unknown function. Collectively, our findings provide new tools and insights that advance our understanding of<italic toggle='yes'>C. difficile</italic>.</p><sec><title>IMPORTANCE</title><p><italic toggle='yes'>Clostridioides difficile</italic>is an opportunistic pathogen for which better antibiotics are sorely needed. Most antibiotics target pathways that are essential for viability. Here, we use saturation transposon mutagenesis and gene silencing with CRISPR interference to identify and characterize genes required for growth on laboratory media. Comparison to the model organism<italic toggle='yes'>Bacillus subtilis</italic>revealed many similarities and a few striking differences that warrant further study and may include opportunities for developing antibiotics that kill<italic toggle='yes'>C. difficile</italic>without decimating the healthy microbiota needed to keep<italic toggle='yes'>C. difficile</italic>in check.</p></sec></sec>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Transposon insertion site sequencing (Tn-seq) identifies essential genes on a genome-wide scale based on the absence of insertions following saturation transposon mutagenesis <ref type="bibr">(5,</ref><ref type="bibr">6)</ref>. However, several caveats must be kept in mind when interpreting the output of a Tn-seq experiment. For instance, insertion mutants that are viable but grow slowly will be lost from the mutant pool during outgrowth, so some apparently essential genes can be deleted. This caveat underscores the fact that binary categoriza tion of genes as essential or non-essential is useful but an oversimplification. Tn-seq might also erroneously classify non-essential genes as essential due to polarity onto bona fide essential genes or because the random nature of Tn insertions means genes might be missed for stochastic reasons. Finally, Tn-seq does not provide insight into the actual function of essential genes because the phenotypic defects of the corresponding insertion mutants are not observed. Despite these caveats and limitations, Tn-seq is a powerful tool for prioritizing genes to investigate by more laborious methods.</p><p>CRISPR interference (CRISPRi) is a complementary approach for genome-wide interrogation of essential genes in bacteria <ref type="bibr">(7)</ref><ref type="bibr">(8)</ref><ref type="bibr">(9)</ref><ref type="bibr">(10)</ref><ref type="bibr">(11)</ref><ref type="bibr">(12)</ref>. CRISPRi uses a single-guide RNA (sgRNA) to direct a catalytically inactive Cas9 protein (dCas9) to a gene of interest, thereby repressing transcription <ref type="bibr">(13)</ref>. As the organism continues to grow and divide, it becomes depleted of the targeted protein, potentially revealing phenotypic changes that precede cell death. Thus, CRISPRi provides functional information that Tn-seq cannot. However, CRISPRi shares with Tn-seq the problem of polarity, which has to be taken into consideration when interpreting phenotypes.</p><p>In 2015, Dembek et al. used Tn-seq to identify 404 protein-encoding genes as essential for vegetative growth in C. difficile strain R20291 on BHI media <ref type="bibr">(14)</ref>. As expected, most of these genes encode proteins involved in core biological processes and cell surface biogenesis, but some are of unknown function or not expected to be essential. Here, we revisit the essential genes of strain R20291 using a combination of CRISPRi and Tn-seq. First, we targeted 181 of the 404 putatively essential genes with CRISPRi to vet essentiality and identify terminal phenotypes. We confirmed essentiality for &gt;90% of the targeted genes and observed morphological defects for &gt;80% of them. Second, we conducted a new and more thorough Tn-seq analysis to identify genes essential for vegetative growth on TY media. We classified 346 protein-coding genes as essential, of which 283 (~80%) were also essential in the previous study. Finally, we conducted a microscopy-based screen to identify potential cell division proteins. We discuss our findings in light of what is known about essential genes and cell division in other bacteria, particularly Bacillus subtilis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RESULTS AND DISCUSSION</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A library for CRISPRi knockdown of 181 putative essential genes</head><p>Our C. difficile CRISPRi plasmid has been described <ref type="bibr">(15)</ref>. It expresses a codon-optimized dCas9 from a xylose-inducible promoter (P xyl ) and a sgRNA from a constitutively active glutamate dehydrogenase promoter (P gdh ). Constructing a knockdown library involved several steps: selecting the genes to be targeted, designing the sgRNAs, cloning those sgRNAs into the CRISPRi plasmid, and moving the finished plasmids from E. coli into C. difficile by conjugation. Because conjugation efficiencies are low, plasmids have to be moved from E. coli into C. difficile one by one. This step imposes a bottleneck that makes it impractical to target all 404 essential genes identified previously. We therefore trimmed the gene list by excluding all transposon and phage-related genes (because these are not part of the core genome), most genes for tRNA synthetases and ribosomal proteins (to limit redundancy), and most genes for small proteins, defined here as fewer than 80 amino acids (240 nucleotides). Short genes are small targets for Tn insertion, so a disproportionate fraction is likely to be false positives. At this point, we were left with 252 genes. Because CRISPRi is polar <ref type="bibr">(7,</ref><ref type="bibr">13,</ref><ref type="bibr">16,</ref><ref type="bibr">17)</ref>, there is little to be gained by targeting multiple genes in an operon, so, in most cases, we targeted only one gene per transcription unit as annotated in BioCyc (v28.5, release Dec 2024) <ref type="bibr">(18)</ref>.</p><p>In the end, we selected a total of 181 putatively essential genes for CRISPRi knock down (Table <ref type="table">S1</ref>). We constructed a library of individual sgRNA clones, using two sgRNAs per gene for a total of 362 CRISPRi plasmids (Table <ref type="table">S2</ref>). Although our experiments were restricted to strain R20291, 91% of sgRNAs are perfect matches to the corresponding genes in strain 630 as well (Table <ref type="table">S2</ref>). As negative controls, we constructed 20 CRISPRi plasmids with scrambled sgRNAs that do not target anywhere in the R20291 genome (Table <ref type="table">S2</ref>). Plasmids were confirmed by sequencing across the P gdh ::sgRNA element in E. coli and after conjugation into C. difficile. Of the genes targeted for knockdown, 86 have an essential ortholog in Bacillus subtilis, 62 have a non-essential ortholog in B. subtilis, and 33 have no B. subtilis ortholog, including four hypothetical genes. However, the number of genes of unknown function is larger than four because many of the non-hypotheticals have homology to domains with such broadly or ill-defined functions that it is not obvious what these genes do or why they would be essential (e.g., "glycosyltransfer ase, " "two-component response regulator, " or "DUF1846"). Considering that 110 of the targeted genes are predicted to be in operons with other apparently essential genes, our study encompasses 281 of the 404 genes identified as essential by Tn mutagenesis, close to 70% of the total <ref type="bibr">(14)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Essentiality determined by CRISPRi knockdown largely agrees with Tn-seq data</head><p>The entire CRISPRi library was screened for viability defects by conducting spot titer assays on TY plates containing thiamphenicol at 10 &#181;g/mL (hereafter TY-Thi10) and 1% xylose. Control plates lacked xylose. Viability defects were scored as strong, moderate, weak, or none based on growth at different dilutions (Fig. <ref type="figure">1A</ref> and <ref type="figure">B</ref>). Using a 10-fold viability defect or small colony phenotype with at least one sgRNA as the cut-off, 167 of the 181 genes (92%) were confirmed as essential by CRISPRi, while 14 were not essential (Fig. <ref type="figure">1B</ref>; Table <ref type="table">S1</ref>). Similar results were obtained with both sgRNAs for 174 of the 181 genes tested (Table <ref type="table">S1</ref>). None of the 20 non-targeting control sgRNAs caused a growth defect, indicating off-target effects are rare. We conclude that the vast majority of the genes Dembek et al. identified as essential by Tn-seq are also essential by CRISPRi <ref type="bibr">(14)</ref>.  <ref type="table">S1</ref>. Tn-seq calls come from Table <ref type="table">S3</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Terminal phenotypes due to CRISPRi knockdown of genes of known function</head><p>To look for morphological abnormalities that might facilitate the provisional assignment of essential genes to functional pathways, cells were scraped from the last culture dilution that grew on the 1% xylose plates and examined by phase-contrast microscopy.</p><p>As the project progressed, we added staining with FM4-64 to visualize the cytoplasmic membrane and Hoechst 33342 to visualize DNA. The morphological defects associated with CRISPRi silencing of all 181 genes are listed in Table <ref type="table">S1</ref>.</p><p>CRISPRi knockdown of genes of known function often provoked expected morpho logical defects, such as filamentation in the case of cell division genes and aberrant nucleoid staining in the case of DNA replication genes (Fig. <ref type="figure">2</ref>; Fig. <ref type="figure">S1</ref>, Table <ref type="table">1</ref>, Table <ref type="table">S1</ref>). Also, as expected, knockdown of DNA replication genes sometimes resulted in filamentation, presumably due to induction of the SOS response <ref type="bibr">(19,</ref><ref type="bibr">20)</ref>. However, we also observed morphological defects that were not expected and are difficult to Full-Length Text Journal of Bacteriology Month XXXX Volume 0 Issue 0 10.1128/jb.00220-25 4 Downloaded from <ref type="url">https://journals.asm.org/journal/jb</ref> on 28 September 2025 by 129.255.1.116.</p><p>rationalize. For instance, knockdown of rpoB (&#946; subunit of RNA polymerase) or era (GTPase involved in ribosome assembly) caused severe filamentation, while knockdown of guaA (synthesis of guanosine ribonucleotides) caused a mild chaining phenotype. To address whether the unexpected morphological abnormalities are an artifact of working with cells scraped from plates, we reexamined the filamentation phenotype of four non-division genes in broth about six doublings after inducing CRISPRi: dnaX, rpoB, prfB, and tilS. We observed elongated cells in each case (Fig. <ref type="figure">S2</ref>). Thus, at least for this phenotype and these four genes, morphologies determined using plates are reliable. Because morphological defects were only loosely associated with the function of well-studied genes, we conclude that CRISPRi is not sufficient for assigning genes of unknown function to physiological pathways. We are not the first to report unanticipa ted complexity among terminal phenotypes in a CRISPRi screen. For example, CRISPRi knockdown of the RNA polymerase gene rpoC and the phospholipid synthesis genes psd and plsB caused filamentation in E. coli <ref type="bibr">(8)</ref>. In addition, knockdown of multiple genes with no direct role in envelope biogenesis caused morphological defects in B. subtilis <ref type="bibr">(7)</ref>. These reports contrast with the narrower spectrum of morphological defects induced by antibiotics that target specific pathways <ref type="bibr">(21)</ref><ref type="bibr">(22)</ref><ref type="bibr">(23)</ref>. Antibiotics might be less subject to secondary effects because cells are visualized at early times after exposure, and polarity is not an issue.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Terminal phenotypes due to CRISPRi knockdown of genes of unknown function</head><p>Our CRISPRi library targeted 11 genes that could not be assigned to a functional category and were confirmed as essential in our own Tn-seq analysis, as will be described below. CRISPRi caused a viability defect in nine cases, often accompanied by abnormal morphologies (Table <ref type="table">2</ref>). Examples include cdr20291_0481 and cdr20291_0828 (elonga tion), the cdr20291_1053-1057 cluster (short, swollen, phase-bright cells, and chaining),  Phenotypes reported encompass the range observed across the genes listed. The phenotypic defects often differed for different genes from the same functional pathway.</p><p>Major phenotypes caused by repression of each gene are listed in Table <ref type="table">S1</ref>. cdr20291_1124 (chaining and many misshapen phase-bright cells), and cdr20291_2526 (a few misshapen cells). The phenotype resulting from knockdown of cdr20291_1124 could be due to reverse polarity onto the upstream gene alaS, which encodes an alanyl-tRNA synthetase. These genes warrant further investigation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Rationale for the decision to conduct Tn-seq</head><p>As noted above, our CRISPRi screen was based on the only published genome-wide analysis of gene essentiality in C. difficile. That study made essentiality calls based on a single pool of mutants containing ~77,000 unique Tn insertions plated on BHIS media <ref type="bibr">(14)</ref>. We reasoned that an independently derived list of essential genes based on multiple biological replicates and larger insertion libraries would serve as a useful resource to the C. difficile community. We also thought a new Tn-seq data set might serve as a "tie-breaker" for the 14 putatively essential genes that did not appear to be essential by CRISPRi, that is, failure to recover insertions in those genes would suggest our sgRNAs were ineffective, while recovery of insertions would suggest the genes are non-essential and were missed in the previous study for stochastic reasons.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Generation of Tn insertion libraries and identification of essential genes</head><p>We used the same R20291 strain and mariner-based transposon as in the previous study <ref type="bibr">(14)</ref>. Mariner is a good choice for C. difficile because it inserts at TA dinucleotides, and the genome G + C content is 29% <ref type="bibr">(24,</ref><ref type="bibr">25)</ref>. However, our experimental design differed from Dembek et al. in several noteworthy respects: (i) we used TY media, (ii) we constructed three independent insertion libraries, (iii) we used a different algorithm for classifying genes as essential or non-essential, and (iv) we determined insertion profiles at both an early and a late timepoint because gradual loss of slow-growing mutants from the pools influences perceptions of gene essentiality. Our early timepoint consisted of primary insertion libraries recovered directly from selection plates after ~18 hours of incubation. For a later timepoint, libraries were sub-cultured in duplicate into TY and harvested after seven generations of outgrowth. Genomic DNA was extracted from libraries, and transposon insertion sites were identified by DNA sequencing following linear PCR, C-tailing, and the addition of barcodes and adaptors as described <ref type="bibr">(26)</ref>. This method was originally referred to as Tn-seq, and we have adopted that terminology here. Insertion profiles were analyzed</p><p>TABLE 2 Essential genes not assigned to a physiological pathway a Locus tag Annotation Size CRISPRi viability defect CRISPRi terminal morphology CDR20291_0351 Phosphoesterase 230 a.a. b Weak Normal CDR20291_0481 Sugar isomerase/endonuclease 251 a.a. Weak Elongated CDR20291_0828 DUF1846 domain 501 a.a. Strong Elongated CDR20291_1053 Pyrophosphokinase 373 a.a. Strong Chaining, short cells, swollen cells, phase-bright CDR20291_1054 Putative exported protein 291 a.a. Strong See CDR20291_1053 CDR20291_1055 Family 2 glycosyl transferase 230 a.a. Strong See CDR20291_1053 CDR20291_1056 Glycosyl transferase family protein 274 a.a. Strong See CDR20291_1053 CDR20291_1057 DUF3866 domain 355 a.a. Not targeted CDR20291_1124 Putative membrane protein 723 a.a. Moderate Chaining, curved cells, phasebright CDR20291_1171 UvrD/REP type DNA helicase 593 a.a. None Normal CDR20291_1418B None 113 a.a. Not targeted Not done CDR20291_2521 PDZ, radical SAM, and DUF512 domains 466 a.a. Not targeted Not done CDR20291_2526 Two-component response regulator 230 a.a. Moderate Mostly normal, a few curved CDR20291_2569 Putative calcium-chelating exported protein 308 a.a. None Normal CDR20291_3525 Conserved hypothetical protein 61 a.a. Not targeted Not done a These proteins were classified as essential in our Tn-seq and either essential or ambiguous by Dembek et al., 2015. CDR20219_3519 and CDR20219_3520 are omitted because their essentiality is likely due to polarity onto dnaC and/or rplI. b a.a., amino acids. Full-Length Text Journal of Bacteriology Month XXXX Volume 0 Issue 0 10.1128/jb.00220-25 6 Downloaded from <ref type="url">https://journals.asm.org/journal/jb</ref> on 28 September 2025 by 129.255.1.116.</p><p>using TRANSIT2 and the C. difficile R20291 reference genome NC_013316.1 <ref type="bibr">(27,</ref><ref type="bibr">28)</ref>. Depending on the experimental replicate, insertions were identified in 117,217-204,061 of the 502,945 unique TA dinucleotides in the R20291 genome (Table <ref type="table">3</ref>). A total of 289,505 TA sites sustained at least one Tn insertion across the three libraries. TRANSIT2 makes essentiality calls by comparing the observed frequency of Tn insertions to the availability of potential TA insertion sites. Genes are classified as essential (E or EB, depending on the model for statistical analysis), not essential (NE), or unclear (U) <ref type="bibr">(29)</ref>. Genes with too few TA sites for statistical analysis are designated short (S). After inspecting the output from TRANSIT2, we manually reclassified 11 NE or U genes as essential, giving them the designation Ei for "essential by inspection. " Ten of these genes had a large number of TA sites but very few insertions. An example is the tRNAsynthetase valS (CDR20291_3114), with insertions in only four of the possible 266 TA dinucleotides after outgrowth (Table <ref type="table">S3A</ref>). For comparison, TRANSIT2 scored the cell division gene ftsZ as essential even though there were insertions in 3 out of 110 TA sites.</p><p>All 10 genes that we moved to Ei based on a few insertions are considered essential in C. difficile and B. subtilis <ref type="bibr">(14,</ref><ref type="bibr">30)</ref>. The final Ei gene, murJ2 (CDR20291_3335), had a large number of insertions, but almost all of these were at the 3&#8242; end of the gene, suggesting a functional protein is still produced (Fig. <ref type="figure">3A</ref>). murJ2 was previously classified as essential in C. difficile by Dembek et al., but its ortholog is not essential in B. subtilis due to functional redundancy <ref type="bibr">(30,</ref><ref type="bibr">31)</ref>. Of the 3,673 annotated protein-coding genes in R20291, 346 were scored as essential for vegetative growth in the initial libraries and/or after outgrowth (Table <ref type="table">S3A</ref>). We grouped these genes into functional categories similar to those used in previous studies of B. subtilis and S. aureus (Table <ref type="table">4</ref> and Table <ref type="table">S3B</ref>) <ref type="bibr">(32,</ref><ref type="bibr">33)</ref>. As expected, over half are involved in DNA metabolism (25 genes), RNA metabolism (24 genes), protein synthesis (113 genes), or cell envelope biogenesis (76 genes). Also, as expected, the majority of C. difficile's essential genes are conserved; BioCyc assigned a B. subtilis ortholog for 272 of the 346 genes, of which 169 are essential (Table <ref type="table">S3A</ref> and <ref type="table">B</ref>) <ref type="bibr">(30)</ref>.</p><p>The above numbers are derived from analyzing all experimental replicates together, raising questions about how having three independent libraries and an outgrowth step influenced essentiality calls. TRANSIT2 flagged 283, 370, and 328 genes as essential when the libraries were analyzed individually, but that number dropped to 263 in the combined analysis (Table <ref type="table">S3C</ref>). Of these 263 genes, 219 were essential in all three libraries, 19 in two libraries, and 15 in only one library. Ten genes were classified as essential in the combined analysis, even though they were not essential in any individual library. Thus, overall, there was good agreement across the three primary insertion libraries, and the primary effect of combining three libraries was to reduce the number of genes classified as essential. This makes biological sense because as the number of Tn insertions goes up, the number of non-essential genes that lack insertions for stochastic reasons goes down. Conversely, 72 genes were newly classified as essential based on the outgrowth experiments, bringing the total to 335 (263 + 72 = 335; Table </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>S3A</head><p>). This increase makes biological sense because mutants are lost from the pool during outgrowth for two reasons: bottlenecks at subculturing (a source of false positives) and gradual loss of slow-growing mutants from the pool (genes that are truly essential or quasi-essential). Inclusion of 11 genes that were classified as essential by inspection brings the final overall tally to 346.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Comparison of our Tn-seq data to Dembek et al.</head><p>There is good overall agreement between our Tn-seq essentiality calls and those made previously. Of the 346 genes identified as essential in our experiments, 283 (82%) were also essential for Dembek et al. (Fig. <ref type="figure">4</ref>; Table <ref type="table">S3A</ref>). Overall, however, we scored fewer genes as essential, 346 versus 404 (Fig. <ref type="figure">4</ref>). This is a difference of 58 genes and comprises 121 genes uniquely essential for Dembek et al. and 63 genes uniquely essential for us (404 -121 + 63 = 346). These differences could reflect both differences in experimental design and the randomness inherent in transposon insertion sequencing, and both data sets are expected to include some misclassified genes. Distinguishing between these sources of variation in the case of specific genes would require follow-up experiments, which we did not undertake. Nevertheless, we can speculate. In particular, we suspect the primary reason we identified fewer genes as essential is that our insertion libraries were much larger. If the primary driver were the use of different media, we would expect discrepancies to be skewed toward more essentiality of metabolism genes. Of the 63 genes uniquely essential to our study, only 14 appear to be directly involved in metabolism (brnQ1, ctfA, cmk, dpaL, fchA, gutB, hisC, ilvB, ilvC, panB, pspA, rpe, thiG, and Full-Length Text Journal of Bacteriology Month XXXX Volume 0 Issue 0 10.1128/jb.00220-25 8 Downloaded from <ref type="url">https://journals.asm.org/journal/jb</ref> on 28 September 2025 by 129.255.1.116.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>yidE).</head><p>If the primary driver of differences were our reanalysis of the mutant pools after outgrowth, we should have classified more genes as essential than did Dembek et al., but the opposite is true. Moreover, of the 72 genes that made our list because they became essential during outgrowth, only 25 were non-essential in Dembek et al., and inspection of this gene list does not reveal any obvious reasons why these 25 genes would be uniquely important during prolonged growth.</p><p>A further potential source of differences is the use of different algorithms for making essentiality calls. Naively, one might suppose that the presence of even a single Tn insertion in a gene would be sufficient to score that gene as non-essential, in which case the algorithm should not matter. But, in practice, insertions can be mapped to genes by errors in work-up steps. In addition, some regions of the chromosome are more accessible to transposons than others. Lastly, insertions into the 3&#8242; end of a gene might not inactivate it. For these and other considerations, a variety of algorithms have been developed for making essentiality calls based on the availability of TA sites and the local density of insertions observed. We do not know to what extent use of different algorithms explains differences in essentiality calls between the two C. difficile studies, but one potential example is the DNA polymerase gene polA, which was classified as essential by us but not by Dembek et al. Although polA sustained Tn insertions at 205 of its 312 TA sites in our experiments, these insertions were restricted to the 3&#8242; end of the gene, implying polA has an essential N-terminal domain (Fig. <ref type="figure">3B</ref>; discussed below).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Comparison of our Tn-seq data to results obtained by CRISPRi</head><p>There is good overall agreement between our CRISPRi and Tn-seq data sets. Of the 141 genes for which CRISPRi elicited a strong or moderate viability defect, 129 (~90%) scored as essential in our Tn-seq (Fig. <ref type="figure">1C</ref>; Table <ref type="table">S3A</ref>). Conversely, only 4 out of 14 genes (~30%) that appeared to be non-essential by CRISPRi nevertheless scored as essential in our Tn-seq. These four genes are an uncharacterized DNA helicase (CDR20291_1171), a sporulation-associated phosphatase (ptpB), an acetyl-CoA thiolase (thlA2), and a putative exported Ca 2+ -chelating protein (ykwD). None of these has an essential ortholog in B. subtilis. Two labs have constructed null mutants of ptpB, indicating it is not essential (34, Short refers to genes that are too short to be called by TRANSIT2. Note that 12 genes identified as essential by Dembek could not be mapped to our data set as they were not present in the genome annotation used here. Category "ambiguous" combines unclear with unclear/NE from Table <ref type="table">S3</ref>.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>35</head><p>). One study reported a growth defect <ref type="bibr">(35)</ref>, which might explain why ptpB appears to be essential by Tn-seq.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DNA metabolism</head><p>Some DNA replication proteins have different names in B. subtilis and E. coli. Where there are conflicts, we adopted the names used in B. subtilis, which, in some cases, differ from the names used in BioCyc. We identified 16 widely conserved DNA replication genes as essential in C. difficile. All but pcrA and polA were previously classified as essential in C. difficile, and all but polA are essential in B. subtilis <ref type="bibr">(14,</ref><ref type="bibr">30)</ref>. Interestingly, polA is domain essential in C. difficile-Tn insertions were recovered in the C-terminal 3&#8242; to 5&#8242; exonuclease and DNA polymerase domains but not in the N-terminal 5&#8242; to 3&#8242; exonuclease domain, which removes Okazaki fragments (Fig. <ref type="figure">3B</ref>). Similar restricted essentiality of the polA 5&#8242; to 3&#8242; exonuclease domain has been reported in Streptococcus and Haemophilus <ref type="bibr">(36,</ref><ref type="bibr">37)</ref>. Organisms like B. subtilis, in which the entire polA gene is dispensable, have an RNAse H that can remove Okazaki fragments <ref type="bibr">(38)</ref>. Interestingly, C. difficile lacks dnaB <ref type="bibr">(39)</ref>. DnaB is an essential protein in B. subtilis, where it works together with DnaD and DnaI to load the replicative helicase DnaC onto oriC DNA <ref type="bibr">(40)</ref>. DnaB and DnaD are structurally related. It has been proposed that in C. difficile, the DnaD ortholog (CDR20291_3512) fulfills the functions of both DnaB and DnaD <ref type="bibr">(39)</ref>.</p><p>C. difficile has four essential DNA packaging and segregation genes, all of which are also essential in B. subtilis. In addition, there are three essential DNA recombination and repair genes, none of which are essential in B. subtilis.</p><p>LexA, which represses genes involved in the SOS response, is required for viability in C. difficile but not in B. subtilis. A C. difficile lexA Clostron insertion mutant has been described and grows poorly, so its apparent essentiality by Tn-seq may be due to slow growth rather than lack of viability per se <ref type="bibr">(19)</ref>. However, the strong viability defect we observed upon CRISPRi knockdown of lexA (Table <ref type="table">S1</ref>) raises the possibility that the reported mutant retains partial function or acquired a suppressor.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RNA metabolism</head><p>As expected, the core subunits and major sigma factor (&#963; 70 ) of RNA polymerase are all essential. Surprisingly, the omega subunit (rpoZ) is also essential according to Tn-seq, even though it is not essential in B. subtilis, S. aureus, or E. coli <ref type="bibr">(30,</ref><ref type="bibr">41,</ref><ref type="bibr">42)</ref>. The apparent essentiality of rpoZ is likely to be an artifact of polarity because it is predicted to be co-transcribed with three widely conserved essential genes: dapF, gmk, and coaBC. The elongation factor greA and three termination/anti-termination factors (nusA, nusG, and rho) are essential. Of these, only nusA is essential in B. subtilis <ref type="bibr">(30,</ref><ref type="bibr">43)</ref>. In C. difficile, rho mutations have been reported, including an early frameshift, but the gene could not be deleted, possibly because the mutant is too sick <ref type="bibr">(44)</ref>.</p><p>In all, 15 genes for enzymes that modify RNA were essential in our analysis, of which 12 were essential or ambiguous for Dembek et al., but only 8 are considered essential Five widely conserved small GTPases involved in ribosome assembly are essential, as are 10 translation factors, including smpB, which encodes a component of the SsrA tagging complex that rescues stalled ribosomes by trans-translation <ref type="bibr">(45)</ref>. We confirmed the essentiality of smpB by CRISPRi (Table <ref type="table">S1</ref>; "weak" viability defect). The essentiality of smpB is unlikely to be an artifact of polarity because it is not predicted to be co-tran scribed with any other genes. SmpB is essential in S. aureus <ref type="bibr">(33,</ref><ref type="bibr">46)</ref> but not in E. coli, Streptococcus sanguinis, or B. subtilis <ref type="bibr">(30,</ref><ref type="bibr">47,</ref><ref type="bibr">48)</ref>. An interesting omission from the list of essential translation factors is elongation factor Tu (EF-Tu), which is essential in B. subtilis <ref type="bibr">(30)</ref>. This difference can be explained by the presence of two EF-Tu genes in C. difficile, tufA and tufB, which are 100% identical at the DNA level. Simultaneous knockdown of tufA and tufB with CRISPRi caused a strong viability defect, demonstrating EF-Tu is indeed required for viability (Table <ref type="table">S1</ref>).</p><p>We identified 24 essential tRNA synthetases, all of which are also essential accord ing to Dembek et al. There are several noteworthy differences in comparison to B. subtilis. First, synthetases for asparagine (asnS), threonine (thrS), and tyrosine (tyrS) are essential in C. difficile but not B. subtilis, which has alternative routes for generating the corresponding charged tRNAs <ref type="bibr">(49)</ref><ref type="bibr">(50)</ref><ref type="bibr">(51)</ref>. Second, although glnS is essential in C. difficile, this gene does not exist in B. subtilis or most other Gram-positive bacteria, which generate Gln-tRNA Gln by a different route. Namely, C. difficile charges tRNA Gln directly with glutamine, as in E. coli, while most Gram-positive bacteria generate glutaminyl-tRNA Gln by (mis)charging tRNA Gln with glutamate, which is then amidated to glutamine <ref type="bibr">(52,</ref><ref type="bibr">53)</ref>. Lastly, C. difficile has two annotated genes for ligating proline to tRNA pro , the essen tial gene proS1 (CDR20291_0038) and the non-essential gene proS2 (CDR20291_0039). According to RNA sequencing, both are expressed during vegetative growth <ref type="bibr">(54)</ref>. B. subtilis has only a single proS gene, which is essential and more similar to C. difficile proS1 than proS2.</p><p>Five proteases appear to be important for viability in C. difficile: clpX, htrA, lon, prp, and the M16 family protease cdr20291_1161. Of these, only prp is essential in B. subtilis. Prp is a cysteine protease needed to remove an N-terminal extension from ribosomal protein L27 <ref type="bibr">(55)</ref>. The apparent essentiality of lon and cdr20291_1161 in C. difficile is likely to be an artifact of polarity onto engB and dapG, respectively. ClpX is a component of the ClpXP protease complex, one of the major housekeeping proteases in bacteria <ref type="bibr">(56)</ref>. C. difficile has only one clpX gene but two genes for ClpP, which might explain why clpX is essential but clpP1 and clpP2 are not. HtrA proteases are involved in protein quality control <ref type="bibr">(57)</ref>. TRANSIT2 scored htrA as essential despite a high number of Tn insertions (67 out of 127 TA sites), and this gene was not essential for Dembek et al. Moreover, htrA has been inactivated in strain 630&#916;erm, further indicating it is not truly essential <ref type="bibr">(58)</ref>.</p><p>In bacteria, protein synthesis begins with N-formyl methionine (fMet). Peptide deformylase (def) and methionine aminopeptidase (map) are essential enzymes that work sequentially to remove the formyl group from about 90% of proteins and the initiating methionine from about half of proteins. E. coli has only one def and one map gene, both of which are essential <ref type="bibr">(59)</ref>. C. difficile has two predicted map genes and two predicted def genes. Of these, only map1 is essential by Tn-seq. This situation is reminiscent of B. subtilis, which also has two def and two map genes. The def genes are functionally redundant and at least one must be present for viability <ref type="bibr">(60,</ref><ref type="bibr">61)</ref>. The essentiality of the map genes in B. subtilis is less clear. One study found mapA is essential, but mapB is not (62), while another found neither is individually essential <ref type="bibr">(30)</ref>. Bacteria have a plethora of systems for exporting proteins out of the cytoplasm, of which the three most important are the General Secretion (Sec) system, the Twin Arginine Translocation (Tat) system, and the Signal Recognition Particle (SRP) system <ref type="bibr">(63)</ref>. There is no Tat system in C. difficile, but the genes for the Sec and SRP systems are present and essential. The Sec system uses an ATPase named SecA to power the export of proteins through a membrane channel composed of SecEYG. Interestingly, C. difficile has two secA paralogs, which handle different protein substrates and are both essential <ref type="bibr">(64)</ref>. The SRP system works together with SecEYG to integrate proteins into the cytoplasmic membrane. Three genes associated with the SRP system (ffh, ftsY, and srpM) were scored as essential, although the apparent essentiality of srpM might result from polarity onto ffh; srpM is not essential in B. subtilis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Cell envelope</head><p>Numerous genes involved in membrane biogenesis are essential in C. difficile. An unexpected exception is the accBCDA gene cluster for the synthesis of malonyl-CoA, the substrate for fatty acid synthesis. This result is difficult to explain and probably incorrect because the acc cluster is essential according to Dembek et al., and we confirmed essentiality by CRISPRi (Table <ref type="table">S1</ref>). Moreover, acc genes are also essential in B. subtilis <ref type="bibr">(30)</ref>. Nevertheless, the acc cluster sustained numerous Tn insertions in our study (e.g., 10 of the 48 TA sites in accB, the first gene in the operon). We identified three membrane biogenesis genes that are essential in C. difficile but not in B. subtilis: fabH, yqhY, and gpsA. The fabH discrepancy can be explained by the presence of two fabH genes in B. subtilis <ref type="bibr">(65)</ref>. B. subtilis &#916;yqhY mutants are not stable <ref type="bibr">(66)</ref>, implying yqhY is quasi-essential in that organism. Regarding gpsA, although Koo et al. reported it is dispensable in B. subtilis <ref type="bibr">(30)</ref>, an earlier study found it is essential <ref type="bibr">(67)</ref>, which agrees with what we see in C. difficile.</p><p>C. difficile synthesizes isoprenoids via the methylerythritol (MEP) pathway ( <ref type="formula">68</ref>). Accordingly, dxr and ispDEFGH were all essential by Tn-seq. Isoprenoids are essential in bacteria because they are precursors for quinones and carrier lipids, such as undecap renyl phosphate (Und-P) required for the synthesis of peptidoglycan and teichoic acids <ref type="bibr">(69)</ref>. C. difficile lacks quinones <ref type="bibr">(70)</ref>, so the essentiality of the MEP pathway presumably reflects the importance of Und-P. Consistent with this inference, the predicted undecap renyl pyrophosphate synthase UppS1 is essential, although that conclusion comes with a caveat because insertions in uppS1 are probably polar onto the essential phospholipid biosynthesis gene cdsA <ref type="bibr">(71)</ref>. Interestingly, C. difficile has a non-essential uppS paralog called uppS2 that might be involved in the synthesis of a secondary cell wall polymer called PS-II <ref type="bibr">(72)</ref>. UppS2 is not essential by Tn-seq, and RNA sequencing implies expres sion of uppS2 is ~60-fold lower in vegetative cells compared to uppS1 <ref type="bibr">(54)</ref>.</p><p>The C. difficile cell has a unique proteinaceous surface layer (S-layer) and, as men tioned above, a unique wall polymer called PS-II, which is thought to function like teichoic acids found in Gram-positive model organisms but whose structure is quite different <ref type="bibr">(73)</ref>. Both the S-layer and PS-II are essential by Tn-seq, although the existence of (unhealthy) null mutants of slpA indicates the S-layer is not strictly required for viability <ref type="bibr">(74,</ref><ref type="bibr">75)</ref>. Multiple studies point to the essentiality of PS-II <ref type="bibr">(72,</ref><ref type="bibr">76,</ref><ref type="bibr">77)</ref>. Whether PS-II is essential because it plays a critical role in cell envelope integrity or because disruption of the PS-II gene cluster depletes the pool of Und-P needed for peptidoglycan synthesis remains to be determined <ref type="bibr">(78,</ref><ref type="bibr">79)</ref>.</p><p>The universal precursor for peptidoglycan synthesis is lipid II, a disaccharide-penta peptide attached to Und-P (80). As expected, many lipid II genes are essential, including six dap genes for biosynthesis of lysine and diaminopimelic acid, and nine mur genes for various steps in lipid II assembly. Lipid II is transported across the cytoplasmic membrane by flippases, of which there are two known families, MurJ and Amj <ref type="bibr">(31,</ref><ref type="bibr">81)</ref>. BLAST searches indicate that C. difficile lacks Amj but has two MurJ homologs, both of which are essential. MurJ1 is part of the PS-II gene cluster and is proposed to transport a lipid-linked precursor for PS-II synthesis <ref type="bibr">(76)</ref>, which leaves MurJ2 as the likely lipid II flippase for peptidoglycan synthesis. Some non-essential proteins distantly related to MurJ can be identified using HHPred and could also potentially transport lipid II <ref type="bibr">(82,</ref><ref type="bibr">83)</ref>. Further work is needed to establish the functions of the two clear MurJ homologs and rule out the presence of alternative or additional lipid II transporters <ref type="bibr">(31,</ref><ref type="bibr">84,</ref><ref type="bibr">85)</ref>. The final steps of peptidoglycan synthesis involve incorporation of new disaccharidepentapeptide subunits into the existing wall by sequential glycosyltransferase (GTase) and transpeptidase (TPase) reactions <ref type="bibr">(86,</ref><ref type="bibr">87)</ref>. These reactions are catalyzed by two types of penicillin-binding proteins (PBPs) <ref type="bibr">(88)</ref>. Class A PBPs (aPBPs) are bifunctional enzymes with both a GTase domain and a TPase domain, while class B PBPs (bPBPs) have a TPase domain and form a complex with a SEDS-family GTase <ref type="bibr">(89)</ref><ref type="bibr">(90)</ref><ref type="bibr">(91)</ref>. C. difficile encodes one aPBP (PBP1), three bPBPs (PBP2, PBP3, and SpoVD), and two SEDS proteins (RodA and SpoVE). Of these proteins, we confirmed by Tn-seq that PBP1, PBP2, and RodA are essential for vegetative growth <ref type="bibr">(14)</ref>. Although spoVE was also classified as essential, it sustained Tn insertions in about half the available TA sites (Table <ref type="table">S3A</ref>), and the gene has been deleted previously <ref type="bibr">(92)</ref>. In confirmation and extension of previous reports <ref type="bibr">(15,</ref><ref type="bibr">93)</ref>, CRISPRi knockdown of PBP1 caused filamentation, while CRISPRi knockdown of PBP2 and RodA resulted in the formation of short, swollen, phase-bright cells, with some chaining (Fig. <ref type="figure">S1</ref>). These morphologies implicate PBP1 in cell division and PBP2 in elongation, respectively. We also examined red fluorescent protein (RFP) fusions to the PBPs and observed that both localize to division sites (Fig. <ref type="figure">5</ref>). Septal localization of PBP1 has been reported by Shen's group, who showed that it is the primary syn thase for septal peptidoglycan <ref type="bibr">(93)</ref>. Septal localization of PBP2 suggests the RodA/PBP2 complex might also contribute to cell division, as further suggested by the mild chaining phenotypes caused by CRISPRi knockdown. Both RFP-PBP1 and RFP-PBP2 exhibited some fluorescence along the cell cylinder, which could indicate they contribute to elongation, especially in the case of PBP2. However, localization to the cell cylinder is not diagnostic of a function in elongation because this is the default location of divisome proteins when they are not at the septum. Finally, it should be noted that non-canonical 3-3 crosslinks made by L,D-transpeptidases (LDTs) are essential for vegetative growth in C. difficile, but none of the five LDTs in the C. difficile genome is individually essential owing to functional redundancy <ref type="bibr">(94)</ref>.</p><p>Our Tn-seq identified two cell envelope-related regulatory loci as essential: walRK and ddlR. These regulators were also essential for Dembek et al. walRK is a two-component Full-Length Text Journal of Bacteriology Month XXXX Volume 0 Issue 0 10.1128/jb.00220-25 14 Downloaded from <ref type="url">https://journals.asm.org/journal/jb</ref> on 28 September 2025 by 129.255.1.116.</p><p>system known to be essential for cell wall homeostasis and viability in numerous Bacillota, including C. difficile <ref type="bibr">(54,</ref><ref type="bibr">95)</ref>. DdlR is essential for peptidoglycan synthesis because it activates expression of the D-alanyl-D-alanine ligase ddl <ref type="bibr">(96)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Cell shape and division</head><p>In rod-shaped bacteria, the essential peptidoglycan synthases work in the context of loosely defined complexes known as the elongasome and the divisome <ref type="bibr">(86,</ref><ref type="bibr">87)</ref>. The C. difficile elongasome appears to comprise the RodA/PBP2 bipartite peptidoglycan synthase and four Mre proteins (MreB1, MreB2, MreC, and MreD). All of these are essential by Tn-seq and CRISPRi, although this inference will need to be revisited with non-polar deletions. CRISPRi knockdown implicates these genes primarily in elongation, because the predominant terminal morphologies include short, swollen cells with some chaining (Fig. <ref type="figure">S1</ref>; Table <ref type="table">S1</ref>). Among canonical divisome proteins, only ftsZ and its assembly factors sepF and zapA are essential in C. difficile. Neither sepF nor zapA is essential in B. subtilis <ref type="bibr">(97)</ref><ref type="bibr">(98)</ref><ref type="bibr">(99)</ref>. The greater importance of sepF and zapA in C. difficile might be due to the absence of an ftsA ortholog (100). As noted above, the primary septal peptidoglycan synthase is the class A enzyme PBP1 <ref type="bibr">(93)</ref>. Consistent with that inference, CRISPRi against pbp1 induces filamentation; however, additional morphological defects such as bending and chaining suggest PBP1 might contribute to elongation as well <ref type="bibr">(15)</ref>. Curiously, the division site placement genes minCDE are essential in C. difficile. This result might be an artifact of polarity onto the essential SEDS gene rodA, because the Min system is not essential in B. subtilis or most other bacteria <ref type="bibr">(101)</ref>. Tn-seq identified maf as essential. Maf is a nucleotide pyrophosphatase whose overproduction causes filamentation in both B. subtilis and E. coli, but Maf is not essential in either organism <ref type="bibr">(102)</ref><ref type="bibr">(103)</ref><ref type="bibr">(104)</ref>. The DNA-binding protein WhiA was essential for Dembek et al., and we observed a weak viability defect and modest cell elongation by CRISPRi, but whiA is not essential in our Tn-seq experiments. WhiA is conserved in monoderms and essential in Mycobacterium tuberculosis but not Streptomyces or B. subtilis, where it has been linked to cell division and chromosome segregation <ref type="bibr">(105)</ref><ref type="bibr">(106)</ref><ref type="bibr">(107)</ref><ref type="bibr">(108)</ref><ref type="bibr">(109)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Use of RFP fusions to identify new divisome proteins</head><p>We have a long-standing interest in bacterial cell division, so we extended our stud ies to include a screen for divisome proteins <ref type="bibr">(110)</ref><ref type="bibr">(111)</ref><ref type="bibr">(112)</ref><ref type="bibr">(113)</ref><ref type="bibr">(114)</ref>. Using CRISPRi knockdown to identify divisome proteins by screening for a filamentous phenotype comes with two major caveats-polarity onto a bona fide division gene will generate false positives, and depletion of non-essential divisome proteins might not cause cells to become longer than normal. A more direct approach is to use fluorescent tags to screen for proteins that localize to the division site. Here, the major caveat is that the tag might interfere with proper localization. We used BLAST searches to identify homologs of known morphogen esis proteins, which were fused to a codon-optimized RFP and produced from a plasmid under control of the xylose-inducible promoter, P xyl <ref type="bibr">(15)</ref>. Some of these proteins are encoded in (predicted) operons with proteins of unknown function, so we constructed RFP fusions to several of these as well. Although septal localization is strong evidence for a role in cell division, the lack of septal localization is uninformative because we did not test whether our RFP fusions are functional. We screened a total of 25 proteins, of which 18 localized and are discussed below (Fig. <ref type="figure">5</ref>). The seven that did not localize are MreB1, MreB2, FtsL, FtsB, SpoVE, CDR_3330, and CDR_2504.</p><p>Seven enzymes for peptidoglycan synthesis exhibited convincing midcell localization, including the two essential PBPs (PBP1 and PBP2), one essential SEDS protein (RodA), one non-essential monofunctional glycosyltransferase related to PBPs (Mgt), and three non-essential LDTs (Ldt1, Ldt4, and Ldt5). Of these, PBP1 was already known to localize to sites of cell division <ref type="bibr">(93)</ref>, but septal localization of the remaining enzymes is new and suggests they too contribute to the synthesis of septal peptidoglycan. Somewhat surprisingly, the canonical elongasome proteins MreC and MreD localized strongly to the midcell, even though our fusions to MreB1 and MreB2 did not. Mre proteins have been reported to localize transiently at or near the midcell in a few other bacteria <ref type="bibr">(115)</ref><ref type="bibr">(116)</ref><ref type="bibr">(117)</ref><ref type="bibr">(118)</ref>. Further work is warranted to investigate the role of the Mre proteins in C. difficile and the possibility that MreC and MreD localize independently of MreB, for which there is precedent from non-rod-shaped bacteria that have MreC and MreD but lack MreB <ref type="bibr">(118,</ref><ref type="bibr">119)</ref>. C. difficile orthologs of five widely conserved divisome proteins localized to the midcell: FtsZ, FtsK, FtsQ, SepF, DivIVA, as did CDR_3331, a unique protein with limited structural similarity to both FtsL and FtsB, which in C. difficile are used for asymmetric division during sporulation <ref type="bibr">(14,</ref><ref type="bibr">93)</ref>. Septal localization of C. difficile FtsZ has been reported previously <ref type="bibr">(120)</ref>. Septal localization of FtsQ is new but probably misleading because C. difficile ftsQ is a sporulation gene and not expressed during vegetative growth <ref type="bibr">(14,</ref><ref type="bibr">54,</ref><ref type="bibr">92)</ref>, whereas we produced RFP-FtsQ from P xyl . Immediately downstream of ftsQ are two genes of unknown function, ylxW and ylxX, that, according to RNA sequencing, are expressed in vegetative cells <ref type="bibr">(54)</ref>. YlxW and YlxX are encoded downstream of ftsQ in many Bacillota and have been proposed on this basis to play a role in envelope biogenesis <ref type="bibr">(121)</ref>. Our observation that these proteins localize to the midcell argues that they are involved in cell division. Another novel divisome protein identified in our screen is YlmG, a small membrane protein encoded in the sepF operon of many Gram-positive bacteria and Cyanobacteria <ref type="bibr">(100)</ref>. Mutants of ylmG have been constructed in several organisms and exhibit thin septa, poor sporulation, and/or aberrant nucleoid compac tion and segregation, depending on the species (100). In closing, and for completeness, we note that four additional proteins have been shown previously to localize to the division site in C. difficile: ZapA, MldA, MldB, and MldC <ref type="bibr">(114,</ref><ref type="bibr">122)</ref>. This brings the total number of documented divisome proteins to 22.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Metabolism</head><p>For an insightful overview of energy metabolism in C. difficile, readers are referred to a review by Neumann-Schaal et al. <ref type="bibr">(123)</ref>. Briefly, C. difficile is an obligate anaerobe that generates energy through fermentation of sugars and amino acids, the latter by a process known as Stickland reactions <ref type="bibr">(124,</ref><ref type="bibr">125)</ref>. There is no electron transport chain. Hence, the five genes that are essential for menaquinone biosynthesis in B. subtilis are not found in C. difficile's genome. The TCA cycle is incomplete and is used to generate precursor metabolites rather than energy. Fermentation pathways generate ATP directly by substrate-level phosphorylation but can also be used via electron bifurcation and the Rnf complex to generate a motive force across the cytoplasmic membrane <ref type="bibr">(126,</ref><ref type="bibr">127)</ref>. Whether this is a proton or a sodium-ion motive force is not yet known; we will assume protons for simplicity, that is, a PMF. C. difficile has an F 0 F 1 -type ATP synthase, which, depending on the needs of the organism, can consume the PMF to generate ATP or hydrolyze ATP to generate a PMF.</p><p>Not very many of the genes involved in these various pathways scored as essential by Tn-seq. Genes for the TCA cycle, acetate kinase, and the major Stickland reductases for glycine, proline, and leucine are all non-essential, as are the genes for the RNF complex and three electron bifurcation complexes (etf genes). The essentiality of genes for glycolysis is less clear because eight of these were essential for Dembek et al., but only two (eno, tpiA) were essential in our experiments. Glycolysis might have more of a contribution to growth on BHIS, which contains glucose, than to growth on TY. Differences in slow growth and statistical cutoffs that impact essentiality calls may also factor into the discrepancies. In support of this explanation, we observed a small colony phenotype when we used CRISPRi to knock down expression of four glycolysis genes (fba, gapB, pgi, and pfkA) that were essential for Dembek et al. but not in our Tn-seq (Table <ref type="table">S1</ref>). A further point to keep in mind is that glycolysis genes could be more important for supplying precursor metabolites rather than energy in C. difficile.</p><p>A noteworthy discrepancy concerns the 10-gene operon for the F-type ATPase. Dembek et al. scored 9 of the genes as essential, but all 10 were non-essential in our Tn-seq experiments. This gene cluster is too large to have escaped Tn insertions by chance. The most likely explanation for this discrepancy has to do with how slow growth affects perceptions of essentiality. In support of this interpretation, we observed that CRISPRi knockdown of atpB and atpD resulted in a small colony phenotype (Fig. <ref type="figure">3C</ref>). We also tested the effect of knockdowns in TY broth using one sgRNA that caused a small colony phenotype (atpD) and one that did not (atpF). Interestingly, both knockdowns caused a strong growth defect in broth, but only if cultures were pre-grown overnight in 1% xylose to deplete the AtpD or AtpF proteins before sub-culturing (Fig. <ref type="figure">3D</ref>). As an aside, we found that all four atp operon knockdowns were sensitized to subinhibitory concentrations of the uncoupler carbonyl cyanide m-chlorophenylhydrazone (CCCP, Fig. <ref type="figure">3E</ref>), which hints at the potential for using our CRISPRi library to study drug targets in C. difficile <ref type="bibr">(7,</ref><ref type="bibr">12)</ref>. Three genes (hisC, ilvB, and ilvC) involved in amino acid biosynthesis were identified as essential despite the utilization of a growth medium rich in tryptone. These genes were not essential for Dembek et al. Note, however, that there are seven essential lysine biosynthesis genes, which we categorized under cell envelope rather than metabolism owing to their role in the synthesis of diaminopimelate for peptidoglycan. Finally, the global regulator CodY is essential by Tn-seq. CodY is widely conserved in Bacillota and senses GTP and branched-chain amino acids to regulate gene expression in response to the energetic and nutritional needs of the cell. In C. difficile, CodY represses hundreds of genes during exponential growth, and a codY null mutant grows poorly upon entry into stationary phase <ref type="bibr">(128)</ref><ref type="bibr">(129)</ref><ref type="bibr">(130)</ref>, which likely explains the Tn-seq result.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Nucleotides and cofactors</head><p>We identified 11 genes essential for nucleotide biosynthesis. All 11 were also essential or ambiguous for Dembek et al., and 5 are essential in B. subtilis as well. One interesting difference is that an anaerobic ribonucleotide reductase encoded by nrdD and nrdG is essential in C. difficile, but these genes are not found in B. subtilis, which has instead an aerobic ribonucleotide reductase encoded by nrdE and nrdF that are not found in C. difficile <ref type="bibr">(131)</ref>. Two additional exceptions are guaA (GMP synthase) and thyA (thymidylate synthase), which are essential in C. difficile and S. aureus but not B. subtilis <ref type="bibr">(14,</ref><ref type="bibr">30,</ref><ref type="bibr">33)</ref>. Two genes for regulatory nucleotides appear to be essential in C. difficile, the cyclic-di-AMP phosphodiesterase yybT and the bifunctional (pp)pGpp synthase/hydrolase relA. In C. difficile, relA is also called rsh and synthesizes exclusively pGpp <ref type="bibr">(132,</ref><ref type="bibr">133)</ref>. The essentiality of relA was confirmed by CRISPRi (Table <ref type="table">S1</ref>). The B. subtilis orthologs of yybT and relA are not essential <ref type="bibr">(30)</ref>, and their apparent essentiality in C. difficile could be due to polarity. However, in addition to the putatively essential bifunctional enzyme RelA, C. difficile contains a non-essential monofunctional pGpp synthetase RelQ <ref type="bibr">(134)</ref>. Based on analogy to other organisms, RelA could be essential in C. difficile because its hydrolase activity is needed to counterbalance the pGpp synthetase activity of RelQ <ref type="bibr">(135,</ref><ref type="bibr">136)</ref>. As an aside, we note that cyclic-di-AMP is essential in C. difficile growing on rich media, but c-di-AMP synthases were not identified by Tn-seq because there are two of them, neither of which is individually essential <ref type="bibr">(137)</ref>.</p><p>In all, 24 genes are essential for the synthesis of cofactors, despite the utilization of media containing tryptone and yeast extract. All but two of these were also essen tial or ambiguous for Dembek et al., and 14 have an essential ortholog in B. subtilis. Curiously, neither we nor Dembek et al. scored dihydrofolate reductase (dfrA) as essential. Dihydrofolate reductase is the target of several important antibiotics and is essential in E. coli, B. subtilis, S. sanguinis, and S. aureus <ref type="bibr">(30,</ref><ref type="bibr">33,</ref><ref type="bibr">47,</ref><ref type="bibr">48)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Phage and transposon-related genes</head><p>The C. difficile genome has a remarkably high content of mobile genetic elements <ref type="bibr">(25,</ref><ref type="bibr">138)</ref>. Mobile genetic elements are not part of the core genome and thus should not be essential for viability. Nevertheless, 21 genes classified as essential appear to reside on a prophage or a transposon. Some of these might be false positives because only eight were also essential or ambiguous for Dembek et al. Even the eight genes classified as essential in both studies are likely due to indirect effects such as the induction of a lytic prophage.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Transporters</head><p>Six genes for transporters were classified as essential in our Tn-seq, three of which were also essential for Dembek et al. and were confirmed by CRISPRi (Table <ref type="table">S1</ref>). These encode a predicted Ktr potassium transporter and a predicted CorA-like divalent metal ion transporter. In B. subtilis, there are two Ktr systems, which are not essential but improve growth at high osmolarity <ref type="bibr">(139)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Sporulation</head><p>Curiously, both we and Dembek et al. classified the sporulation-associated phosphatases ptpA and ptpB as ambiguous or essential for vegetative growth. Two labs have reported null mutants of these genes, so they are not formally essential <ref type="bibr">(34,</ref><ref type="bibr">140,</ref><ref type="bibr">141)</ref>. Loss of ptpA or ptpB enhances sporulation, which we confirmed using CRISPRi against ptpB (Table <ref type="table">S1</ref>). We presume that ptp genes are essential by Tn-seq because enhanced sporulation reduces vegetative growth.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Genes of unknown function</head><p>Our Tn-seq analysis identified 28 putatively essential genes that could not be assigned to a functional pathway. None of these genes have an essential ortholog in B. subti lis, although in five cases BioCyc identified a non-essential ortholog. Eleven of these genes were not essential for Dembek et al. and in two cases (cdr20291_3519 and cdr20291_3520) essentiality is likely due to polarity. That leaves 15 genes that are essential or ambiguous in two independent Tn-seq studies and are therefore likely to be bona fide essential genes. As noted above in the discussion of our CRISPRi experi ments, we silenced expression of 11 of these genes and observed a viability defect for 9 of them, often accompanied by abnormal morphologies (Table <ref type="table">2</ref>, Table <ref type="table">S1</ref>). The apparently essential genes of unknown function constitute a high-value gene set from the perspectives of bacterial physiology and antibiotic development.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>In summary, we identified 346 protein-encoding genes that, by Tn mutagenesis, are essential for vegetative growth of C. difficile strain R20291 on TY media. Of these, 283 were also identified as essential by Tn mutagenesis in a previous study <ref type="bibr">(14)</ref>, and 169 have an essential ortholog in B. subtilis <ref type="bibr">(30)</ref>. Overall, these results are broadly consistent with studies of gene essentiality in model organisms such as E. coli, B. subtilis, and S. aureus <ref type="bibr">(30,</ref><ref type="bibr">33,</ref><ref type="bibr">46,</ref><ref type="bibr">47,</ref><ref type="bibr">59)</ref>. The 283 C. difficile genes identified as essential in two independent Tn mutagenesis studies can be regarded as a consensus "essentialome" that minimizes false positives. Most of these genes play key roles in foundational cellular processes such as DNA replication, transcription, translation, and cell envelope biogenesis. But the consensus essentialome also includes 15 genes that could not be assigned to any functional pathway (Table <ref type="table">2</ref>, Table <ref type="table">S3A</ref> and <ref type="table">B</ref>). These genes might be targets for antibiotics that kill C. difficile without decimating the healthy microbiota needed to keep C. difficile in check.</p><p>We also used CRISPRi knockdown to investigate 181 genes that had been identified as essential in a previous transposon mutagenesis analysis <ref type="bibr">(14)</ref>. Our goals were to vet essentiality and screen for morphological defects that would facilitate assigning genes of unknown function to physiological pathways. Our CRISPRi platform used a plasmid that expresses dCas9 from a xylose-inducible promoter (P xyl ) and an sgRNA from a strong constitutive promoter (P gdh ) <ref type="bibr">(15)</ref>. CRISPRi resulted in reduced plating efficiencies and/or small colony phenotypes on TY-xylose plates for 167 of the 181 genes targeted, a very high confirmation rate of 92%. The 14 genes for which no viability defect was observed could be false positives from the previous report or genes for which our sgRNAs were ineffective. Of these genes, 10 sustained insertions in our Tn-seq experiments, so we infer they are non-essential. Four did not sustain Tn insertions and are therefore likely to be essential genes that were poorly repressed by our sgRNAs. Importantly, no growth defects were observed using 20 control sgRNAs that did not target anywhere in the genome, indicating off-target effects are rare. Microscopy of surviving cells scraped from the TY-xylose plates revealed that most knockdowns resulted in morphological abnormalities (151 out of 181 genes, 83%). Disappointingly, however, the utility of these defects for making functional assignments was limited by the observation that repressing genes of known function often resulted in non-intuitive defects. For example, repressing RNA polymerase gene rpoB resulted in severe filamentation suggestive of a cell division defect, while repressing the nucleotide biosynthesis gene guaA caused a chaining phenotype suggestive of a daughter cell separation defect. Non-intuitive phenotypes have also been reported in other CRISPRi screens <ref type="bibr">(7,</ref><ref type="bibr">8)</ref>.</p><p>The findings and resources presented here should help guide future studies of C. difficile. First, our results can be used to prioritize genes for more rigorous but laborintensive investigation using depletion strains with in-frame deletions <ref type="bibr">(142)</ref>. The 15 apparently essential genes that could not be assigned to a functional pathway seem like a good place to start. Second, our CRISPRi library can be leveraged to investigate antibiotic sensitivities <ref type="bibr">(7,</ref><ref type="bibr">12,</ref><ref type="bibr">143)</ref>, which might illuminate gene function and reveal vulnerabilities that can be exploited to improve treatment of C. difficile infections. Third, the identification of 18 proteins that localize to the midcell raises new questions related to C. difficile morphogenesis. For example, septal localization of the canonical elongation proteins MreC and MreD suggests they contribute to cell division, and/or C. difficile elongates by inserting new peptidoglycan near the midcell. In addition, our discovery that YlmG, YlxW, and YlxX localize to the division site provides the most direct evidence to date that these conserved but enigmatic proteins play a role in cell division.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>MATERIALS AND METHODS</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Strains, media, and growth conditions</head><p>Most bacterial strains used in this study are listed in Table <ref type="table">S4</ref>. Strains and plasmids constructed for the CRISPRi library are summarized separately in Table <ref type="table">S2</ref>. C. difficile strains were derived from R20291 <ref type="bibr">(144)</ref>. C. difficile was routinely grown in tryptoneyeast extract (TY) medium, supplemented as needed with thiamphenicol at 10 &#181;g/mL (TY-Thi10). TY medium consisted of 3% tryptone, 2% yeast extract, and 2% agar (for plates). Brain heart infusion (BHI) media was prepared per manufacturer's (DIFCO) instructions. C. difficile strains were maintained at 37&#176;C in an anaerobic chamber (Coy Laboratory Products) in an atmosphere of 2% H 2 , 5% CO 2 , and 93% N 2 . Escherichia coli strains were grown in LB medium at 37&#176;C with chloramphenicol at 10 &#181;g/mL and/or ampicillin at 100 &#181;g/mL as needed. LB medium contained 1% tryptone, 0.5% yeast extract, 0.5% NaCl, and 1.5% agar (for plates). OD 600 measurements were made with the WPA Biowave CO8000 tube reader in the anaerobic chamber.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Plasmid and strain construction</head><p>Plasmids are listed in Table <ref type="table">S5</ref> and were constructed with HiFi DNA Assembly from New England Biolabs (Ipswich, MA). Oligonucleotide primers (Table <ref type="table">S6</ref>) were synthesized by Integrated DNA Technologies (Coralville, IA). CRISPRi plasmids were constructed as described in reference <ref type="bibr">(15)</ref>. Regions constructed by PCR were verified by DNA sequenc ing. Plasmids were propagated in E. coli HB101/pRK24 and conjugated into C. difficile R20291 according to reference <ref type="bibr">(54)</ref>. Final R20291 CRISPRi strains were verified by PCR amplifying and sequencing the guide region. Details relevant to other plasmid construc tion are provided in Table <ref type="table">S5</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CRISPRi screen</head><p>Overnight cultures grown in TY-Thi10 were serially diluted 10-fold in TY, and 5 &#181;L were spotted on TY-Thi10 and TY-Thi10 1% (wt/vol) xylose plates. Plates were incubated at 37&#176;C overnight and imaged the following morning (~18 h). Cells were scraped from select spots (usually the last spot with growth) and resuspended in 50 &#181;L TY. Cell suspensions were supplemented with 5 &#181;g/mL FM4-64 (red fluorescent membrane stain, Thermo Scientific) and 15 &#181;g/mL Hoechst 33342 (blue fluorescent DNA stain, Invitrogen) and imaged by phase-contrast and fluorescence microscopy.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Protein localization</head><p>R20291 harboring plasmids that expressed RFP-tagged proteins under xylose control were grown in TY-Thi10 overnight, subcultured into TY-Thi10 with 0.1% or 1% xylose, grown to an OD 600 of about 0.6, and fixed with 4% buffered paraformaldehyde as described <ref type="bibr">(54,</ref><ref type="bibr">122,</ref><ref type="bibr">145)</ref>. Fixed cells were photographed under phase-contrast and (red) fluorescence. Septal localization was scored manually by inspecting cells for the presence of a fluorescent band near the midcell. MicrobeJ was used to keep track of cells that scored positive or negative for septal localization <ref type="bibr">(146)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Microscopy</head><p>Cells were immobilized using thin agarose pads (1% wt/vol agarose). Phase-contrast micrographs were recorded on an Olympus BX60 microscope equipped with a 100&#215; UPlanApo objective (numerical aperture, 1.35). Micrographs were captured with a Hamamatsu Orca Flash 4.0 V2+ complementary metal oxide semiconductor (CMOS) camera. Excitation light was generated with an X-Cite XYLIS LED light source. Red fluorescence was detected with the Chroma filter set 49008 (538-582 nm excitation filter, 587 nm dichroic mirror, and a 590-667 nm emission filter). Blue fluorescence was detected with the Olympus filter set U-MWU (330-385 nm excitation filter, 400 nm dichroic mirror, and a 420 nm barrier emission filter).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Transposon library construction</head><p>Plasmid pRPF215 is a quasi-suicide plasmid that harbors the Himar1 mariner transposase gene under control of P tet <ref type="bibr">(14)</ref>. The gene for TetR does not have a terminator, and transcription reads through into the origin of replication, presumably disrupting plasmid replication. The addition of anhydrotetracycline therefore both induces the transposase and causes plasmid loss. A single colony of R20291/pRPF215 was used to inoculate a 2 mL overnight culture in TY-Thi10. In total, 20 independent overnight cultures were grown for each transposon library construction. After overnight growth, each was then sub-cultured 1:50 into 2 mL TY and grown to an OD 600 of 0.3. From each subculture, an aliquot was removed and spread onto two large (15 cm diameter) plates of TY agar with 80 &#181;g/mL lincomycin (RPI) and 100 ng/mL anhydrotetracycline (Sigma), for a total of 40 plates. We used higher concentrations of lincomycin than originally published (14) because we found 80 &#181;g/mL lincomycin decreased the number of false positives. The amount of subculture to plate was experimentally determined to give roughly 5,000-8,000 colonies. Typically, we used 220 &#181;L of subculture diluted with TY to 600 &#181;L, a volume suitable for spreading evenly on a large plate. A dilution series of one subculture was also plated on TY to calculate plating efficiency. Selection plates typically grew one colony for every 500 plated (i.e., an efficiency of about 2 &#215; 10 -3 ). Plates were incubated for 20 hours at 37&#176;C. Cells were then scraped off the plates with 5 mL TY each, pooled, amended to 10% DMSO, aliquoted, and stored at 80&#176;C. This material was referred to as the primary transposon library. Suspensions of the primary libraries typically had an OD 600 of about 6. The concentration of viable cells was quantified by plating aliquots on TY plates and was typically around 3 &#215; 10 8 CFU/mL. Three independent libraries were constructed on different days. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Tn-seq sample preparation</head><p>DNA samples were prepared directly from 1 mL of primary library or from 10 mL culture that had been grown for an additional seven doublings in TY. To avoid creating a bottleneck, 10 mL TY was inoculated with 2.2 &#215; 10 7 CFU. There are 502,945 possible TA insertion sites in the R20291 chromosome; thus, cultures were started with a ratio of about 45 CFU per TA site. DNA libraries for Illumina sequencing were prepared based on modifications of Karash et al. <ref type="bibr">(26)</ref>. Briefly, regions adjacent to any transposon inser tion were amplified by single primer extension. The resulting products were extended with a cytosine tail, which then allowed further amplification by PCR. The upstream primer recognizes the transposon sequence, incorporates the P5 sequence for Illumina sequencing and a sample-specific barcode; the downstream primer recognizes the C-tail and incorporates the P7 sequence. Genomic DNA was prepared using the Monarch Genomic DNA purification kit from NEB, using the protocol for Gram-positive bacteria. A maximum of 2 &#215; 10 9 cells were pelleted. Lysis was facilitated through the addition of 0.5 mg hen egg white lysozyme (Boehringer Mannheim) and 20 U mutanolysin (Sigma), and DNA was eluted in 35 &#181;L with a typical yield of 200 ng/&#181;L. Linear extension PCR was performed on 100 ng DNA in 50 &#181;L with Taq polymerase (NEB) and primer Tn-ermB-2 (anneal: 30 s at 55&#176;C, extend 30 s at 68&#176;C, 50 cycles). The resulting product was spin-column purified (Zymo Research Clean &amp; Concentrator kit) and eluted in 12 &#181;L. A C-tail was added by extending with terminal transferase (NEB) in a 20 &#181;L reaction, using 1.25 mM dCTP (NEB) and 50 &#181;M ddCTP (MilliporeSigma/Roche). The product was again spin-column purified and eluted in 10 &#181;L. Final PCR amplification used 1 &#181;L of C-tailed DNA in a 35 &#181;L reaction mixture, Taq polymerase, and primers P716G and P5TnPx (x: variable barcode; anneal: 30 s at 62&#176;C, extend 30 s at 68&#176;C, 35 cycles). The resulting product was separated on a 1.5% agarose gel in Tris Acetate EDTA buffer (TAE). Fragments of 300-500 base pair length were excised, purified with the Zymo Research Gel DNA recovery kit, and eluted in 10 &#181;L. DNA concentration was quantified with the Qubit dsDNA assay and was typically around 5 ng/&#181;L. Four samples with distinct barcodes were combined and submitted for sequencing (Illumina HiSeq X, 150 bp PE reads) with Admera Health Biopharma Services (South Plainfield, NJ). Samples were spiked with 5% PhiX DNA to improve data quality.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Sequencing data processing</head><p>Raw sequencing files were first trimmed with Trimmomatic to eliminate poor-quality reads <ref type="bibr">(147)</ref>. The first four bases before the barcodes were then removed using Trim Sequences, and the resulting files were de-multiplexed using the Barcode splitter, both on Galaxy <ref type="bibr">(148)</ref>. Reads were aligned to the reference genome of R20291 (NC_013316.1 or ASM2710v1) using the Burrows-Wheeler Aligner (BWA) provided in TRANSIT <ref type="bibr">(28)</ref>. Finally, the resulting Wig files were compared in TRANSIT2, which evaluates gene essentiality both by Gumbel analysis and binomial analysis <ref type="bibr">(149)</ref>. The former makes essentiality calls based on insertion gaps, that is, consecutive TA sites lacking transposon insertions, using the Gumbel distribution <ref type="bibr">(150)</ref>. The latter calls essentiality for small genes lacking insertions, which can be difficult to detect by the more conservative Gumbel algorithm <ref type="bibr">(29)</ref>. Essentiality calls are either "E" when identified by Gumbel or "EB" when identified by the Binomial analysis. Table <ref type="table">S3</ref> lists genes that were called essential in primary insertion libraries using cells scraped from plates, or after an additional 7 generations of growth. The library data set was generated from three independently constructed transposon libraries. The outgrowth data set was generated from two independent growth cultures from each of the three independent libraries. We present both the separate data output as well as a combined essentiality call (Table <ref type="table">S3</ref>). The latter was further hand-edited by including 11 genes (indicated as "Ei" for "essential by inspection") that appeared to have been mistakenly called non-essential by TRANSIT2. Ten of these genes had very few insertions despite numerous possible TA sites, while the eleventh had a large number of insertions but mostly at the 3&#8242; end of the gene. </p></div></body>
		</text>
</TEI>
