<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Functional Connectivity: Continuous-Time Latent Factor Models for Neural Spike Trains</title></titleStmt>
			<publicationStmt>
				<publisher>Cognitive Computational Neuroscience</publisher>
				<date>01/01/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10575777</idno>
					<idno type="doi">10.32470/CCN.2023.1683-0</idno>
					
					<author>Meixi Chen</author><author>Martin Lysy</author><author>David E Moorman</author><author>Reza Ramezan</author><author>Cognitive_Computational_Neuroscience</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Modelling the dynamics of interactions in a neuronal ensemble is an important problem in functional connectivity research. One popular framework is latent factor models (LFMs), which have achieved notable success in decoding neuronal population dynamics. However, most LFMs are specified in discrete time, where the choice of bin size significantly impacts inference results. In this work, we present what is, to the best of our knowledge, the first continuous-time multivariate spike train LFM for studying neuronal interactions and functional connectivity. We present an efficient parameter inference algorithm for our biologically justifiable model which (1) scales linearly in the number of simultaneously recorded neurons and (2) bypasses time binning and related issues. Simulation studies show that parameter estimation using the proposed model is highly accurate. Applying our LFM to experimental data from a classical conditioning study on the prefrontal cortex in rats, we found that coordinated neuronal activities are affected by (1) the onset of the cue for reward delivery, and (2) the sub-region within the frontal cortex (OFC/mPFC). These findings shed new light on our understanding of cue and outcome value encoding.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>An important question in neuroscience research is understanding the functional connectivity between neurons in different parts of the brain. Spike trains based on simultaneously recorded neurons provide information about population coding and neuronal interaction. Both model-free and modelbased spike-train analysis tools have been developed to answer this question. While model-free methods <ref type="bibr">(Perkel et al., 1967;</ref><ref type="bibr">Ventura et al., 2005;</ref><ref type="bibr">Fujisawa et al., 2008;</ref><ref type="bibr">Humphries, 2011;</ref><ref type="bibr">Lopes-dos-Santos et al., 2013)</ref> are typically more efficient and convenient to implement, they often fail to uncover more complex underlying neuronal relationships beyond correlation at the level of observed data. In contrast, latent factor models (LFMs) are able to discover patterns which modelfree algorithms cannot, thanks to their ability to specify different structures in the latent layers. The remarkable success of spike-train LFMs in predictive tasks <ref type="bibr">(Yu et al., 2008;</ref><ref type="bibr">Gao et al., 2016;</ref><ref type="bibr">Wu et al., 2017;</ref><ref type="bibr">Pandarinath et al., 2018)</ref> and neuronal clustering <ref type="bibr">(Buesing et al., 2014;</ref><ref type="bibr">Wei et al., 2022)</ref> motivates our work in this paper to study the functional connectivity between neurons. As all the aforementioned LFMs are specified in discrete time, they are commonly applied upon binning the experimental time to obtain spike counts, which are then modelled using, e.g., a Poisson likelihood. However, the bin size is often chosen arbitrarily-despite this having a significant impact on parameter estimation <ref type="bibr">(Nelson, 2002;</ref><ref type="bibr">Kass &amp; Ventura, 2006;</ref><ref type="bibr">Ramezan et al., 2014)</ref>. Although there are Poisson process models set in continuous time without binning (e.g., <ref type="bibr">Duncker &amp; Sahani, 2018;</ref><ref type="bibr">Williams et al., 2020)</ref>, these models cannot be readily applied to the functional connectivity analysis of multiple neurons.</p><p>In this work we present a continuous-time multivariate point process LFM to study neuronal interactions based on simultaneously recorded spike trains in a neural population. To the best of our knowledge, this is the first continuous-time LFM proposed for the analysis of neural spike trains. In our model, the activities of a neuronal population are described by correlated Wiener processes with resetting. Each of these processes is viewed as a proxy of the evolving membrane potential of a neuron which resets after reaching a threshold. Crucially, we assume that the high-dimensional multivariate latent process can be summarized by a small number of dynamic factors. Not only does this factor analysis framework provide an interpretable low-dimensional representation of neural activities, it also serves as a means of dimension reduction for studying large neuronal populations. Our model generalizes the limiting case of the multivariate Skellam point process with resetting of <ref type="bibr">Ramezan et al. (2022)</ref>. However, by passing to the Brownian limit, we are able to develop an efficient algorithm for parameter inference, which both circumvents the choice of bin size (it can be made so small as to approximate the continuous time process arbitrarily well) and scales linearly in the number of neurons in the analysis. The applicability of the proposed model and inference procedure is demonstrated in simulated and experimental data analyses.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Model</head><p>A graphical summary of the proposed model is presented in Figure <ref type="figure">1</ref>. Let Y t = (y 1,t , . . . , y q,t ) &#8712; {0, 1} q denote the observed binary spike trains of a population of q neurons at time t, and X t = (x 1,t , . . . , x q,t ) &#8712; R q be the unobserved latent dynamic processes of the neuronal population. We assume that the neuronal activities are governed by X t , which are modelled as correlated Wiener processes with resetting. Each process i is reset to its initial value whenever it crosses some neuron-specific threshold, and then a spike occurs, i.e., y i,t = 1 is observed. We let &#952; &#952; &#952; denote the threshold parameters and the Wiener process drift vector. The threshold-crossing and resetting processes mimic the spike-generating mechanism and the refractoriness of the neuron, respectively. We further assume that the q dynamic processes can be represented as X t = &#923; &#923; &#923;F t + &#949; &#949; &#949; t , where F t = ( f 1,t , . . . , f d,t ) are d (d &#8810; q) dynamic factors, and &#949; &#949; &#949; t = (&#949; 1,t , . . . , &#949; q,t ) are neuron-specific idiosyncrasies. The loading matrix &#923; &#923; &#923; can be interpreted within the traditional factor analysis framework-that is, it identifies a small number of factors driving the neuronal dynamic processes. It is also used to model the correlation matrix between latent neuron processes, i.e., cor(X t ) = &#931; &#931; &#931; = &#923; &#923; &#923;&#923; &#923; &#923; &#8242; + &#936; &#936; &#936;, where &#936; &#936; &#936; is a diagonal matrix with elements determined by &#923; &#923; &#923; and the fact that &#931; &#931; &#931; has unit diagonal.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Inference</head><p>Model inference is carried out in two steps. First, the drift of the Wiener process and the threshold parameters, sum- marized by &#952; &#952; &#952;, can be estimated analytically since the first passage time of a Wiener process follows an Inverse Gaussian distribution <ref type="bibr">(Brown, 2005)</ref>. However, closed-form estimation of &#923; &#923; &#923; is not available. One common solution is to employ MCMC sampling on p(X 1:T , &#923; &#923; &#923; | Y 1:T ), where time has been discretized to an arbitrarily fine grid. However, this can be prohibitively slow in high dimensions. Instead, the latent processes X 1:T are integrated out via the Laplace approximation <ref type="bibr">(Tierney &amp; Kadane, 1986;</ref><ref type="bibr">Skaug &amp; Fournier, 2006;</ref><ref type="bibr">Koyama et al., 2010)</ref></p><p>so that an estimate of &#923; &#923; &#923; can be obtained by maximizing the Laplace-approximated marginal likelihood p LA (Y 1:T | &#923; &#923; &#923;).</p><p>An efficient gradient-based nested optimization algorithm for this is implemented in our R/C++ library fastr <ref type="bibr">(Chen et al., 2023)</ref>. The algorithm involves repeatedly solving linear systems of the form &#931; &#931; &#931;x = b, which due to the factor structure &#931; &#931; &#931; = &#923; &#923; &#923;&#923; &#923; &#923; + &#936; &#936; &#936;, scales linearly in the number of neurons q for fixed number of factors d.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Data Analysis</head><p>To assess the performance of the proposed model, we first applied it to a biologically plausible simulated dataset generated in NEURON <ref type="bibr">(Hines &amp; Carnevale, 1997)</ref>. We found that the K-means clustering results based on &#923; &#923; &#923; are far more ac- curate than those based on (convolved) multiple neuron spike trains Y 1:T (see Figure <ref type="figure">2</ref>(a)-(c)). Therefore, the estimate &#923; &#923; &#923;</p><p>uncovers the underlying associations between neurons beyond what is observed at the data level. Next, we applied our model on neuronal ensembles recorded from a rat's medial prefrontal cortex (mPFC) and orbitofrontal cortex (OFC) during a classical conditioning experiment. The rat was trained to recognize a cue predictive of an appetitive outcome (sucrose).</p><p>We modelled neuronal population dynamics in relation to encoding appetitive outcomes and functional connectivity, and found that coordinated activities are modulated by brain areas </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head><p>We have proposed a novel continuous-time multivariate point process latent factor model for simultaneously recorded spike</p><p>&#8226; Figure 2: Clustering performance. The (i, j)-th entry of the matrix is black if neuron i and j are assigned to the same cluster, and white otherwise.</p><p>&#8226; &#8226; Figure 3: Estimated correlation between neurons. "O" stands for OFC neurons and "M" stands for mPFC neurons. The numbers are the neuron indices. For visual presentation, nonsignificant values and the diagonal elements are set to zero. trains. Downstream analyses using the proposed model can reveal neuronal clustering and estimate correlations between neurons. Computational challenges in model inference are addressed by carefully designing and implementing an efficient model-fitting procedure based on the Laplace approximation. We have confirmed, via simulation studies, that our algorithm achieves satisfactory accuracy and precision in parameter estimation. Finally, compared to black-box deep learning models for neural spike trains, our proposed model is able to provide more reliable statistical inference results with uncertainty quantification, which is vital for making scientifically sound conclusions. One immediate future direction is to apply our model to appetitive and aversive classical conditioning outcomes (available within the same experimental data) to investigate reward-value coding, and to identify value signals for the most relevant contextual features.</p></div></body>
		</text>
</TEI>
