<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Learning brain connectivity in social cognition with dynamic network regression</title></titleStmt>
			<publicationStmt>
				<publisher>Institute of Mathematical Statistics</publisher>
				<date>12/01/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10609633</idno>
					<idno type="doi">10.1214/24-AOAS1942</idno>
					<title level='j'>The Annals of Applied Statistics</title>
<idno>1932-6157</idno>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">4</biblScope>					

					<author>Maoyu Zhang</author><author>Biao Cai</author><author>Wenlin Dai</author><author>Dehan Kong</author><author>Hongyu Zhao</author><author>Jingfei Zhang</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Dynamic networks have been increasingly used to characterize brain connectivity that varies during resting and task states. In such characterizations a connectivity network is typically measured at each time point for a subject over a common set of nodes representing brain regions, together with rich subject-level information. A common approach to analyzing such data is an edge-based method that models the connectivity between each pair of nodes separately. However, such approach may have limited performance when the noise level is high and the number of subjects is limited, as it does not take advantage of the inherent network structure. To better understand if and how the subject-level covariates affect the dynamic brain connectivity, we introduce a semiparametric dynamic network response regression that relates a dynamic brain connectivity network to a vector of subject-level covariates.A key advantage of our method is to exploit the structure of dynamic imaging coefficients in the form of high-order tensors. We develop an efficient estimation algorithm and evaluate the efficacy of our approach through simulation studies. Finally, we present our results on the analysis of a task-related study on social cognition in the Human Connectome Project, where we identify known sex-specific effects on brain connectivity that cannot be inferred using alternative methods.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>1. Introduction. Social cognition, which refers to how individuals process, memorize, and use information in social contexts to explain and predict their own behavior and that of others <ref type="bibr">(Fiske and Taylor (1991)</ref>), is a crucial aspect of human functioning and has been extensively studied in the field of psychology and neuroscience <ref type="bibr">(Lieberman (2007)</ref>, <ref type="bibr">Saxe and Kanwisher (2013)</ref>). The use of neuroimaging techniques, particularly functional magnetic resonance imaging (fMRI), has enabled a better understanding of the neural mechanisms underlying social cognition <ref type="bibr">(Saxe and Kanwisher (2013)</ref>). Previous studies using fMRI have shown that specific brain regions, such as the medial prefrontal cortex, the temporoparietal junction, and the superior temporal sulcus, are consistently activated during tasks related to social cognition <ref type="bibr">(Castelli et al. (2000)</ref>, <ref type="bibr">Gallagher and Frith (2003)</ref>). While significant progress has been made in uncovering the neural mechanisms underlying social cognition, our understandings of the coordination between brain regions during social cognition and how it relates to individual differences in social behavior remain limited <ref type="bibr">(Adolphs (2009)</ref>).</p><p>The social cognition study in the Human Connectome Project (HCP) 1 provided a unique opportunity for advancing our understandings of the brain connectivity underlying social cognition. In this study, imaging scans are collected using fMRI from a set of subjects as each subject goes through a sequence of cognitive tasks and rest states. In addition, it also collects subject features such as sex and social covariates (e.g., social distress); see more details in Section 1.1. Based on the imaging scans, a dynamic connectivity network, characterizing activation and deactivation of connections between brain regions during task and rest states, can be constructed for each subject, with nodes corresponding to a common set of brain regions and the edges encoding dynamic functional associations between the regions. From this study it is of fundamental scientific interest to understand which brain regions are co-activated during the cognitive tasks. In addition, it is important to understand whether there are sex differences in brain connectivity during cognitive tasks, and if so, how social covariates influence these differences.</p><p>There is some recent literature on modeling a collection of networks, including dynamic networks. However, these methods may not flexibly associate dynamic network connectivity with external covariates while taking into account the structure of the network and smoothness in the dynamic brain connectivity. Specifically, <ref type="bibr">Xu and Hero (2014)</ref>, <ref type="bibr">Pensky (2019)</ref>, <ref type="bibr">Zhang and Cao (2017)</ref>, <ref type="bibr">Zhang, Sun and Li (2020)</ref> proposed several approaches based on stochastic block models. These methods cannot associate network connectivity with external covariates. <ref type="bibr">Wang et al. (2017)</ref> proposed a Bayesian network model with covariates, which is flexible but can be computationally intensive, especially for large networks or a large number of covariates. <ref type="bibr">Kong et al. (2020)</ref>, <ref type="bibr">Hu et al. (2021)</ref>, <ref type="bibr">Zhang, Sun and Li (2023)</ref> studied matrix or network response regressions, but they focused on nontime-varying networks. <ref type="bibr">Sun and Li (2017)</ref>, <ref type="bibr">Hao et al. (2021)</ref>, <ref type="bibr">Zhou et al. (2021)</ref>, <ref type="bibr">Tang, Bi and Qu (2020)</ref> considered tensor regressions that can be formulated to tackle our problem by stacking the dynamic networks observed at different time points into a tensor, but these approaches could not account for the temporal smoothness in the dynamic brain connectivity.</p><p>To model the dynamic brain connectivity in the social cognition study, we propose a new semiparametric dynamic network model for a collection of dynamic networks with subjectlevel covariates. We adopt the form of generalized linear model (GLM) and assume the connectivity between a pair of regions, after a proper transformation, is the sum of two functional components. The first component is the baseline time-varying connectivity shared by all subjects, and the second component involves time-varying slopes and models the effects of subject-level covariates on the time-varying brain connectivity. To estimate the unknown functional coefficients, we consider a nonparameteric estimation via B-spline approximations. Under such approximations we can then write our model in the form of a dynamic network regression, where the response is the dynamic connectivity matrix and the predictors are subject covariates. With the B-spline basis, the baseline connectivity can be characterized using an intercept tensor and the covariate effect using a slope tensor. We assume the intercept tensor is low-rank and the slope tensors are structurally sparse. We discuss the benefit of placing different assumptions on these two tensor coefficients in Section 2.1. These structural hypotheses significantly reduce the number of free parameters, facilitate model interpretability and estimability, and are commonly considered in scientific applications <ref type="bibr">(Bi, Qu and Shen (2018)</ref>, <ref type="bibr">Zhang, Sun and Li (2023)</ref>).</p><p>For estimation we propose an efficient alternating gradient descent algorithm with a fast iterative shrinkage-thresholding method to estimate the sparse slope tensor. In Section 3 we demonstrate in simulation studies that our method can accurately estimate the model coefficients and identify nonzero covariate effects, whereas other methods fail to offer accurate estimates. In Section 4 we apply our proposed method to the social cognition study and identify sex differences both in the baseline connectivity and social covariate effects. The majority of our results agree with the existing findings in the neuroscience literature. We also implement an elementwise (i.e., edge-based) method, where the results are highly noisy and lack interpretability, and a method designed for nontime-varying networks <ref type="bibr">(Zhang, Sun and Li (2023)</ref>), where the results are highly sparse and cannot identify areas that are known to be engaged in social cognition. Finally, we consider a permutation based procedure to evaluate the identified sex-specific differences from our analysis.</p><p>Taken together, our work proposes a new dynamic network regression for analyzing taskevoked brain connectivity with subject-level covariates that exploits the structure in the brain network and the temporal smoothness in the time-varying connectivity. We demonstrate in simulations and real data analysis that the proposed method usually performs better than elementwise methods that model the connectivity between each pair of nodes separately. Next, we discuss in detail the motivating scientific problem and the research questions to be addressed.</p><p>1.1. The HCP social cognition study and research questions. The social cognition study in the HCP data collected behavioral and task-related fMRI data from 850 healthy adult subjects. In each session a participate was presented with several short videos of objects (squares, circles, triangles) interacting <ref type="bibr">(Castelli et al. (2000)</ref>), and the fMRI data were collected on 274 evenly spaced time points. These videos were developed by either Castelli and colleagues <ref type="bibr">(Castelli et al. (2000)</ref>) or Martin and colleagues <ref type="bibr">(Wheatley, Milleville and Martin (2007)</ref>). Specifically, two types of video clips were shown to the subjects including mental (objects interact in some way: see Figure <ref type="figure">S13</ref> in the Supplementary Material, <ref type="bibr">Zhang et al. (2024)</ref>) and random (objects move randomly). For each participant, there were five video blocks (three mental and two random), with each video task and rest duration taking up 23 seconds and 15 seconds, respectively. We focus our analysis on the N = 843 subjects who were shown videos in the sequence of mental, mental, random, mental and random. Additionally, social related traits, such as social distress, social support, and companionship, were measured for each subject via self-reported questionnaires; see more details in Section 4.</p><p>In our analysis the fMRI data are preprocessed and summarized as a 68 &#215; 274 spatialtemporal matrix for each subject using the Desikan-Killiany Atlas <ref type="bibr">(Desikan et al. (2006)</ref>) with n = 68 regions of interest (ROIs, see Table <ref type="table">S4</ref>). As each subject goes through various tasks and rest states during the scanning session and activation/deactivation of brain regions measured via fMRI are typically lagged <ref type="bibr">(Sch&#246;lvinck et al. (2010)</ref>), it is more appropriate to study the brain connectivity as a dynamic network. Specifically, for each subject the dynamic network is constructed by calculating a sequence of connectivity matrices over T sliding windows, each summarizing the connectivity between 68 brain regions in a given window. While there are many choices of connectivity measures <ref type="bibr">(Smith et al. (2013)</ref>), the most commonly used one is perhaps the marginal Pearson correlation coefficient. We follow the vast majority of the neuroscience literature and measure connectivity in each individual by calculating Pearson correlations using samples from a pair of regions. The correlation matrix is then converted into a binary network to represent networks amongst ROIs; see more details in Section 4. In our analysis we have also considered partial correlation matrices <ref type="bibr">(Meinshausen and B&#252;hlmann (2006)</ref>) and found that our main results and qualitative findings remain similar.</p><p>A number of scientifically important questions are to be addressed for this study. First, which brain regions are activated during these cognitive social tasks and how do these regions function together? Second, if and how subject's social covariates, such as social distress, affect the task-evoked brain connectivity. Third, whether sex differences in brain connectivity during cognitive tasks exist, and if so, how do social covariates influence these differences?</p><p>We organize our paper as follows. Section 2 introduces the dynamic network response model and the estimation algorithm. Section 3 presents the simulations, and Section 4 analyzes the task-related study on social cognition and discusses our findings in answering the aforementioned research questions. Section 5 concludes the paper with a short discussion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Model.</head><p>Throughout this paper we employ the following notation. Let &#8226; denote the outer product and [k] = {1, 2 . . . , k}. For a vector b &#8712; R d 1 , let b 2 denote its Euclidean norm. For a matrix B &#8712; R d 1 &#215;d 2 , let B i&#8226; and B &#8226;j denote its ith row and j th column, B ij denote its (i, j )th entry, and B F denote its Frobenius Norm, respectively. For a tensor B &#8712; R d 1 &#215;d 2 &#215;d 3 , let B ij k denotes its (i, j, k)th entry, B ij &#8226; denote the (i, j )th tube fiber, and B &#8226;&#8226;k denote the kth frontal slice. For b &#8712; R d 3 and B &#8712; R d 1 &#215;d 2 &#215;d 3 , we define the tensor vector multiplication as</p><p>2.1. The dynamic network response model. Consider dynamic networks denoted by G i (V, E i (t)), i &#8712; [N], observed from N subjects, where V represents the common set of n nodes and E i (t) represents the set of edges at time point t for subject i. For each subject we also observe a p-vector of covariates, denoted by x i = (x i1 , . . . , x ip ) . At each time point t, the network G i (V, E i (t)) can be uniquely represented by its n &#215; n adjacency matrix A (i) (t), where A (i) jj (t) denotes the edge between nodes j and j at time point t in subject i. The edges can be continuous, binary or nonnegative integers. Without loss of generality, we assume t &#8712; [0, 1], and</p><p>, where the expectation E(&#8226;) is applied elementwise to entries in A (i) (t). We assume that, conditioning on x i , the entries in A (i) (t) are independent and follow an exponential distribution with a canonical link function that</p><p>where B 0 (t) &#8712; R n&#215;n characterizes the population-level time-varying network connectivity and B l (t) &#8712; R n&#215;n characterizes the time-varying effects of the lth covariate on the network connectivity. The function g(&#8226;) is an invertible link function, as commonly used in GLMs <ref type="bibr">(McCullagh and Nelder (1989)</ref>), and is applied elementwise to entries in &#956; (i) (t).</p><p>Let B ljj (t) denote the (j, j )th element of B l (t). To estimate the unknown functions B ljj (t)'s, we consider a nonparametric estimation using B-spline approximations. Specifically, we approximate B 0jj (t)'s using a K 0 -dimensional basis, denoted by &#966; 0 (t) = (&#966; 01 (t), . . . , &#966; 0K (t)) such that B 0jj (t) &#8776; &#966; 0 (t) &#215; b 0jj , and approximate B ljj (t)'s using a K 1 -dimensional basis, denoted by &#966; 1 (t) = (&#966; 11 (t), . . . , &#966; 1K (t)) , such that B ljj (t) &#8776; &#966; 1 (t) &#215; b ljj . Defining B 0 &#8712; R n&#215;n&#215;K 0 and B l &#8712; R n&#215;n&#215;K 1 such that B ljj . = b ljj for all j , j and l, model (2) can be rewritten as</p><p>where &#215; 3 is defined as in (1), and B 0 &#8712; R n&#215;n&#215;K 0 , B 1 . . . , B p &#8712; R n&#215;n&#215;K 1 are unknown tensor coefficients. A graphical illustration of model ( <ref type="formula">3</ref>) is given in Figure <ref type="figure">1</ref>.</p><p>One challenge in estimating model ( <ref type="formula">3</ref>) is the inherent high-dimensionality of the tensor coefficients. In our analysis of the HCP social cognition study, if we set K 0 = K 1 = 10, then each coefficient tensor B l is of dimension 68 &#215; 68 &#215; 10 = 46,240, far exceeding the number of subjects in the study. Thus, it is imperative to employ effective dimension reduction assumptions that can facilitate estimability and interpretability. Next, we move to discuss the dimension reduction assumptions placed on the baseline effect coefficient tensor B 0 and the covariate effect coefficient tensors B 1 , . . . , B p . We also discuss the need for considering different assumptions for these two types of effects: Low-rankness on B 0 . The component B 0 is the baseline coefficient tensor, and we assume that it possesses a low-rank structure. This specification assumes that there is a lowdimensional structure in the baseline time-varying network connectivity such that both the nodes and the basis coefficients have lower dimensional representations. This is similar to, but more general than, for example, the stochastic blockmodel <ref type="bibr">(Holland, Laskey and Leinhardt (1983)</ref>), a well-studied network model that assumes the nodes form a number of groups, and after reorganizing by group membership, the connecting probability matrix is a block matrix.</p><p>In our data problem, the low-rank assumption effectively reduces the number of parameters and increases computational efficiency. Specifically, we assume that B 0 admits the following rank-R CP decomposition <ref type="bibr">(Kolda and Bader (2009)</ref>):</p><p>where w r &#8712; R + , u 1r &#8712; R n , and u 3r &#8712; R K 0 . For identifiability, we assume u 1r 's and u 3r 's are unit length vectors. We note that the above formulation is for undirected networks. When the networks are directed, we can write B 0 = R r=1 w r u 1r &#8226; u 2r &#8226; u 3r , where u 2r &#8712; R n is a unit length vector. Notably, CP low-rank is a special case of the Tucker low-rank <ref type="bibr">(Kolda and Bader (2009)</ref>). The definition of the Tucker decomposition for B is expressed as</p><p>and U 3 &#8712; R r 3 &#215;d 3 are matrices with orthonormal columns. It is seen that the CP low-rank is a special case of Tucker low-rank with G l 1 l 2 l 3 = 0, unless l 1 = l 2 = l 3 (i.e., superdiagonal). Although the Tucker decomposition is more flexible, it also brings computational challenges, due to the orthonormal identifiability constraint on U 1 , U 2 , U 3 and an increased number of parameters. As a result, the CP low-rank structure is commonly considered in tensor methods for neuroimaging data, such as tensor regressions <ref type="bibr">(Zhou, Li and Zhu (2013)</ref>, <ref type="bibr">Sun et al. (2017)</ref>, <ref type="bibr">Zhang and Li (2017)</ref>, <ref type="bibr">Zhou et al. (2021)</ref>), tensor clustering <ref type="bibr">(Cai, Zhang and Sun (2021)</ref>) and covariance decomposition <ref type="bibr">(Deng, Tang and Qu (2023)</ref>), and has been found to give a good performance.</p><p>Structured sparsity in B 1 , . . . , B p . We assume that the subject covariates have sparse effects on the dynamic network connectivity; that is, the effects concentrate on a small number of edges. This is scientifically plausible, as brain connections are energy consuming and biological units tend to minimize energy-consuming activities <ref type="bibr">(Bullmore and Sporns (2009)</ref>). Sparsity also greatly reduces the number of free parameters and improves interpretation of the resulting model. Specifically, we assume that B l is structurally sparse in that most tube fibers in B l are zero, corresponding to sparse covariate effects. In particular, tube fiber B ljj &#8226; = 0 indicates covariate l has no effect on edge (j, j ), that is, B ljj (t) = 0 for all t. To encourage structural sparsity, we consider the group lasso <ref type="bibr">(Yuan and Lin (2006)</ref>) penalty, defined as ( <ref type="formula">4</ref>)</p><p>Different assumptions on B 0 and B 1 , . . . , B p . We briefly discuss the benefits and necessity of imposing separate structures on B 0 and B 1 , . . . , B p . It is natural to think that one could stack B 0 , B 1 , . . . , B p into one higher-order coefficient tensor of size n &#215; n &#215; K &#215; (p + 1) (assume K 0 = K 1 = K) and specify it to be both low-rank and sparse. However, assuming B 0 to be sparse may not be plausible in the GLM setting. For instance, when the network edges are binary and g(&#8226;) is the logit link, g(0) yields a connecting probability of 0.5; when the network edges are counts and g(&#8226;) is the log link, g(0) is not well defined. Correspondingly, a sparse B 0 does not necessarily imply sparsity in the baseline connectivity and may not even be well defined. This issue is unique in using sparse GLM to model edges in a network. Finally, more complex structures on B 1 , . . . , B p can be incorporated (e.g., B 1 , . . . , B p are low-rank and sparse), which can further reduce the number of effective parameters. However, such assumptions are expected to incur a much higher computational cost and also involve more tuning parameters on, for example, the rank of each coefficient. To balance model complexity and feasibility, we focus on the current assumption that assumes B 1 , . . . , B p have structured sparsity. <ref type="formula">3</ref>) the negative log-likelihood function, up to a constant, can be written as</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Estimation. Recall that</head><p>where <ref type="bibr">McCullagh and Nelder (1989)</ref>). We estimate the parameters w, U 1 , U 3 , and by solving the following optimization problem:</p><p>where P(&#8226;) is as defined in (4) and &#955; is a tuning parameter. The optimization problem in ( <ref type="formula">6</ref>) is computationally challenging, as the size of the networks, the dimension of the covariates, and the number of basis functions can be large in practice. The GLM loss function further increases the computation burden due to its nonlinearity. While (6) is nonconvex, the conditional optimization with respect to u 1r , while fixing all other parameters, is convex, and the same holds for w, u r3 's, and B j 's. This observation permits an alternating minimization algorithm. One potential issue in such an approach is that solving for , conditional on all other parameters, is a regularized optimization problem of dimension n &#215; n &#215; K 1 &#215; p. This can be computationally expensive when the network size n, the number of splines K 0 , K 1 , and the dimension of the covariates p are large. To tackle this challenge, we consider a proximal gradient descent algorithm that is easy to implement and computationally efficient. Our estimation procedure is summarized in Algorithm 1.</p><p>In Step 2, &#361;jr 's are solved using a Newton-type algorithm <ref type="bibr">(Schnabel, Koonatz and Weiss (1985)</ref>), and the gradients are given in Section S7.1 in the Supplementary Material. In Step 3</p><p>Algorithm 1 Optimization procedure of ( <ref type="formula">6</ref>)</p><p>Input: rank R, tuning parameter &#955;, bases dimensions {K 0 ,K 1 } and step size &#951;.</p><p>Step 1: initialize w (0) , U (0)</p><p>p . Repeat Steps 2-5 for t = 0, 1, . . . until convergence.</p><p>Step 2: repeat the following steps for r = 1, 2, . . . R. &#361;(t+1)</p><p>3R , (t) ).</p><p>Step 3:</p><p>3</p><p>),</p><p>Step 5: repeat the following steps for s = 0, 1, . . . until convergence.</p><p>(t,s+1) = S &#955;&#951; ( (t,s) -&#951;&#8711; &#8467;(w (t+1) , U</p><p>we define two matrix operators for U . Norm(U ) calculates the &#8467; 2 norms of columns in a matrix U , and Unit(U ) rescales the columns of a matrix into unit vectors. That is,</p><p>In Step 5 we employ the fast iterative shrinkage-thresholding method (FISTA, <ref type="bibr">Beck and Teboulle, 2009)</ref> under group lasso penalty. Specifically, we define the shrinkage operator by S &#955;&#951; ( ) = (T &#955;&#951; (B 1 ), . . . , T &#955;&#951; (B p )) &#8712; R n&#215;n&#215;K 1 &#215;p , where</p><p>and (x) + = max(0, x). In the FISTA algorithm and at step s + 1, the iterative shrinkage operator S &#955;&#951; (&#8226;) is not directly applied to the previous point (t,s) but rather at the point (t,s) , which uses a specific linear combination of the previous two points (t,s) and (t,s-1) . The FISTA algorithm has been shown to enjoy a fast global rate of convergence <ref type="bibr">(Beck and Teboulle (2009)</ref>) and is easy to implement. The stepsize &#951; is typically chosen as the Lipschitz constant of &#8711; &#8467;(w, U 1 , U 3 , ), which can be approximately calculated given the initial values.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Initialization.</head><p>In Algorithm 1 we need to determine the initial values for the alternating minimization procedure. To obtain a good initial estimate, we first estimate B (0) 0 , B (0) 1 , . . . , B (0) p via an elementwise generalized spline regression; see (8). We then estimate w (0) , U</p><p>1 , U (0) 3 via a CP decomposition of the estimated B (0) 0 . In our experiments this initialization procedure leads to a good numerical performance of Algorithm 1. The accuracy of this initialization procedure is evaluated in Section 3.</p><p>Parameter tuning. The B-spline dimensions K 0 and K 1 , rank R, and regularization parameter &#955; are tuning parameters in our algorithm. We choose these parameters using the eBIC criterion that was first developed for variable selection in the diverging dimension regime in <ref type="bibr">Chen and Chen (2012)</ref>. It has been demonstrated that the eBIC function is effective as a heuristic criterion to balance model fitting and complexity when used in low-rank estimation problems <ref type="bibr">(Srivastava, Engelhardt and Dunson (2017)</ref>, <ref type="bibr">Cai, Zhang and Sun (2021)</ref>, <ref type="bibr">Zhang, Sun and Li (2023)</ref>). Specifically, we choose the combination of (K 0 , K 1 , R, &#955;) that minimizes</p><p>where &#8467; is the loss function in (5), and &#373;, &#219; 1 , &#219; 3 , &#710; are the estimates of w, U 1 , U 3 , under the working rank and regularization parameter. In our numerical experiments, the above eBIC is found to be minimized at the true rank and sparsity level under the selected &#955; (see Section S3). To reduce computational cost in tuning, we recommend using cubic B-splines in practice. If desired, the order of the B-splines can be another tuning parameter selected using the eBIC function.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Simulation.</head><p>We conduct simulations to investigate the performance of our proposed method. We focus on symmetric networks and compare our proposed dynamic network response regression method, referred as DNetReg, with two alternative elementwise approaches. To reduce computational cost, we set K 0 = K 1 = K, and the spline dimension K is set to the true value in simulations; see Section S3 of more details of tuning K, R, and s 0 using the eBIC.</p><p>The first elementwise approach, referred as EdgeReg, fits elementwise GLMs at each time point t k . That is, for any j, j &#8712;</p><p>This elementwise approach ignores both the network structure and the temporal smoothness in the dynamic brain connectivity. The second elementwise approach, referred as DEdgeReg, fits a generalized spline regression to each entry in A jj (t). Specifically, for any j, j</p><p>A Newton-type algorithm is employed to estimate the parameters in the above model. The method DEdgeReg is used to find the initial values in Algorithm 1. We simulate N binary dynamic networks of size n &#215; n in [0, 1] from model (3), where A jj (t), t &#8712; [0, 1], follows a Bernoulli distribution and g(&#8226;) is taken to be the logit link function. The covariates x i 's are generated independently from N (0, 1), and we standardize the columns of the design matrix to have zero mean and unit standard deviation. For B 0 = R r=1 w r u 1r &#8226; u 1r &#8226; u 3r , we first generate the entries of u 1r and u 3r from N (0, 1), set w r = u 1r 2 2 u 3r , and then we standardize u 1r and u 3r as unit length vectors. For B 1 we randomly set s 0 proportion of its entries to be 1 and the rest to zero such that s 0 = B 1 0 /(n 2 K 1 ). The basis functions in &#966; 0 (t) and &#966; 1 (t) are set to B-spline bases with K 0 = K 1 = 8, and the knots are equally spaced in [0, 1].</p><p>To evaluate the estimation accuracy, we report estimation errors B 0 -B0 F , B 1 -B1 F , and N i=1 &#956; (i) -&#956;(i) F /N , where &#956;(i) (t) = g -1 ( B0 &#215; 3 &#966; 0 (t) + x i ( B1 &#215; 3 &#966; 1 (t))). Furthermore, to evaluate the edge selection accuracy from our method, we report the true positive rate (TPR) and false positive rate (FPR) in identifying the nonzero entries in B 1 . The first elementwise approach EdgeReg does not estimate spline coefficients B 0 and B 1 , and thus their estimation errors are not reported. While estimates from EdgeReg are not sparse, the p-values for B ljj (t h )'s are directly available from standard GLM model fitting. In our evaluations we apply Bonferroni correction to these p-values and then calculate the TPR and FPR in identifying the edges modulated by x 1 , that is, entries (j, j )'s with nonzero time-varying covariate effects B 1jj (t)'s. Specifically, we define P BC &#8712; R n&#215;n&#215;T , where P BC jj h is the pvalue in evaluating the significance of B 1jj (t h ) from ( <ref type="formula">7</ref>), after the Bonferroni correction of n &#215; n &#215; T tests. Defining H &#8712; R n&#215;n with H jj = 1{min(P BC jj . ) &#8804; 0.05}, and</p><p>The FPR and TPR are calculated as</p><p>where * denotes the elementwise product. The second elementwise approach DEdgeReg does not give sparse estimates, and there are no readily available inference results to calculate pvalues; hence, their TPRs and FPRs are not reported.</p><p>We set the number of subjects N = 50, 100, the number of equally spaced time points T = 100, and consider the number of nodes n = 50, 100, rank R = 2, 5, and the sparsity proportion s 0 = 0.05, 0.1, respectively. Tables <ref type="table">1</ref> and <ref type="table">2</ref> report the average accuracy measures over 50 replications with sample size N = 50, 100, respectively, with the standard deviations shown in parentheses. It is seen that our proposed method achieves the best performance among all competing methods, in terms of both estimation accuracy and selection accuracy, and this holds for different sample sizes N , numbers of nodes n, ranks R, and sparsity levels s 0 . Moreover, the estimation error of our method DNetReg decreases as network size n, rank R, and sparsity proportion s 0 decrease and as sample size N increases. Estimation errors from EdgeReg and DEdgeReg are not sensitive to R or s 0 , as they are elementwise approaches and do not consider the low-rank and sparsity structure in the tensor coefficients. In terms of edge selection accuracy, EdgeReg is overly conservative after the Bonferroni correction, and its TPRs are close to zero. In our analysis we also considered FDR (or BH) correction <ref type="bibr">(Benjamini and Hochberg (1995)</ref>) for p-value corrections, and the results are similar. We further report the eBIC over varying R, K 0 , K 1 and s 0 values in Section S3 of the Supplementary Material, consider sensitivity analysis of our method under model misspecifications in Section S4 of the Supplementary Material, and discuss the computational cost of DNetReg in Section S5 of the Supplementary Material.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Application to the social cognition study in the human connectome project.</head><p>We consider the social cognition study in the HCP study, a publicly available HCP study with preprocessed cortical-surface fMRI data (see <ref type="url">https://www.humanconnectome.org/study/hcp-</ref>young-adult/document/900-subjects-data-release). <ref type="foot">2</ref> In our analysis we mapped the preprocessed data to the Desikan-Killiany Atlas <ref type="bibr">(Desikan et al. (2006)</ref>). The social cognition study collects task-related fMRI data from N = 843 healthy adult subjects. Specifically, the fMRI data are collected on 274 evenly spaced time points covering an initiation countdown (five seconds) followed by five video blocks (23 seconds each) with fixation blocks in between (15 seconds each). The first 11 scans in the initiation countdown period are removed in our analysis. The fMRI data are then preprocessed and summarized as a 68&#215;263 spatial-temporal matrix for each subject using the Desikan-Killiany Atlas <ref type="bibr">(Desikan et al. (2006)</ref>) with n = 68 ROIs (see Table <ref type="table">S4</ref>). For each subject the dynamic network is constructed by calculating a sequence of connectivity matrices of dimension 68 &#215; 68 over T sliding windows, each summarizing the connectivity between the 68 brain regions in a given window. We let the number of samples in each window and the overlap between adjacent windows be 30 and 5, respectively, giving a total of T = 47 networks per subject. To check whether the results are sensitive to the parameters used to construct the dynamic connectivity matrices, we also consider setting the window size to 25 and 35 and step size 3 and 7. The result are shown in Section S6.1 of the Supplementary Material. We determine connectivity in each individual by computing Pearson correlations between samples from a pair of regions and create binary networks by setting A jj (t h ) = 1 if the computed correlation value is greater than 0.5 and A jj (t h ) = 0 otherwise, and this gives an average network density about 15%. This procedure can eliminate weak functional connectivity and is commonly employed in existing neuroscience literature <ref type="bibr">(Power et al. (2011)</ref>). In our analysis we have also considered partial correlation matrices <ref type="bibr">(Meinshausen and B&#252;hlmann (2006)</ref>) and applied other thresholding values, such as 0.6, to the Pearson correlation matrix and found that our main results and qualitative findings remain similar. Notably, we have also considered continuous edge weights, calcu-  <ref type="formula">3</ref>) with Gaussian distribution and g(&#8226;) as the identity link, and the results are shown in Section S6.2 of the Supplementary Material. Our analysis is carried out in R, and the code is available online. <ref type="foot">3</ref>In the social cognition study, there are 374 males and 469 females, aged between 22 and 36 years old. In addition, the HCP data contain social covariates on three categories, including friendship and loneliness in the Companionship category, perceived hostility and perceived rejection in the Social Distress category, and emotional support and instrumental support in the Social Support category. Our preliminary analysis finds that these covariates are highly correlated with correlations ranging from 0.4 to 0.6 and a median of 0.5. Due to this consideration and to reduce computational cost, we focus our analysis on perceived hostility (e.g., how often people argue with me, yell at me, or criticize me) representing the Social Distress category. A higher perceived hostility shows increased social distress, which is the extent to which an individual perceives his/her daily social interactions as negative or distressing <ref type="bibr">(Lieberman (2007)</ref>). Moreover, we have also considered the friendship covariate from the Companionship category and the emotional support from the Social Support category. The model fitting results are shown in Section S6.3 of the Supplementary Material. The goal of our analysis is to characterize the baseline brain connectivity in tasks, to ascertain how social covariates modulate the subject-level connectivity changes, and to examine whether there are any sex-specific differences. We apply our proposed model to the dynamic connectivity networks from males and females, respectively (see more discussions in Section S6.3). The social covariate is standardized to have mean zero and variance one, and we consider cubic B-spline basis with different equally spaced knots of B 0 and B 1 . Using the eBIC function, the basis dimensions were selected as K 0 = 8, K 1 = 10, the rank as R = 9, and the sparsity proportion as s 0 = 0.11 for males and K 0 = 12, K 1 = 10, R = 11, and s 0 = 0.17 for females. The results of fitting the model with cubic natural spline are shown in Section S6.4 of the Supplementary Material.</p><p>Baseline brain connectivity. We start by examining the estimated baseline connectivity coefficient B0 . Figure <ref type="figure">2</ref> plots the baseline connectivity averaged over time, that is, g -1 ( T h=1 B0 &#215; 3 &#966; 0 (t h )), where g(&#8226;) is the logit link function and nodes are organized by results from a K-means clustering. Specifically, we apply K-means clustering, based on SVD of the average connectivity matrix T h=1 B0 &#215; 3 &#966; 0 (t h ) for male, and identify five clusters among the 68 ROIs. The members of each cluster are given in Table <ref type="table">3</ref>. While clustering results using B 0 estimated for females are similar, we use the same clustering labels to facilitate comparisons. Anatomically, the first community contains mostly nodes in the cingulate gyrus, the second and fifth communities contain nodes from the frontal lobe, the third community contains nodes from the temporal lobe, and the fourth community contains nodes from the frontal, parietal, occipital, and temporal lobes (see Tables <ref type="table">3</ref> and <ref type="table">S4</ref>). Many of the 68 anatomic ROIs in the Desikan Atlas overlap with the resting-state functional modules. We find that community 1 is associated with emotion formation and processing, commu-</p><p>TABLE 3 The anatomic regions of interest in the identified communities 1 Caudalanteriorcingulate, isthmuscingulate, paracentral, posteriorcingulate, transversetemporal, insula 2 Cuneus, lingual, pericalcarine, postcentral, precentral, precuneus, rostralmiddlefrontal, superiorfrontal, supramarginal 3 Entorhinal, parahippocampal, temporalpole 4 Bankssts, caudalmiddlefrontal, fusiform, inferiorparietal, inferiortemporal, lateraloccipital, middletemporal, parsopercularis, parstriangularis, superiorparietal, superiortemporal 5 Lateralorbitofrontal, medialorbitofrontal, parsorbitalis, rostralanteriorcingulate, frontalpole nity 2 is related to visual, attention, and emotion regulation modules, and community 4 is enriched with visual and object identification. The lateral occipital gyrus in community 4, lingual gyrus in community 2, and pericalcarine gyrus in community 2 are from the occipital lobe, a region responsible for interpreting the visual world <ref type="bibr">(Goldenberg et al. (1991)</ref>), and is seen to be active for both males and females. For both males and females, we find that connectivity between communities 2 and 4 is more active both within and between the two hemispheres, especially the temporal parietal junction, superior temporal cortex regions, and occipital gyrus, which are all relevant in social cognition. This is in line with previous research which showed that mental animations stimulate these regions <ref type="bibr">(Castelli et al. (2000)</ref>, <ref type="bibr">Barch et al. (2013)</ref>). Within each hemisphere, males have higher connectivity within communities 2 and 4, and this is consistent with the existing findings that males have increased intrahemispheric connectivity <ref type="bibr">(Ingalhalikar et al. (2014)</ref>).</p><p>Social effects on brain connectivity and sex differences. We next examine the estimated covariate effect coefficient B1 . Figure <ref type="figure">3</ref> plots the heatmap of estimates for males and females, where we show ( T t=1 B1 &#215; 3 &#966; 1 (t))/T , the mean coefficient of B1 , representing the covariate effect on brain connectivity averaged over time.</p><p>It is seen that the social effects on connectivity show different patterns in males and females. Specifically, the estimated B1 has sparsity portions equal to 0.17 and 0.11 for females and males, respectively. Hence, the social effect on connectivity is more sparse in males, and such differences are observed in within-and between-community connectivity within and across hemispheres. Compared to males, the social covariate is seen to more notably decrease the connectivity between communities 2 and 4 within the right hemisphere and also across hemispheres in females, suggesting that the task-related brain connectivity in females is more sensitive to social stress. This supports existing findings that social stress influences brain connectivity and emotional perception differently for males and females <ref type="bibr">(Mather et al. (2010)</ref>). In general, the perceived hostile social distress covariate has a negative impact on the connection response for females both within and between communities, particularly for community 4, while it tends to have a positive impact on the connection response for males. The above findings on sex-specific difference are interesting, and they may be linked to existing research on sex differences in neural response to psychological stress <ref type="bibr">(Wang et al. (2007)</ref>).</p><p>Figure <ref type="figure">4</ref> shows the social effects on brain connectivity in males and females during different periods of the experiments including watching a mental video, resting, and watching a random video. It is seen that, during a mental video, the connectivity between temporal and occipital lobes in females is more affected by social stress. The temporal lobe plays an important role in visual perception and processing emotions, and the occipital lobe is related to visual processing, containing most of the anatomical region of the visual cortex <ref type="bibr">(Goldenberg et al. (1991)</ref>). Moreover, We discover that females have greater activity in the across-lobe connectivity, particularly among the temporal, parietal, and occipital lobes (Ingalhalikar et al. ( <ref type="formula">2014</ref>)) in all periods. This finding suggests some interesting patterns that warrant further investigation and validation.</p><p>Finally, we have visualized elementwise plots of estimated baseline effect B 0 (t) and covariate effect B 1 (t) in Figure <ref type="figure">5</ref> below. Our presentation focuses on region 22 in baseline effect B 0 (t) and regions 10 and 28 in covariate effect B 1 (t), as they show interesting differences between females and males. Specifically, plots (a)-(b) show the time-varying baseline effects in B 0 (t) between the left posterior cingulate (region 22) and other regions. For females, there are five notable valleys near time points t = 5, 11, 20, 32, and 42, corresponding precisely to three mental tasks and two random video clips. In contrast, these trends are much less noticeable in males, suggesting the posterior cingulate is more responsive in females during video viewing. This is supported by existing findings that the posterior cingulate gyrus is more active in females during emotion-related tasks, given its role in emotion regulation and processing <ref type="bibr">(Proverbio et al. (2009)</ref>). Plots and females, the connectivity between Region 62 and left superior parietal (28) is the most affected by the social covariate, reflecting the important role of the superior parietal lobule in cognitive tasks <ref type="bibr">(Alahmadi (2021)</ref>). It is seen that, for females, the connectivity between Region 62 and others are more affected by the social covariate.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>4.1.</head><p>A permutation based procedure to examine sex differences. Developing the asymptotic distribution of the estimated B 1 under the CP low-rank and sparsity constraints in our model is challenging. In this section we conduct an ad hoc permutation based procedure to examine whether the previously identified sex-specific differences are meaningful.</p><p>Specifically, we randomly permute the sex labels across subjects 100 times. In each permutation i, we divide the N = 843 samples into two groups, based on the permuted sex labels, and apply the proposed model to the male and female groups, respectively. We denote the coefficient tensors as B male,i 0 (or B female,i 0 ) and B male,i 1 (or B female,i 1 ) in permutation i, i &#8712; [100]. To quantify the difference in B 1 between males and females, we calculate the absolute distance between the coefficient vectors for each (j, j ). Specifically, we write We define a binary matrix S &#8712; R n&#215;n S jj = 1 100 i=1 1 D obs jj &gt; D per,i jj &#8805; 95 , where 1(&#8226;) is the indicator function. Correspondingly, S jj = 1 if the observed sex difference is the same as or greater than the 95th percentile of permuted sex difference. Figure 6(b) plots S, which further illustrates that the sex differences within community 4 and between communities 2 and 4 are likely significant (regions in the blue and black dashed lines), affirming the findings in Figure 3. We also consider comparing results based on subgraphs of interests, shown in Figure S5, where sex-specific differences from observed data are consistently greater than those from permuted data. FIG. 7. Heatmaps of ( T t=1 B1 &#215; 3 &#966; 1 (t))/T estimated by DEdgeReg, with rows and columns ordered the same as Figure 4. The top and bottom panels show the results without and with thresholding, respectively. 4.2. Results using existing methods. We evaluate the performance of two alternative methods including an elementwise method DEdgeReg, evaluated in Section 3, and GLSNet <ref type="bibr">(Zhang, Sun and Li (2023)</ref>), a nontime-varying matrix response regression model. Since GLSNet is not designed to model dynamic networks, we directly calculate the connectivity matrix based on all 263 scans using the same procedure that binarizes the Pearson correlation matrix. Using GLSNet and the recommended eBIC function in <ref type="bibr">Zhang, Sun and Li (2023)</ref>, the rank is selected as R = 5 and the sparsity proportion as s 0 = 0.025 for males, and R = 13 and s 0 = 0.0359 for females.</p><p>Figure <ref type="figure">7</ref> shows ( T t=1 B1 &#215; 3 &#966; 1 (t))/T (representing the effect averaged over time) estimated by DEdgeReg with or without threshlding at &#177;0.1. It is seen that the estimates from the elementwise method DEdgeReg are very noisy, and they identify a large number of regions with relatively small signals. The estimated social score effect coefficients B1 from GLSNet are shown in Figure <ref type="figure">8</ref>. For both males and females, the estimates are highly sparse. In males, several areas associated with social cognition, such as the temporal parietal junction, superior temporal cortical regions, and occipital gyrus, do not appear to be engaged. This can potentially due to the fact that GLNet ignores the dynamic changes of brain connectivity during the experiments.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion.</head><p>In this paper we study the task-evoked brain connectivity by introducing a new semiparametric dynamic network response regression that relates a dynamic brain connectivity network to a vector of subject-level covariates. A key advantage of our method is to exploit the structure of dynamic imaging coefficients in the form of high-order tensors. We briefly comment on potential future research. In our model setup, we assume that the tensor coefficients B 1 , . . . , B p are sparse. More complex structures, such as the low-rank or fused structures, can be considered as well, though they will increase the computation time and complexity in tuning. We have included two extended models with time-varying covariates and low-rank covariate effects. The corresponding simulation results and real data analyses are presented in Sections S1 and S2 of the Supplementary Material, respectively. In Section 4.1 we consider an ad hoc permutation procedure to evaluate the identified sexspecific differences. A more rigorous approach would be to derive the asymptotic distribution of B 1 and carry out hypothesis testing. This is not a trivial task due to the involvement of both low-rank and sparse constraints on the model parameters. We leave this investigation to future research.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0"><p>See https://wiki.humanconnectome.org/docs/How%20To%20Connect%20to%20Connectome%20Data%20 via%20AWS.html for download instructions.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1"><p>https://github.com/maoyuzhang09/DNetReg.</p></note>
		</body>
		</text>
</TEI>
