We present a novel self-supervised approach for hierarchical representation learning and segmentation of perceptual inputs in a streaming fashion. Our research addresses how to semantically group streaming inputs into chunks at various levels of a hierarchy while simultaneously learning, for each chunk, robust global representations throughout the domain. To achieve this, we propose STREAMER, an architecture that is trained layer-by-layer, adapting to the complexity of the input domain. In our approach, each layer is trained with two primary objectives: making accurate predictions into the future and providing necessary information to other levels for achieving the same objective. The event hierarchy is constructed by detecting prediction error peaks at different levels, where a detected boundary triggers a bottom-up information flow. At an event boundary, the encoded representation of inputs at one layer becomes the input to a higher-level layer. Additionally, we design a communication module that facilitates top-down and bottom-up exchange of information during the prediction process. Notably, our model is fully self-supervised and trained in a streaming manner, enabling a single pass on the training data. This means that the model encounters each input only once and does not store the data. We evaluate the performance of our model on the egocentric EPIC-KITCHENS dataset, specifically focusing on temporal event segmentation. Furthermore, we conduct event retrieval experiments using the learned representations to demonstrate the high quality of our video event representations.
more »
« less
Complementary networks of cortical somatostatin interneurons enforce layer specific control
The neocortex is functionally organized into layers. Layer four receives the densest bottom up sensory inputs, while layers 2/3 and 5 receive top down inputs that may convey predictive information. A subset of cortical somatostatin (SST) neurons, the Martinotti cells, gate top down input by inhibiting the apical dendrites of pyramidal cells in layers 2/3 and 5, but it is unknown whether an analogous inhibitory mechanism controls activity in layer 4. Using high precision circuit mapping, in vivo optogenetic perturbations, and single cell transcriptional profiling, we reveal complementary circuits in the mouse barrel cortex involving genetically distinct SST subtypes that specifically and reciprocally interconnect with excitatory cells in different layers: Martinotti cells connect with layers 2/3 and 5, whereas non-Martinotti cells connect with layer 4. By enforcing layer-specific inhibition, these parallel SST subnetworks could independently regulate the balance between bottom up and top down input.
more »
« less
- Award ID(s):
- 1707398
- PAR ID:
- 10099435
- Date Published:
- Journal Name:
- eLife
- Volume:
- 8
- ISSN:
- 2050-084X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Deep neural networks (DNNs) have achieved near-human level accuracy on many datasets across different domains. But they are known to produce incorrect predictions with high confidence on inputs far from the training distribution. This challenge of lack of calibration of DNNs has limited the adoption of deep learning models in high-assurance systems such as autonomous driving, air traffic management, cybersecurity, and medical diagnosis. The problem of detecting when an input is outside the training distribution of a machine learning model, and hence, its prediction on this input cannot be trusted, has received significant attention recently. Several techniques based on statistical, geometric, topological, or relational signatures have been developed to detect the out-of-distribution (OOD) or novel inputs. In this paper, we present a runtime monitor based on predictive processing and dual process theory. We posit that the bottom-up deep neural networks can be monitored using top-down context models comprising two layers. The first layer is a feature density model that learns the joint distribution of the original DNN’s inputs, outputs, and the model’s explanation for its decisions. The second layer is a graph Markov neural network that captures an even broader context. We demonstrate the efficacy of our monitoring architecture in recognizing out-of-distribution and out-of-context inputs on the image classification and object detection tasks.more » « less
-
Abstract Salient objects grab attention because they stand out from their surroundings. Whether this phenomenon is accomplished by bottom-up sensory processing or requires top-down guidance is debated. We tested these alternative hypotheses by measuring how early and in which cortical layer(s) neural spiking distinguished a target from a distractor. We measured synaptic and spiking activity across cortical columns in mid-level area V4 of male macaque monkeys performing visual search for a color singleton. A neural signature of attentional capture was observed in the earliest response in the input layer 4. The magnitude of this response predicted response time and accuracy. Errant behavior followed errant selection. Because this response preceded top-down influences and arose in the cortical layer not targeted by top-down connections, these findings demonstrate that feedforward activation of sensory cortex can underlie attentional priority.more » « less
-
Abstract Large stocks of soil carbon (C) and nitrogen (N) in northern permafrost soils are vulnerable to remobilization under climate change. However, there are large uncertainties in present‐day greenhouse gas (GHG) budgets. We compare bottom‐up (data‐driven upscaling and process‐based models) and top‐down (atmospheric inversion models) budgets of carbon dioxide (CO2), methane (CH4) and nitrous oxide (N2O) as well as lateral fluxes of C and N across the region over 2000–2020. Bottom‐up approaches estimate higher land‐to‐atmosphere fluxes for all GHGs. Both bottom‐up and top‐down approaches show a sink of CO2in natural ecosystems (bottom‐up: −29 (−709, 455), top‐down: −587 (−862, −312) Tg CO2‐C yr−1) and sources of CH4(bottom‐up: 38 (22, 53), top‐down: 15 (11, 18) Tg CH4‐C yr−1) and N2O (bottom‐up: 0.7 (0.1, 1.3), top‐down: 0.09 (−0.19, 0.37) Tg N2O‐N yr−1). The combined global warming potential of all three gases (GWP‐100) cannot be distinguished from neutral. Over shorter timescales (GWP‐20), the region is a net GHG source because CH4dominates the total forcing. The net CO2sink in Boreal forests and wetlands is largely offset by fires and inland water CO2emissions as well as CH4emissions from wetlands and inland waters, with a smaller contribution from N2O emissions. Priorities for future research include the representation of inland waters in process‐based models and the compilation of process‐model ensembles for CH4and N2O. Discrepancies between bottom‐up and top‐down methods call for analyses of how prior flux ensembles impact inversion budgets, more and well‐distributed in situ GHG measurements and improved resolution in upscaling techniques.more » « less
-
null (Ed.)Two new alkali vanadate carbonates with divalent transition metals have been synthesized as large single crystals via a high-temperature (600 °C) hydrothermal technique. Compound I , Rb 2 Mn 3 (VO 4 ) 2 CO 3 , crystallizes in the trigonal crystal system in the space group P 3̄1 c , and compound II , K 2 Co 3 (VO 4 ) 2 CO 3 , crystallizes in the hexagonal space group P 6 3 / m . Both structures contain honeycomb layers and triangular lattices made from edge-sharing MO 6 octahedra and MO 5 trigonal bipyramids, respectively. The honeycomb and triangular layers are connected along the c -axis through tetrahedral [VO 4 ] groups. The MO 5 units are connected with each other by carbonate groups in the ab -plane by forming a triangular magnetic lattice. The difference in space groups between I and II was also investigated with Density Functional Theory (DFT) calculations. Single crystal magnetic characterization of I indicates three magnetic transitions at 77 K, 2.3 K, and 1.5 K. The corresponding magnetic structures for each magnetic transition of I were determined using single crystal neutron diffraction. At 77 K the compound orders in the MnO 6 -honeycomb layer in a Néel-type antiferromagnetic orientation while the MnO 5 triangular lattice ordered below 2.3 K in a colinear ‘up–up–down’ fashion, followed by a planar ‘Y’ type magnetic structure. K 2 Co 3 (VO 4 ) 2 CO 3 ( II ) exhibits a canted antiferromagnetic ordering below T N = 8 K. The Curie–Weiss fit (200–350 K) gives a Curie–Weiss temperature of −42 K suggesting a dominant antiferromagnetic coupling in the Co 2+ magnetic sublattices.more » « less
An official website of the United States government

