Living organisms learn on multiple time scales: evolutionary as well as individual-lifetime learning. These two learning modes are complementary: the innate phenotypes developed through evolution significantly influence lifetime learning. However, it is still unclear how these two learning methods interact and whether there is a benefit to part of the system being optimized on a different time scale using a population-based approach while the rest of it is trained on a different time-scale using an individualistic learning algorithm. In this work, we study the benefits of such a hybrid approach using an actor-critic framework where the critic part of an agent is optimized over evolutionary time based on its ability to train the actor part of an agent during its lifetime. Typically, critics are optimized on the same time-scale as the actor using the Bellman equation to represent long-term expected reward. We show that evolution can find a variety of different solutions that can still enable an actor to learn to perform a behavior during its lifetime. We also show that although the solutions found by evolution represent different functions, they all provide similar training signals during the lifetime. This suggests that learning on multiple time-scales can effectively simplify the overall optimization process in the actor-critic framework by finding one of many solutions that can still train an actor just as well. Furthermore, analysis of the evolved critics can yield additional possibilities for reinforcement learning beyond the Bellman equation. 
                        more » 
                        « less   
                    
                            
                            Interaction Between Evolution and Learning in NK Fitness Landscapes
                        
                    
    
            Artificial Life has a long tradition of studying the interaction between learning and evolution. And, thanks to the increase in the use of individual learning techniques in Artificial Intelligence, there has been a recent revival of work combining individual and evolutionary learning. Despite the breadth of work in this area, the exact trade-offs between these two forms of learning remain unclear. In this work, we systematically examine the effect of task difficulty, the individual learning approach, and the form of inheritance on the performance of the population across different combinations of learning and evolution. We analyze in depth the conditions in which hybrid strategies that combine lifetime and evolutionary learning outperform either lifetime or evolutionary learning in isolation. We also discuss the importance of these results in both a biological and algorithmic context. 
        more » 
        « less   
        
    
                            - Award ID(s):
- 1845322
- PAR ID:
- 10174175
- Date Published:
- Journal Name:
- ALIFE 2020: The 2020 Conference on Artificial Life
- Issue:
- 32
- Page Range / eLocation ID:
- 761 - 767
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Lifetime learning, or the change (or acquisition) of behaviors during a lifetime, based on experience, is a hallmark of living organisms. Multiple mechanisms may be involved, but biological neural circuits have repeatedly demonstrated a vital role in the learning process. These neural circuits are recurrent, dynamic, and non-linear and models of neural circuits employed in neuroscience and neuroethology tend to involve, accordingly, continuous-time, non-linear, and recurrently interconnected components. Currently, the main approach for finding configurations of dynamical recurrent neural networks that demonstrate behaviors of interest is using stochastic search techniques, such as evolutionary algorithms. In an evolutionary algorithm, these dynamic recurrent neural networks are evolved to perform the behavior over multiple generations, through selection, inheritance, and mutation, across a population of solutions. Although, these systems can be evolved to exhibit lifetime learning behavior, there are no explicit rules built into these dynamic recurrent neural networks that facilitate learning during their lifetime (e.g., reward signals). In this work, we examine a biologically plausible lifetime learning mechanism for dynamical recurrent neural networks. We focus on a recently proposed reinforcement learning mechanism inspired by neuromodulatory reward signals and ongoing fluctuations in synaptic strengths. Specifically, we extend one of the best-studied and most-commonly used dynamic recurrent neural networks to incorporate the reinforcement learning mechanism. First, we demonstrate that this extended dynamical system (model and learning mechanism) can autonomously learn to perform a central pattern generation task. Second, we compare the robustness and efficiency of the reinforcement learning rules in relation to two baseline models, a random walk and a hill-climbing walk through parameter space. Third, we systematically study the effect of the different meta-parameters of the learning mechanism on the behavioral learning performance. Finally, we report on preliminary results exploring the generality and scalability of this learning mechanism for dynamical neural networks as well as directions for future work.more » « less
- 
            Humans have long known how to co-opt evolutionary processes for their own benefit. Carefully choosing which individuals to breed so that beneficial traits would take hold, they have domesticated dogs, wheat, cows and many other species to fulfil their needs. Biologists have recently refined these ‘artificial selection’ approaches to focus on microorganisms. The hope is to obtain microbes equipped with desirable features, such as the ability to degrade plastic or to produce valuable molecules. However, existing ways of using artificial selection on microbes are limited and sometimes not effective. Computer scientists have also harnessed evolutionary principles for their own purposes, developing highly effective artificial selection protocols that are used to find solutions to challenging computational problems. Yet because of limited communication between the two fields, sophisticated selection protocols honed over decades in evolutionary computing have yet to be evaluated for use in biological populations. In their work, Lalejini et al. compared popular artificial selection protocols developed for either evolutionary computing or work with microorganisms. Two computing selection methods showed promise for improving directed evolution in the laboratory. Crucially, these selection protocols differed from conventionally used methods by selecting for both diversity and performance, rather than performance alone. These promising approaches are now being tested in the laboratory, with potentially far-reaching benefits for medical, biotech, and agricultural applications. While evolutionary computing owes its origins to our understanding of biological processes, it has much to offer in return to help us harness those same mechanisms. The results by Lalejini et al. help to bridge the gap between computational and biological communities who could both benefit from increased collaboration.more » « less
- 
            null (Ed.)The stability of supported metal nanoparticles determines the activity and lifetime of heterogeneous catalysts. Catalysts can destabilize through several thermodynamic and kinetic pathways, and the competition between these mechanisms complicates efforts to quantify and predict the overall evolution of supported nanoparticles in reactive environments. Pairing in situ transmission electron microscopy with unsupervised machine learning, we quantify the destabilization of hundreds of supported Au nanoparticles in real-time to develop a model describing the observed particle evolution as a competition between evaporation and surface diffusion. Data mining of particle evolution statistics allows us to determine physically reasonable values for the model parameters, quantify the particle size at which the Gibbs–Thomson pressure accelerates the evaporation process, and explore how individual particle interactions deviate from the mean-field model. This approach can be applied to a wide range of supported nanoparticle systems, allowing quantitative insight into the mechanisms that control their evolution in reactive environments.more » « less
- 
            The study of individual behavioral variation, sometimes called animal personalities or behavioral types, is now a well-established area of research in behavioral ecology and evolution. Considerable theoretical work has developed predictions about its ecological and evolutionary causes and consequences, and studies testing these theories continue to grow. Here, we synthesize the current empirical work to shed light on which theories are well supported and which need further refinement. We find that the major frameworks explaining the existence of individual behavioral variation, the pace-of-life syndrome hypothesis and state-dependent feedbacks models, have mixed support. The consequences of individual behavioral variation are well studied at the individual level but less is known about consequences at higher levels such as among species and communities. The focus of this review is to reevaluate and reestablish the foundation of individual behavioral variation research: What do we know? What questions remain? And where are we going next?more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    