Abstract
The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks that they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on a hand-written digit dataset and by learning several Atari 2600 games sequentially.
Keywords: artificial intelligence; continual learning; deep learning; stability plasticity; synaptic consolidation.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Comment in
-
Avoiding Catastrophic Forgetting.Hasselmo ME. Hasselmo ME. Trends Cogn Sci. 2017 Jun;21(6):407-408. doi: 10.1016/j.tics.2017.04.001. Epub 2017 Apr 23. Trends Cogn Sci. 2017. PMID: 28442279
-
Reply to Huszár: The elastic weight consolidation penalty is empirically valid.Kirkpatrick J, Pascanu R, Rabinowitz N, Veness J, Desjardins G, Rusu AA, Milan K, Quan J, Ramalho T, Grabska-Barwinska A, Hassabis D, Clopath C, Kumaran D, Hadsell R. Kirkpatrick J, et al. Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):E2498. doi: 10.1073/pnas.1800157115. Epub 2018 Feb 20. Proc Natl Acad Sci U S A. 2018. PMID: 29463734 Free PMC article. No abstract available.
-
Note on the quadratic penalties in elastic weight consolidation.Huszár F. Huszár F. Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):E2496-E2497. doi: 10.1073/pnas.1717042115. Epub 2018 Feb 20. Proc Natl Acad Sci U S A. 2018. PMID: 29463735 Free PMC article. No abstract available.
References
-
- Legg S, Hutter M. Universal intelligence: A definition of machine intelligence. Minds Mach. 2007;17(4):391–444.
-
- French RM. Catastrophic forgetting in connectionist networks. Trends Cognit Sci. 1999;3(4):128–135. - PubMed
-
- McCloskey M, Cohen NJ. Catastrophic interference in connectionist networks: The sequential learning problem. In: Bower GH, editor. The Psychology of Learning and Motivation. Vol 24. Academic; New York: 1989. pp. 109–165.
-
- McClelland JL, McNaughton BL, O’Reilly RC. Why there are complementary learning systems in the hippocampus and neocortex: Insights from the successes and failures of connectionist models of learning and memory. Psychol Rev. 1995;102(3):419–457. - PubMed
-
- Kumaran D, Hassabis D, McClelland JL. What learning systems do intelligent agents need? Complementary learning systems theory updated. Trends Cogn Sci. 2016;20(7):512–534. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
