Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.

Andrea Ferrario, Burcu Demiray, Kristina Yordanova, Minxia Luo, Mike Martin
Author Information
  1. Andrea Ferrario: Department of Management, Technology, and Economics, ETH Zurich, Zurich, Switzerland. ORCID
  2. Burcu Demiray: Department of Psychology, University of Zurich, Zurich, Switzerland. ORCID
  3. Kristina Yordanova: Institute of Computer Science, University of Rostock, Rostock, Germany. ORCID
  4. Minxia Luo: Department of Psychology, University of Zurich, Zurich, Switzerland. ORCID
  5. Mike Martin: Department of Psychology, University of Zurich, Zurich, Switzerland. ORCID

Abstract

BACKGROUND: Reminiscence is the act of thinking or talking about personal experiences that occurred in the past. It is a central task of old age that is essential for healthy aging, and it serves multiple functions, such as decision-making and introspection, transmitting life lessons, and bonding with others. The study of social reminiscence behavior in everyday life can be used to generate data and detect reminiscence from general conversations.
OBJECTIVE: The aims of this original paper are to (1) preprocess coded transcripts of conversations in German of older adults with natural language processing (NLP), and (2) implement and evaluate learning strategies using different NLP features and machine learning algorithms to detect reminiscence in a corpus of transcripts.
METHODS: The methods in this study comprise (1) collecting and coding of transcripts of older adults' conversations in German, (2) preprocessing transcripts to generate NLP features (bag-of-words models, part-of-speech tags, pretrained German word embeddings), and (3) training machine learning models to detect reminiscence using random forests, support vector machines, and adaptive and extreme gradient boosting algorithms. The data set comprises 2214 transcripts, including 109 transcripts with reminiscence. Due to class imbalance in the data, we introduced three learning strategies: (1) class-weighted learning, (2) a meta-classifier consisting of a voting ensemble, and (3) data augmentation with the Synthetic Minority Oversampling Technique (SMOTE) algorithm. For each learning strategy, we performed cross-validation on a random sample of the training data set of transcripts. We computed the area under the curve (AUC), the average precision (AP), precision, recall, as well as F1 score and specificity measures on the test data, for all combinations of NLP features, algorithms, and learning strategies.
RESULTS: Class-weighted support vector machines on bag-of-words features outperformed all other classifiers (AUC=0.91, AP=0.56, precision=0.5, recall=0.45, F1=0.48, specificity=0.98), followed by support vector machines on SMOTE-augmented data and word embeddings features (AUC=0.89, AP=0.54, precision=0.35, recall=0.59, F1=0.44, specificity=0.94). For the meta-classifier strategy, adaptive and extreme gradient boosting algorithms trained on word embeddings and bag-of-words outperformed all other classifiers and NLP features; however, the performance of the meta-classifier learning strategy was lower compared to other strategies, with highly imbalanced precision-recall trade-offs.
CONCLUSIONS: This study provides evidence of the applicability of NLP and machine learning pipelines for the automated detection of reminiscence in older adults' everyday conversations in German. The methods and findings of this study could be relevant for designing unobtrusive computer systems for the real-time detection of social reminiscence in the everyday life of older adults and classifying their functions. With further improvements, these systems could be deployed in health interventions aimed at improving older adults' well-being by promoting self-reflection and suggesting coping strategies to be used in the case of dysfunctional reminiscence cases, which can undermine physical and mental health.

Keywords

References

  1. J Gerontol A Biol Sci Med Sci. 2018 Nov 10;73(12):1653-1660 [PMID: 29408961]
  2. J Gerontol B Psychol Sci Soc Sci. 2006 Jul;61(4):P237-44 [PMID: 16855036]
  3. Memory. 2017 Mar;25(3):403-411 [PMID: 27145425]
  4. Expert Rev Neurother. 2012 May;12(5):545-55 [PMID: 22550983]
  5. Med Decis Making. 2016 Jan;36(1):137-44 [PMID: 25449060]
  6. Multivariate Behav Res. 2015;50(6):706-20 [PMID: 26717128]
  7. J Med Internet Res. 2020 Feb 19;22(2):e13855 [PMID: 32130118]
  8. PLoS One. 2015 Mar 04;10(3):e0118432 [PMID: 25738806]
  9. Aging Ment Health. 2012;16(5):541-58 [PMID: 22304736]
  10. Curr Dir Psychol Sci. 2017 Apr;26(2):184-190 [PMID: 28529411]
  11. J Med Internet Res. 2019 Mar 11;21(3):e11990 [PMID: 30855231]
  12. Aging Ment Health. 2011 Jul 1;15(5):638-46 [PMID: 21815856]
  13. Health Educ Behav. 2014 Oct;41(1 Suppl):51S-61S [PMID: 25274711]
  14. Aging Ment Health. 2011 Mar;15(2):272-81 [PMID: 21140308]
  15. IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1968-1978 [PMID: 29993930]
  16. Cochrane Database Syst Rev. 2018 Mar 01;3:CD001120 [PMID: 29493789]
  17. J Psychiatr Res. 1975 Nov;12(3):189-98 [PMID: 1202204]
  18. J Med Internet Res. 2018 Jun 29;20(6):e231 [PMID: 29959110]
  19. Gerontology. 2010;56(5):507-19 [PMID: 19996570]
  20. Nat Biotechnol. 2018 Mar 6;36(3):228-232 [PMID: 29509737]
  21. J Gerontol B Psychol Sci Soc Sci. 2019 Jun 14;74(5):745-755 [PMID: 29190392]
  22. J Med Internet Res. 2018 Nov 07;20(11):e10497 [PMID: 30404767]
  23. Epidemiology. 2020 Jan;31(1):90-97 [PMID: 31651659]
  24. BMC Bioinformatics. 2013 Mar 22;14:106 [PMID: 23522326]
  25. Behav Res Methods Instrum Comput. 2001 Nov;33(4):517-23 [PMID: 11816455]
  26. Behav Res Methods. 2018 Dec;50(6):2597-2605 [PMID: 29687235]
  27. Psychiatry. 1963 Feb;26:65-76 [PMID: 14017386]

MeSH Term

Aged
Algorithms
Communication
Humans
Machine Learning
Memory, Long-Term
Natural Language Processing

Word Cloud

Created with Highcharts 10.0.0learningreminiscencedatatranscriptsNLPfeaturesconversationsolderstudyGermanstrategiesmachinealgorithmslifeeverydaydetect12adults'bag-of-wordswordembeddingssupportvectormachinesmeta-classifierstrategyReminiscenceagingfunctionssocialcanusedgenerateadultsnaturallanguageprocessingusingmethodsmodels3trainingrandomadaptiveextremegradientboostingsetprecisionoutperformedclassifiersAUC=0AP=0precision=0recall=0F1=0specificity=0imbalanceddetectionsystemshealthBACKGROUND:actthinkingtalkingpersonalexperiencesoccurredpastcentraltaskoldageessentialhealthyservesmultipledecision-makingintrospectiontransmittinglessonsbondingothersbehaviorgeneralOBJECTIVE:aimsoriginalpaperpreprocesscodedimplementevaluatedifferentcorpusMETHODS:comprisecollectingcodingpreprocessingpart-of-speechtagspretrainedforestscomprises2214including109Dueclassimbalanceintroducedthreestrategies:class-weightedconsistingvotingensembleaugmentationSyntheticMinorityOversamplingTechniqueSMOTEalgorithmperformedcross-validationsamplecomputedareacurveAUCaverageAPrecallwellF1scorespecificitymeasurestestcombinationsRESULTS:Class-weighted91565454898followedSMOTE-augmented895435594494trainedhoweverperformancelowercomparedhighlyprecision-recalltrade-offsCONCLUSIONS:providesevidenceapplicabilitypipelinesautomatedfindingsrelevantdesigningunobtrusivecomputerreal-timeclassifyingimprovementsdeployedinterventionsaimedimprovingwell-beingpromotingself-reflectionsuggestingcopingcasedysfunctionalcasesunderminephysicalmentalSocialOlderAdults'EverydayConversations:AutomatedDetectionUsingNaturalLanguageProcessingMachineLearningdementiaelectronicallyactivatedrecorderEARreal-life

Similar Articles

Cited By