Big Data Analysis of Sports and Physical Activities among Korean Adolescents.

Sung-Un Park, Hyunkyun Ahn, Dong-Kyu Kim, Wi-Young So
Author Information
  1. Sung-Un Park: Department of Sport & Leisure Studies, College of Arts & Physical Education, Shingyeong University, Hwaseong-si 18274, Korea. ORCID
  2. Hyunkyun Ahn: Department of Sport & Leisure Studies, Division of Arts & Health, Myongji College, Seoul 03656, Korea.
  3. Dong-Kyu Kim: Department of Sport Management, Graduate School of Technology Management, Kyung Hee University, Yongin-si 17104, Korea.
  4. Wi-Young So: Sports and Health Care Major, College of Humanities and Arts, Korea National University of Transportation, Chungju-si 27469, Korea. ORCID

Abstract

The Korean government (Ministry of Culture, Sports and Tourism, Ministry of Health and Welfare, and Ministry of Education) has framed policies and conducted many projects to encourage adolescents to be more physically active. Despite these efforts, the participation rate of physical activity in Korean adolescents keeps decreasing. Thus, the purpose of this study was to analyze the perception of sports and physical activity in Korean adolescents through big data analysis of the last 10 years and to provide research data and statistical direction with regard to sports and physical activity participation in Korean adolescents. For data collection, data from 1 January 2010 to 31 December 2019 were collected from Naver (NAVER Corp., Seongnam, Korea), Daum (Kakao Corp., Jeju, Korea), and Google (Alphabet Inc., Mountain View, CA, USA), which are the most widely used search engines in Korea, using TEXTOM 4.0 (The Imc Inc., Daegu, Korea), a big data collection and analysis solution. Keywords such as "adolescent + sports + physical activity" were used. TEXTOM 4.0 can generate various collection lists at once using keywords. Collected data were processed through text mining (frequency analysis, term frequency-inverse document frequency analysis) and social network analysis (SNA) (degree centrality, convergence of iterated correlations analysis) by using TEXTOM 4.0 and UCINET 6 social network analysis software (Analytic Technologies Corp., Lexington, KY, USA). A total of 9278 big data (10.36 MB) were analyzed. Frequency analysis of the top 50 terms through text mining showed exercise (872), mind (851), health (824), program (782), and burden (744) in a descending order. Term frequency-inverse document frequency analysis revealed exercise (2108.070), health (1961.843), program (1928.765), mind (1861.837), and burden (1722.687) in a descending order. SNA showed that the terms with the greatest degree of centrality were exercise (0.02857), program (0.02406), mind (0.02079), health (0.02062), and activity (0.01872) in a descending order. Convergence of the iterated correlations analysis indicated five clusters: exercise and health, child to adult, sociocultural development, therapy, and program. However, female gender, sports for all, stress, and wholesome did not have a high enough correlation to form one cluster. Thus, this study provides basic data and statistical direction to increase the rate of physical activity participation in Korean adolescents by drawing significant implications based on terms and clusters through bid data analysis.

Keywords

References

  1. Ageing Res Rev. 2012 Jul;11(3):404-20 [PMID: 22465452]
  2. Am J Prev Med. 2003 Jan;24(1):22-8 [PMID: 12554020]
  3. Appl Physiol Nutr Metab. 2016 Jun;41(6 Suppl 3):S197-239 [PMID: 27306431]
  4. J Phys Act Health. 2018 Nov 1;15(S2):S251-S273 [PMID: 30475137]
  5. Acta Paediatr. 2005 Dec;94(12):1791-7 [PMID: 16421041]
  6. Ann Dermatol. 2016 Feb;28(1):1-5 [PMID: 26848212]
  7. Br J Sports Med. 2011 Sep;45(11):886-95 [PMID: 21807669]
  8. Cochrane Database Syst Rev. 2015 Jan 05;1:CD008366 [PMID: 25556970]
  9. BMC Public Health. 2012 Apr 02;12:258 [PMID: 22471711]
  10. Health Aff (Millwood). 2014 Jul;33(7):1115-22 [PMID: 25006136]
  11. Prev Med. 1999 Oct;29(4):305-12 [PMID: 10547056]
  12. Circulation. 2005 Apr 19;111(15):1999-2012 [PMID: 15837955]
  13. Gene Regul Syst Bio. 2008 May 15;2:193-201 [PMID: 19787083]
  14. Prog Cardiovasc Dis. 2011 May-Jun;53(6):419-28 [PMID: 21545928]
  15. J Exerc Sci Fit. 2019 Jan;17(1):26-33 [PMID: 30662511]
  16. Arch Pediatr Adolesc Med. 2003 Dec;157(12):1212-8 [PMID: 14662578]

MeSH Term

Adolescent
Big Data
Child
Data Analysis
Exercise
Female
Humans
Male
Republic of Korea
Sports

Word Cloud

Created with Highcharts 10.0.0analysisdata0KoreanadolescentsphysicalactivitysportsKoreaexercisehealthprogramMinistryparticipationbigcollectionCorpusingTEXTOM4frequencytermsminddescendingorderSportsrateThusstudy10statisticaldirectionIncUSAused+textminingfrequency-inversedocumentsocialnetworkSNAdegreecentralityiteratedcorrelationsshowedburdengovernmentCultureTourismHealthWelfareEducationframedpoliciesconductedmanyprojectsencouragephysicallyactiveDespiteeffortskeepsdecreasingpurposeanalyzeperceptionlastyearsprovideresearchregard1January201031December2019collectedNaverNAVERSeongnamDaumKakaoJejuGoogleAlphabetMountainViewCAwidelysearchenginesImcDaegusolutionKeywords"adolescentactivity"cangeneratevariouslistskeywordsCollectedprocessedtermconvergenceUCINET6softwareAnalyticTechnologiesLexingtonKYtotal927836MBanalyzedFrequencytop50872851824782744Termrevealed21080701961843192876518618371722687greatest0285702406020790206201872Convergenceindicatedfiveclusters:childadultsocioculturaldevelopmenttherapyHoweverfemalegenderstresswholesomehighenoughcorrelationformoneclusterprovidesbasicincreasedrawingsignificantimplicationsbasedclustersbidBigDataAnalysisPhysicalActivitiesamongAdolescentsactivities

Similar Articles

Cited By