ESG2PreEM: Automated ESG grade assessment framework using pre-trained ensemble models.

Haein Lee, Seon Hong Lee, Heungju Park, Jang Hyun Kim, Hae Sun Jung
Author Information
  1. Haein Lee: Department of Applied Artificial Intelligence/ Department of Human Artificial Intelligence Interaction, Sungkyunkwan University, 03063, Seoul, South Korea.
  2. Seon Hong Lee: Department of Applied Artificial Intelligence/ Department of Human Artificial Intelligence Interaction, Sungkyunkwan University, 03063, Seoul, South Korea.
  3. Heungju Park: SKK Business School, Sungkyunkwan University, 03063, Seoul, South Korea.
  4. Jang Hyun Kim: Department of Interaction Science/ Department of Human Artificial Intelligence Interaction, Sungkyunkwan University, 03063, Seoul, South Korea.
  5. Hae Sun Jung: Department of Applied Artificial Intelligence, Sungkyunkwan University, 03063, Seoul, South Korea.

Abstract

Incorporating environmental, social, and governance (ESG) criteria is essential for promoting sustainability in business and is considered a set of principles that can increase a firm's value. This research proposes a strategy using text-based automated techniques to rate ESG. For autonomous classification, data were collected from the news archive LexisNexis and classified as E, S, or G based on the ESG materials provided by the Refinitiv-Sustainable Leadership Monitor, which has over 450 metrics. In addition, Bidirectional Encoder Representations from Transformers (BERT), Robustly optimized BERT approach (RoBERTa), and A Lite BERT (ALBERT) models were trained to accurately categorize preprocessed ESG documents using a voting ensemble model, and their performances were measured. The accuracy of the ensemble model utilizing BERT and ALBERT was found to be 80.79% with batch size 20. Additionally, this research validated the performance of the framework for companies included in the Dow Jones Industrial Average (DJIA) and compared it with the grade provided by Morgan Stanley Capital International (MSCI), a globally renowned ESG rating agency known for having the highest creditworthiness. This study supports the use of sophisticated natural language processing (NLP) techniques to attain important knowledge from large amounts of text-based data to improve ESG assessment criteria established by different rating agencies.

Keywords

References

  1. Financ Res Lett. 2021 Jan;38:101870 [PMID: 36569646]
  2. Educ Psychol Meas. 2023 Aug;83(4):831-854 [PMID: 37398846]
  3. Math Biosci Eng. 2023 Aug 29;20(9):17018-17036 [PMID: 37920045]

Word Cloud

Created with Highcharts 10.0.0ESGBERTusingensemblemodellanguagecriteriaresearchtext-basedtechniquesdataprovidedALBERTmodelsframeworkgraderatingprocessingNLPassessmentIncorporatingenvironmentalsocialgovernanceessentialpromotingsustainabilitybusinessconsideredsetprinciplescanincreasefirm'svalueproposesstrategyautomatedrateautonomousclassificationcollectednewsarchiveLexisNexisclassifiedESGbasedmaterialsRefinitiv-SustainableLeadershipMonitor450metricsadditionBidirectionalEncoderRepresentationsTransformersRobustlyoptimizedapproachRoBERTaLitetrainedaccuratelycategorizepreprocesseddocumentsvotingperformancesmeasuredaccuracyutilizingfound8079%batchsize20AdditionallyvalidatedperformancecompaniesincludedDowJonesIndustrialAverageDJIAcomparedMorganStanleyCapitalInternationalMSCIgloballyrenownedagencyknownhighestcreditworthinessstudysupportsusesophisticatednaturalattainimportantknowledgelargeamountsimproveestablisheddifferentagenciesESG2PreEM:Automatedpre-trainedEnsembleNaturalPretrained

Similar Articles

Cited By (4)