File size: 2,740 Bytes
7848e31
 
d3768a5
 
57a0175
 
 
7848e31
2e3e316
d3768a5
2e3e316
d3768a5
2e3e316
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a9dcc9f
1c75972
 
2e3e316
 
 
d3768a5
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
---
license: apache-2.0
language:
- en
pipeline_tag: text-classification
tags:
- ESG
---
# SASB ESG Sentence Classifier (Stage 1)

The SASB ESG sentence classifier is a BERT-based model fine-tuned to separate ESG from non-ESG sentences. It was trained using data extracted from documents conforming to the Sustainability Accounting Standards Board (SASB) standards. For a full description of our training data, please refer to https://www.kaggle.com/datasets/edwardjunprung/sasb-aligned-esg-sentences.

Our classifier consists of a two-stage pipeline:
1. **[Stage 1](https://huggingface.co/ejunprung/SASB-ESG-Sentence-Classifier)** - Classify sentences as ESG or not.
2. **[Stage 2](https://huggingface.co/ejunprung/SASB-ESG-Classification-26Categories)** - Subsequently, bucket ESG sentences into one of [26 SASB categories](https://sasb.org/standards/materiality-finder/).

## Goal

The objective is to categorize sentences within ESG documents in order to evaluate corporate ESG alignment. As an illustration, upon analyzing all sentences in Activision's annual ESG report, the SASB ESG model determined that more than 40% of sentences correspond with the Diversity & Inclusion and Human Rights SASB categories. Consequently, we can infer that Activision places a significant emphasis on these initiatives, which positions it as a potential candidate for investment funds with social impact mandates.

## Model Output

SASB ESG sentence classifier outputs either 0 (i.e. Not ESG) or 1 (i.e. ESG).

## Results

Below, we present a comparison between our two-stage approach and a baseline heuristic method. The baseline method categorizes ESG sentences based solely on the presence of specific keywords. For instance, any sentence containing the phrase "human rights" would be automatically labeled under that category.

| Model          | Parent ESG Category | Child ESG Category | 
|----------------|:-------------------:|:------------------:|
| Heuristic      |         31%         |        34%         |
| SASB ESG Model |         71%         |        61%         |

**Parent Category** = Environment, Social Capital, Human Capital, Business Model & Innovation, Leadership & Governance<br>
**Child Category** = GHG Emissions, Air Quality, etc. Please visit https://sasb.org/standards/materiality-finder to see full list.

## Misc
- Developed by: [Victor Chen](https://www.linkedin.com/in/victorzitianchen), [Jude Zhu](https://www.linkedin.com/in/judewzhu), [Michael Liston](https://www.linkedin.com/in/michael-c-liston/), [Edward Junprung](https://www.linkedin.com/in/ejunprung/)
- Parent Model: [bert_en_cased_L-12_H-768_A-12](https://huggingface.co/google/bert_uncased_L-12_H-768_A-12)
- Blog Post: https://www.gopeaks.org/esg-mapper