bhaskartripathi
commited on
Commit
•
376b347
1
Parent(s):
cbba859
Update README.md
Browse files
README.md
CHANGED
@@ -31,15 +31,15 @@ The primary objective of this model is to **serve the unique needs of Indian sto
|
|
31 |
|
32 |
<p align="center">
|
33 |
<img src="https://huggingface.co/bhaskartripathi/GPT_Neo_Market_Analysis/resolve/main/indicBull.JPG" alt="IndicFinGPT Logo" width="400" height="300">
|
34 |
-
<strong
|
35 |
</p>
|
36 |
|
37 |
|
38 |
-
## Training Data
|
39 |
|
40 |
**IndicFinGPT 125M** utilizes the **Pile dataset** created by EleutherAI and includes the **top 100 tickers** (by volume and liquidity) from Indian stock markets, covering data from **January 1, 2018, to October 30, 2024**. This dataset encompasses diverse market periods, including **pre-COVID-19 (stable), COVID-19 (volatile), and post-COVID-19 (recovery phase)**. Such comprehensive data exposure allows the model to recognize **problem-solution patterns across various bull and bear runs**.
|
41 |
The training data also incorporates **local influences** such as cultural factors and **market-specific volatility**, enhancing its ability to perform **automated technical analysis** for chartless trading. Key capabilities include identifying **classical chart patterns** using technical analysis, conducting **earnings analysis**, interpreting **market sentiment** from multiple sources, and **assessing risks**, all aimed at **improving decision-making for Indian investors**.
|
42 |
-
|
43 |
|
44 |
## Key Highlights
|
45 |
|
|
|
31 |
|
32 |
<p align="center">
|
33 |
<img src="https://huggingface.co/bhaskartripathi/GPT_Neo_Market_Analysis/resolve/main/indicBull.JPG" alt="IndicFinGPT Logo" width="400" height="300">
|
34 |
+
<strong>भारतीय बाजार की शीर्ष 100 कंपनियों का वित्तीय विश्लेषण करने वाला पहला Small Language Model</strong>
|
35 |
</p>
|
36 |
|
37 |
|
38 |
+
## Training Data and Procedure
|
39 |
|
40 |
**IndicFinGPT 125M** utilizes the **Pile dataset** created by EleutherAI and includes the **top 100 tickers** (by volume and liquidity) from Indian stock markets, covering data from **January 1, 2018, to October 30, 2024**. This dataset encompasses diverse market periods, including **pre-COVID-19 (stable), COVID-19 (volatile), and post-COVID-19 (recovery phase)**. Such comprehensive data exposure allows the model to recognize **problem-solution patterns across various bull and bear runs**.
|
41 |
The training data also incorporates **local influences** such as cultural factors and **market-specific volatility**, enhancing its ability to perform **automated technical analysis** for chartless trading. Key capabilities include identifying **classical chart patterns** using technical analysis, conducting **earnings analysis**, interpreting **market sentiment** from multiple sources, and **assessing risks**, all aimed at **improving decision-making for Indian investors**.
|
42 |
+
This model was trained on 310 billion tokens over 692,380 steps. It was trained as a masked autoregressive language model, using cross-entropy loss, F1, Accuracy, Precision, recall,Pattern Detection Rate, and Cross-Entropy Loss.
|
43 |
|
44 |
## Key Highlights
|
45 |
|