bhaskartripathi commited on
Commit
87f7b68
·
verified ·
1 Parent(s): 376b347

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -6
README.md CHANGED
@@ -26,6 +26,12 @@ library_name: peft
26
  **IndicFinGPT** is a specialized transformer model, re-engineered from **EleutherAI's GPT-Neo-125M** architecture, which is a GPT-3 class architecture, designed specifically for the **Indian financial market**. The model has undergone **retraining on its top layers** to enhance its performance in providing insights into the **top 100 companies listed in the NIFTY50 Index, BSE, and NSE exchanges**.
27
  The primary objective of this model is to **serve the unique needs of Indian stock markets** and **investors engaged in chartless trading**. IndicFinGPT aims to provide insights that could **minimize capital loss and drawdowns** while **maximizing financial ratios** such as the **Sharpe, Sortino, Calmar, Omega, and Treynor Ratios**. Additionally, the model is designed to help in **reducing maximum drawdowns** in financial portfolios, offering a robust AI solution tailored to **India’s dynamic financial landscape**.
28
 
 
 
 
 
 
 
29
 
30
  ## First Indic-Stock Small Language Model Focused Top 100 Companies Listed in NSE and BSE Stock Exchanges
31
 
@@ -35,11 +41,6 @@ The primary objective of this model is to **serve the unique needs of Indian sto
35
  </p>
36
 
37
 
38
- ## Training Data and Procedure
39
-
40
- **IndicFinGPT 125M** utilizes the **Pile dataset** created by EleutherAI and includes the **top 100 tickers** (by volume and liquidity) from Indian stock markets, covering data from **January 1, 2018, to October 30, 2024**. This dataset encompasses diverse market periods, including **pre-COVID-19 (stable), COVID-19 (volatile), and post-COVID-19 (recovery phase)**. Such comprehensive data exposure allows the model to recognize **problem-solution patterns across various bull and bear runs**.
41
- The training data also incorporates **local influences** such as cultural factors and **market-specific volatility**, enhancing its ability to perform **automated technical analysis** for chartless trading. Key capabilities include identifying **classical chart patterns** using technical analysis, conducting **earnings analysis**, interpreting **market sentiment** from multiple sources, and **assessing risks**, all aimed at **improving decision-making for Indian investors**.
42
- This model was trained on 310 billion tokens over 692,380 steps. It was trained as a masked autoregressive language model, using cross-entropy loss, F1, Accuracy, Precision, recall,Pattern Detection Rate, and Cross-Entropy Loss.
43
 
44
  ## Key Highlights
45
 
@@ -47,7 +48,7 @@ This model was trained on 310 billion tokens over 692,380 steps. It was trained
47
  2. Market Sentiment: Built-in understanding of Indian market sentiment and cultural influences
48
  3. Macro-Economic Indicators: Adapted to domestic economic and financial metrics
49
  4. Indian Economic Influences: Awareness of timing, festival impacts, and market-specific volatility
50
- 5. 10+ Technical Indicators
51
 
52
  ## Implementation
53
 
 
26
  **IndicFinGPT** is a specialized transformer model, re-engineered from **EleutherAI's GPT-Neo-125M** architecture, which is a GPT-3 class architecture, designed specifically for the **Indian financial market**. The model has undergone **retraining on its top layers** to enhance its performance in providing insights into the **top 100 companies listed in the NIFTY50 Index, BSE, and NSE exchanges**.
27
  The primary objective of this model is to **serve the unique needs of Indian stock markets** and **investors engaged in chartless trading**. IndicFinGPT aims to provide insights that could **minimize capital loss and drawdowns** while **maximizing financial ratios** such as the **Sharpe, Sortino, Calmar, Omega, and Treynor Ratios**. Additionally, the model is designed to help in **reducing maximum drawdowns** in financial portfolios, offering a robust AI solution tailored to **India’s dynamic financial landscape**.
28
 
29
+ ## Training Data and Procedure
30
+
31
+ **IndicFinGPT 125M** utilizes the **Pile dataset** created by EleutherAI and includes the **top 100 tickers** (by volume and liquidity) from Indian stock markets, covering data from **January 1, 2018, to October 30, 2024**. This dataset encompasses diverse market periods, including **pre-COVID-19 (stable), COVID-19 (volatile), and post-COVID-19 (recovery phase)**. Such comprehensive data exposure allows the model to recognize **problem-solution patterns across various bull and bear runs**.
32
+ The training data also incorporates **local influences** such as cultural factors and **market-specific volatility**, enhancing its ability to perform **automated technical analysis** for chartless trading. Key capabilities include identifying **classical chart patterns** using technical analysis, conducting **earnings analysis**, interpreting **market sentiment** from multiple sources, and **assessing risks**, all aimed at **improving decision-making for Indian investors**.
33
+ This model was trained on 310 billion tokens over 692,380 steps. It utilized Quantized Low-Rank Adoption as a masked autoregressive language model, utilizing cross-entropy loss, F1, Accuracy, Precision, recall,Pattern Detection Rate, and Cross-Entropy Loss as performance metrics.
34
+
35
 
36
  ## First Indic-Stock Small Language Model Focused Top 100 Companies Listed in NSE and BSE Stock Exchanges
37
 
 
41
  </p>
42
 
43
 
 
 
 
 
 
44
 
45
  ## Key Highlights
46
 
 
48
  2. Market Sentiment: Built-in understanding of Indian market sentiment and cultural influences
49
  3. Macro-Economic Indicators: Adapted to domestic economic and financial metrics
50
  4. Indian Economic Influences: Awareness of timing, festival impacts, and market-specific volatility
51
+
52
 
53
  ## Implementation
54