Vamsi002 commited on
Commit
37703b9
·
verified ·
1 Parent(s): 96aa1db

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -13
README.md CHANGED
@@ -1,13 +1,17 @@
1
- ---
2
- title: Albanian
3
- emoji: 🐠
4
- colorFrom: purple
5
- colorTo: pink
6
- sdk: streamlit
7
- sdk_version: 1.39.0
8
- app_file: app.py
9
- pinned: false
10
- short_description: LSTM English and Albanian
11
- ---
12
-
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
1
+ Language Model for Next Token Prediction (Albanian & English) Description:
2
+
3
+ This project builds a language model using LSTM to predict the next word based on combined Albanian and English sentences. The model generates coherent text in both languages and evaluates its performance using perplexity. Files:
4
+
5
+ Albanian.ipynb: Code for training the LSTM model on Albanian and English.
6
+ model_checkpoint.keras: Best model during training.
7
+ final_model.keras: Final trained model.
8
+ loss_history.csv: Training loss over time.
9
+ Key Outputs:
10
+
11
+ Generated Text: Generates text in English and French.
12
+ Perplexity Score: Final perplexity score is 24.58.
13
+ How to Run:
14
+
15
+ Clone/download the notebook and files.
16
+ Install tensorflow, numpy, matplotlib, pandas, keras.
17
+ Run the notebook to train and generate text.