sartajbhuvaji
/

gpt2_B_Shakespeare

Text Generation

Model card Files Files and versions Metrics Training metrics Community

sartajbhuvaji commited on Aug 31, 2024

Commit

de2510f

·

verified ·

1 Parent(s): bc3483b

Create README.md

Files changed (1) hide show

README.md +74 -0

README.md ADDED Viewed

	@@ -0,0 +1,74 @@

+---
+license: apache-2.0
+language:
+- en
+base_model: openai-community/gpt2
+pipeline_tag: text-generation
+tags:
+- art
+---
+## Model Card: GPT2_Shakespeare
+### Model Description
+This model is a fine-tuned version of the GPT-2 base model, fine-tuned on a dataset consisting of works by William Shakespeare to generate text in his tone and style. The model is designed to generate coherent and contextually relevant text, mimicking the unique style and phrasing found in the dataset.
+### Model Details
+- **Model Type:** GPT-2 ([Base](https://huggingface.co/openai-community/gpt2))
+- **Training Dataset:** Works by William Shakespeare [Github](https://gist.githubusercontent.com/blakesanie/dde3a2b7e698f52f389532b4b52bc254/raw/76fe1b5e9efcf0d2afdfd78b0bfaa737ad0a67d3/shakespeare.txt)
+- **Intended Use Cases:**
+  - Creative writing assistance
+  - Educational purposes for studying literary styles
+  - Text generation in the style of William Shakespeare
+### Usage
+You can easily use this model to generate text in Python using the Hugging Face `transformers` library.
+#### Installation
+Ensure you have the `transformers` library installed:
+```bash
+pip install transformers
+```
+#### Inference
+```python
+from transformers import GPT2LMHeadModel, GPT2Tokenizer
+# Load the fine-tuned model and tokenizer
+model_name = "sartajbhuvaji/gpt2_B_Shakespeare"
+model = GPT2LMHeadModel.from_pretrained(model_name)
+tokenizer = GPT2Tokenizer.from_pretrained(model_name)
+# Prepare input text
+input_text = "To be, or not to be, that is the question:"
+input_ids = tokenizer.encode(input_text, return_tensors="pt")
+# Generate text
+output = model.generate(
+    input_ids,
+    max_length=200,
+    num_return_sequences=1,
+    no_repeat_ngram_size=2,
+    do_sample=True,
+    top_k=50,
+    top_p=0.95
+)
+# Decode the generated text
+generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
+print(generated_text)
+```
+#### Limitations and Biases
+This model has been trained on a specific dataset, and its responses will reflect the content and style of that dataset.
+The model may generate text that reflects the biases present in the original data.
+This model is not suitable for generating factual information or for use cases requiring highly accurate and unbiased outputs.
+#### Ethical Considerations
+Use this model responsibly. The text generated by the model should not be used for misleading or harmful purposes.
+Note that this model might reflect historical biases inherent in the original text sources.
+Acknowledgments
+This model is based on the GPT-2 architecture by OpenAI and has been fine-tuned using the Hugging Face transformers library.