Gachomba
/

multichoice-question-generator

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

multichoice-question-generator / README.md

Gachomba's picture

Update README.md

150e88a verified 3 months ago

|

2.34 kB

	---
	license: apache-2.0
	base_model: facebook/bart-large
	tags:
	- generated_from_trainer
	model-index:
	- name: multichoice-question-generator
	results: []
	---



	# multichoice-question-generator

	This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
	It achieves the following results on the evaluation set:
	- Loss: 0.1787

	## Model description

	More information needed

	## Intended uses & limitations
	This is an early version of a model meant to generate multichoice questions from text
	To load the model:
	from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

	tokenizer = AutoTokenizer.from_pretrained("Gachomba/multichoice-question-generator")
	model = AutoModelForSeq2SeqLM.from_pretrained("Gachomba/multichoice-question-generator")
	# tokenize input text
	import torch
	device = "cuda" if torch.cuda.is_available() else "cpu"

	def tokenize_input(input_text):
	inputs = tokenizer(input_text, return_tensors="pt", truncation=True, padding='max_length', max_length=1024)
	return inputs.input_ids.to(device), inputs.attention_mask.to(device)

	# generate output from the model
	def generate_output(input_text):
	input_ids, attention_mask = tokenize_input(input_text)
	outputs = model.generate(input_ids=input_ids, attention_mask=attention_mask, max_length=512)
	return tokenizer.decode(outputs[0], skip_special_tokens=True)

	# get user input and generate a response
	def get_response():
	user_input = input("Enter your text: ")
	response = generate_output(user_input)
	print("Generated Output:", response)

	get_response()



	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 2e-05
	- train_batch_size: 8
	- eval_batch_size: 8
	- seed: 42
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- num_epochs: 3

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \|
	\|:-------------:\|:-----:\|:----:\|:---------------:\|
	\| 0.2218 \| 1.0 \| 1000 \| 0.1910 \|
	\| 0.1913 \| 2.0 \| 2000 \| 0.1811 \|
	\| 0.1727 \| 3.0 \| 3000 \| 0.1787 \|


	### Framework versions

	- Transformers 4.42.4
	- Pytorch 2.3.1+cu121
	- Tokenizers 0.19.1