Ken4070TiS
/

qubit_arXiv_LoRA_llama3

text-generation-inference

Model card Files Files and versions Community

This model was made by the following step:

Use a web crawler to collect the papers by using arXiv API.
The searching keyword is "qubit AND (IBM OR IQM OR Rigetti)", the time range is 2018 - 2024.
The data was corrected in the JSON with column' Title, Abstract, Authors, arXiv_id, Date, Author_company.
Feed the JSON files to llama-3-8b-bnb-4bit and fine-tune the model by using unsloth on google colab, the GPU is A100
That's it! :)

Uploaded model

Developed by: Ken4070TiS
License: apache-2.0
Finetuned from model : unsloth/llama-3-8b-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 2

GGUF

Model size

8.03B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

4-bit

5-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Ken4070TiS/qubit_arXiv_LoRA_llama3

Base model

meta-llama/Meta-Llama-3-8B

Quantized

unsloth/llama-3-8b-bnb-4bit

Quantized

(858)

this model

Dataset used to train Ken4070TiS/qubit_arXiv_LoRA_llama3