File size: 4,146 Bytes

fb4bfd8
 
 
 
 
c05f5a3
 
 
 
 
 
edd762c
 
 
dc1122f
 
 
 
 
 
 
 
 
 
b00cf22
 
 
 
dc1122f
865c0cb
1bf945b
 
dc1122f
 
 
 
b00cf22
dc1122f
 
 
 
b00cf22
 
dc1122f
 
 
 
 
 
 
 
 
b00cf22
 
 
 
 
 
b651ec8
dc1122f
 
 
 
 
b00cf22
 
dc1122f
 
 
 
 
 
 
b00cf22
dc1122f
b00cf22
 
dc1122f
b00cf22
 
dc1122f
b00cf22
 
 
 
dc1122f
b00cf22
 
 
 
dc1122f
b00cf22
 
dc1122f
b00cf22
 
 
 
 
 
 
 
 
dc1122f
 
 
 
 
 
 
 
 
b00cf22
 
 
 
dc1122f
 
 
 
 
b00cf22
dc1122f
 
 
 
b00cf22
dc1122f
a1d0ce3

---
datasets:
- brucewayne0459/Skin_diseases_and_care
language:
- en
license: mit
tags:
- medical
- dermatology
- skin_disease
- skin_care
- unsloth
- trl
- sft
---

## Model Details

### Model Description

<!-- Provide a longer summary of what this model is. -->



- **Developed by:** Bruce_Wayne(The Batman)
- **Funded by [optional]:** Wayne Industies
- **Model type:** Text Generation
- **Finetuned from model [optional]:** OpenBioLLM(llama-3)(aaditya/Llama3-OpenBioLLM-8B)

## You can find the gguf versions here --> https://huggingface.co/brucewayne0459/OpenBioLLm-Derm-gguf
### please let me know how the model works -->https://forms.gle/N14zZTkLpUr6Hf4BA
### Thank you!
## Uses

### Direct Use

This model is fine-tuned on skin diseases and dermatology data and is used for a dermatology chatbot to provide clear, accurate, and helpful information about various skin diseases, skin care routines, treatments, and related dermatological advice.


## Bias, Risks, and Limitations

This model is trained on dermatology data, which might contain inherent biases. It is important to note that the model's responses should not be considered a substitute for professional medical advice. There may be limitations in understanding rare skin conditions or those not well-represented in the training data.
The model still need to be fine-tuned further to get accurate answers.

### Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

## How to Get Started with the Model

Use the code below to get started with the model.

```python
from transformers import AutoTokenizer, AutoModelForCausalLM

model_name = "brucewayne0459/OpenBioLLm-Derm"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)
```

## Training Details

### Training Data

The model is fine-tuned on a dataset containing information about various skin diseases and dermatology care.
brucewayne0459/Skin_diseases_and_care

### Training Procedure

<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->

#### Preprocessing [optional]

"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
You are a highly knowledgeable and empathetic dermatologist. Provide clear, accurate, and helpful information about various skin diseases, skin care routines, treatments, and related dermatological advice.

### Input:
{}

### Response:
{}
"""
EOS_TOKEN = tokenizer.eos_token  # Must add EOS_TOKEN

def formatting_prompts_func(examples):
    inputs = examples["Topic"]
    outputs = examples["Information"]
    texts = []

Prompt passed while fine tuning the model
#### Training Hyperparameters

Training regime: The model was trained using the following hyperparameters:
Per device train batch size: 2
Gradient accumulation steps: 4
Warmup steps: 5
Max steps: 120
Learning rate: 2e-4
Optimizer: AdamW (8-bit)
Weight decay: 0.01
LR scheduler type: Linear



## Environmental Impact

<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->

Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).

- **Hardware Type:** Tesls T4 gpu
- **Hours used:** 1hr
- **Cloud Provider:** Google Colab


## Technical Specifications [optional]

### Model Architecture and Objective

This model is based on the LLaMA (Large Language Model Meta AI) architecture and fine-tuned to provide dermatological advice.


#### Hardware

The training was performed on Tesla T4 gpu with 4-bit quantization and gradient checkpointing to optimize memory usage.


### Feel free to provide any missing details or correct the assumptions made, and I'll update the model card accordingly.