segestic
/

phi2_medical_seg

Text Generation

Model card Files Files and versions Community

segestic commited on Sep 14

Commit

e0d60ee

•

1 Parent(s): c0f806e

first commit read_me

Files changed (1) hide show

README.md +73 -3

README.md CHANGED Viewed

@@ -1,3 +1,73 @@
----
-license: mit
----

+---
+license: mit
+license_link: https://huggingface.co/microsoft/phi-2/resolve/main/LICENSE
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- nlp
+- Medicine
+datasets:
+- medalpaca/medical_meadow_health_advice
+- medalpaca/medical_meadow_mediqa
+- medalpaca/medical_meadow_mmmlu
+- medalpaca/medical_meadow_medical_flashcards
+- medalpaca/medical_meadow_wikidoc_patient_information
+- medalpaca/medical_meadow_wikidoc
+- medalpaca/medical_meadow_pubmed_causal
+- medalpaca/medical_meadow_medqa
+- medalpaca/medical_meadow_cord19
+base_model: microsoft/phi-2
+---
+## Model Summary
+Phi2_med_seg is a fine-tuned version of the Phi-2 model, specifically optimized for medical applications. This model has been trained using the Trainer framework on several different datasets from the MedAlpaca collection, which focuses on medical question answering and conversational AI.
+This model can answer information about different excplicit ideas in medicine
+## How to Get Started with the Model
+## Sample Code
+```python
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer
+torch.set_default_device("cuda")
+model = AutoModelForCausalLM.from_pretrained("segestic/phi2_medical_seg", torch_dtype="auto", trust_remote_code=True)
+tokenizer = AutoTokenizer.from_pretrained("segestic/phi2_medical_seg", trust_remote_code=True)
+inputs = tokenizer('''def print_prime(n):
+   """
+   What is Medcine?
+   """''', return_tensors="pt", return_attention_mask=False)
+outputs = model.generate(**inputs, max_length=200)
+text = tokenizer.batch_decode(outputs)[0]
+print(text)
+```
+## Training
+The fine-tuning process involved leveraging various medical datasets to enhance the model's ability to understand and generate relevant medical information. This approach aims to improve the model's performance in medical contexts, making it a valuable tool for healthcare professionals and researchers alike. By utilizing the Trainer framework, Phi2_med_seg benefits from advanced training techniques that help refine its responses and accuracy in medical scenarios.
+### Model
+* Architecture: a Transformer-based model with next-word prediction objective
+* Context length: 2048 tokens
+### Software
+* [PyTorch](https://github.com/pytorch/pytorch)
+* [DeepSpeed](https://github.com/microsoft/DeepSpeed)
+* [Flash-Attention](https://github.com/HazyResearch/flash-attention)