Upload folder using huggingface_hub

Browse files

Files changed (11) hide show

README.md +19 -55
config.json +5 -3
generation_config.json +6 -0
openvino_detokenizer.bin +2 -2
openvino_detokenizer.xml +2 -2
openvino_model.bin +2 -2
openvino_model.xml +2 -2
openvino_tokenizer.bin +2 -2
openvino_tokenizer.xml +2 -2
tokenizer.json +0 -0
tokenizer_config.json +1 -1

README.md CHANGED Viewed

@@ -1,90 +1,54 @@
 ---
 license: mit
-language:
-  - en
 ---
 # dolly-v2-3b-fp16-ov
- * Model creator: [Databricks](https://huggingface.co/databricks)
  * Original model: [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b)
 ## Description
-This is [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b) model converted to the [OpenVINO™ IR](https://docs.openvino.ai/2024/documentation/openvino-ir-format.html) (Intermediate Representation) format.
 ## Compatibility
 The provided OpenVINO™ IR model is compatible with:
-* OpenVINO version 2024.2.0 and higher
-* Optimum Intel 1.17.0 and higher
-## Running Model Inference with [Optimum Intel](https://huggingface.co/docs/optimum/intel/index)
 1. Install packages required for using [Optimum Intel](https://huggingface.co/docs/optimum/intel/index) integration with the OpenVINO backend:
-    ```
-    pip install optimum[openvino]
-    ```
-2. Run model inference:
-    ```
-    from transformers import AutoTokenizer
-    from optimum.intel.openvino import OVModelForCausalLM
-    model_id = "OpenVINO/dolly-v2-3b-fp16-ov"
-    tokenizer = AutoTokenizer.from_pretrained(model_id)
-    model = OVModelForCausalLM.from_pretrained(model_id)
-    inputs = tokenizer("What is OpenVINO?", return_tensors="pt")
-    outputs = model.generate(**inputs, max_length=200)
-    text = tokenizer.batch_decode(outputs)[0]
-    print(text)
-    ```
-For more examples and possible optimizations, refer to the [OpenVINO Large Language Model Inference Guide](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html).
-## Running Model Inference with [OpenVINO GenAI](https://github.com/openvinotoolkit/openvino.genai)
-1. Install packages required for using OpenVINO GenAI.
 ```
-pip install openvino-genai huggingface_hub
-```
-2. Download model from HuggingFace Hub
 ```
-import huggingface_hub as hf_hub
-model_id = "OpenVINO/dolly-v2-3b-fp16-ov""
-model_path = "dolly-v2-3b-fp16-ov""
-hf_hub.snapshot_download(model_id, local_dir=model_path)
 ```
-3. Run model inference:
-```
-import openvino_genai as ov_genai
-device = "CPU"
-pipe = ov_genai.LLMPipeline(model_path, device)
-print(pipe.generate("What is OpenVINO?", max_length=200))
 ```
-More GenAI usage examples can be found in OpenVINO GenAI library [docs](https://github.com/openvinotoolkit/openvino.genai/blob/master/src/README.md) and [samples](https://github.com/openvinotoolkit/openvino.genai?tab=readme-ov-file#openvino-genai-samples)
 ## Limitations
-Check the original model card for [limitations](https://huggingface.co/databricks/dolly-v2-3b#known-limitations).
 ## Legal information
-The original model is distributed under [MIT](https://choosealicense.com/licenses/mit/) license. More details can be found in [original model card](https://huggingface.co/databricks/dolly-v2-3b).
 ## Disclaimer

 ---
 license: mit
+license_link: https://choosealicense.com/licenses/mit/
 ---
 # dolly-v2-3b-fp16-ov
+* Model creator: [Databricks](https://huggingface.co/databricks)
  * Original model: [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b)
 ## Description
 ## Compatibility
 The provided OpenVINO™ IR model is compatible with:
+* OpenVINO version 2024.4.0 and higher
+* Optimum Intel 1.20.0 and higher
+## Running Model Inference
 1. Install packages required for using [Optimum Intel](https://huggingface.co/docs/optimum/intel/index) integration with the OpenVINO backend:
 ```
+pip install optimum[openvino]
 ```
+2. Run model inference:
 ```
+from transformers import AutoTokenizer
+from optimum.intel.openvino import OVModelForCausalLM
+model_id = "OpenVINO/dolly-v2-3b-fp16-ov"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = OVModelForCausalLM.from_pretrained(model_id)
+inputs = tokenizer("What is OpenVINO?", return_tensors="pt")
+outputs = model.generate(**inputs, max_length=200)
+text = tokenizer.batch_decode(outputs)[0]
+print(text)
 ```
+For more examples and possible optimizations, refer to the [OpenVINO Large Language Model Inference Guide](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html).
 ## Limitations
+Check the original model card for [original model card](https://huggingface.co/databricks/dolly-v2-3b) for limitations.
 ## Legal information
+The original model is distributed under [mit](https://choosealicense.com/licenses/mit/) license. More details can be found in [original model card](https://huggingface.co/databricks/dolly-v2-3b).
 ## Disclaimer

config.json CHANGED Viewed

@@ -25,13 +25,15 @@
   "model_type": "gpt_neox",
   "num_attention_heads": 32,
   "num_hidden_layers": 32,
   "rope_scaling": null,
   "rotary_emb_base": 10000,
   "rotary_pct": 0.25,
   "tie_word_embeddings": false,
-  "torch_dtype": "float32",
-  "transformers_version": "4.40.1",
   "use_cache": true,
   "use_parallel_residual": true,
   "vocab_size": 50280
-}

   "model_type": "gpt_neox",
   "num_attention_heads": 32,
   "num_hidden_layers": 32,
+  "partial_rotary_factor": 0.25,
   "rope_scaling": null,
+  "rope_theta": 10000,
   "rotary_emb_base": 10000,
   "rotary_pct": 0.25,
   "tie_word_embeddings": false,
+  "torch_dtype": "float16",
+  "transformers_version": "4.45.2",
   "use_cache": true,
   "use_parallel_residual": true,
   "vocab_size": 50280
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 0,
+  "eos_token_id": 0,
+  "transformers_version": "4.45.2"
+}

openvino_detokenizer.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e3d0218341805b3876fc9c8e95c98d75cc9ddd0fa34fac6df212e790b6f91a08
-size 558494

 version https://git-lfs.github.com/spec/v1
+oid sha256:f1e43770f23d5b9dbfc8bf99bbea4fe501870adf36235dff20156f6c0a129a47
+size 514078

openvino_detokenizer.xml CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a7675447cb4bf5c86e1f08f505dd3df3616bc5e96c5e67569836fed97d6cac47
-size 5981

 version https://git-lfs.github.com/spec/v1
+oid sha256:978a0d7f70294bcc62041496c85f3988aa2198f81697b338896a3a12b933b1a4
+size 4498

openvino_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2a7e96cb1c20f52ead1cb0280485e2b1588b96f726c6ae9f1c66d9d10b07def2
-size 5554525594

 version https://git-lfs.github.com/spec/v1
+oid sha256:ea2579c5c7a6ae1e1bf0afa54c363aa231816b6ca4b88859a004a00ea9e6c604
+size 5550172420

openvino_model.xml CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:57f0715d3d331f91a95996642aedf6774aeea993c3b4a19f8599710de5e0df2b
-size 3581222

 version https://git-lfs.github.com/spec/v1
+oid sha256:051b8420c5a95133352fc66b1db055d33ce5f6695feed62255572221bfc0f075
+size 2383298

openvino_tokenizer.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bf4bb4307e428d32680d2973f34247c51210fad1f4c6408b2087dfdfb053e210
-size 1166376

 version https://git-lfs.github.com/spec/v1
+oid sha256:c378d88077ae7c7e13ef61745e1ceef76412338e9a7398445c09632413e52abe
+size 1227935

openvino_tokenizer.xml CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c7cdfc7122104652c01e43777808b3c7c4670445cceaf2976554adc694d9064
-size 27473

 version https://git-lfs.github.com/spec/v1
+oid sha256:3a72a76f00a28f33541aa4d9bda2a9559ace3f5fa60cfa6d7556e072c4cc0a4c
+size 22293

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json CHANGED Viewed

@@ -234,7 +234,7 @@
     "### Response:"
   ],
   "bos_token": "<|endoftext|>",
-  "clean_up_tokenization_spaces": true,
   "eos_token": "<|endoftext|>",
   "model_max_length": 1000000000000000019884624838656,
   "pad_token": "<|endoftext|>",

     "### Response:"
   ],
   "bos_token": "<|endoftext|>",
+  "clean_up_tokenization_spaces": false,
   "eos_token": "<|endoftext|>",
   "model_max_length": 1000000000000000019884624838656,
   "pad_token": "<|endoftext|>",