ai21labs
/

AI21-Jamba-Mini-1.6

@@ -267,76 +267,6 @@ The weather in Jerusalem is currently 18 degrees Celsius. In London, it is 8 deg
 </details>
-## Grounded Generation with Jamba:
-A common use-case for LLMs is grounded generation and RAG, where the model is required to answer a question or follow an instruction based on a given set of documents or document snippets. To standardize this process, Jamba was trained with a specific "documents" section in its chat template. The model was trained to attend to this section, and grounded generation tasks show improved performance when the task is formatted in this way.
-Similar to tools, which are given as an external argument to the model in addition to the conversation, documents are provided in a similar way. To support document-level metadata, a document is defined as a dictionary with key-values of your choosing. These are formatted within the chat template. Two keys that get special treatment are "title", which is formatted at the top of the document if present, and "text" which is a required field and defines the actual text of the document.
-<details><summary><strong>Ataching documents to Jamba Mini 1.6 prompt</strong></summary>
-```python
-from transformers import AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("ai21labs/AI21-Jamba-Mini-1.6")
-messages = [
-        {
-            "role": "user",
-            "content": "Who wrote Harry Potter?"
-        }
-]
-documents = [
-        {
-            "text": "Harry Potter is a series of seven fantasy novels written by British author J. K. Rowling.",
-            "title": "Harry Potter"
-        },
-        {
-            "text": "The Great Gatsby is a novel by American writer F. Scott Fitzgerald.",
-            "title": "The Great Gatsby",
-            "country": "United States",
-            "genre": "Novel"
-        }
-]
-prompt = tokenizer.apply_chat_template(
-    messages,
-    documents=documents,
-    tokenize=False,
-)
-# Output: J. K. Rowling
-```
-</details>
-## JSON mode
-Jamba 1.6 was trained with specific “knobs”, which help steer the model towards commonly requested behaviors. Each behavior is enabled by including specific pre-defined text in the system message. For ease of use, we've included them as flags in Jamba 1.6's chat template, so they can be toggled by passing appropriate arguments to the chat template.
-Jamba 1.6 was trained to produce valid JSONs when requested to. It does so naturally, but when the JSON mode knob is activated the likelihood of a valid json increases considerably. In JSON mode, Jamba 1.6 will attempt to output a valid JSON regardless of the user request. However, it is highly recommended to specify information about the expected json schema in the user request or system message to get the best results, as shown in the example below.
-<details><summary><strong>Usage of JSON knob in Jamba 1.6</strong></summary>
-```python
-from transformers import AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("ai21labs/AI21-Jamba-Mini-1.6")
-messages = [
-    {'role':'user',
-     'content':'Describe the first American president. Include year of birth (number) and name (string).'}
-    ]
-prompt = tokenizer.apply_chat_template(messages,
-                                       tokenize=False,
-                                       add_generation_prompt=False,
-#Output: "{ "year of birth": 1732, "name": "George Washington." }"
-```
-</details>
 ## Fine-tuning examples


267
268	</details>
269






































































270
271	## Fine-tuning examples
272