README.md · NorGLM/NorGPT-3B-continue at 4a668085ae8263bfc4a700eb0fc7a1f3f4b757b5

metadata

license: cc-by-nc-sa-4.0
language:
  - 'no'
extra_gated_prompt: >-
  You agree to not use the model to conduct experiments that cause harm to human
  subjects.
extra_gated_fields:
  Company: text
  Country: country
  Specific date: date_picker
  I want to use this model for:
    type: select
    options:
      - Research
      - Education
      - label: Other
        value: other
  I agree to use this model for non-commercial use ONLY: checkbox
extra_gated_heading: Acknowledge license to accept the repository
extra_gated_description: Our team may take 2-3 days to process your request
extra_gated_button_content: Acknowledge license

Gnerative Pretrained Tranformer with 3 Billion parameters for Norwegian. The model is continue trained using NorGPT-3B model on a selective documents from the pretraining dataset, which includes news articles, parlamentary speech, books and govermental reports.

It belongs to NorGLM, a suite of pretrained Norwegian Generative Language Models. NorGLM can be used for non-commercial purposes.

Run the Model

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "NorGLM/NorGPT-3B-continue"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    device_map='auto',
    torch_dtype=torch.bfloat16
)

text = "Tom ønsket å gå på barene med venner"
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=20)

Note

More training and evaluation details and papers will come soon!