Unable to run with default instructions on Colab

by rahulmanuwas - opened Oct 31, 2024

Discussion

rahulmanuwas

Oct 31, 2024

•

edited Oct 31, 2024

Hi, anyone able to run the models yet? I'm facing issues -

@zechunliu @reach-vb will appreciate any help!

Eryk-Chmielewski

Oct 31, 2024

•

edited Oct 31, 2024

You missed

!pip install --upgrade transformers

even though the configuration states:
"transformers_version": "4.41.2"
https://huggingface.co/facebook/MobileLLM-125M/blob/main/config.json

and colab has 4.42.2

import transformers
transformers.__version__

you have update it to the newest one.

paulmaksimovich

Oct 31, 2024

•

edited Oct 31, 2024

I went down this same path, however running with the newest transformers the tokenizer gets returned as a bool object:

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
Cell In[2], line 1
----> 1 tokenizer.add_special_tokens(
      2     {
      3         "eos_token": "</s>",
      4         "bos_token": "<s>",
      5         "unk_token": "<unk>",
      6     }
      7 )

AttributeError: 'bool' object has no attribute 'add_special_tokens'

Also see this post of the same issue on the 1B

mikezhaocs

AI at Meta org Oct 31, 2024

There's a typo on model card. Please use this command instead:
AutoTokenizer.from_pretrained("facebook/MobileLLM-125M", use_fast=False)

zechunliu changed discussion status to closed Nov 1, 2024

nealpu

Dec 7, 2024

Some weights of the model checkpoint at facebook/MobileLLM-125M were not used when initializing MobileLLMForCausalLM: ['lm_head.weight']

This IS expected if you are initializing MobileLLMForCausalLM from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
This IS NOT expected if you are initializing MobileLLMForCausalLM from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Some weights of MobileLLMForCausalLM were not initialized from the model checkpoint at facebook/MobileLLM-125M and are newly initialized: ['model.embed_tokens.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.

hi, I load this model using latest transformers(4.47.0) but get this message. What can I do to load model successfully?

mikezhaocs

AI at Meta org Dec 7, 2024

You can ignore the warning. The ['lm_head.weight'] is not used because MobileLLM use embedding sharing. So lm_head.weight = embed_tokens.weight.clone()

Minus0

about 13 hours ago

Hello! Running this with the current version of Transformers gives me the following error:

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-5-5a5008a62dec> in <cell line: 0>()
      1 model_inputs = tokenizer(["A list of colors: red, blue"], return_tensors="pt")
----> 2 generated_ids = model.generate(**model_inputs)
      3 tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

3 frames
~/.cache/huggingface/modules/transformers_modules/facebook/MobileLLM-125M/e09efb976ae4a767cf27a4bf277b7aade91dbd3a/modeling_mobilellm.py in prepare_inputs_for_generation(self, input_ids, past_key_values, attention_mask, inputs_embeds, cache_position, use_cache, **kwargs)
   1277             max_cache_length = (
   1278                 torch.tensor(past_key_values.get_max_length(), device=input_ids.device)
-> 1279                 if past_key_values.get_max_length() is not None
   1280                 else None
   1281             )

AttributeError: 'DynamicCache' object has no attribute 'get_max_length'

It seems like get_max_length was removed in the latest version of transformers and replaced with cache.get_max_cache_shape() instead.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment