Warning: The attention mask and the pad token id were not set..

#40

by Stephen-smj - opened Apr 20

Apr 20

Hi, when I infer the llama3-8b-instruct, there is an warning:

The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's attention_mask to obtain reliable results.
Setting pad_token_id to eos_token_id:128001 for open-end generation.

Although it doesn't impact the result, I still want to fix the warning. Any idea?
Thanks for answering.

rumAndRegression

Apr 20

when calling model.generate, setting pad_token_id=tokenizer.eos_token_id seems to remove the warning.

https://stackoverflow.com/questions/69609401/suppress-huggingface-logging-warning-setting-pad-token-id-to-eos-token-id

it seems to work fine still with that, although some explanation and reasoning wouldn't hurt. maybe i saw one when searching for the solution but i forget.. :)

LAKSERS

Apr 21

model.generate(**encoded_input, pad_token_id=tokenizer.eos_token_id).just change this will be ok.

nnilayy

Aug 26

•

edited Aug 26

Hey there, i was also facing the same error and after tinkering with the tokenizer for a bit i found out that inputs = tokenizer.encode(prompt) returns just the input_ids but not the attention mask, Whereas inputs = tokenizer(prompt) returns both the input_ids and the attention mask.

So if you replace this code

inputs = tokenizer.encode(prompt)
output = model.generate(inputs)

With this

inputs = tokenizer(prompt)
output = model.generate(**inputs)

The error warning goes away. Hope that helps 😊.

Stephen-smj changed discussion status to closed 13 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment