Use T5_large for training and output outputs.loss as NAN.

#19

by Swithun - opened Nov 27, 2023

Nov 27, 2023

Why is T5_large loaded for training, e.g. outputs = model(input_ids, labels = target_ids) and the output outputs.loss is NAN. But on the same dataset loaded T5_small model outputs.loss is correct.

![1701050677158.jpg]({"error":"Only GIF, JPEG, JPG, MOV, MP3, MP4, MPGA, PNG, QT, WAV, WEBM, WEBP files are supported"})

ybelkada

Dec 9, 2023

Hi @Swithun
Hmm this might be a precision issue, how exactly do you train your model?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment