which loss function was used?

#5
by npotts - opened

Which loss function was used to fine tune this model? Euclidean distance, cosine similarity?

LinqAlpha org
โ€ข
edited Aug 8, 2024

We used cosine similarity, normalizing the output of the model (last token pooling)

npotts changed discussion status to closed

Sign up or log in to comment