4 4 11

Oleksandr Korovii

zeusfsx

ZeusFSX

AI & ML interests

NLP field

Recent Activity

liked a model 5 days ago

answerdotai/ModernBERT-base

new activity 9 days ago

zeusfsx/title-instruction:Adding `safetensors` variant of this model

View all activity

Organizations

None yet

zeusfsx's activity

liked a model 5 days ago

answerdotai/ModernBERT-base

Fill-Mask • Updated 6 days ago • 27.6k • 437

New activity in zeusfsx/title-instruction 9 days ago

Adding `safetensors` variant of this model

#1 opened about 1 month ago by

SFconvertbot

liked a dataset 3 months ago

MacPaw/UiPad

Viewer • Updated Sep 25 • 1.06k • 114 • 12

upvoted a paper 6 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 86

liked a dataset 8 months ago

McAuley-Lab/Amazon-Reviews-2023

Updated 17 days ago • 10.9k • 91

upvoted an article 8 months ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Sep 26, 2022

• 20

upvoted a collection 8 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 19 days ago • 697

liked 2 models 8 months ago

google/umt5-small

Text2Text Generation • Updated Jul 6, 2023 • 7.43k • 21

benjamin/gpt2-wechsel-ukrainian

Text Generation • Updated May 30, 2023 • 73 • 7

reacted to santiviquez's post with ❤️ 10 months ago

Post

Eigenvalues to the rescue? 🛟🤔

I found out about this paper thanks to @gsarti 's post from last week; I got curious, so I want to post my take on it. 🤗

The paper proposes a new metric called EigenScore to detect LLM hallucinations. 📄

Their idea is that given an input question, they generate K different answers, take their internal embedding states, calculate a covariance matrix with them, and use it to calculate an EigenScore.

We can think of the EigenScore as the mean of the eigenvalues of the covariance matrix of the embedding space of the K-generated answers.

❓But why eigenvalues?

Well, if the K generations have similar semantics, the sentence embeddings will be highly correlated, and most eigenvalues will be close to 0.

On the other hand, if the LLM hallucinates, the K generations will have diverse semantics, and the eigenvalues will be significantly different from 0.

The idea is pretty neat and shows better results when compared to other methods like sequence probabilities, length-normalized entropy, and other uncertainty quantification-based methods.

💭 What I'm personally missing from the paper is that they don't compare their results with other methods like LLM-Eval and SelfcheckGPT. They do mention that EigenScore is much cheaper to implement than SelfcheckGPT, but that's all on the topic.

Paper: INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection (2402.03744)