arxiv:2310.18348

Meaning Representations from Trajectories in Autoregressive Models

Published on Oct 23, 2023

Authors:

Abstract

We propose to extract meaning representations from autoregressive language models by considering the distribution of all possible trajectories extending an input text. This strategy is prompt-free, does not require fine-tuning, and is applicable to any pre-trained autoregressive model. Moreover, unlike vector-based representations, distribution-based representations can also model asymmetric relations (e.g., direction of logical entailment, hypernym/hyponym relations) by using algebraic operations between likelihood functions. These ideas are grounded in distributional perspectives on semantics and are connected to standard constructions in automata theory, but to our knowledge they have not been applied to modern language models. We empirically show that the representations obtained from large models align well with human annotations, outperform other zero-shot and prompt-free methods on semantic similarity tasks, and can be used to solve more complex entailment and containment tasks that standard embeddings cannot handle. Finally, we extend our method to represent data from different modalities (e.g., image and text) using multimodal autoregressive models. Our code is available at: https://github.com/tianyu139/meaning-as-trajectories

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2310.18348 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2310.18348 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2310.18348 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.