template format

by prudant - opened Jan 17, 2024

Jan 17, 2024

the template for the model is:

Instruction:

Your task is to perform retrieval augmented generation (RAG) over the given query and search results. Return your answer in a json format that includes a summary of the search results and a list of related queries.

Query:
{prompt}
\n\n
Search Results:
{context}
\n\n
Query:
{prompt}

Response:

{"summary":

that means that query/prompt has to be repeated? or i'm missing something.

thanks, great work!

davidgortega

Jan 17, 2024

Funny, I opened something alike in the repo

emrgnt-cmplxty

SciPhi-AI org Jan 22, 2024

query is repeated as the attention mechanism appears to benefit from repeating - see Google's recent Fresh LLMs.

cahya

Jan 25, 2024

I think this is not limited to the attention mechanism, I use RWKV model (RNN-based) and the community put the question at the beginning and at the end of the prompt for such context based QA task, otherwise it forgets sometimes what the question was

davidgortega

Jan 25, 2024

@cahya RWKV is a bit different, is a strict RNN and does not have attention. It cant "look" backwards.

@emrgnt-cmplxty Not sure what you are referring to but I guess that you have measured it. DO you apply the same strategy in the training dataset?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment