200K Version

by brucethemoose - opened Dec 6, 2023

Dec 6, 2023

•

edited Dec 6, 2023

Separate from my previous request, have you considered training on Yi 200K instead? It doesn't need to be trained at 200K to maintain some of the long context performance, I believe.

Might be a good candidate for a LongLora if y'all are doing full finetuning now?

inNexus

Dec 11, 2023

I think it is nice to support 200k version

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment