Higher context support?
#4
by
aayushg159
- opened
Does this support higher contexts with rope scaling?
I've used alpha_value
= 2.5 and it seems to run pretty well on a 5bit exl2 version with 16k context. But I haven't run a ton of tests, just my own primitive needle in the haystack kind of test and some logic and json questions.