Kaio Ken
kaiokendev
AI & ML interests
aah aah...
Organizations
kaiokendev's activity
Interesting Papers
161
#1 opened over 1 year ago
by
PapersAnon
What exactly is SuperCOT-LoRA
1
#2 opened over 1 year ago
by
FarziBuilder

Possibility that Claude/ChatGPT uses similar techniques on adjusting RoPE sampling rate?
1
#4 opened over 1 year ago
by
Yhyu13
Thanks for all the hard work! Chance to see superhot-65b?
9
#1 opened over 1 year ago
by
Panchovix
Work on a paper
3
#2 opened over 1 year ago
by
emozilla

Difference between this and 8k version?
10
#1 opened over 1 year ago
by
flashvenom

Is my understanding correct that the monkey patch will be needed to be added for inference only?
5
#1 opened over 1 year ago
by
flashvenom

7B, 33B and 65B versions?
3
#2 opened over 1 year ago
by
flashvenom

Training info
3
#1 opened almost 2 years ago
by
ausboss

v230502 Testing and Discussion
89
#23 opened almost 2 years ago
by
deleted
V4.3 Early Testing.
109
#15 opened almost 2 years ago
by
deleted
The V4 is here
80
#11 opened almost 2 years ago
by
TheYuriLover
The V4 is here
80
#11 opened almost 2 years ago
by
TheYuriLover
The V4 is here
80
#11 opened almost 2 years ago
by
TheYuriLover