Size Mismatch Error
#7 opened 6 months ago
by
mchl914
128k量化時會出現ValueError: Duplicated tensor name 'output.weight'
3
#5 opened 6 months ago
by
Garfield1978
這張表有點怪怪的
#3 opened 6 months ago
by
wennycooper
請問是用什麼技術擴展context_window 到128k?
#1 opened 7 months ago
by
wennycooper