Why does a 1B parameter model have a 29GB weights file?

#20
by narai - opened

This weights file is approx 10 times larger than normal for this parameter count. What is the explanation?

narai changed discussion status to closed

Sign up or log in to comment