sleepylx commited on
Commit
71c7db5
·
verified ·
1 Parent(s): e2a1867

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -67,6 +67,7 @@ We adopt the architecture of FLM-101B as the backbone for Tele-FLM, with several
67
  - SwiGLU for activation function
68
  - Linear bias disabled
69
  - Embedding and language model head untied
 
70
 
71
  Consequently, Tele-FLM is largely compatible with Llama architecturally.
72
  To maximize convenience for the community, we made minimal adjustments to Llama's code to adapt it to Tele-FLM and released it as open source.
 
67
  - SwiGLU for activation function
68
  - Linear bias disabled
69
  - Embedding and language model head untied
70
+ - Input and output multiplication
71
 
72
  Consequently, Tele-FLM is largely compatible with Llama architecturally.
73
  To maximize convenience for the community, we made minimal adjustments to Llama's code to adapt it to Tele-FLM and released it as open source.