TMElyralab
/

lyraLLaMA

Model card Files Files and versions Community

benleader commited on Jun 30, 2023

Commit

ebc08c7

·

1 Parent(s): f7dcb3b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ Due to the license regulation of LLaMA, we are not allowed to release the accele
 ## Model Card for lyraLLaMA
-lyraLLaMA is currently the **fastest LLaMA-13b** available. The inference speed of lyraLLaMA has achieved **4x** acceleration upon the torch version.
 Among its main features are:
 - device: Nvidia GPU with Amperer architecture or Volta architecture (A100 or higher, V100).

 ## Model Card for lyraLLaMA
+lyraLLaMA is currently the **fastest LLaMA-13b** available. The inference speed of lyraLLaMA has achieved **3000+ tokens/s** on A100, **4x** acceleration upon the torch version.
 Among its main features are:
 - device: Nvidia GPU with Amperer architecture or Volta architecture (A100 or higher, V100).