datalama
/

EXAONE-3.5-2.4B-Instruct-Llamafied

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

datalama commited on Dec 9, 2024

Commit

85f5a58

·

verified ·

1 Parent(s): e26fcb6

Create README.md

Files changed (1) hide show

README.md +24 -0

README.md ADDED Viewed

	@@ -0,0 +1,24 @@

+---
+language:
+- ko
+- en
+---
+# Updates in EXAONE-3.5
+## Key Changes
+- **RoPE Scaling Parameter**: Added to support longer `context_length`.
+- **Memory Optimization**: For the 2.4B model, `tie_word_embeddings` is set to `True` for improved memory efficiency.
+⚠️ Using the original [Llamafy script](https://huggingface.co/maywell/EXAONE-3.0-7.8B-Instruct-Llamafied) as-is may lead to performance degradation.
+To address this, I have updated the script and uploaded the Llamafied version of the model.
+## Special Thanks
+- **[@maywell](https://huggingface.co/maywell)**
+  For updating the code and uploading the model.
+- **LG AI Research**
+  For releasing the original model.
+  Check out the [original release here](https://huggingface.co/collections/LGAI-EXAONE/exaone-35-674d0e1bb3dcd2ab6f39dbb4).