d-matrix
/

bert-large

bmah-dmx commited on 28 days ago

Commit

873c930

verified ·

1 Parent(s): 035e605

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ The reference provides the following functional *configurations*:
   Configuration | Explanation
   :-- | :--
   **`BASELINE`** | a reference functionally equivalent to the original model
-  **`BASIC`** | all linear algebraic operands quantized to `MXINT8-64`, and all other operations transformed to approximated kernel simulations
 ### Usage
@@ -28,6 +28,7 @@ pip install -e .
 from dmx.compressor.modeling import DmxModel
 import lm_eval
 model_args = "pretrained=d-matrix/bert-large,trust_remote_code=True"
 lm = lm_eval.api.registry.get_model("hf").create_from_arg_string(model_args, {"batch_size": 1})

   Configuration | Explanation
   :-- | :--
   **`BASELINE`** | a reference functionally equivalent to the original model
+  **`BASIC`** | all linear algebraic operands quantized to `MXINT8-64`
 ### Usage
 from dmx.compressor.modeling import DmxModel
 import lm_eval
+lm_eval.api.registry.register_model("hf", HFLM)
 model_args = "pretrained=d-matrix/bert-large,trust_remote_code=True"
 lm = lm_eval.api.registry.get_model("hf").create_from_arg_string(model_args, {"batch_size": 1})