ltg
/

deberta-xxlarge-fixed

Text Generation

Model card Files Files and versions Community

davda54 commited on Jun 9

Commit

e836ed3

•

1 Parent(s): 5c5c79c

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -7,11 +7,11 @@ license: mit
 pipeline_tag: text-generation
 ---
-# DeBERTa-fixed: from paper "BERTs are Generative In-Context Learners"
 This is [**deberta-v2-xxlarge**](https://huggingface.co/microsoft/deberta-v2-xxlarge) updated to implement the `AutoModelForCausalLM` class, enabling it to generate text. This implementation is based on our paper "BERTs are Generative In-Context Learners".
-This repository also fixes three bugs in the original HF implementation of DeBERTa:
 1. We fixed the incorrect name of the output embedding weights in the checkpoint file;
 2. We fixed the implementation of the enhanced mask decoder (EMD), based on [the original GitHub repository](https://github.com/microsoft/DeBERTa);
 3. We clamp the positional embeddings so that they work with long sequence lengths.

 pipeline_tag: text-generation
 ---
+# DeBERTa (1.4B) fixed version
 This is [**deberta-v2-xxlarge**](https://huggingface.co/microsoft/deberta-v2-xxlarge) updated to implement the `AutoModelForCausalLM` class, enabling it to generate text. This implementation is based on our paper "BERTs are Generative In-Context Learners".
+This repository also fixes three bugs in [the original HF implementation of DeBERTa](https://huggingface.co/microsoft/deberta-v2-xxlarge):
 1. We fixed the incorrect name of the output embedding weights in the checkpoint file;
 2. We fixed the implementation of the enhanced mask decoder (EMD), based on [the original GitHub repository](https://github.com/microsoft/DeBERTa);
 3. We clamp the positional embeddings so that they work with long sequence lengths.