xlnet-base-cased / README.md

julien-c HF staff

Migrate model card from transformers-repo

553f39a about 4 years ago

233 Bytes

This model is pre-trained XLNET with 12 layers.

It comes with paper: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models

Project Page: SBERT-WK