Model Logo

ARBERTv2_ArLAMA is a transformer-based Arabic language model fine-tuned on Masked Language Modeling (MLM) tasks. The model uses Knowledge Graphs (KGs) to enhance its understanding of semantic relations and improve its performance in various Arabic NLP tasks.

Uses

Direct Use

Filling masked tokens in Arabic text, particularly in contexts enriched with knowledge from KGs.

Downstream Use

Can be further fine-tuned for Arabic NLP tasks that require semantic understanding, such as text classification or question answering.

How to Get Started with the Model

from transformers import pipeline
fill_mask = pipeline("fill-mask", model="AfnanTS/ARBERTv2_ArLAMA")
fill_mask("اللغة [MASK] مهمة جدا."

Training Details

Training Data

Trained on the ArLAMA dataset, which is designed to represent Knowledge Graphs in natural language.

Training Procedure

Continued pre-training of ArBERTv2 using Masked Language Modeling (MLM) tasks, integrating structured knowledge from Knowledge Graphs.

Downloads last month
16
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for AfnanTS/ARBERTv2_ArLAMA

Base model

UBC-NLP/ARBERTv2
Finetuned
(5)
this model

Dataset used to train AfnanTS/ARBERTv2_ArLAMA