--- license: cc-by-nc-sa-4.0 datasets: - QCRI/LlamaLens-English - QCRI/LlamaLens-Arabic - QCRI/LlamaLens-Hindi language: - ar - en - hi base_model: - meta-llama/Llama-3.1-8B-Instruct pipeline_tag: text-generation tags: - Social-Media - Hate-Speech - Summarization - offensive-language - News-Genre --- # LlamaLens: Specialized Multilingual LLM forAnalyzing News and Social Media Content ## Overview LlamaLens is a specialized multilingual LLM designed for analyzing news and social media content. It focuses on 19 NLP tasks, leveraging 52 datasets across Arabic, English, and Hindi.

capablities_tasks_datasets

## Dataset The model was trained on the [LlamaLens dataset](https://huggingface.co/collections/QCRI/llamalens-672f7e0604a0498c6a2f0fe9). ## To Replicate the Experiments The code to replicate the experiments is available on [GitHub](https://github.com/firojalam/LlamaLens). ## Model Inference TBA # License This model is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0). # Citation Please cite [our paper](https://arxiv.org/pdf/2410.15308) when using this model: ``` @article{kmainasi2024llamalensspecializedmultilingualllm, title={LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content}, author={Mohamed Bayan Kmainasi and Ali Ezzat Shahroor and Maram Hasanain and Sahinur Rahman Laskar and Naeemul Hassan and Firoj Alam}, year={2024}, journal={arXiv preprint arXiv:2410.15308}, volume={}, number={}, pages={}, url={https://arxiv.org/abs/2410.15308}, eprint={2410.15308}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```