Primus
Paper β’ 2502.11191 β’ Published β’ 3Note Start by reading the πPrimus Paper! To the best of our knowledge, we are the π₯ first to release datasets covering cybersecurity pretraining, IFT, and reasoning distillation. Of course, we are also the first to pretrain an LLM on a large-scale cybersecurity corpus.
trendmicro-ailab/Llama-Primus-Base
Text Generation β’ Updated β’ 8 β’ 4Note Based on Llama-3.1-8B-Instruct, continually pretrained on 2.77B tokens of cybersecurity text, achieving a π15.88% improvement in the aggregated score across multiple cybersecurity benchmarks.
trendmicro-ailab/Llama-Primus-Merged
Text Generation β’ Updated β’ 121 β’ 7Note Instruct Model! While maintaining nearly the same instruction-following capability as Llama-3.1-8B-Instruct, achieving a π14.84% improvement across multiple cybersecurity benchmarks.
trendmicro-ailab/Llama-Primus-Reasoning
Text Generation β’ Updated β’ 8 β’ 1Note Distilled on reasoning and reflection data from o1-preview for cybersecurity tasks, achieving a π10% improvement on CISSP.
trendmicro-ailab/Primus-Seed
Viewer β’ Updated β’ 174kNote Includes high-quality cybersecurity texts manually collected from reputable sources such as wikipedia, MITRE, cybersecurity company websites, CTI, and more.
trendmicro-ailab/Primus-FineWeb
Viewer β’ Updated β’ 3.39M β’ 21 β’ 4Note Includes 2.57B tokens of cybersecurity texts filtered from FineWeb.
trendmicro-ailab/Primus-Instruct
Viewer β’ Updated β’ 835 β’ 7Note Includes approximately 1K QA pairs covering common cybersecurity business scenarios.
trendmicro-ailab/Primus-Reasoning
Viewer β’ Updated β’ 2.4k β’ 12 β’ 4Note Includes reasoning and reflection data generated by o1-preview on cybersecurity tasks for distillation.