metadata

library_name: peft
datasets:
  - InstaDeepAI/nucleotide_transformer_downstream_tasks_revised
metrics:
  - f1
base_model:
  - tattabio/gLM2_150M
model-index:
  - name: alejandralopezsosa/gLM2_150M-promoter_tata-lora
    results:
      - task:
          type: sequence-classification
        dataset:
          type: InstaDeepAI/nucleotide_transformer_downstream_tasks_revised
          name: nucleotide_transformer_downstream_tasks_revised
          config: promoter_tata
          split: test
          revision: c8c94743d3d2838b943398ee676247ac2f774122
        metrics:
          - type: f1
            value: 0.9811

gLM2 LoRA adapter for TATA promoter recognition

This model demonstrates the use of gLM2_150M embeddings for downstream classification. The model is fine-tuned using LoRA and obtains an F1 score of 98.11% on the TATA promoter task from the Nucleotide Transformer benchmarks.

How to Get Started with the Model

Use the code below to use the model for inference:

from peft import PeftModel
from transformers import AutoConfig, AutoModelForSequenceClassification, AutoModel

glm2 = "tattabio/gLM2_150M"
adapter = "alejandralopezsosa/gLM2_150M-promoter_tata-lora"

load_kwargs = {
    'trust_remote_code': True,
    'torch_dtype': torch.bfloat16,
}

config = AutoConfig.from_pretrained(adapter, **load_kwargs)
base_model = AutoModelForSequenceClassification.from_config(config, **load_kwargs)
base_model.glm2 = AutoModel.from_pretrained("tattabio/gLM2_150M", **load_kwargs)

model = PeftModel.from_pretrained(base_model, adapter)