---
license: cc-by-nc-nd-4.0
extra_gated_fields:
Name: text
Company: text
Country: country
Specific date: date_picker
I want to use this model for:
type: select
options:
- Research
- Education
- label: Other
value: other
I agree to share generated sequences and associated data with authors before publishing: checkbox
I agree not to file patents on any sequences generated by this model: checkbox
I agree to use this model for non-commercial use ONLY: checkbox
base_model:
- facebook/esm2_t30_150M_UR50D
pipeline_tag: fill-mask
---
# MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Language Models
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65bbea9a26c639b000501321/QQoEGgvJddR2JfE7tzMSR.png)
[Masked Diffusion Language Models (MDLMs)](arxiv.org/pdf/2406.07524), introduced by Sahoo et al, provide strong generative capabilities to BERT-style models. In this work, we pre-train and fine-tune ESM-2-150M protein language model (pLM) on the MDLM objective to scaffold functional motifs and unconditionally generate realistic, high-quality membrane protein sequences.
## Repository Authors
[Shrey Goel](mailto:shrey.goel@duke.edu), Undergraduate Student at Duke University
[Vishrut Thoutam](mailto:vishrut.thoutam64@gmail.com), Student at High Technology High School
[Pranam Chatterjee](mailto:pranam.chatterjee@duke.edu), Assistant Professor at Duke University
Reach out to us with any questions!