File size: 1,502 Bytes
cf80a81
 
 
 
 
 
 
 
 
f566f99
 
 
 
 
cf80a81
 
 
f566f99
 
 
cf80a81
ed920f9
70a84f8
ed920f9
42a5a6f
 
c9c5888
c3c0264
f566f99
01ec89b
 
 
78c9270
01ec89b
 
1e5428b
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
license: cc-by-nc-nd-4.0
extra_gated_fields:
  Name: text
  Company: text
  Country: country
  Specific date: date_picker
  I want to use this model for:
    type: select
    options:
    - Research
    - Education
    - label: Other
      value: other
  I agree to share generated sequences and associated data with authors before publishing: checkbox
  I agree not to file patents on any sequences generated by this model: checkbox
  I agree to use this model for non-commercial use ONLY: checkbox
base_model:
- facebook/esm2_t30_150M_UR50D
pipeline_tag: fill-mask
---

# MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Language Models

![image/png](https://cdn-uploads.huggingface.co/production/uploads/65bbea9a26c639b000501321/QQoEGgvJddR2JfE7tzMSR.png)


[Masked Diffusion Language Models (MDLMs)](arxiv.org/pdf/2406.07524), introduced by Sahoo et al, provide strong generative capabilities to BERT-style models. In this work, we pre-train and fine-tune ESM-2-150M protein language model (pLM) on the MDLM objective to scaffold functional motifs and unconditionally generate realistic, high-quality membrane protein sequences.


## Repository Authors
[Shrey Goel](mailto:[email protected]), Undergraduate Student at Duke University <br>
[Vishrut Thoutam](mailto:[email protected]), Student at High Technology High School <br>
[Pranam Chatterjee](mailto:[email protected]), Assistant Professor at Duke University 

Reach out to us with any questions!