avelezarce commited on
Commit
0c7d959
·
verified ·
1 Parent(s): 6a0b3ad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -1
README.md CHANGED
@@ -16,12 +16,24 @@ tags:
16
  library_name: tdc
17
  license: bsd-2-clause
18
  ---
19
- The TDC Transformers APi is still under development. You may download PINNACLE pre-trained weights and hyperparameters from the files included.
20
 
21
  ## Model description
22
  We introduce PINNACLE, a flexible geometric deep-learning approach that is trained on contextualized protein interaction networks to generate context-PINNACLE protein representations. Leveraging a human multi-organ single-cell transcriptomic atlas, PINNACLE provides 394,760 protein representations split across 156 cell type contexts from 24 tissues and organs.
23
  To load the pre-trained model, use the Files and Versions tab files.
24
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ## References
26
  * Dataset entry in Therapeutics Data Commons, https://tdcommons.ai/multi_pred_tasks/scdti/
27
  * Li, Michelle, et al. “Contextual AI models for single-cell protein biology” Nature Methods (2024)
 
16
  library_name: tdc
17
  license: bsd-2-clause
18
  ---
 
19
 
20
  ## Model description
21
  We introduce PINNACLE, a flexible geometric deep-learning approach that is trained on contextualized protein interaction networks to generate context-PINNACLE protein representations. Leveraging a human multi-organ single-cell transcriptomic atlas, PINNACLE provides 394,760 protein representations split across 156 cell type contexts from 24 tissues and organs.
22
  To load the pre-trained model, use the Files and Versions tab files.
23
 
24
+ ## Abstract
25
+ Understanding protein function and developing molecular therapies require deciphering the cell types in which proteins act as well as the interactions between proteins. However, modeling protein interactions across biological contexts remains challenging for existing algorithms. Here we introduce PINNACLE, a geometric deep learning approach that generates context-aware protein representations. Leveraging a multiorgan single-cell atlas, PINNACLE learns on contextualized protein interaction networks to produce 394,760 protein representations from 156 cell type contexts across 24 tissues. PINNACLE’s embedding space reflects cellular and tissue organization, enabling zero-shot retrieval of the tissue hierarchy. Pretrained protein representations can be adapted for downstream tasks: enhancing 3D structure-based representations for resolving immuno-oncological protein interactions, and investigating drugs’ effects across cell types. PINNACLE outperforms state-of-the-art models in nominating therapeutic targets for rheumatoid arthritis and inflammatory bowel diseases and pinpoints cell type contexts with higher predictive capability than context-free models. PINNACLE’s ability to adjust its outputs on the basis of the context in which it operates paves the way for large-scale context-specific predictions in biology.
26
+
27
+ ## Code
28
+ We support retrieval of PINNACLE embeddings.
29
+ ```python
30
+ from tdc.resource.pinnacle import PINNACLE
31
+ pinnacle = PINNACLE()
32
+ embeds = pinnacle.get_embeds()
33
+ ```
34
+ We share here also code used to benchmark PINNACLE on the TDC.scDTI task.
35
+ https://colab.research.google.com/drive/14WeNbDNyDi_45hFayBMtt2MBOvhyRRvy?usp=sharing
36
+
37
  ## References
38
  * Dataset entry in Therapeutics Data Commons, https://tdcommons.ai/multi_pred_tasks/scdti/
39
  * Li, Michelle, et al. “Contextual AI models for single-cell protein biology” Nature Methods (2024)