|
--- |
|
license: mit |
|
--- |
|
|
|
# SRTK Scorer |
|
|
|
This model is a trained scorer for [SRTK](https://github.com/happen2me/subgraph-retrieval-toolkit). It is used to compare the similarity between a query and the expansion path at the time of subgraph retrieval. |
|
|
|
## Training Information |
|
|
|
It is initialized with `roberta-base`. It is trained jointly on the following datasets: |
|
|
|
- [WebQSP for Freebase](https://www.microsoft.com/en-us/download/details.aspx?id=52763) |
|
- [SimpleQuestionsWikidata for Wikidata](https://github.com/askplatypus/wikidata-simplequestions) |
|
- [SimpleDBpediaQA](https://github.com/castorini/SimpleDBpediaQA) |
|
|
|
It achieves an answer coverage rate of 0.9728 on SimpleQuestionsWikidata (depth 1) 0.8501 on WebQSP test set (depth 2) with a beam width of only 2! |
|
|
|
## Usage Example |
|
|
|
First install the package: |
|
|
|
```bash |
|
pip install srtk |
|
``` |
|
|
|
Then you can retrieve subgraphs with the help of this scorer: |
|
|
|
```bash |
|
srtk retrieve -i data/wikidata-simplequestions/intermediate/scores_test.jsonl \ |
|
-o artifacts/subgraphs/wikidata-simple-contrast \ |
|
-e http://localhost:1234/api/endpoint/sparql \ |
|
--scorer-model-path drt/srtk-scorer \ |
|
--scorer --beam-width 2 --max-depth 1 --evaluate |
|
``` |
|
|
|
## Limitations |
|
|
|
As both SimpleQuestionsWikidata and SimpleDBpediaQA contain only one-hop relations, the model tends to stop at one-hop when you retrieve subgraphs on Wikidata and DBpedia. We will release a updated version of the model that is trained on a more diverse dataset in the future. |
|
|
|
## License |
|
|
|
MIT |
|
|