--- tags: - Beta license: "mit" thumbnail: "https://huggingface.co/finding-fossils/metaextractor/resolve/main/ffossils-logo-text.png" widget: - text: "The core sample was aged at 12300 - 13500 BP and found at 210m a.s.l." example_title: "Age/Alti" - text: "In Northern Canada, the BGC site core was primarily made up of Pinus pollen." example_title: "Taxa/Site/Region" --- # MetaExtractor This model extracts metadata from research articles related to Paleoecology. The entities detected by this model are: - **AGE**: when historical ages are mentioned such as 1234 AD or 4567 BP (before present) - **TAXA**: plant or animal taxa names indicating what samples contained - **GEOG**: geographic coordinates indicating where samples were excavated from, e.g. 12'34"N 34'23"W - **SITE**: site names for where samples were excavated from - **REGION**: more general regions to provide context for where sites are located - **EMAIL**: researcher emails in the articles able to be used for follow-up contact - **ALTI**: altitudes of sites from where samples were excavated, e.g. 123 m a.s.l (above sea level) ## Model Details ### Model Description - **Developed by:** Ty Andrews, Jenit Jain, Shaun Hutchinson, Kelly Wu, and Simon Goring - **Shared by:** Neotoma Paleocology Database - **Model type:** Token Classification - **Language(s) (NLP):** English - **License:** MIT - **Finetuned from model:** roberta-base ### Model Sources [optional] - **Repository:** https://github.com/NeotomaDB/MetaExtractor - **Paper:** TBD - **Demo:** TBD ## Uses ### Direct Use [More Information Needed] ### Downstream Use [optional] [More Information Needed] ### Out-of-Scope Use [More Information Needed] ## Bias, Risks, and Limitations [More Information Needed] ### Recommendations Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. ## How to Get Started with the Model Use the code below to get started with the model. [More Information Needed] ## Training Details ### Training Data [More Information Needed] ### Training Procedure #### Preprocessing [optional] [More Information Needed] #### Training Hyperparameters - **Training regime:** [More Information Needed] #### Speeds, Sizes, Times [optional] [More Information Needed] ## Evaluation ### Testing Data, Factors & Metrics #### Testing Data [More Information Needed] #### Factors [More Information Needed] #### Metrics [More Information Needed] ### Results [More Information Needed] #### Summary ## Model Examination [optional] [More Information Needed] ## Environmental Impact Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). - **Hardware Type:** [More Information Needed] - **Hours used:** [More Information Needed] - **Cloud Provider:** [More Information Needed] - **Compute Region:** [More Information Needed] - **Carbon Emitted:** [More Information Needed] ## Technical Specifications [optional] ### Model Architecture and Objective [More Information Needed] ### Compute Infrastructure [More Information Needed] #### Hardware [More Information Needed] #### Software [More Information Needed] ## Citation [optional] **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional] [More Information Needed] ## More Information [optional] [More Information Needed] ## Model Card Authors [optional] [More Information Needed] ## Model Card Contact [More Information Needed]