Spaces:

MarioBarbeque
/

VanderbiltGlossary

Sleeping

App Files Files Community

John Graham Reynolds commited on Nov 11, 2024

Commit

4abddf8

1 Parent(s): f45b463

try to change path to css file and add newer, non-experimental decorator for caching

Browse files

Files changed (1) hide show

app.py +7 -5

app.py CHANGED Viewed

@@ -31,12 +31,13 @@ EXAMPLE_PROMPTS = [
 ]
 TITLE = "VUMC Chatbot"
-DESCRIPTION="""Welcome to the first generation Vanderbilt AI assistant! This AI assistant is built atop the Databricks DBRX large language model
 and is augmented with additional organization-specific knowledge. Specifically, it has been preliminarily augmented with knowledge of Vanderbilt University Medical Center
 terms like **Data Lake**, **EDW** (Enterprise Data Warehouse), **HCERA** (Health Care and Education Reconciliation Act), and **thousands more!** The model has **no access to PHI**.
 Try querying the model with any of the examples prompts below for a simple introduction to both Vanderbilt-specific and general knowledge queries. The purpose of this
-model is to allow VUMC employees access to an intelligent assistant that improves and expedites VUMC work. Please provide any feedback, ideas, or issues to the email: **[email protected]**.
-Feedback and ideas are very welcome! We hope to gradually improve this AI assistant to create a large-scale, all-inclusive tool to compliment the work of all VUMC staff."""
 GENERAL_ERROR_MSG = "An error occurred. Please refresh the page to start a new conversation."
@@ -58,7 +59,7 @@ st.markdown(DESCRIPTION)
 st.markdown("\n")
 # use this to format later
-with open("style.css") as css:
     st.markdown( f'<style>{css.read()}</style>' , unsafe_allow_html= True)
 if "messages" not in st.session_state:
@@ -80,7 +81,7 @@ def get_system_prompt():
 # make sure we cache this so that it doesnt redownload each time, hindering Space start time if sleeping
 # try adding this st caching decorator to ensure the embeddings class gets cached after downloading the entirety of the model
 # does this cache to the given folder though? It does appear to populate the folder as expected after being run
-@st.experimental_memo
 def load_embedding_model():
     embeddings = HuggingFaceEmbeddings(model_name="BAAI/bge-large-en", cache_folder="./langchain_cache/")
     return embeddings
@@ -89,6 +90,7 @@ embeddings = load_embedding_model()
 # instantiate the vector store for similarity search in our chain
 # need to make this a function and decorate it with @st.experimental_memo as above?
 # We are only calling this initially when the Space starts. Can we expedite this process for users when opening up this Space?
 vector_store = DatabricksVectorSearch(
     endpoint=VS_ENDPOINT_NAME,
     index_name=VS_INDEX_NAME,

 ]
 TITLE = "VUMC Chatbot"
+DESCRIPTION="""Welcome to the first generation Vanderbilt AI assistant! \n This AI assistant is built atop the Databricks DBRX large language model
 and is augmented with additional organization-specific knowledge. Specifically, it has been preliminarily augmented with knowledge of Vanderbilt University Medical Center
 terms like **Data Lake**, **EDW** (Enterprise Data Warehouse), **HCERA** (Health Care and Education Reconciliation Act), and **thousands more!** The model has **no access to PHI**.
 Try querying the model with any of the examples prompts below for a simple introduction to both Vanderbilt-specific and general knowledge queries. The purpose of this
+model is to allow VUMC employees access to an intelligent assistant that improves and expedites VUMC work. \n
+Feedback and ideas are very welcome! Please provide any feedback, ideas, or issues to the email: **[email protected]**.
+We hope to gradually improve this AI assistant to create a large-scale, all-inclusive tool to compliment the work of all VUMC staff."""
 GENERAL_ERROR_MSG = "An error occurred. Please refresh the page to start a new conversation."
 st.markdown("\n")
 # use this to format later
+with open("./style.css") as css:
     st.markdown( f'<style>{css.read()}</style>' , unsafe_allow_html= True)
 if "messages" not in st.session_state:
 # make sure we cache this so that it doesnt redownload each time, hindering Space start time if sleeping
 # try adding this st caching decorator to ensure the embeddings class gets cached after downloading the entirety of the model
 # does this cache to the given folder though? It does appear to populate the folder as expected after being run
+@st.cache_data # will this work here?
 def load_embedding_model():
     embeddings = HuggingFaceEmbeddings(model_name="BAAI/bge-large-en", cache_folder="./langchain_cache/")
     return embeddings
 # instantiate the vector store for similarity search in our chain
 # need to make this a function and decorate it with @st.experimental_memo as above?
 # We are only calling this initially when the Space starts. Can we expedite this process for users when opening up this Space?
+# @st.cache_data # TODO add this in
 vector_store = DatabricksVectorSearch(
     endpoint=VS_ENDPOINT_NAME,
     index_name=VS_INDEX_NAME,