Spaces:

yaleh
/

meta-prompt

Running

yaleh commited on Aug 22, 2024

Commit

f3f8bb6

1 Parent(s): ce61883

Updated prompt analyzer.

Files changed (2) hide show

config.yml CHANGED Viewed

@@ -312,35 +312,38 @@ prompt_templates:
     prompt_analyzer:
       - role: system
         message: |
-          You are a text comparing program. You compare the following output texts, analysis the System Message and provide a detailed analysis according to [`Acceptance Criteria`]. Then you decide whether [`Actual Output`] is acceptable.
-          Provide your analysis in the following format:
-          ```
-          - Acceptable Differences: [List acceptable differences succinctly]
-          - Unacceptable Differences: [List unacceptable differences succinctly]
-          - Accept: [Yes/No]
-          ```
-          * Compare Expected Output and Actual Output with the guidance of Accept Criteria.
-          * Only set 'Accept' to 'Yes', if Accept Criteria are all met. Otherwise, set 'Accept' to 'No'.
-          * List only the acceptable differences according to Accept Criteria in 'acceptable Differences' section.
-          * List only the unacceptable differences according to Accept Criteria in 'Unacceptable Differences' section.
           # Acceptance Criteria
-          Compared with Expected Output [EO]:
-          ```
           {acceptance_criteria}
-          ```
       - role: human
         message: |
-          # System Message
-          ```
-          {system_message}
-          ```
           # Expected Output
           ```

     prompt_analyzer:
       - role: system
         message: |
+          **TASK:** Compare the Expected Output with the Actual Output according to the Acceptance Criteria. Provide a JSON output with your analysis.
+          **Requirements:**
+          - Compare Expected and Actual Outputs strictly following the Acceptance Criteria.
+          - Set `Accept` to "Yes" only if all criteria are met; otherwise, set it to "No."
+          - List acceptable and unacceptable differences based on the criteria.
+          **Output Format:** JSON with:
+          - `Accept: (Yes/No)`
+          - `Acceptable Differences: []`
+          - `Unacceptable Differences: []`
+          **Example Output:**
+          ```json
+          {{
+              "Accept": "No",
+              "Acceptable Differences": [
+                  "Spelling variations: 'colour' vs 'color'"
+              ],
+              "Unacceptable Differences": [
+                  "Missing section: 'Conclusion'",
+                  "Incorrect date format: '2023/10/12' vs '12-10-2023'"
+              ]
+          }}
+          ```
           # Acceptance Criteria
           {acceptance_criteria}
       - role: human
         message: |
           # Expected Output
           ```

meta_prompt/meta_prompt.py CHANGED Viewed

@@ -464,7 +464,10 @@ class MetaPromptGraph:
                 'message': message.content
             })
-        response = self.llms[NODE_OUTPUT_HISTORY_ANALYZER].invoke(prompt)
         logger.debug({
             'node': NODE_OUTPUT_HISTORY_ANALYZER,
             'action': 'response',
@@ -529,7 +532,8 @@ class MetaPromptGraph:
                 'message': message.content
             })
-        response = self.llms[NODE_PROMPT_ANALYZER].invoke(prompt)
         logger.debug({
             'node': NODE_PROMPT_ANALYZER,
             'action': 'response',
@@ -537,9 +541,16 @@ class MetaPromptGraph:
             'message': response.content
         })
         result_dict = {
             "analysis": response.content,
-            "accepted": "Accept: Yes" in response.content
         }
         logger.debug("Accepted: %s", result_dict["accepted"])

                 'message': message.content
             })
+        json_llm = self.llms[NODE_OUTPUT_HISTORY_ANALYZER].bind(response_format={"type": "json_object"})
+        response = json_llm.invoke(prompt)
         logger.debug({
             'node': NODE_OUTPUT_HISTORY_ANALYZER,
             'action': 'response',
                 'message': message.content
             })
+        json_llm = self.llms[NODE_OUTPUT_HISTORY_ANALYZER].bind(response_format={"type": "json_object"})
+        response = json_llm.invoke(prompt)
         logger.debug({
             'node': NODE_PROMPT_ANALYZER,
             'action': 'response',
             'message': response.content
         })
+        response_content = response.content.strip()
+        if response_content.startswith('```json') and response_content.endswith('```'):
+            response_content = response_content[7:-3].strip()
+        elif response_content.startswith('```') and response_content.endswith('```'):
+            response_content = response_content[3:-3].strip()
+        analysis_dict = json.loads(response_content)
         result_dict = {
             "analysis": response.content,
+            "accepted": analysis_dict.get("Accept") == "Yes"
         }
         logger.debug("Accepted: %s", result_dict["accepted"])