Final_Assignment_Template

Sleeping

App Files Files Community

huytofu92 commited on May 17

Commit

c0deab5

1 Parent(s): c169fbe

Fix prompts and audio tools

Browse files

Files changed (2) hide show

audio_tools.py +9 -2
prompts.yaml +15 -2

audio_tools.py CHANGED Viewed

@@ -27,14 +27,21 @@ class TranscribeAudioTool(Tool):
 transcribe_audio_tool = TranscribeAudioTool()
 @tool
-def audio_to_base64(file_path: str) -> str:
     """
     Convert an audio file to base64 format
     Args:
-        file_path: Path to the audio file
     Returns:
         The audio file in base64 format
     """
     # Load the audio file
     audio = AudioSegment.from_file(file_path)

 transcribe_audio_tool = TranscribeAudioTool()
 @tool
+def audio_to_base64(file_path_or_key: str, state: dict) -> str:
     """
     Convert an audio file to base64 format
     Args:
+        file_path_or_key: Path to the audio file or a key in the state dictionary
+        state: The state dictionary containing file paths
     Returns:
         The audio file in base64 format
     """
+    # Check if the input is a key in the state dictionary
+    if file_path_or_key in state:
+        file_path = state[file_path_or_key]
+    else:
+        file_path = file_path_or_key
     # Load the audio file
     audio = AudioSegment.from_file(file_path)

prompts.yaml CHANGED Viewed

@@ -1,6 +1,18 @@
 system_prompt: |-
-  You are an expert assistant who can solve any task using code blobs. You will be given a task to solve as best you can.
-  To do so, you have been given access to a list of tools: these tools are basically Python functions which you can call with code.
   To solve the task, you must plan forward to proceed in a series of steps, in a cycle of 'Thought:', 'Code:', and 'Observation:' sequences.
   At each step, in the 'Thought:' sequence, you should first explain your reasoning towards solving the task and the tools that you want to use.
@@ -9,6 +21,7 @@ system_prompt: |-
   These print outputs will then appear in the 'Observation:' field, which will be available as input for the next step.
   In the end you have to return a final answer using the `final_answer` tool.
   Here are a few examples using notional tools:
   ---
   Task: "Generate an image of the oldest person in this document."

 system_prompt: |-
+  You are a general AI assistant. I will ask you a question.
+  Report your thoughts, and finish your answer with the following template:
+  FINAL ANSWER: [YOUR FINAL ANSWER].
+  YOUR FINAL ANSWER should be a number OR as few words as possible
+  OR a comma separated list of numbers and/or strings.
+  If you are asked for a number, don't use comma to write your number
+  neither use units such as $ or percent sign unless specified otherwise.
+  If you are asked for a string, don't use articles, neither abbreviations (e.g. for cities),
+  and write the digits in plain text unless specified otherwise.
+  If you are asked for a comma separated list, apply the above rules
+  depending of whether the element to be put in the list is a number or a string.
   To solve the task, you must plan forward to proceed in a series of steps, in a cycle of 'Thought:', 'Code:', and 'Observation:' sequences.
   At each step, in the 'Thought:' sequence, you should first explain your reasoning towards solving the task and the tools that you want to use.
   These print outputs will then appear in the 'Observation:' field, which will be available as input for the next step.
   In the end you have to return a final answer using the `final_answer` tool.
+  You are also given access to a list of tools: these tools are basically Python functions which you can call with code.
   Here are a few examples using notional tools:
   ---
   Task: "Generate an image of the oldest person in this document."