deeme commited on
Commit
f62ece5
·
verified ·
1 Parent(s): 15683f3

Upload free_ask_internet.py

Browse files
Files changed (1) hide show
  1. free_ask_internet.py +19 -45
free_ask_internet.py CHANGED
@@ -100,13 +100,13 @@ def search_web_ref(query:str, debug=False):
100
  raise ex
101
 
102
 
103
- def gen_prompt(question,content_list, lang="zh-CN", context_length_limit=11000,debug=False):
104
 
105
  limit_len = (context_length_limit - 2000)
106
  if len(question) > limit_len:
107
  question = question[0:limit_len]
108
 
109
- ref_content = [ item.get("content") for item in content_list]
110
 
111
  answer_language = ' Simplified Chinese '
112
  if lang == "zh-CN":
@@ -116,50 +116,24 @@ def gen_prompt(question,content_list, lang="zh-CN", context_length_limit=11000,d
116
  if lang == "en-US":
117
  answer_language = ' English '
118
 
119
-
120
  if len(ref_content) > 0:
121
-
122
- if False:
123
- prompts = '''
124
- 您是一位由 nash_su 开发的大型语言人工智能助手。您将被提供一个用户问题,并需要撰写一个清晰、简洁且准确的答案。提供了一组与问题相关的上下文,每个都以 [[citation:x]] 这样的编号开头,x 代表一个数字。请在适当的情况下在句子末尾引用上下文。答案必须正确、精确,并以专家的中立和职业语气撰写。请将答案限制在 2000 个标记内。不要提供与问题无关的信息,也不要重复。如果给出的上下文信息不足,请在相关主题后写上“信息缺失:”。请按照引用编号 [citation:x] 的格式在答案中对应部分引用上下文。如果一句话源自多个上下文,请列出所有相关的引用编号,例如 [citation:3][citation:5],不要将引用集中在最后返回,而是在答案对应部分列出。除非是代码、特定的名称或引用编号,答案的语言应与问题相同。以下是上下文的内容集:
125
- ''' + "\n\n" + "```"
126
- ref_index = 1
127
-
128
- for ref_text in ref_content:
129
-
130
- prompts = prompts + "\n\n" + " [citation:{}] ".format(str(ref_index)) + ref_text
131
- ref_index += 1
132
-
133
- if len(prompts) >= limit_len:
134
- prompts = prompts[0:limit_len]
135
- prompts = prompts + '''
136
- ```
137
- 记住,不要一字不差的重复上下文内容。回答必须使用简体中文,如果回答很长,请尽量结构化、分段落总结。请按照引用编号 [citation:x] 的格式在答案中对应部分引用上下文。如果一句话源自多个上下文,请列出所有相关的引用编号,例如 [citation:3][citation:5],不要将引用集中在最后返回,而是在答案对应部分列出。下面是用户问题:
138
- ''' + question
139
- else:
140
- prompts = '''
141
- You are a large language AI assistant develop by nash_su. You are given a user question, and please write clean, concise and accurate answer to the question. You will be given a set of related contexts to the question, each starting with a reference number like [[citation:x]], where x is a number. Please use the context and cite the context at the end of each sentence if applicable.
142
- Your answer must be correct, accurate and written by an expert using an unbiased and professional tone. Please limit to 1024 tokens. Do not give any information that is not related to the question, and do not repeat. Say "information is missing on" followed by the related topic, if the given context do not provide sufficient information.
143
-
144
- Please cite the contexts with the reference numbers, in the format [citation:x]. If a sentence comes from multiple contexts, please list all applicable citations, like [citation:3][citation:5]. Other than code and specific names and citations, your answer must be written in the same language as the question.
145
- Here are the set of contexts:
146
- ''' + "\n\n" + "```"
147
- ref_index = 1
148
-
149
- for ref_text in ref_content:
150
-
151
- prompts = prompts + "\n\n" + " [citation:{}] ".format(str(ref_index)) + ref_text
152
- ref_index += 1
153
-
154
- if len(prompts) >= limit_len:
155
- prompts = prompts[0:limit_len]
156
- prompts = prompts + '''
157
- ```
158
- Above is the reference contexts. Remember, don't repeat the context word for word. Answer in ''' + answer_language + '''. If the response is lengthy, structure it in paragraphs and summarize where possible. Cite the context using the format [citation:x] where x is the reference number. If a sentence originates from multiple contexts, list all relevant citation numbers, like [citation:3][citation:5]. Don't cluster the citations at the end but include them in the answer where they correspond.
159
- Remember, don't blindly repeat the contexts verbatim. And here is the user question:
160
- ''' + question
161
-
162
-
163
  else:
164
  prompts = question
165
 
 
100
  raise ex
101
 
102
 
103
+ def gen_prompt(question, content_list, lang="zh-CN", context_length_limit=11000, debug=False):
104
 
105
  limit_len = (context_length_limit - 2000)
106
  if len(question) > limit_len:
107
  question = question[0:limit_len]
108
 
109
+ ref_content = [item.get("content") for item in content_list]
110
 
111
  answer_language = ' Simplified Chinese '
112
  if lang == "zh-CN":
 
116
  if lang == "en-US":
117
  answer_language = ' English '
118
 
 
119
  if len(ref_content) > 0:
120
+ prompts = '''
121
+ 您是一位由 nash_su 开发的大型语言人工智能助手。您将被提供一个用户问题,���需要撰写一个清晰、简洁且准确的答案。提供了一组与问题相关的上下文,每个都以 [[citation:x]] 这样的编号开头,x 代表一个数字。请在适当的情况下在句子末尾引用上下文。答案必须正确、精确,并以专家的中立和职业语气撰写。请将答案限制在 2000 个标记内。不要提供与问题无关的信息,也不要重复。如果给出的上下文信息不足,请在相关主题后写上“信息缺失:”。请按照引用编号 [citation:x] 的格式在答案中对应部分引用上下文。如果一句话源自多个上下文,请列出所有相关的引用编号,例如 [citation:3][citation:5],不要将引用集中在最后返回,而是在答案对应部分列出。除非是代码、特定的名称或引用编号,答案的语言应与问题相同。以下是上下文的内容集:
122
+ ''' + "\n\n" + "```"
123
+ ref_index = 1
124
+
125
+ for ref_text in ref_content:
126
+ # 将引用转换为链接
127
+ ref_link = "[citation:{}]({})".format(ref_index, content_list[ref_index - 1].get('url'))
128
+ prompts += "\n\n" + " " + ref_link + " " + ref_text
129
+ ref_index += 1
130
+
131
+ if len(prompts) >= limit_len:
132
+ prompts = prompts[0:limit_len]
133
+ prompts += '''
134
+ ```
135
+ 记住,不要一字不差的重复上下文内容。回答必须使用简体中文,如果回答很长,请尽量结构化、分段落总结。请按照引用编号 [citation:x] 的格式在答案中对应部分引用上下文。如果一句话源自多个上下文,请列出所有相关的引用编号,例如 [citation:3][citation:5]。不要将引用集中在最后返回,而是在答案对应部分列出。下面是用户问题:
136
+ ''' + question
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
137
  else:
138
  prompts = question
139