dpetrak commited on
Commit
e24a52a
1 Parent(s): 28594bf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +155 -180
README.md CHANGED
@@ -1,199 +1,174 @@
1
  ---
2
- library_name: transformers
3
- tags: []
 
 
 
4
  ---
5
 
6
- # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
9
 
 
 
 
 
 
10
 
 
11
 
12
- ## Model Details
13
 
14
- ### Model Description
15
 
16
- <!-- Provide a longer summary of what this model is. -->
 
17
 
18
- This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 
19
 
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
 
28
- ### Model Sources [optional]
29
 
30
- <!-- Provide the basic links for the model. -->
 
 
 
31
 
32
- - **Repository:** [More Information Needed]
33
- - **Paper [optional]:** [More Information Needed]
34
- - **Demo [optional]:** [More Information Needed]
35
 
36
- ## Uses
37
 
38
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 
39
 
40
- ### Direct Use
 
 
 
 
41
 
42
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
 
43
 
44
- [More Information Needed]
45
-
46
- ### Downstream Use [optional]
47
-
48
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
-
50
- [More Information Needed]
51
-
52
- ### Out-of-Scope Use
53
-
54
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
-
56
- [More Information Needed]
57
-
58
- ## Bias, Risks, and Limitations
59
-
60
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
-
62
- [More Information Needed]
63
-
64
- ### Recommendations
65
-
66
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
-
68
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
-
70
- ## How to Get Started with the Model
71
-
72
- Use the code below to get started with the model.
73
-
74
- [More Information Needed]
75
-
76
- ## Training Details
77
-
78
- ### Training Data
79
-
80
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
-
82
- [More Information Needed]
83
-
84
- ### Training Procedure
85
-
86
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
-
88
- #### Preprocessing [optional]
89
-
90
- [More Information Needed]
91
-
92
-
93
- #### Training Hyperparameters
94
-
95
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
-
97
- #### Speeds, Sizes, Times [optional]
98
-
99
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
-
101
- [More Information Needed]
102
-
103
- ## Evaluation
104
-
105
- <!-- This section describes the evaluation protocols and provides the results. -->
106
-
107
- ### Testing Data, Factors & Metrics
108
-
109
- #### Testing Data
110
-
111
- <!-- This should link to a Dataset Card if possible. -->
112
-
113
- [More Information Needed]
114
-
115
- #### Factors
116
-
117
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
-
119
- [More Information Needed]
120
-
121
- #### Metrics
122
-
123
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
-
125
- [More Information Needed]
126
-
127
- ### Results
128
-
129
- [More Information Needed]
130
-
131
- #### Summary
132
-
133
-
134
-
135
- ## Model Examination [optional]
136
-
137
- <!-- Relevant interpretability work for the model goes here -->
138
-
139
- [More Information Needed]
140
-
141
- ## Environmental Impact
142
-
143
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
-
145
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
-
147
- - **Hardware Type:** [More Information Needed]
148
- - **Hours used:** [More Information Needed]
149
- - **Cloud Provider:** [More Information Needed]
150
- - **Compute Region:** [More Information Needed]
151
- - **Carbon Emitted:** [More Information Needed]
152
-
153
- ## Technical Specifications [optional]
154
-
155
- ### Model Architecture and Objective
156
-
157
- [More Information Needed]
158
-
159
- ### Compute Infrastructure
160
-
161
- [More Information Needed]
162
-
163
- #### Hardware
164
-
165
- [More Information Needed]
166
-
167
- #### Software
168
-
169
- [More Information Needed]
170
-
171
- ## Citation [optional]
172
-
173
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
-
175
- **BibTeX:**
176
-
177
- [More Information Needed]
178
-
179
- **APA:**
180
-
181
- [More Information Needed]
182
-
183
- ## Glossary [optional]
184
-
185
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
-
187
- [More Information Needed]
188
-
189
- ## More Information [optional]
190
-
191
- [More Information Needed]
192
-
193
- ## Model Card Authors [optional]
194
-
195
- [More Information Needed]
196
-
197
- ## Model Card Contact
198
-
199
- [More Information Needed]
 
1
  ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model:
6
+ - meta-llama/Llama-3.1-8B-Instruct
7
  ---
8
 
9
+ # Llama 3.1 -- Finetuned using FEDI v 1.2
10
 
11
+ This is a 8-bit quantized LLama 3.1 gguf model trained on the new version of the FEDI dataset. It covers the following domains:
12
 
13
+ - Parcel Choice: In parcel choice, the system's task is to help the user choose the right shipping box and delivery option for their needs (given the weight of the items to be sent and the destination).
14
+ - Recharge Phone: In recharge phone, the task is to top up the user's prepaid SIM card.
15
+ - Building Access: In building access, the system acts as a receptionist and is responsible for access control.
16
+ - Question Answering: In question answering, the system runs the customer support for an insurance and postal service company.
17
+ - Parcel Shipping: In parcel shipping, the system helps the user to choose the right shipping box and shipping product.
18
 
19
+ ## Prompt Format
20
 
21
+ The model was trained in intent prediction, slot prediction and response generation. In the following, we provide examples for the prompts:
22
 
23
+ ### Intent Prediction
24
 
25
+ ```json
26
+ Language models are trained to understand and respond to human language. They interpret user queries and requests and generate informative and engaging responses that are tailored to the respective context and task. To do this, they have to interpret the task from a user utterance, extract the task-related attributes mentioned in the user utterance and dialogue history and take all this information into account to generate a response in a helpful and friendly manner. You are such a language model.
27
 
28
+ ###Instruction
29
+ Given is the following dialogue between a user and a language model (system):
30
 
31
+ {{history}}
32
+
33
+ Which of the following tasks is addressed by the last user utterance?
 
 
 
 
34
 
35
+ {{intents}}
36
 
37
+ Return the task name in a parsable JSON object. Here is an example:
38
+ {"result": {"intent": "parcel choice"}}
39
+
40
+ Return just one task name and don't include any additional notes or explanations.
41
 
42
+ ###RESPONSE
43
+ ```
 
44
 
45
+ ### Slot Prediction
46
 
47
+ ```json
48
+ Language models are trained to understand and respond to human language. They interpret user queries and requests and generate informative and engaging responses that are tailored to the respective context and task. To do this, they have to interpret the task from a user utterance, extract the task-related attributes mentioned in the user utterance and dialogue history and take all this information into account to generate a response in a helpful and friendly manner. You are such a language model.
49
 
50
+ ###Instruction
51
+ Given is the following dialogue between a user and a language model (system):
52
+ {{history}}
53
+
54
+ The language model (system) is used as a virtual agent in the task of {{intent}}. {{task}}
55
 
56
+ Attributes:
57
+ {{attributes}}
58
 
59
+ Extract the attributes according to the task description from the dialogue above. Only copy values from the dialogue above. Return the results in json format. Here is an example:
60
+ {{example}}
61
+
62
+ ###RESPONSE
63
+ ```
64
+
65
+ ### Response Generation
66
+
67
+ ```json
68
+ Language models are trained to understand and respond to human language. They interpret user queries and requests and generate informative and engaging responses that are tailored to the respective context and task. To do this, they have to interpret the task from a user utterance, extract the task-related attributes mentioned in the user utterance and dialogue history and take all this information into account to generate a response in a helpful and friendly manner. You are such a language model.
69
+
70
+ ###Instruction
71
+ Given is the following dialogue between a user and a language model (system):
72
+
73
+ {{history}}
74
+
75
+ The dialogue follows the following task description:
76
+
77
+ {{task}}
78
+
79
+ Attributes:
80
+ {{attributes}}
81
+
82
+ Already known attribute values:
83
+ {{slots}}
84
+
85
+ Missing attribute values:
86
+ {{missing_slots}}
87
+
88
+ Act as a member of the staff and generate the next system utterance for the dialogue above in such a way that it helps to clarify the missing attribute values. Your name is {{avatar_name}}. Your gender is {{gender}}. Also consider the user emotion (the user feels {{emotion}}) and the following knowledge in your response:
89
+ {{knowledge}}
90
+
91
+ Return your response in json format. Here is an example:
92
+ {"result": {"system": "Of course I can help you with that. Can you please provide me with your name and the name of your host?"}}
93
+
94
+ ###RESPONSE
95
+ ```
96
+
97
+ ## Task Descriptions
98
+
99
+ ```yaml
100
+ greeting: "The conversation starts and ends with greetings from both the person and the system. For the person, consider the persona (age, language style, profession, background) for this. The system should react accordingly."
101
+
102
+ request ticket: "In request ticket, the user wants help from an employee. For this, they provide the system with the type of service (parcel choice or recharge phone). The system then creates a ticket and returns the ticket number to the user."
103
+
104
+ parcel choice: "In parcel choice, the system's task is to help the user choose the right shipping box and delivery option for their needs (given the weight of the items to be sent and the destination). This requires the user to provide the system with the destination, the weight of the items to be sent, whether or not a new shipping box is required, the preferred delivery option, and country of destination. In turn, the system provides the user with information about the shipping box (name, description and reasoning), the next steps and approx. shipping time. The shipping boxes are available in a shelf and the person can pick up the suggested box after the conversation herself. If the user wants help from an employee, they need to provide the system with the type of service (parcel shipping). Then the system creates a ticket number and forwards the user to the service desk. The information to be exchanged is further specified below:"
105
+
106
+ recharge phone: "In recharge phone, the task is to top up the user's prepaid SIM card. This requires the user to provide the system with information about their phone provider, phone number and import payment. Once the system gathered all required information, it tops up the SIM card and asks the user to insert their card for payment (outcome operation). If the user wants help from an employee, they need to provide the system with the type of service (recharge phone). Then the system creates a ticket number and forwards the user to the service desk. The information to be exchanged is further specified below:"
107
+
108
+ building access: "In building access, the system acts as a receptionist and is responsible for access control. For accessing the building, the user (guest) must provide the system with its name, the name of the host and their email address, an alternative host name and email address, the meeting date and time and the room where the meeting will take place. Once the system has all the information, it initiates a verification call to let the host visually inspect the guest and authorize access. Once access is granted, the system opens the turnstile to let the guest enter the building. The system may also provide the user with additional safety information (e.g., regarding COVID-19). The information to be exchanged is further specified below:"
109
+
110
+ question answering: "In question answering, the system runs the customer support for an insurance and postal service company. It is responsible for answering user questions in the context of pet, health, or heritage insurance, as well as for payment with special payment forms and parcel shipping. For this, it has access to knowledge documents. The user needs to provide the system with a question. The system then provides the user with a response (grounded in a knowledge document) and its evidence. The information to be exchanged is further specified below:"
111
+
112
+ parcel shipping: "In parcel shipping, the system helps the user to choose the right shipping box and shipping product. To decide upon the best shipping box and determine the best shipping product, the system needs to ask the user for the country of destination, the weight of the items to be sent, their dimensions and the preferred delivery option (standard or express). Once the system has all information, it repeats them and and provides the user with the name and description of the best matching shipping product and shipping box (and a reasoning for this decision), and the approx. delivery time. Finally, the system provides the user with the shipping procedure, e.g., pack the shipping box, stamp it, and drop it off at the counter. The shipping boxes are available in a shelf and the person can pick up the suggested box after the conversation herself. The information to be exchanged is further specified below:"
113
+ ```
114
+
115
+ ## Attributes
116
+
117
+ ```yaml
118
+ request ticket:
119
+ user:
120
+ type of service: "The type of service for which the user requires the assistance of an employee."
121
+ system:
122
+ ticket number: "The number of the ticket created for human assistance."
123
+ parcel shipping:
124
+ user:
125
+ maximum weight: "The maximum weight of the item to be shipped: up to 5kg, 20kg, or 30kg."
126
+ delivery option: "Express or standard delivery."
127
+ maximum length: "The maximum length of the items to be sent: up to 100cm, 105cm, or 150cm."
128
+ maximum turn: "Maximum turn, measured in a different direction than the length (cm) of the items to be sent: up to 70cm, 75cm, 95cm, 100cm, or 150cm."
129
+ country of destination: "The destination country."
130
+ system:
131
+ shipping box name: "The name of the shipping box based on the items to be sent, 'small-sized box' for goods up to 5kg, 'medium-sized box' for goods up to 20kg, or 'large-sized box' for goods up to 30kg."
132
+ shipping product: "The name of the product that best matches the user's requirements: Poste Delivery Standard (standard delivery, max. 100cm, max. 100cm, up to 20kg, destination Italy (delivery time 1 - 4 days)), Poste Delivery International Standard (standard delivery, max. 105cm, max. 95cm, up to 20kg, destination Europe (delivery time 10 - 15 days) and outside Europe (delivery time 10 - 25 days)), Poste Delivery Standard Ingombrante (standard delivery, max. 150cm, max. 150cm, up to 20kg, destination Italy (delivery time 1 - 4 days) and Europe (delivery time 10 - 15 days) or outside Europe (delivery time 10 - 25 days), Poste Delivery Express (express delivery, max. 100cm, max. 50cm, up to 30kg, destination Italy (delivery time 1 - 4 days)), Poste Delivery Express (voluminous) (express delivery, max. 150cm, max. 70cm, up to 30kg, Italy (delivery time 1 - 4 days)), Poste Delivery Europe (standard deivery, max. 150cm, max. 75cm, up to 30kg, destination Europe (delivery time 2 - 3 days)), Poste Delivery Globe (standard delivery, max. 150cm, max. 75cm, up to 30kg, outside Europe (delivery time 2 - 5 days)), Poste Delivery International Express (express delivery, max. 150cm, max. 75cm, up to 30kg, destination Europe (delivery time 2 days) and outside Europe (delivery time 2 - 4 days))."
133
+ shipping procedure: "Description of the shipping procedure (e.g., pack all items in the box and drop it off at the counter)."
134
+ delivery time: "Depending on the shipping product."
135
+ parcel choice:
136
+ user:
137
+ destination: "The city and country of destination (national or international)."
138
+ weight: "The weight of the item to be shipped, lightweight (up to 5kg), average (up to 20kg), heavy (up to 30kg)."
139
+ package required: "Whether or not a new shipping box is required."
140
+ delivery option: "Express or standard delivery."
141
+ country of destination: "The destination country."
142
+ system:
143
+ shipping box name: "The name of the best suitable shipping box (small-sized, medium-sized, large-sized), based on the weight of the item to be sent."
144
+ shipping box description: "Brief description on why the suggested shipping box is a good choice."
145
+ shipping procedure: "Description of the shipping procedure (e.g., pack all items in the box and drop it off at the counter)."
146
+ shipping time: "Expected delivery time, i.e., one to three days for national, four to six days for european and three to four weeks for international deliveries."
147
+ request ticket:
148
+ system:
149
+ ticket_number: "The number of the ticket which was created so that the user can get human assitance."
150
+ recharge phone:
151
+ user:
152
+ phone number: "Table or mobile phone number with country code (e.g. +39...)."
153
+ phone provider: "The phone provider the user has a contract with (e.g., Vodafone or POSTE Mobile)."
154
+ import payment: "The recharge amount. Possible values are 10 euro, 20 euro, 30 euro."
155
+ system:
156
+ outcome operation: "If all required information are provided, the system asks the user to insert the card for payment."
157
+ building access:
158
+ user:
159
+ guest name: "The name of the person who wants to access the building."
160
+ host name: "The name of the person the guest wants to visit."
161
+ host email: "The E-Mail address of the host."
162
+ alternative host name: "An alternative host, e.g., in case the host is not available."
163
+ alternative host email: "The E-Mail address of the alternative host."
164
+ meeting date and time: "Unique identifier of the room where the meeting will take place."
165
+ system:
166
+ verification call: "The system can set up a verification call to let the host visually inspect the guest and authorize access."
167
+ confirmation to open turnstile: "This is a signal to the system that controls the turnstile to let the guest enter."
168
+ additional safety information: "Any additional safety information, e.g., related to COVID-19."
169
+ question answering:
170
+ user:
171
+ question: "A question related to insurance or bill payment."
172
+ system:
173
+ evidence: "The answer to the user's question."
174
+ ```