Introducing the SURE Dataset for Shopping Dialogues
A dataset designed to improve interactions between customers and salespeople in stores.
― 6 min read
Table of Contents
- SURE Dataset Overview
- Understanding Subjective Preferences
- Challenges in Multimodal Shopping Dialogs
- The SURE Dataset Structure
- Key Components of SURE
- Evaluating Multimodal Recommendation Agents
- Importance of Data Annotation
- Analyzing the SURE Dataset
- Future Directions
- Conclusion
- Original Source
- Reference Links
Shopping in physical stores involves various interactions between customers and salespeople. Customers often have specific preferences but may not express them clearly. This creates a need for systems that can assist in understanding these subjective preferences during shopping interactions. To address this challenge, a new dataset called SURE was created. It contains Dialogues that show how salespeople can recommend items based on customers’ preferences in a shopping setting.
SURE Dataset Overview
The SURE dataset focuses on dialogues that occur in complex store environments, like clothing and furniture stores. It is designed to capture the various ways customers express their desires and how salespeople can respond appropriately. The dataset comprises 12 shopping dialogues with annotations that highlight subjective preferences along with recommended actions taken by salespeople.
Dataset Creation
Creating the SURE dataset involved a systematic process divided into phases. Initially, simulated dialogues were generated using specially designed customer and salesperson models. This allowed for the creation of realistic interactions. The next phase involved human annotators who worked to ensure the quality and diversity of the dialogues by rewriting them to reflect various subjective preferences expressed by customers.
Understanding Subjective Preferences
Subjective preferences refer to the personal feelings and tastes that customers have when shopping. Unlike clear descriptions, like “the red dress on the rack,” customers often use phrases that reflect their feelings, such as “I want something cheerful” or “I like calming colors.” These expressions can refer to multiple item attributes, which creates a challenge for salespeople.
Importance of Communication
For salespeople, understanding these subjective preferences is key to making suitable recommendations. They need to engage with customers, ask the right questions, and clarify what those preferences mean. This requires a level of understanding that goes beyond simply matching a description to an item on a rack.
Challenges in Multimodal Shopping Dialogs
Many existing datasets do not capture the complexity of real-life shopping dialogues. Previous datasets focused on straightforward customer queries and sales responses. However, they often failed to represent the rich and diverse ways customers express their preferences, which can lead to difficulty in making accurate recommendations.
Role of Salespersons
Salespeople play a critical role in narrowing down choices for customers. They use different strategies to obtain more information about what customers really want. This might include asking about colors, styles, or specific details related to the items. Successfully navigating these conversations is essential for a positive shopping experience.
The SURE Dataset Structure
The SURE dataset consists of 12,000 dialogues collected across various shopping scenarios. The dialogues are structured to provide insight into how subjective preferences are expressed and handled. They feature two key roles: the customer and the salesperson.
Dialog Flow Simulation
The dialog flows are crafted to simulate real shopping interactions. Initially, sales agents were trained based on surveys filled out by experienced salespeople regarding how they interact with customers. These simulations then yielded dialogues that were further refined by human annotators to ensure natural language use and clarity.
Key Components of SURE
Subjective Preferences
One of the standout features of the SURE dataset is its focus on subjective preferences. Here, salespersons must learn how to decode the customer’s needs based on their subjective expressions. This requires knowledge of how these preferences can map to standard categories of items.
Salesperson Actions
The dataset includes a variety of actions that salespeople can take during the conversation. These include asking for preferences, providing recommendations, and iterating on customer feedback to refine suggestions. Each action is designed to clarify the customer's needs and aid in the decision-making process.
Evaluating Multimodal Recommendation Agents
The dataset introduces three Evaluation Tasks aimed at testing the capabilities of recommendation agents in real-world shopping scenarios.
Task 1: Subjective Preference Disambiguation
The first task focuses on interpreting subjective preferences expressed by the customers and connecting them to specific product attributes. This is crucial for ensuring the salespeople can recommend the most suitable items based on what the customer actually desires.
Task 2: Referred Region Understanding
The second task involves understanding specific regions referred to by customers, such as “over there on the left shelf.” This requires the agent to locate items in the context of the physical store layout while filtering out irrelevant options.
Task 3: Multimodal Recommendation
The final task assesses how well an agent can make recommendations based on the input from dialogues and visual scene context. Effective recommendations depend on analyzing conversations and leveraging the relevant visual information available in the store.
Importance of Data Annotation
Data annotation in the SURE dataset is a crucial step that ensures rich and diverse representations of customer preferences. Experienced sales professionals contributed to the categorization of subjective preferences and the development of dialogue acts.
Feedback Loop
The interactions captured in the dataset create feedback loops, allowing salespeople to adjust their strategies based on customer responses. This helps in refining the recommendations and enhances the overall shopping experience.
Analyzing the SURE Dataset
Subjective Preferences in Detail
The dataset contains a wide range of subjective preferences that reflect the varied ways customers express their needs. On average, each dialogue includes several subjective preferences, showcasing the richness of customer language in shopping contexts.
Dialogue Length and Structure
Investigation into the average length of dialogues suggests that they resemble real-world conversations, where multiple rounds of interaction often occur before a recommendation is made. This feature adds realism and depth to the dataset.
Future Directions
The SURE dataset represents a significant step forward in the understanding of multimodal shopping dialogues. However, there are still areas for improvement and expansion.
Expansion to Multilingual Contexts
One proposed future direction is to expand the dataset to include different languages. This would widen its applicability and create opportunities for cross-cultural research in shopping dialogues.
Additional Scenarios
Future versions of the dataset could include more diverse scenarios, such as those found in different retail sectors beyond fashion and furniture. This would help to broaden the range of dialogues and preferences represented.
Conclusion
The SURE dataset is an important resource for those studying multimodal dialogues in shopping contexts. By focusing on how subjective preferences influence interactions, it provides valuable insights into the dynamic between customers and salespeople. As technology continues to evolve, datasets like SURE will be essential in training effective conversational agents that enhance the shopping experience and provide relevant recommendations based on individual customer needs.
Title: Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark
Abstract: Existing multimodal task-oriented dialog data fails to demonstrate the diverse expressions of user subjective preferences and recommendation acts in the real-life shopping scenario. This paper introduces a new dataset SURE (Multimodal Recommendation Dialog with SUbjective PREference), which contains 12K shopping dialogs in complex store scenes. The data is built in two phases with human annotations to ensure quality and diversity. SURE is well-annotated with subjective preferences and recommendation acts proposed by sales experts. A comprehensive analysis is given to reveal the distinguishing features of SURE. Three benchmark tasks are then proposed on the data to evaluate the capability of multimodal recommendation agents. Based on the SURE, we propose a baseline model, powered by a state-of-the-art multimodal model, for these tasks.
Authors: Yuxing Long, Binyuan Hui, Caixia Yuan1, Fei Huang, Yongbin Li, Xiaojie Wang
Last Update: 2023-05-26 00:00:00
Language: English
Source URL: https://arxiv.org/abs/2305.18212
Source PDF: https://arxiv.org/pdf/2305.18212
Licence: https://creativecommons.org/licenses/by-nc-sa/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to arxiv for use of its open access interoperability.