Machines Learning Social Norms
A study on how machines can adapt to human social norms.
― 8 min read
Table of Contents
- The Problem of Norm Learning
- Our Approach to Norm Learning
- Understanding Norms in Multiagent Systems
- How Machines Learn Norms
- Framework for Learning Norms
- Representing Social Norms
- Planning to Comply with Norms
- Learning Through Observation
- The Role of Social Norms in Cooperation
- Transmission of Norms Across Generations
- The Emergence of Norms
- Discussion
- Future Directions
- Original Source
- Reference Links
In human societies, people follow rules and Norms to work together. This helps them live and cooperate effectively. The big question is: how can we create Machines that learn to follow these rules and work with people? The idea is that if machines know there are common rules that most people follow, even if they don't know the specifics, they can learn what these rules are through observation.
When new machines enter a group, they can figure out the existing rules by watching how others act. Even if everyone starts off with different ideas about the rules, they can still find common ground. This is crucial because it keeps order and helps everyone in the group to act in a cooperative way. When machines learn these rules and norms, they can become better partners for humans.
The Problem of Norm Learning
To make machines behave well in society, we need to think about how they learn social norms. These norms are often unspoken rules that guide behavior. People learn these norms over time and through experience. This presents a challenge for machines, which need a systematic way to learn these rules without a central authority.
A solid model of norm learning should take into account that norms are shared by everyone but learned individually. This allows machines to quickly adapt to new situations without needing direct orders.
Our Approach to Norm Learning
Our approach uses a method called Bayesian learning, which helps machines understand and learn about social norms. We build on ideas about how humans work together and assume that machines can also assume there are shared rules when they interact with one another.
By observing when others break or follow rules, machines can gather data and infer what the rules might be. They can do this by looking at the actions of other machines and updating their beliefs about what is considered acceptable or unacceptable behavior.
Once a machine understands the rules, it can choose to follow them either because it wants to or because it makes sense for its goals. When many machines start to follow the same rules, it creates a system where those rules are reinforced and more likely to stick around. This process means that new machines can quickly learn what the existing rules are.
Understanding Norms in Multiagent Systems
We introduce a new way to study how machines can learn norms by using Multiagent systems. In these systems, many machines interact over time and have to make decisions based on their surroundings. We focus on how they can learn to follow rules.
Each machine acts based on its goals while also considering the norms it thinks others are following. This model helps machines learn how to work together, maintain their understanding of the rules, and converge around shared norms even if they start off with different beliefs.
How Machines Learn Norms
The learning process for machines is broken down into a series of steps. First, machines observe the actions of others. This helps them see what is considered a norm. Machines also need a way to represent these norms and decide if they comply based on what they learned.
Machines can have two ways of planning their actions: focusing on rewards or focusing on Obligations. In the reward-focused mode, machines aim to get the most rewards for their actions while observing the prohibitions that are present. In the obligation-focused mode, machines plan to fulfill certain responsibilities that may be extended over time.
By switching between these planning modes, machines can effectively navigate situations in which they need to balance their personal goals with the rules they have learned.
Framework for Learning Norms
We introduce a framework called Norm-Augmented Markov Games (NMGs) to formalize how machines can learn norms in a systematic way. In this framework, machines take actions in specific environments where their actions can change the state of that environment over time.
In NMGs, each machine has its own goals but must also consider the norms that govern their interactions. This means each machine uses its beliefs about the norms to make decisions. By paying attention to how often they violate these norms, machines can adjust their behavior to align better with the expectations of others.
The NMG framework allows us to model how norms can guide machines' learning and decision-making processes over time. Machines can use this framework to understand how their actions are influenced by shared norms and individual desires.
Representing Social Norms
To ensure machines can learn norms effectively, we define how these norms should be represented. Norms can be thought of as rules that either prohibit certain actions or require certain behaviors. For example, a norm might say, "Don't take apples if there are too few to share." Or, "If you see a dirty river, you are required to clean it."
These norms can be expressed in a structured way, allowing machines to understand and implement them. By representing norms in a logical format, machines can learn when to apply them in various situations.
Planning to Comply with Norms
Once machines understand the norms, they need a way to plan their actions accordingly. We describe how machines can plan their actions based on the norms they have learned. This involves deciding when to maximize rewards and when to fulfill obligations, ensuring they stay in line with the rules.
Interplaying between reward-oriented and obligation-oriented planning allows machines to act effectively while adhering to the social norms they have learned.
Learning Through Observation
One critical method for machines to learn norms is through observing the behavior of others. By passively watching the actions of experienced machines, they can start to infer what is considered appropriate behavior. This process is called passive norm learning.
We conducted experiments to study how quickly and effectively machines can learn norms simply by observing others. The results showed that experienced machines teach new machines about norms efficiently, allowing them to catch on within a short time.
The Role of Social Norms in Cooperation
Norms play a vital role in creating a cooperative environment. Our experiments demonstrated that when social norms are in place, it allows machines to work together successfully. By following the shared norms, machines can achieve better overall results and enhance collective welfare.
However, not all norms necessarily lead to positive outcomes. Some norms might hinder cooperation, while others significantly benefit it. This requires careful consideration of which norms are beneficial and which may need adjustment.
Transmission of Norms Across Generations
As machines interact over time, it is essential that they maintain a stable set of norms, especially when new machines join the group. This concept is known as intergenerational transmission of norms.
Our studies indicated that when machines learn from their predecessors, norms can be passed along successfully. A sufficient number of experienced machines is needed to help new machines understand these norms. If there are not enough experienced machines to guide the newcomers, norms may drift or be lost altogether.
The Emergence of Norms
The process by which norms emerge is complex yet vital for cooperative behavior among machines. Unlike previous approaches that relied on punishment for compliance, our model focused on how machines can adopt shared norms without explicit communication.
Through repeated interactions and observational learning, machines can begin to converge around shared beliefs about what norms exist. This shared understanding can foster better cooperation and increase the stability of the normative system.
Discussion
Building machines that learn and comply with social norms presents various challenges and opportunities. Our work shows how machines can quickly learn norms through observation and maintain them through shared understanding. The framework we created not only explains significant aspects of human behavior but also provides a way for autonomous systems to function better in society.
Future Directions
There are still many questions to explore within norm learning and compliance. Some future areas of investigation include the role of punishments in enforcing norms and how machines can adapt norms when circumstances change.
Combining different learning approaches, such as model-based and model-free methods, may provide new insights into how machines can learn and apply norms effectively.
Moreover, understanding how to represent norms in ways that encourage flexibility and adaptation can lead to more capable and normative agents. This could include allowing machines to reason about norms and when to apply or adapt them, similar to how humans do.
In conclusion, our research emphasizes the importance of creating machines that not only adhere to norms but also understand them as part of a social fabric. This could ultimately lead to more responsible, cooperative, and effective autonomous systems.
Title: Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games
Abstract: A universal feature of human societies is the adoption of systems of rules and norms in the service of cooperative ends. How can we build learning agents that do the same, so that they may flexibly cooperate with the human institutions they are embedded in? We hypothesize that agents can achieve this by assuming there exists a shared set of norms that most others comply with while pursuing their individual desires, even if they do not know the exact content of those norms. By assuming shared norms, a newly introduced agent can infer the norms of an existing population from observations of compliance and violation. Furthermore, groups of agents can converge to a shared set of norms, even if they initially diverge in their beliefs about what the norms are. This in turn enables the stability of the normative system: since agents can bootstrap common knowledge of the norms, this leads the norms to be widely adhered to, enabling new entrants to rapidly learn those norms. We formalize this framework in the context of Markov games and demonstrate its operation in a multi-agent environment via approximately Bayesian rule induction of obligative and prohibitive norms. Using our approach, agents are able to rapidly learn and sustain a variety of cooperative institutions, including resource management norms and compensation for pro-social labor, promoting collective welfare while still allowing agents to act in their own interests.
Authors: Ninell Oldenburg, Tan Zhi-Xuan
Last Update: 2024-02-22 00:00:00
Language: English
Source URL: https://arxiv.org/abs/2402.13399
Source PDF: https://arxiv.org/pdf/2402.13399
Licence: https://creativecommons.org/licenses/by/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to arxiv for use of its open access interoperability.