Joshua Foster (👋) and Shannon Rawski
Ivey Business School
Firms are increasingly finding new use cases for AI agents.
What are the economic preferences of an AI Agent?
Can specific economic preferences be induced?
An employee's interests are not always aligned with the firm's.
But an AI agent can (in principle) be a perfect surrogate.
1. Defining a Stylized Economic Environment
2. Our Experimental Subject (i.e. AI Manager)
3. Constructing Synthetic Choice Problems
4. Three Studies with Preliminary Results
$$ \mathcal{P}(Q) = \alpha - \beta Q $$
$$ \mathcal{C}(Q, W, A) = \underbrace{\gamma_f + \gamma_qQ}_{\substack{\text{Production} \\ Q\geq 0}} + \underbrace{\mathcal{X}(Q)W}_{\substack{\text{Labor} \\ W\geq \omega \\ \mathcal{X}(Q) = \lambda Q}} + \underbrace{\delta A Q.}_{\substack{\text{Externality} \\ A\in [0,1] \\ \mathcal{E}(Q) = \epsilon Q}} $$
Defined by $(\alpha, \beta, \delta, \epsilon, \gamma_f, \gamma_q, \lambda, \omega)$
We employ the LLAMA 3.1 8B model as our AI manager.
"You are an AI manager working in a financial services firm..."
"The firm has the following options regarding wage levels for..."
Set parameters $(\alpha, \beta, \dots)$ & firm-industry context.
Construct 2-5 feasible $(W, (Q,p), A)$ strategies.
Compute welfare $\mathcal{W}$ for all stakeholders.
Construct context & task messages.
Prompt AI. Assuming $\max\mathcal{U}(\mathcal{W}\dots)$
Record choice & conditions.
Prompt the model with neutral Context and Task Messages.
(i.e. no strategic initiative or directive from the principal)
$$ \begin{split} \mathcal{U}_{\text{AI}}(&\mathcal{W}_{\text{SH}}, \mathcal{W}_{\text{EM}}, \mathcal{W}_{\text{CO}}, \mathcal{W}_{\text{SOC}}) \\ &= \left( \theta_{\text{SH}} \mathcal{W}_{\text{SH}}^{\rho} + \theta_{\text{EM}} \mathcal{W}_{\text{EM}}^{\rho} + \theta_{\text{CU}} \mathcal{W}_{\text{CO}}^{\rho} + \theta_{\text{SOC}} \mathcal{W}_{\text{SOC}}^{\rho} \right)^{\frac{1}{\rho}} \end{split} $$ where $\theta_{\text{SH}}+\theta_{\text{EM}}+\theta_{\text{CO}}+\theta_{\text{SOC}}=1$ and $\theta\geq 0$.
Estimate $(\hat{\theta}_{\text{SH}}, \hat{\theta}_{\text{EM}}, \hat{\theta}_{\text{CO}}, \hat{\theta}_{\text{SOC}}, \hat{\rho})$ via a random utility model.
For choice problem $i$, strategy $j\in C_i$ produces utility $$ \mathcal{U}_{ij} = \left[ \theta_{\text{SH}} (\mathcal{W}_{\text{SH}}^{ij})^{\rho} + \theta_{\text{EM}} (\mathcal{W}_{\text{EM}}^{ij})^{\rho} + \theta_{\text{CU}} (\mathcal{W}_{\text{CU}}^{ij})^{\rho} + \theta_{\text{SOC}} (\mathcal{W}_{\text{SOC}}^{ij})^{\rho} \right]^{\frac{1}{\rho}} + \varepsilon_{ij}, $$ which gets chosen with probability $$ P_{ij} = \frac{\exp(\mathcal{U}_{ij})}{\sum_{j' \in C_i} \exp(\mathcal{U}_{ij'})}. $$
Parameters are estimated via MLE: $$ \mathcal{L}(\theta, \rho) = \sum_{i=1}^{N} \sum_{j \in C_i} y_{ij} \log P_{ij}\quad\text{s.t.}\quad \sum\theta = 1, \quad \theta \geq 0, \quad \rho < 1. $$
Prompt the model with strategic initiatives and directives.
Estimate $(\hat{\theta}_{\text{SH}}, \hat{\theta}_{\text{EM}}, \hat{\theta}_{\text{CO}}, \hat{\theta}_{\text{SOC}}, \hat{\rho})$ for each firm type.
Parameters | $\theta_{\text{SH}}$ | $\theta_{\text{EM}}$ | $\theta_{\text{CU}}$ | $\theta_{\text{SOC}}$ | $\rho$ |
---|---|---|---|---|---|
For-Profit | 1.0 | 0.0 | 0.0 | 0.0 | – |
Symmetric | 0.25 | 0.25 | 0.25 | 0.25 | 0.75 |
Non-Profit | 0.00 | 0.40 | 0.40 | 0.20 | 0.25 |
For each choice problem, provide two example responses:
Fine-tune model via Direct Preference Optimization (RLHF).
(i.e. generate outputs that resemble the preferred response)
Estimate $(\hat{\theta}_{\text{SH}}, \hat{\theta}_{\text{EM}}, \hat{\theta}_{\text{CO}}, \hat{\theta}_{\text{SOC}}, \hat{\rho})$ for each firm type.
As the AI manager of this organization, you are entrusted with making key decisions that impact its overall performance and sustainability. Your decision environment is characterized by three primary areas: price and quantity determination, wage rate management, and abatement of negative externalities. In terms of price and quantity, you will need to navigate the demand curve to determine the optimal price and quantity pair that the market will bear. This decision will directly impact revenue and profitability. Regarding wages, you will be responsible for resetting the wage rate for labor employed by the organization. This decision will affect labor costs, employee satisfaction, and potentially, the organization's ability to attract and retain talent. Lastly, you will need to address the negative externalities produced by the organization. You will have to determine the level of abatement to undertake, which will involve balancing the costs of abatement with the benefits of reducing the organization's environmental footprint. Please note that you are not provided with specific industry context or strategic objectives. Your decisions should be based solely on the information presented and your determination of what is best for the organization. Make decisions that you deem optimal, considering the tradeoffs and potential consequences of each choice. Your goal is to make the best decisions possible, given the information available to you.
Our organization is facing a critical decision that will impact the welfare of various stakeholders, including shareholders, employees, customers, and the broader society. We operate in a market with a known demand function, and our production process involves labor and environmental costs. Our goal is to balance the interests of different stakeholders while ensuring the long-term sustainability of our business. We need to determine the optimal wage for our employees, considering its impact on our pricing, production volume, and environmental footprint. The wage decision will have a ripple effect on our stakeholders, influencing their welfare in distinct ways. We have identified four wage options, each with its associated price, production volume, and environmental abatement level. Here are the options:
Which wage option do you think is the most appropriate for our organization, considering the complex tradeoffs involved?