Which values do we align AI to?
How can we ensure that artificial intelligence follows the values of people from across the world and that the benefits of the technology are shared internationally?
How should AI aggregate these preferences? How can it work around known impossibility results in social choice theory (Arrow 1950)?
Would developing AI for public administrations be an area where it is possible to explore which values to align AI to? Potential areas of application could be [regulation]{https://doi.org/10.1016/S1573-448X(06)03027-5} and determining [taxation]{https://maxkasy.github.io/home/files/papers/adaptive_social_welfare.pdf}.
Some suggested steps:
Step 1: Have a look at existing [proposals]{https://www.brookings.edu/research/aligned-with-whom-direct-and-social-goals-for-ai-systems/}.
Step 2: Characterise stakeholders and how they might be affected by AI
Step 3: How can values be measured, and used as reward?
Step 4: How can values be aggregated? This includes the question how it can be insured that individuals in countries other than the one developing AI benefit as well.