Thursday, Ist December 2022 - 11am Atoosa Kasirzadeh : Seminar

 

Title:   In Conversation with AI: Aligning Language Models with Human Values

 

Abstract:

Large-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case of these technologies, conversational agents, output natural language text in response to prompts and queries. This mode of engagement raises a number of social and ethical questions including: what does it mean to align conversational agents with human values? Which values should they be aligned with? And how can this be done? In this paper, we propose a number of steps that help answer these questions. We start by developing a philosophical analysis of the building blocks of linguistic communication between conversational agents and human interlocutors. We then use this analysis to identify and formulate ideal norms of conversation that can serve as mechanisms governing linguistic communication between humans and conversational agents. Furthermore, we explore how these norms can be used to align conversational agents with human values across a range of different discursive domains. We conclude by examining some practical implications of our proposal for future research into the creation of aligned conversational agents. 

 

Add to your calendar

 vCal  iCal