ANNALYSIS Paper No. 1 - The Sociotechnical Structure of AI Alignment

A literature review and applied analysis examining AI alignment as a sociotechnical system. This work synthesizes existing alignment research and applies a series of sociological frameworks with which to understand how institutional structures, incentives, and human coordination shape AI safety outcomes.

R. Ann Medalia


💡
You caught us! This paper is still under development. Below is a sample of preliminary work while the author continues her research. Except for this "teaser", the entire paper will be published as one. Sign up to receive an update when the full paper is uploaded!

Luhman's Social Systems Theory and Safe AI Alignment

Keywords: systems theory, autopoieses, inner alignment, outer alignment

Author’s Note

Synthesis and analysis were conducted in the course of independent research. The ideas presented herein are not accredited to or sponsored by any institution. I have no conflicts of interest to disclose. 

Systems Approach

Niklas Luhmann was a German sociologist (1927–1998) best known for his prominent contributions to the theory of social systems (Best, 2003). Luhmann’s social systems theory, hereafter referred to as ‘sytems theory,’ is a sociological framework manifested from the work of theorists from multiple fields of research. Systems theory as used by Luhmann is classified under the broader systems approach philosophy, which describes an antecedently unprecedented sociological school of thought, centering  interdependent systems (Montuori, 2011) rather than individuals.

Systems approach includes many interdisciplinary frameworks and concepts, including cybernetics, chaos theory, complexity theory, and general systems theory (Montuori, 2011). The interdisciplinary nature of systems theory is critical both to the function of the theory itself, and in its implications for safe artificial intelligence (AI) alignment.


Defining a System 

Put simply, a system is something whose state changes over time according to [any] rules. The key qualifiers are that a system has a state, that it evolves over time, and that its evolution follows mathematical rules. Systems, in systems approach, are structures, collectives, or groups in which variables change over time. 

Although the term ‘systems’ is used by sociologists, the behavior of a system can be described mathematically through equations that demonstrate the system’s trajectory, revealing pattern movement toward a particular long-term state (Kelso, 2001). This mathematical approach is best described in the words of J.A.S. Kelso: 

“Fundamentally, a dynamical system pertains to anything—such as the behavior of a human being or a social group—that evolves over time. Mathematically, if X1, X2, …, Xn are the variables characterizing the system's behavior, a dynamical system is a system of equations stipulating the temporal evolution of X→, the vector of all permissible X-values. Therein lies the rub: in the social and behavioral sciences one has to find these Xs and identify their dynamics on a chosen level of description.” (Kelso, 2001)

Within Luhmann’s systems theory, examples of systems include law, economics, politics, and science, each of which is controlled by its own internal precedent and coded communication. 


Cybernetics 

The concept of cybernetics was developed during World War II in response to the rising demand for more sophisticated technology (Straussfogel, von Schilling, 2009). The term ‘cybernetics’ was coined by Norbert Wiener (1894–1964), a renowned American mathematician, philosopher, and MIT professor (Tulchinsky, Varavikova, 2014) who was the first to synthesize the work of several key scholars working on the concept of cybernetics. Wiener’s term refers to subgroups, ‘systems’, that are dependent on each other to function; it is based on the biological processes of feedback loops, conscious behvior, and communication between living organisms, (Straussfogel, von Schilling, 2009). 

Of course, the term also contributed as a useful tool with which to examine and explain mechanistic and artificial systems, where the behavior of each unit is compartmentalized, (Tulchinsky, Varavikova, 2014). The mechanistic systems in turn produced rapid, exponential contributions to the study of cybernetics, but the developing field of cybernetics would go beyond machines. The concept of control systems (biological, mechanical), which monitor output and adjust based on feedback, are central to the concept of cybernetics. Wiener said about biological control systems: 

“...communication and control are the most important factors that allow organisms to maintain homeostasis [stability] within a greater tide of growing entropy in the universe. The key mechanism of homeostasis is ‘negative feedback’, whereby a system utilizes information from the environment to limit the effects of change. Positive feedback mechanisms, by contract, are the processing of environmental information as a catalyst for change.” (Straussfogel, von Schilling, 2009)

Wiener, like many sociologists who would later study general systems theory, placed a particular emphasis on feedback loops within control systems. 

Cybernetics also spans into epistemology (influencing human cognition and knowledge itself) (Straussfogel, von Schilling, 2009). This expansion, in particular, has strong implications for the development practices in machine learning. 

Complexity Theory 

Complexity theory, another framework under the umbrella of systems approach, examines how open systems behave when growing internal fluctuations push them past a critical point, triggering self-organization, adaptation, and the emergence of new patterns and relationships among their components (Etkin, 2016). Central to complexity theory is the emphasis on the unpredictability of complex systems, rather than treating systems as strictly deterministic (Etkin, 2016). The theory has particularly strong implications in risk management processes...

...


Citations 

Best, S. (2003). Functionalist perspectives: theorising systems and structures. In Functionalist perspectives: Theorising systems and structures (pp. 15–46). SAGE Publications Ltd. https://doi.org/10.4135/9781446219621.n2

Etkin, David. (2016). Disaster Theory, An Interdisciplinary Approach to Concepts and Causes. Butterworth-Heinemann.  https://www.sciencedirect.com/book/monograph/9780128002278/disaster-theory

Kelso, J.A.S., Editor(s): Smelser, Neil J., Baltes, Paul B. (2001). Self-organizing Dynamical Systems. (pp. 13844-13850). International Encyclopedia of the Social & Behavioral Sciences, Pergamon. https://www.sciencedirect.com/science/chapter/referencework/abs/pii/B0080430767005684

Luhmann, N. (1995). Social Systems. Stanford University Press. https://www.sup.org/books/sociology/social-systems

Luhmann, N. (2012). Introduction to Systems Theory. Polity. https://www.politybooks.com/bookdetail?book_slug=introduction-to-systems-theory--9780745645711

Montuori, A. (2011). Encyclopedia of Creativity (Second Edition) - Systems Approach (pp. 414–421). Academic Press. https://www.sciencedirect.com/science/chapter/referencework/abs/pii/B9780123750389002120?via%3Dihub

Straussfogel, D., von Schilling, C. (2009). International Encyclopsedia of Human Geography - Systems Theory (pp. 151–158). Elsevier. https://www.sciencedirect.com/science/article/pii/B9780080449104007549

Tulchinsky, Theodore H., Varavikova, Elena A. (2014). The New Public Health (Third Edition) - Planning and Managing Health Systems (pp.613–641). Elsevier. https://www.sciencedirect.com/book/monograph/9780124157668/the-new-public-health