The conceptual structure of human relationships across modern and historical cultures
TL;DR Summary
Using surveys, cognitive tasks, and NLP across modern and ancient cultures, the study identifies five key dimensions and three core categories forming a universal conceptual space of human relationships, advancing understanding of social cognition and cultural variability.
Abstract
Nature Human Behaviour | Volume 9 | June 2025 | 1162–1175 1162 nature human behaviour Article https://doi.org/10.1038/s41562-025-02122-8 The conceptual structure of human relationships across modern and historical cultures Xi Cheng 1,9 , Haroon Popal 1,2,9 , Huanqing Wang 1,9 , Renfen Hu 3 , Yinyin Zang 4 , Mingzhe Zhang 1 , Mark A. Thornton 5 , Yina Ma 1 , Huajian Cai 6 , Yanchao Bi 1 , Jamie Reilly 7 , Ingrid R. Olson 8 & Yin Wang 1 A defining characteristic of social complexity in Homo sapiens is the diversity of our relationships. We build connections of various types in our families, workplaces, neighbourhoods and online communities. How do we make sense of such complex systems of human relationships? The basic organization of relationships has long been studied in the social sciences, but no consensus has been reached. Here, by using online surveys, laboratory cognitive tasks and natural language processing in diverse modern cultures across the world ( n = 20,427) and ancient cultures spanning 3,000 years of history, we examined universality and cultural variability in the ways that people conceptualize relationships. We discover
Mind Map
In-depth Reading
English Analysis
1. Bibliographic Information
1.1. Title
The conceptual structure of human relationships across modern and historical cultures
1.2. Authors
Xi Cheng, Haroon Popal, Huanqing Wang, Renfen Hu, Yinyin Zang, Mingzhe Zhang, Mark A. Thornton, Yina Ma, Huajian Cai, Yanchao Bi, Jamie Reilly, Ingrid R. Olson & Yin Wang
1.3. Journal/Conference
Nature Human Behaviour. This is a highly reputable peer-reviewed scientific journal publishing research in psychology, economics, sociology, and neuroscience. Its high impact factor and selective publication process indicate the significant influence and quality of the research published within this venue, making it a leading platform for interdisciplinary studies on human behavior.
1.4. Publication Year
2025
1.5. Abstract
A defining characteristic of social complexity in Homo sapiens is the diversity of our relationships. We build connections of various types in our families, workplaces, neighbourhoods and online communities. How do we make sense of such complex systems of human relationships? The basic organization of relationships has long been studied in the social sciences, but no consensus has been reached. Here, by using online surveys, laboratory cognitive tasks and natural language processing in diverse modern cultures across the world (n = 20,427) and ancient cultures spanning 3,000 years of history, we examined universality and cultural variability in the ways that people conceptualize relationships. We discovered a universal representational space for relationship concepts, comprising five principal dimensions (formality, activeness, valence, exchange and equality) and three core categories (hostile, public and private relationships). Our work reveals the fundamental cognitive constructs and cultural principles of human relationship knowledge and advances our understanding of human sociality.
1.6. Original Source Link
/files/papers/68ffa09ebd968e29d463b620/paper.pdf (This appears to be a direct PDF link, indicating it's an officially published paper or at least the final accepted manuscript).
2. Executive Summary
2.1. Background & Motivation
The core problem the paper aims to solve is the lack of a comprehensive, unified understanding of how humans mentally conceptualize and organize their diverse relationships. Despite extensive study across various social science disciplines (sociology, anthropology, cognitive psychology, communication studies) over the past 50 years, no consensus has been reached on the fundamental structure and organization of human relationships.
This problem is critically important because relationships are a defining characteristic of social complexity in Homo sapiens. Humans form myriad connections in different contexts (family, work, online), and the quality and quantity of these relationships are integral to individual survival, thriving, cognition, behavior, development, and well-being. Understanding this conceptual structure—the common-sense way ordinary people make sense of their social worlds—is crucial for advancing human sociality.
The challenges or gaps in prior research include:
-
Diversity and Complexity: Human relationships are incredibly diverse, context-dependent, and multifaceted, making it difficult to isolate and study individual components or establish objective measures. Non-human primate societies, for example, are largely defined by hierarchy and affiliation, whereas human societies involve far more complex types of relationships (e.g.,
frenemies,godparents). -
Subjectivity: Relationships are subjective beliefs, experiences, and practices shaped by individual perspectives and dynamic, unwritten rules, making objective comparison across individuals and cultures challenging. The degree of shared cognitive, behavioral, and cultural principles of relationships across time and cultures is underexplored.
-
Disciplinary Silos: Different social science disciplines have explored relationships using their own theoretical perspectives and methodological approaches, leading to disparate findings and a lack of a unified understanding. For instance, sociologists found a three-factor model for role-based relationships, anthropologists proposed four elementary forms of social bonds, and cognitive psychologists suggested a four-dimensional framework. These approaches often tap into distinct
feature spacesandrelationship types.The paper's innovative idea or entry point is to focus on the
common sense of human relationships—how ordinary people mentally conceptualize and understand relationship concepts. By leveraging diverse methodologies (online surveys, laboratory cognitive tasks, and advanced natural language processing) across a large global sample of modern cultures and historical texts spanning 3,000 years, the authors aim to build a unified framework that clarifies the underlying elements and organizational structures of the relationship concept system, while also revealing similarities and differences across cultures and time.
2.2. Main Contributions / Findings
The paper makes several primary contributions and key findings:
-
Discovery of a Universal Representational Space (FAVEE-HPP Framework):
- The study identified a
universal representational spacefor human relationship concepts, structured by five principal dimensions and three core categories. - The five principal dimensions, termed the
FAVEE model, are:Formality,Activeness,Valence,Exchange, andEquality. These dimensions account for a significant portion of the variance in how relationships are conceptualized. - The three core categories, termed the
HPP model, are:Hostile,Public, andPrivaterelationships. The paper demonstrates that these categories emerge from theFAVEEdimensions. - This framework unifies existing theories from various disciplines and provides a parsimonious model for understanding relationship concepts.
- The study identified a
-
Cross-Cultural Universality and Variability:
- The
FAVEE-HPPframework was found to beuniversally sharedacross diverse modern cultures (19 global regions, 10 languages, n = 17,686 participants), confirming its generalizability worldwide. - The
FAVEEmodeloutperformed 15 other existing theoriesin data fitting and explained variance across these global regions. - While the basic structure is universal, the study also revealed
rich cultural variabilityin how specific relationship concepts are understood. This variability was quantitatively linked toreligionandmodernizationlevels. For instance,public relationshipsshowed more cultural variability thanfamilialorromantic relationships. Specific comparisons between the USA and China highlighted differences in conceptualizingcloseness(physical vs. psychological distance),powerwithin families, andsocial exchangein private relationships. - The model was further validated in a
non-industrial society(the Chinese Mosuo tribe), demonstrating its robustness beyond industrialized contexts.
- The
-
Historical Endurance:
- By employing advanced
Natural Language Processing (NLP)techniques andPre-trained Language Models (PLMs)on large-scale historical text corpora (ancient China spanning 3,000 years), the paper demonstrated that theFAVEE-HPPstructures arepersistent through time. - The
FAVEE-HPPmodel alsooutperformed other theoretical modelsin predicting relationship representations in both ancient and modernPLM embeddings. - Analysis of ancient vs. modern Chinese
PLM embeddingsrevealed shifts in the relative importance offormalityandequalitydimensions over time, suggestingequalitywas more salient in ancient conceptualizations of relationships, andformalitymore so in modern times.
- By employing advanced
-
Generalizability to Non-Dyadic Relationships:
-
The
FAVEEframework was extended and confirmed to be generalizable tonon-dyadic relationships, includingtriadic relations(e.g.,love triangle) andgroup relations(e.g.,rich-poor,Democrats-Republicans).In essence, the paper provides a fundamental, robust, and empirically validated framework for understanding the cognitive and cultural principles underlying human relationship knowledge, thereby advancing the understanding of
human sociality. It offers a computational framework for objective and quantitative measurement of human relationships, akin to the "Big Five" framework for personality.
-
3. Prerequisite Knowledge & Related Work
3.1. Foundational Concepts
To fully understand this paper, a beginner should be familiar with several key concepts from social science, cognitive science, and computational methods.
- Social Complexity: This refers to the intricate and diverse ways individuals interact and organize themselves within a society. In
Homo sapiens, it's characterized by varied relationship types, social structures, and cultural norms, contrasting with simpler social hierarchies in many animal species. - Conceptual Structure: This is how mental concepts are organized in the human mind. For relationships, it refers to the underlying cognitive map or framework people use to understand, categorize, and differentiate between various types of social connections. It represents the "common sense" or "lay theory" people hold about relationships.
- Universality vs. Cultural Variability:
Universalityimplies that certain aspects of relationship conceptualization are common across all human cultures, suggesting a fundamental, shared cognitive basis.Cultural variabilityhighlights how these conceptualizations can differ significantly due to specific cultural norms, beliefs, and societal structures. - Natural Language Processing (NLP): This is a field of artificial intelligence that enables computers to understand, interpret, and generate human language. In this paper, NLP is used to:
- Generate comprehensive lists of relationship terms from text.
- Analyze large text corpora (collections of written texts) to infer how relationships are conceptualized in different cultures and historical periods.
- Pre-trained Language Models (PLMs): These are advanced NLP models (e.g., BERT, RoBERTa) that have been trained on vast amounts of text data (e.g., the entire internet or massive book collections) to learn general linguistic patterns, grammar, and semantic relationships between words.
Pre-traininginvolves tasks likeMasked Language Modeling (MLM), where the model learns to predict missing words in a sentence. Once pre-trained, these models can generateword embeddings. - Large Language Models (LLMs): A subset of PLMs, LLMs (e.g., GPT-4) are particularly large neural networks with billions of parameters, excelling at understanding and generating human-like text, performing tasks like summarization, translation, and answering complex questions. In this paper, LLMs like GPT-4 are used to generate
descriptions([DESC]) to provide context forPLMswhen probing relationship understanding. - Word Embeddings: These are numerical representations (vectors) of words or concepts in a multi-dimensional space. Words with similar meanings or contexts are mapped closer together in this space. PLMs generate
embeddingsthat capture semantic and contextual information. By analyzing theembedding spaceof relationship terms, researchers can infer their conceptual similarities and differences. - Principal Component Analysis (PCA): This is a statistical procedure that transforms a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called
principal components. It's adimensionality reductiontechnique.- Goal: To reduce the number of dimensions (features) in a dataset while retaining as much variance (information) as possible.
- How it works (simplified): PCA finds new axes (principal components) along which the data varies the most. The first principal component accounts for the largest possible variance, and each succeeding component accounts for the next highest variance possible and is orthogonal to the preceding components.
- In this paper,
PCAis used to reduce a large set ofevaluative features(e.g.,intimacy,formality) into a smaller, more manageable set oflatent dimensions(e.g.,formality,activeness,valence,exchange,equality).
- Clustering Algorithms (e.g., k-means, Hierarchical Clustering): These are unsupervised machine learning techniques used to group a set of objects in such a way that objects in the same group (called a
cluster) are more similar to each other than to those in other groups.- k-means: A popular algorithm that aims to partition observations into clusters. It iteratively assigns data points to the nearest cluster centroid and then recomputes the centroids.
- Hierarchical Clustering: Builds a hierarchy of clusters, either by merging smaller clusters into larger ones (agglomerative) or by splitting larger clusters into smaller ones (divisive).
- In this paper, clustering is used to identify
categoriesof relationships (e.g.,hostile,public,private) based on their conceptual similarities in therepresentational space.
- Representational Dissimilarity Matrices (RDMs): An
RDMis a square matrix where each entry(i, j)represents thedissimilarity(e.g., conceptual distance or difference) between two items(i)and(j). It's symmetric, with zeros on the diagonal (an item is perfectly similar to itself).RDMsare a key tool inRepresentational Similarity Analysis (RSA)to compare different representations (e.g., human judgments, brain activity, computational models).
- Representational Similarity Analysis (RSA): A powerful framework for comparing the
representational geometries(the patterns of similarities and dissimilarities among a set of items) between different sources of data.- Goal: To understand how closely different models or datasets reflect each other's underlying conceptual organization.
- How it works (simplified): It involves computing
RDMsfor different data sources (e.g., human ratings, PLM embeddings, cultural variables) and then comparing theseRDMs(typically using correlation coefficients like Spearman's ). A high correlation between twoRDMssuggests that their underlying representational spaces are structurally similar. - In this paper,
RSAis used to:- Assess cross-cultural concordance of relationship concepts.
- Model cross-region variability using cultural/sociocultural variables.
- Compare
PLM embeddingswith human ratings.
- Multi-arrangement Task: A cognitive task where participants arrange items (e.g., relationship terms) on a 2D screen such that the spatial distance between any two items reflects their
conceptual dissimilarity. Closer items are more similar. - Free Sorting Task: A cognitive task where participants categorize a set of items (e.g., relationship terms) into groups of their own choosing and provide labels for these categories.
- Uniform Manifold Approximation and Projection (UMAP): A non-linear
dimensionality reductiontechnique often used for visualization and as a preprocessing step for clustering. It aims to preserve both local and global data structure, projecting high-dimensional data into a lower-dimensional space (e.g., 2D) while maintaining meaningful distances.
3.2. Previous Works
The paper highlights a historical lack of consensus in understanding the basic organization of relationships, despite efforts across various disciplines. It references several foundational works and theories, each contributing a piece to the puzzle but none offering a unified framework:
- Marwell & Hage (1970) - Sociological Perspective:
- Focus: Formation and organization of
social relationships. - Model: Discovered a
three-factor modelforrole-based relationships:intimacy,visibility, andregulation. - Context: Sociologists were interested in how social roles shape interactions.
- Focus: Formation and organization of
- Fiske (1992) - Anthropological Perspective (Relational Models Theory):
- Focus: Foundations of
social coordination across cultures. - Model: Proposed
four elementary forms of social bonds:communal sharing,authority ranking,expected reciprocity, andmarket pricing. - Context: Anthropologists sought universal principles governing social interactions across diverse societies.
- Explanation: Fiske's theory suggests that people organize their social lives using a limited number of fundamental relational models.
Communal Sharing: People feel that they are all the same, undifferentiated, and belong to a single group. Needs are met without regard for proportionality (e.g., family, close friends).Authority Ranking: People are ordered hierarchically, with superiors having authority and protection, and subordinates having deference and loyalty (e.g., military, traditional family structures).Equality Matching: People relate as peers who are distinct but equal, engaging in turn-taking, balanced reciprocity, and equal distributions (e.g., friends taking turns buying rounds, splitting chores).Market Pricing: People relate in terms of proportional ratios or rates, often using a common metric like money, valuing efficiency and cost-benefit analysis (e.g., buying and selling, wages).
- Focus: Foundations of
- Wish, Deutsch, & Kaplan (1976) - Cognitive Psychological Perspective:
- Focus: Perception of
interpersonal relations. - Model: Revealed a
four-dimensional frameworkfor describing relationships:valence(affective tone),equality(power balance),activeness(intensity/engagement), andformality(rules/structure). - Context: Cognitive psychologists investigated how individuals mentally represent and differentiate relationships.
- Focus: Perception of
- Montgomery (1988) - Communication Studies Perspective:
-
Focus:
Communication qualityinpersonal relationships. -
Model: Proposed
three factorsfor effective relational dialogues:positiveness,intimacy, andcontrol. -
Context: Communication scholars studied the dynamics of interaction within relationships.
The paper notes that all these theories have been insightful and endured in their respective fields. However, a major limitation is that researchers "in different disciplines have approached the problem of human relationships using their unique features of interest and thus have tapped into distinct feature spaces and relationship types." This disciplinary fragmentation is precisely what the current paper aims to overcome by building a
unified frameworkacross these perspectives. The present work seeks to synthesize these diverse feature sets into a higher-order, more comprehensive model.
-
3.3. Technological Evolution
The study's ability to address the long-standing problem of relationship conceptualization is significantly enabled by recent technological advancements, particularly in Natural Language Processing (NLP) and computational modeling.
- From Manual Surveys to Large-Scale Online Data Collection: Traditionally, social science research relied on smaller, localized studies. The advent of
online survey platforms(like MTurk, CloudResearch, Credamo, NaoDao) has revolutionized data collection, allowing for:- Massive Sample Sizes: Reaching thousands of participants (n = 20,427 in this study) across diverse geographical and cultural regions efficiently.
- Cross-Cultural Reach: Accessing participants from 19 global regions, enabling robust cross-cultural comparisons.
- Diverse Demographics: Capturing a broader range of human experiences and perspectives on relationships.
- Emergence of Advanced NLP and Language Models:
- Early NLP: Focused on rule-based systems or simpler statistical models for text analysis.
- Modern NLP with Deep Learning: The rise of deep learning, particularly
transformer architectures, led to the development of powerfulPre-trained Language Models (PLMs)like BERT, RoBERTa, andLarge Language Models (LLMs)like GPT-4. These models:- Capture Semantic Nuances: Can learn highly sophisticated representations of words (
word embeddings) that capture subtle semantic and contextual relationships, far beyond what traditional methods could achieve. - Proxy for Human Cognition: The paper demonstrates that
PLM embeddingscan reflecthuman-like relationship understandingand evenscholarly knowledgein historical contexts, serving as a proxy for the collective human mind at different periods. - Historical Access: Enabled the study of
ancient culturesby analyzing historical text corpora, making populations otherwise inaccessible to modern researchers available for cognitive analysis.
- Capture Semantic Nuances: Can learn highly sophisticated representations of words (
- Sophisticated Computational Modeling Techniques:
-
Dimensionality Reduction: Techniques like
Principal Component Analysis (PCA)andUMAPallow researchers to simplify complex datasets (e.g., 30 theoretical features) into interpretable lower-dimensional spaces (e.g., 5 dimensions), revealing underlying structures. -
Clustering: Algorithms like
k-meansandhierarchical clusteringenable the data-driven discovery of natural categories within relationship concepts. -
Representational Similarity Analysis (RSA): Provides a robust framework for quantitatively comparing
representational geometriesfrom various sources (human judgments, computational models, cultural variables), allowing for precise measurement of agreement and disagreement across cultures and time.This paper's work fits within the technological timeline as a cutting-edge example of
computational social scienceanddigital humanities, leveraging big data and AI (specifically NLP) to tackle fundamental questions in psychology and anthropology that were previously limited by methodological constraints. It represents a shift from purely theoretical or small-scale empirical studies tolarge-scale, data-driven, and interdisciplinary investigationsinto human social cognition.
-
3.4. Differentiation Analysis
This paper's approach differentiates itself from previous works by addressing several limitations inherent in prior discipline-specific studies:
-
Unified, Interdisciplinary Framework vs. Fragmented Perspectives:
- Previous Work: As noted, prior research typically focused within single disciplines (e.g., sociology, anthropology, cognitive psychology), each developing models based on their specific interests and methodological traditions. This led to different
feature spaces(e.g.,intimacyfor sociologists,market pricingfor anthropologists) and limited integration across fields. For instance, Wish et al. found a 4-dimensional framework for perception, while Fiske proposed 4 forms of social bonds. - This Paper's Innovation: The authors explicitly set out to "synthesize this cross-field literature and build a unified representational space across disciplines." They achieved this by:
- Comprehensive Feature Collection: Systematically collecting and summarizing 30
conceptual featuresfrom 15 prominent existing theories. This ensures a broad, compositefeature space. - Higher-Order Reduction: Instead of starting anew, they treated these existing theoretical features as inputs for further
dimensionality reduction(PCA) to derivehigher-order components(theFAVEEdimensions). This meta-analytic approach allows for the emergence of a more encompassing structure.
- Comprehensive Feature Collection: Systematically collecting and summarizing 30
- Previous Work: As noted, prior research typically focused within single disciplines (e.g., sociology, anthropology, cognitive psychology), each developing models based on their specific interests and methodological traditions. This led to different
-
Broad Empirical Scope (Cross-Cultural and Historical) vs. Limited Contexts:
- Previous Work: Many foundational studies were often conducted within specific cultural contexts (e e.g., Western, industrialized societies) or relied on smaller samples, limiting the generalizability of their findings. Studies on ancient cultures or non-industrial societies were rare or qualitative.
- This Paper's Innovation: The study provides unprecedented breadth and depth in its empirical validation:
- Global Modern Cultures: Tested the model across 19 diverse global regions and 10 languages (n = 17,686), far exceeding previous sample sizes and cultural diversity.
- Non-Industrial Societies: Validated the model in the Chinese Mosuo tribe, a small-scale matrilineal society, demonstrating applicability beyond industrialized contexts.
- Ancient Cultures: Pioneered the use of
Pre-trained Language Models (PLMs)on historical text corpora (ancient China spanning 3,000 years) to infer relationship conceptualizations from the past, an otherwise inaccessible population for direct study. This provides a uniquediachronic(across time) perspective.
-
Data-Driven and Computational Rigor vs. Predominantly Theoretical/Qualitative:
- Previous Work: While some theories involved statistical methods (e.g., dimensionality reduction in Wish et al.), the overall landscape often involved theoretical models or qualitative anthropological observations.
- This Paper's Innovation: Emphasizes a
data-driven approachat every stage:NLPto generate extensive relationship lists.PCAfor robustdimensionality reduction.Clustering algorithmsforcategoryidentification.Representational Similarity Analysis (RSA)to quantitatively compare different representations and link cultural variables to conceptual variations.- Rigorous
model comparison analysis(FAVEE vs. 15 other theories) demonstrating superior performance and consistency. - Quantification of
universalityandcultural variabilitythrough statistical modeling.
-
Integrated Dimensional and Categorical View vs. Separate Models:
-
Previous Work: Some theories focused on dimensions (e.g., Wish et al.), others on categories/forms (e.g., Fiske).
-
This Paper's Innovation: The
FAVEE-HPPframework integrates both:dimensions(FAVEE) define a continuousrepresentational space, andcategories(HPP) emerge as clusters within this space. This provides a more complete picture of how people mentally organize relationships, moving from continuous features to discrete types.In essence, this paper moves beyond fragmented, context-limited, and often discipline-bound understandings to construct a truly
universal,cross-cultural, andhistorically enduringframework for human relationship conceptualization, backed by massive empirical data and advanced computational methods.
-
4. Methodology
The paper outlines its methodology across three main studies, each building upon the previous one to establish the FAVEE-HPP framework and test its universality, cultural variability, and historical endurance.
4.1. Principles
The core idea behind the methodology is to identify the fundamental dimensions and categories that humans use to mentally organize their relationships. The theoretical basis or intuition is that despite the apparent diversity of relationships, there must be a limited set of underlying cognitive principles or constructs that allow humans to make sense of this complexity. The paper employs a multi-method approach, combining traditional social science surveys and cognitive tasks with cutting-edge Natural Language Processing (NLP) and Pre-trained Language Models (PLMs), to converge on these principles from various angles (self-report, behavioral, and linguistic data).
The methodology can be understood through three main phases, corresponding to the three studies:
- Discovery (Study 1): Identify the
latent dimensionsandcategoriesthat structure human relationship concepts in a modern, Western culture (USA), by synthesizing existing theories and using empirical data. - Generalization and Variability (Study 2): Test the
universalityof the discovered structure across diverse modern global cultures and a non-industrial society, and quantifycultural variabilityby linking it to macro-level societal factors. - Historical Endurance (Study 3): Investigate the persistence of this structure over long historical periods using
NLPtools to analyze ancient texts, thereby extending the framework's validity across time.
4.2. Core Methodology In-depth (Layer by Layer)
4.2.1. Participants
Across all studies, participants were native speakers who grew up or lived for the longest period of their life in the targeted regions. All studies were approved by the Institutional Review Board of Beijing Normal University (IRB_A_0024_2021002), and informed consent was obtained.
- Study 1:
- 1,065 online US participants via MTurk for online surveys.
- 60 offline US participants for laboratory cognitive tasks (within-subject design).
- Study 2:
- 17,686 online participants from 19 global regions (e.g., USA, China, Israel, Portugal) via MTurk, CloudResearch, Credamo, and NaoDao platform.
- 229 native Mosuo people (a non-industrial, matrilineal society in China) through face-to-face interviews and door-to-door paper surveys.
- Study 3:
- 44 scholars specialized in ancient Chinese culture for expert evaluation of the NLP method.
- Generalizability Test (Non-dyadic relationships):
- 380 online US participants (MTurk).
- 242 online Chinese participants (NaoDao platform).
4.2.2. Sampling of Human Relationships
A data-driven Natural Language Processing (NLP) approach was used to generate comprehensive lists of human relationships to ensure ecological validity and cover a wide range of types.
- Seed Words Generation: Brainstorming and social media searches by a set of participants (n=15 for USA, n=27 for China) were used to create initial
seed wordsrelated to relationships. - Text Embedding & Co-occurrence:
Text embedding(numerical representation of words) was used to find words that frequentlyco-occurredwith these seed words. This was done by calculating thecosine distancebetween word vectors. Words with smaller cosine distance are more similar in meaning or context. - Filtering: The list was filtered to keep only
nounsand then further filtered byfrequency. Manual checks ensured only words related to human relationships were retained. - Pairing and Literature Integration: Words were paired based on their relationship meanings. Additionally, relationships pulled from existing literature were added.
- Result: 159 typical relationships for English (USA) and 258 relationships for Chinese. These included common (e.g.,
siblings,friends) and uncommon (e.g.,master-servant,friends with benefits) English terms, and Chinese-unique relationships (some untranslatable or culture-specific).
- Result: 159 typical relationships for English (USA) and 258 relationships for Chinese. These included common (e.g.,
4.2.3. Evaluative Features
To capture a comprehensive understanding of relationship dimensions, the researchers synthesized features from prior work.
- Literature Search: A comprehensive literature search was performed across 15 prominent theories in relationship science.
- Feature Collection: 30
conceptual featureswere summarized and extracted (e.g.,activeness,communality,concreteness,equality,endurance,formality,intensity,intimacy,reciprocity,valence). Redundant features were combined. - Cross-Cultural Additions: For Study 2, three additional theoretical features (
morality,trust,generation gap) fromcross-cultural literaturewere included.
4.2.4. Dimensional Survey
This was a primary data collection method across all studies.
- Online Survey Design: Participants rated human relationships on
bipolar Likert scales. Each page cued a specificevaluative feature(e.g.,activeness) with two opposite phrases (e.g.,passivevs.active) at the ends of a slider bar. - Detailed Definitions: Since some features were obscure (e.g.,
communality), each feature was presented with a detailed definition and an exemplary relationship. - Instructions: Participants were asked to consider all aspects of relationships (thinking, feeling, acting, talking, characteristics), focusing on
general knowledgeorstereotypical understandingrather than personal experiences. - Attention Checks: Questions were used to ensure active engagement and prevent random or patterned responses.
- Efficiency:
- Between-subject design (online): To avoid fatigue, each participant was randomly assigned a subset of relationships (5-8) and a subset of features (10-11).
- Within-subject design (offline for Study 1): In contrast, offline participants rated all 159 relationships on all features (3 hours task).
- Data Quality Control: Additional questions on common object knowledge (size/color of animals, fruits) were included to rule out cross-cultural variations in online data quality or general semantic knowledge. Low variation in object knowledge suggested cultural variability was unique to relationship concepts.
4.2.5. Cognitive Tasks (Study 1)
Two lab-based cognitive tasks were used to measure the categorization of relationship concepts.
- Multi-arrangement Task:
- Goal: Collect intuitive
similarity judgments. - Procedure: Participants arranged the 159 relationships on a 2D computer screen by dragging and dropping. The rule was that
conceptually similarrelationships should be placed closer together, anddissimilarones further apart. - Output: A
dissimilarity matrixwhere values indicated distances in the 2D circle.
- Goal: Collect intuitive
- Free Sorting Task:
- Goal: Understand how people deliberately classify relationships into categories.
- Procedure: Participants classified the 159 relationships into
labelled categoriesof their choosing, allowed up to eight groups.
- Text Analysis on Categorical Labels (from Free Sorting Task):
- 444 labels were initially obtained.
- These were coded by assigning 159 relationships (e.g.,
familylabel assigned towife-husbandbut notdoctor-patient). Hierarchical clustering(Ward method) was performed on the label relationship matrix. This revealed three and six clusters after excluding noisy clusters.
4.2.6. Dimensionality Reduction and Clustering
These techniques were central to extracting the FAVEE dimensions and HPP categories.
- Data Preprocessing:
- Raw data was cleaned; participants failing attention checks were excluded (e.g., 2,441 out of 18,537 in Study 2).
- A matrix of average ratings for each relationship on each evaluative feature across participants was created.
- This matrix was normalized using
scikit-learn.
- Principal Component Analysis (PCA):
- Purpose: The primary technique for
dimensionality reductionto derivedimensional models. - Software:
prcompfunction from R (v.4.3.3). - Rotation:
Varimax rotationwas used to maximize loadings of individual evaluative features onto components, making them more interpretable. - Component Naming: Components were named by examining the top five highest absolute loadings and the distribution of relationship scores.
- Optimal Component Number: Determined by checking four
data-driven metrics(all explained below in Evaluation Metrics) and assessing the interpretability of each component. Solutions with cross-metrics agreement and high interpretability were chosen. - Robustness Check: Other
dimensionality reductiontechniques (Independent Component Analysis, Exploratory Factor Analysis, Multidimensional Scaling, Network Analysis) were also applied, consistently yielding the samefive-factor solution(FAVEE).
- Purpose: The primary technique for
- Clustering Techniques:
- Purpose: To derive
categorical models(e.g., HPP). - Primary Technique:
k-means clustering. Other techniques (e.g., hierarchical clustering, HDBSCAN) were used for validation. - Input: A
dissimilarity matrixof 159 relationships.- For
dimensional survey:Euclidean distance matrixcalculated from relationships' ratings on all evaluative features. - For
multi-arrangement task: Distance matrix directly from participant arrangements. - For
free sorting task: Distance matrix based on the probability that two relationships were classified in the same category.
- For
- Preprocessing:
Uniform Manifold Approximation and Projection (UMAP)was used as a preprocessing step fork-meansto boost performance.- UMAP Parameters:
nearest neighbour(15, for local vs. global structure) andminimum distance(0.01, for cluster tightness).
- UMAP Parameters:
- Optimal Cluster Number: Determined by the
silhouette score(explained below in Evaluation Metrics), and the stability and interpretability of the output clusters.
- Purpose: To derive
4.2.7. Language Models and Embeddings (Study 3)
PLMs and LLMs were harnessed to infer relationship understanding in ancient contexts.
- Modern Chinese PLM:
- Model:
word-based Chinese-RoBERTa-Basefrom UER-py Modelzoo. - Rationale: Chosen for its focus on the
mask language modeling taskduring pre-training and its use of words (not characters) for Chinese. Trained on a large, public modern Chinese text corpus.
- Model:
- Ancient Chinese PLM:
- Model:
BERT-ancient-Chinese(Wang & Ren, 2023). - Rationale: Trained on a large-scale ancient Chinese corpus (historical texts from Zhou Dynasty (-1046 BCE) to Qing Dynasty (1912 CE)).
- Model:
- Generating Human-like PLM Embeddings:
-
Approach: Adapted from Cutler and Condon (2023), combining
PLMswithLLMs. -
Query Formulation: The following query (in Chinese) was used as input for the
PLM:[DESC] The most salient feature of the relationship [TERM] is [MASK].[TERM]: Substituted with one of the 258 Chinese relationship terms.[MASK]: Represents the conceptualization of the target relationship, which the PLM predicts words for based on context.[DESC]: Relationship-specific descriptions generated byGPT-4(anLLM). This provided rich contextual information.- For ancient contexts,
GPT-4descriptions were generated for ancient China, then refined by human experts in ancient Chinese language/history to match linguistic features and relationship characteristics of the era.
- For ancient contexts,
-
Embedding Extraction: The
last layer vector(768 dimensions) was extracted from the[MASK]token's position for each relationship, resulting in a 258 x 768 matrix. -
Validation: Resemblance between human representations (similarity matrix from Chinese populations in Study 2) and
PLM representations(pre-trained on modern Chinese corpus) was 0.553 (), confirmingPLM embeddingscould reflect human-like understanding. -
PCA on PLM Embeddings:
PCAwas applied toPLM representations, and the resulting components corresponded well withFAVEEstructures.The following figure (Figure 5a from the original paper) shows the pipeline for generating PLM embeddings:
该图像是包含三部分的复合图表,展示了不同文化间人际关系的交叉区域可靠性。a部分为不同关系类型(依附、权力、交易、家庭、浪漫、敌对)的可靠性条形图;b部分展示敌对、公有和私人关系的可靠性小提琴图;c部分说明五个核心维度(正式性、活跃性、情感色彩、交换和平等)对应的可靠性分布。
-
4.2.8. Representational Similarity Analysis (RSA) and Model Comparison
These techniques were used for evaluating cross-cultural variability and model performance.
-
RSA Multiple Regression (Study 2):
- Purpose: To identify which cultural variables account for
cross-cultural variancein relationship representations. - Procedure:
- Cultural Variables: Collected from multiple open databases (e.g., World Values Survey, Worldbank) for language, personality, socio-ecology (subsistence, disease, climate), modernization (composite of education, urbanization, wealth), genetics, religion, politics, and Hofstede 6D culture model.
- Cultural Variable RDMs: For each cultural variable, an
RDMwas computed, where each cell represented the dissimilarity between two regions on that variable (e.g., dissimilarity in modernization level between China and Portugal). - Relationship Representational RDMs: For each
representational geometry(full-feature, dimensional, categorical), anRDMwas created to represent the dissimilarity of relationship representations across regions. - Linear Regression: A linear regression model was used, where
cultural variable RDMswerepredictors, and therelationship representational geometry RDMwas theoutcome variable.
- Statistical Significance:
Mantel testwas used. This involves permuting the order of cultural variableRDMs(while holding relationshipRDMsconstant), recalculating regression, and repeating 10,000 times to compute a -value for the statistic. A one-sided test was used as only positive similarities were expected. - Noise Ceiling: Estimated by using the mean relationship
RDMsofn-1regions to predict the remaining region'sRDM, reflecting inherent heterogeneity.
- Purpose: To identify which cultural variables account for
-
RSA Correlations (Study 3):
- Purpose: To compare
PLM embeddingswith human-ratingFAVEE-HPPmodel in ancient and modern contexts. - Procedure:
PLM embeddings(e.g., matrix for modern Chinese) were transformed into acosine similarity matrix(e.g., ).- This
similarity matrixwas then correlated (usingSpearman correlation) with the lower triangle ofRDMsderived from:- The
FAVEE dimensions(distances between pairs of relationships in 5DFAVEEspace). - The
HPP categories.
- The
- Noise Ceiling: Estimated by correlating human-rating
RDMsfromFAVEE-HPPwith human-ratingRDMsfrom 33 dimensional features.
- Purpose: To compare
-
Model Comparison Analysis:
-
Purpose: To compare the performance of the
FAVEEmodel against the 15 other existing theories. -
Procedure: For each model,
linear combinations of its featureswere used asregressorsto predict each of theremaining theoretical features(not included in that model). -
Metrics: (explained variance) and
BIC(Bayesian Information Criterion, for data fitting) were calculated for each region.FAVEEconsistently outperformed others.The following figure (Figure 3b from the original paper) shows the computational modeling of representational geometries using RSA multiple regression:
该图像是论文中的复合图,包括a部分的成分主成分分析碎石图,b部分的关系特征与五个主维度的相关性气泡图,以及c部分展示关系类别(私人、公有、敌意)网络结构的圆形聚类图,揭示了人类关系的概念结构。
-
4.2.9. Robustness Test
To ensure the reliability of the findings against data sparsity.
- Procedure: Relationships were removed one by one, and all analyses (PCA, clustering, cross-cultural RSA) were re-performed.
- Removal Sequence: Pairs of relationships were ranked from most to least similar (based on
multi-arrangement task). The relationship with lowerfamiliarity ratingin each pair was removed first. - Metrics:
Pearson correlationsbetween metrics from the full set and from the subsets determined robustness. A subset of 40 relationships was found sufficient to replicate findings based on 159 relationships.
5. Experimental Setup
5.1. Datasets
The study utilized several datasets, primarily consisting of human judgments on relationships and linguistic data for Natural Language Processing.
- 159 English Relationships:
- Source: Generated using a
data-driven NLP approachbased on seed words and text embeddings, supplemented by relationships from literature. - Characteristics: Included a mix of common (e.g.,
siblings,friends,enemies) and less common (e.g.,master-servant,friends with benefits)dyadic relationships. - Domain: English-speaking cultures (primarily USA).
- Purpose: Used in Study 1 for dimensional surveys and cognitive tasks, and in Study 2 for cross-cultural comparisons.
- Example: A sample data point would be the relationship
wife-husbandbeing rated on features likeformalityoractiveness.
- Source: Generated using a
- 258 Chinese Relationships:
- Source: Generated using Chinese
NLP algorithms, includingChinese-unique relationships(e.g., those related to Confucian ideals that are hard to translate directly). - Characteristics: Broader set of
dyadic relationshipsspecific to Chinese culture. - Domain: Chinese cultures (modern and ancient).
- Purpose: Used for US-China comparisons in Study 2 and for
PLM embeddingsin Study 3. - Example: The relationship
ancestor-descendantwhich holds different conceptual weight in Chinese culture due toancestor veneration.
- Source: Generated using Chinese
- 75 Mosuo Relationships:
- Source: Identified through field work within the Mosuo culture.
- Characteristics:
Typical relationshipsspecific to the Mosuo matrilineal society. - Domain: Mosuo (non-industrial) culture in China.
- Purpose: Used for validation of the
FAVEE-HPPmodel in anon-industrial societyin Study 2.
- Group Relations (40) and Triadic Relations (34):
- Source: Generated to test generalizability beyond
dyadic relationships. - Domain: USA and China.
- Purpose: Used to confirm the
FAVEEmodel's applicability to more complexnon-dyadic relationshipstructures. - Example:
love triangle(triadic),rich-poor(group).
- Source: Generated to test generalizability beyond
- Text Corpora for Pre-trained Language Models (PLMs):
-
Modern Chinese Corpus: A large-scale, publicly available corpus of modern Chinese text.
-
Ancient Chinese Corpus: A comprehensive compilation of historical Chinese texts ranging from the Zhou Dynasty (-1046 BCE) to the Qing Dynasty (1912 CE).
-
Domain: Linguistic data reflecting conceptualizations over time.
-
Purpose: To train and apply
PLMsto infer relationship understanding in modern and ancient China, effectively serving as proxies for the collective human mind in those eras.These datasets were chosen for their effectiveness in validating the method's performance across diverse contexts:
-
- The large number of relationships and global participant pool ensured robust empirical validation of universality and cultural variability.
- The inclusion of unique Chinese relationships and the Mosuo culture allowed for the exploration of culturally specific nuances and generalizability beyond typical industrialized societies.
- The use of ancient text corpora enabled a novel historical validation, addressing the temporal endurance of the proposed framework.
5.2. Evaluation Metrics
The paper employs a variety of statistical and computational metrics to evaluate its models and findings.
-
Pearson's Correlation Coefficient ():
- Conceptual Definition: Measures the linear correlation between two sets of data. It indicates the strength and direction of a linear relationship.
- Mathematical Formula: $ r_{xy} = \frac{\sum_{i=1}^{n} (x_i - \bar{x})(y_i - \bar{y})}{\sqrt{\sum_{i=1}^{n} (x_i - \bar{x})^2 \sum_{i=1}^{n} (y_i - \bar{y})^2}} $
- Symbol Explanation:
- : Pearson correlation coefficient between variables and .
- : Number of data points.
- : Individual data points for variables and .
- : Mean of variable and , respectively.
- Usage in Paper: Used to compare
PCA loadingsandrelationship scoresbetween subsamples and the full dataset in robustness tests, and generally for correlations between different measures.
-
Spearman's Rank Correlation Coefficient ():
- Conceptual Definition: A non-parametric measure of the strength and direction of monotonic association between two ranked variables. It assesses how well the relationship between two variables can be described using a monotonic function.
- Mathematical Formula: $ \rho = 1 - \frac{6 \sum d_i^2}{n(n^2 - 1)} $
- Symbol Explanation:
- : Spearman's rank correlation coefficient.
- : Difference between the ranks of corresponding items in the two variables.
- : Number of data points.
- Usage in Paper: Used for
Representational Similarity Analysis (RSA)correlations between language models and human ratings, and for cultural variable correlations (e.g., modernization level and formality score of neighbors).
-
Explained Variance (e.g., in PCA, , Adjusted ):
- Conceptual Definition: A measure of how much of the variability in a dataset is accounted for by a statistical model. In
PCA, it indicates the proportion of total variance explained by eachprincipal component. In regression, (coefficient of determination) measures the proportion of variance in the dependent variable that is predictable from the independent variable(s). is a modified version that accounts for the number of predictors in the model, providing a more accurate measure for comparison between models with different numbers of predictors. - Mathematical Formula (for in linear regression): $ R^2 = 1 - \frac{SS_{res}}{SS_{tot}} $ Mathematical Formula (for Adjusted ): $ \text{Adjusted } R^2 = 1 - (1 - R^2) \frac{n - 1}{n - k - 1} $
- Symbol Explanation:
- : Sum of squares of residuals (the unexplained variance).
- : Total sum of squares (the total variance in the dependent variable).
- : Number of data points (observations).
- : Number of independent variables (predictors) in the model.
- Usage in Paper:
PCA: Used to determine how much of the total variance in relationship features is captured by the fiveFAVEE dimensions(e.g., 82.14% in Study 1).Model Comparison: was used to assess how well a model (e.g.,FAVEE) can predictremaining theoretical featuresacross regions, providing a measure ofexplained variance.
- Conceptual Definition: A measure of how much of the variability in a dataset is accounted for by a statistical model. In
-
Bayesian Information Criterion (BIC):
- Conceptual Definition: A criterion for model selection among a finite set of models. It's a trade-off between model fit (likelihood) and model complexity (number of parameters). Lower
BICvalues generally indicate a better model. - Mathematical Formula: $ \text{BIC} = k \ln(n) - 2 \ln(\hat{L}) $
- Symbol Explanation:
- : Number of parameters estimated by the model.
- : Number of observations (data points).
- : The maximized value of the likelihood function for the model.
- Usage in Paper: Used in
model comparison analysisas a metric fordata fitting, where a lowerBICindicates a better-fitting model, especially when comparingFAVEEagainst other theories.
- Conceptual Definition: A criterion for model selection among a finite set of models. It's a trade-off between model fit (likelihood) and model complexity (number of parameters). Lower
-
Optimal Number of PCA Components (Metrics):
- Conceptual Definition: Various statistical rules and visual tests are used to determine the ideal number of
principal componentsto retain from aPCAanalysis, balancing variance explained with interpretability and parsimony. - Mathematical Formula / Explanation:
- Parallel Analysis: Compares the eigenvalues from the actual data to eigenvalues from randomly generated data of the same size. Components are retained if their eigenvalue is larger than the corresponding random data eigenvalue.
- Kaiser-Guttman Rule (Eigenvalue > 1 Rule): Retains
principal componentswhose eigenvalues are greater than 1. An eigenvalue of 1 means the component explains as much variance as a single original variable. - Cattell's Scree Test: Involves plotting the eigenvalues in descending order and looking for the "elbow" or "scree" point, where the slope of the line changes dramatically, suggesting components after this point contribute little additional variance.
- Optimal Coordinates: A more formal statistical approach to identify the elbow point in the scree plot.
- Usage in Paper: These metrics were used in Study 1 and Study 2 to determine that
five principal components(theFAVEE dimensions) were optimal.
- Conceptual Definition: Various statistical rules and visual tests are used to determine the ideal number of
-
Silhouette Score (for Clustering):
- Conceptual Definition: A measure of how similar an object is to its own cluster (cohesion) compared to other clusters (separation). It ranges from -1 to +1, where a high value indicates that the object is well matched to its own cluster and poorly matched to neighboring clusters. Scores near +1 indicate dense, well-separated clusters; scores near 0 indicate overlapping clusters; and negative scores suggest an object might be assigned to the wrong cluster.
- Mathematical Formula: $ s(i) = \frac{b(i) - a(i)}{\max(a(i), b(i))} $
- Symbol Explanation:
s(i): Thesilhouette scorefor data point .a(i): The average distance between data point and all other points in the same cluster.b(i): The minimum average distance between data point and all points in a different cluster (i.e., the nearest cluster to that is not part of).
- Usage in Paper: Used to determine the
optimal number of clustersfor theHPP categoriesandcanonical relationship types, where the highestsilhouette scoreindicates the best clustering solution.
-
Mantel Test (for RSA):
- Conceptual Definition: A statistical test used to determine the correlation between two
distance matrices(ordissimilarity matrices). It assesses whether the patterns of relationships among items in one matrix are significantly similar to the patterns in another matrix. Unlike standard correlation, it accounts for the non-independence of entries within a distance matrix. - Mathematical Formula / Explanation: The
Mantel testdoesn't have a single simple formula like Pearson's . Instead, it involves calculating a standard correlation coefficient (e.g., Pearson's ) between the vectorized lower triangles of the twoRDMs. The significance (-value) is then determined bypermutation testing:- One of the
RDMsis randomly permuted many times (e.g., 10,000 times). - The correlation is recalculated for each permutation.
- The observed correlation is compared to the distribution of correlations from the permuted data to determine its statistical significance.
- One of the
- Usage in Paper: Used to assess the statistical significance of
RSAmultiple regression coefficients (i.e., whether cultural variables significantly predict cross-regional variability in relationship representations).
- Conceptual Definition: A statistical test used to determine the correlation between two
-
F-statistic (for ANOVA and Regression):
- Conceptual Definition: A value calculated in
ANOVA(Analysis of Variance) or regression analysis to test the overall significance of the model or to compare means across multiple groups. A largerF-statisticgenerally indicates that the variation among group means (or explained by the model) is larger than the variation within groups (or unexplained by the model). - Mathematical Formula (conceptual for ANOVA): $ F = \frac{\text{Variance between groups}}{\text{Variance within groups}} $
- Symbol Explanation:
Variance between groups: Measures how much the means of different groups vary from the overall mean.Variance within groups: Measures the variability of observations within each group.
- Usage in Paper: Used in
ANOVAto compare cultural variability acrossHPP categoriesorFAVEE dimensions, and inRSA multiple regressionto assess the overall significance of the model predictingRDMs.
- Conceptual Definition: A value calculated in
5.3. Baselines
The paper primarily compared its proposed FAVEE model against 15 other existing theories from the social sciences that have attempted to describe the basic forms or underlying organization of human relationships. These theories served as the main baselines for evaluating the performance and consistency of the FAVEE model.
The specific theories are not explicitly named as a list in the main text, but they are implied by the comprehensive literature review that summarized 30 conceptual features from these 15 prominent existing theories (as mentioned in Study 1 and Extended Data Fig. 1). Each of these theories, with their own set of proposed dimensions or categories, implicitly served as a baseline model.
- Why these baselines are representative:
-
Disciplinary Coverage: They span different social science disciplines (e.g., sociology, anthropology, cognitive psychology, communication studies), reflecting the diverse historical approaches to understanding relationships.
-
Established Paradigms: These are "prominent existing theories" that have "proved insightful, as attested by their endurance in the field," indicating their foundational status and influence in relationship science.
-
Feature-Based Comparison: The paper's methodology allowed for a direct quantitative comparison by seeing how well the features (or dimensions) proposed by each of these 15 theories could explain the variance in the comprehensive set of relationship data, and how consistently they performed across different cultures.
By demonstrating that the
FAVEEmodeloutperformedthese 15 other theories in terms ofdata fittingandexplained varianceacross global regions, the paper establishes the superior predictive power and parsimony of its proposed framework compared to the established, but often fragmented, understanding of relationships.
-
6. Results & Analysis
6.1. Core Results Analysis
The paper presents its findings across three studies, demonstrating the discovery, universality, cultural variability, and historical endurance of the FAVEE-HPP framework.
6.1.1. Study 1: A Unified Representational Space Across Disciplines
Discovery of FAVEE Dimensions:
-
Through an extensive literature review, 30
conceptual featuresfrom 15 existing theories were identified. -
Principal Component Analysis (PCA)on ratings of 159 relationships across these 30 features from 1,065 US participants extractedfive latent dimensions, accounting for 82.14% of the variance. These dimensions were named:Formality: Contrasting formal, occupational, rule-bound relationships (e.g.,co-worker) with informal, socio-emotional, private ones (e.g.,parent-infant).Activeness: Distinguishing close, engaged relationships (e.g.,wife-husband) from distant ones (e.g.,strangers).Valence: Differentiating friendly, harmonious relationships (e.g.,church members) from conflictual, hostile ones (e.g.,bully-victim).Exchange: Separating relationships involving concrete resource exchange (e.g.,dealer-buyer) from those involving symbolic, intangible resources (e.g.,brother-sister).Equality: Distinguishing relationships with equal power (e.g.,sports rivals) from those with unequal power (e.g.,man-god).
-
This
five-dimensional solutionwas robustly replicated using otherdimensionality reduction techniques(e.g.,ICA,EFA,MDS), leading to theFAVEE model.The following figure (Figure 1 from the original paper) shows the PCA loadings for the FAVEE model:

Discovery of HPP Categories:
-
Categorical representationswere explored usingmulti-arrangementandfree sorting taskswith 60 US participants. -
Clustering algorithmsapplied to the dimensional survey data revealedthree core categories:Hostile,Private, andPublicrelationships (theHPP model).Hostile: Antagonistic relationships (e.g.,divorced spouses).Private: Personal and family ties (e.g.,siblings,close friends).Public: Formal, occupational, impersonal ties (e.g.,driver-passenger).
-
Text analysis on
free sorting labelsfurther showed that these threeHPP categoriescould be subdivided intosix canonical relationship types:hostile,familial,romantic,affiliative,transactional, andpower. -
A
dimension-category hybrid modeldemonstrated thatHPP categoriesemerge from theFAVEE dimensions(e.g.,privateandpublicclusters at poles offormality,hostilecluster low onvalence).The following figure (Figure 2, now 6, from the original paper) illustrates the categorical and dimensional representations:
该图像是图表,展示了图5中FAVEE-HPP模型在现代和古代中国中的验证过程,包括生成PLM嵌入的流程、主成分分析与人类评分的相关性及模型泛化性。图b和c中展示了PCA载荷分数及其与人类评分和嵌入的对应关系,图d-f呈现了模型泛化性和古今差异的比较。
6.1.2. Study 2: Universality and Variability Across Modern Cultures
Universality of FAVEE-HPP:
-
The
5D FAVEE spaceandthree HPP categorieswere consistently identified in globally aggregated data (n = 17,686) and regional data across 19 global regions. -
Leave-one-region-out cross-validationshowed high similarities inrepresentational geometriesacross regions, suggestingFAVEEandHPPareuniversal structures. -
Model comparison analysisconfirmed that theFAVEE model outperformed 15 other existing theoriesin data fitting (lower BIC) and explained variance (higher Adjusted ) across global regions, indicating its superior representation of relationship features.The following figure (Figure 3, now 7, from the original paper) shows the universality and cultural variability of relationship representational geometries:

Cultural Variability:
-
Despite universality,
rich cultural variationwas observed.Religionandmodernizationwere the only two factors that significantly predicted cross-region variability inrepresentational geometries(Extended Data Table 1). Regions with similar levels ofreligionandmodernizationhad similar relationship conceptualizations. -
Public relationship conceptsshowed more cultural variability thanhostileorprivate relationships. No singleFAVEE dimensionwas selectively influenced by culture, but their relative importance could shift.The following figure (Extended Data Figure 4, now 12, from the original paper) shows cultural variability across relationships, dimensions, and categories:

USA vs. China Comparisons:
-
Closeness: Americans focused more on
physical distance, while Chinese focused onpsychological distance(e.g.,ancestor-descendantwas less distant for Chinese due toancestor veneration). -
Power: Chinese held stronger stereotypes of
inequalityamong family members (e.g.,uncle-nephew), consistent with Confucianfilial piety. -
Exchange: Americans experienced more
concrete resource exchangesinprivate relationshipsthan Chinese (e.g., gifts for long-distance lovers in USA vs. long phone calls in China), potentially linked to higher modernization/capitalism.The following figure (Figure 4, now 8, from the original paper) compares conceptualization between USA and China:

Non-Industrial Society Validation:
-
The
FAVEE-HPPmodel was validated in theMosuo tribe(a matrilineal, traditional agrarian society in China). Highly similarrepresentational geometrieswere observed between Mosuo, Han Chinese, and other industrial societies, confirming its applicability beyond industrialized contexts.
6.1.3. Study 3: Relationship Representations in Ancient Cultures
PLMs as Proxies for Ancient Minds:
- The study demonstrated that
PLMscan generatehuman-like relationship understandingby comparingPLM embeddingswith human ratings (). PCAonPLM embeddingsrevealed components corresponding well withFAVEEstructures.
Historical Endurance of FAVEE-HPP:
FAVEEstructures were identified from anancient Chinese PLM(trained on texts from -1046 BCE to 1912 CE).RSA correlationsconfirmed significant correlations betweenhuman ratingsandancient PLM embeddingsfor bothFAVEE dimensionsandHPP categories.- The
FAVEE model outperformed 15 other theoretical modelsin predictingancient and modern PLM embeddings.
Ancient vs. Modern Differences:
-
Formalityexplained more variance in modern ChinesePLMs(0.279) than ancient (0.178). -
Equalityaccounted for more variance in ancient ChinesePLMs(0.243) than modern (0.148). This suggests ancient Chinese weightedequality(e.g.,social hierarchy) more andformality(e.g.,occupations) less than modern Chinese. -
Expert validation showed ancient
PLM embeddingsagreed more with expert ratings on ancient Chinese culture, indicating it captured scholarly insights.The following figure (Figure 5, now 9, from the original paper) shows validation of the FAVEE-HPP model in modern and ancient China using language models:
该图像是包含三部分的复合图表,展示了不同文化间人际关系的交叉区域可靠性。a部分为不同关系类型(依附、权力、交易、家庭、浪漫、敌对)的可靠性条形图;b部分展示敌对、公有和私人关系的可靠性小提琴图;c部分说明五个核心维度(正式性、活跃性、情感色彩、交换和平等)对应的可靠性分布。
6.1.4. Generalizability to Non-Dyadic Relationships
- The
FAVEEframework successfully generalized totriadic relations(e.g.,love triangle) andgroup relations(e.g.,rich-poor,Democrats-Republicans) in both the US and China, with high correlations betweendyadicandnon-dyadicFAVEE structures.
6.2. Data Presentation (Tables)
The following are the results from [Extended Data Table 1] of the original paper:
| Representational Geometry Models | ||||||||
|---|---|---|---|---|---|---|---|---|
| Full-feature Model | Dimensional Model | Categorical Model | ||||||
| Main Analysis | ||||||||
| Predictors | β | p | β | p | β | p | ||
| Climates | 0.163 | 0.180 | 0.167 | 0.200 | 0.146 | 0.154 | ||
| Demographics | -0.052 | 0.591 | -0.003 | 0.469 | -0.323 | 0.996 | ||
| Disease | 0.004 | 0.499 | -0.025 | 0.609 | -0.012 | 0.555 | ||
| Gene | -0.056 | 0.568 | -0.157 | 0.740 | -0.052 | 0.574 | ||
| Geography | -0.213 | 0.924 | -0.197 | 0.896 | -0.205 | 0.936 | ||
| Hofstede6D | -0.147 | 0.810 | -0.178 | 0.857 | -0.097 | 0.746 | ||
| Language | 0.005 | 0.498 | -0.107 | 0.667 | 0.189 | 0.208 | ||
| Modernization | 0.347 | 0.022 | 0.274 | 0.048 | 0.245 | 0.047 | ||
| Personality | 0.133 | 0.204 | 0.170 | 0.191 | 0.062 | 0.295 | ||
| Politics | -0.125 | 0.820 | -0.122 | 0.818 | -0.077 | 0.730 | ||
| Religion | 0.561 | 0.014 | 0.601 | 0.011 | 0.299 | 0.109 | ||
| Subsistence | -0.079 | 0.698 | -0.029 | 0.563 | -0.004 | 0.491 | ||
| Adjusted R-squared | 0.561 | 0.532 | 0.267 | |||||
| Follow-up Analysis | ||||||||
| Predictors | β | p | β | p | β | p | ||
| Climates | 0.198 | 0.156 | 0.188 | 0.181 | 0.182 | 0.120 | ||
| Demographics | 0.035 | 0.395 | 0.063 | 0.332 | -0.278 | 0.986 | ||
| Disease | -0.014 | 0.575 | -0.027 | 0.632 | -0.052 | 0.729 | ||
| Education | 0.123 | 0.203 | 0.122 | 0.207 | 0.032 | 0.366 | ||
| Gene | -0.139 | 0.700 | -0.230 | 0.839 | -0.065 | 0.598 | ||
| Geography | -0.122 | 0.776 | -0.127 | 0.790 | -0.164 | 0.891 | ||
| Hofstede6D | -0.087 | 0.700 | -0.125 | 0.773 | -0.090 | 0.728 | ||
| Language | 0.091 | 0.369 | -0.047 | 0.573 | 0.238 | 0.154 | ||
| Personality | 0.146 | 0.188 | 0.188 | 0.177 | 0.050 | 0.321 | ||
| Politics | -0.078 | 0.716 | -0.083 | 0.730 | -0.063 | 0.691 | ||
| Religion | 0.317 | 0.109 | 0.419 | 0.048 | 0.177 | 0.223 | ||
| Subsistence | -0.045 | 0.614 | -0.005 | 0.502 | 0.023 | 0.413 | ||
| Urbanization | 0.445 | 0.018 | 0.326 | 0.050 | 0.307 | 0.044 | ||
| Wealth | -0.072 | 0.693 | -0.077 | 0.708 | 0.080 | 0.241 | ||
| Object knowledge | 0.017 | 0.439 | 0.056 | 0.359 | -0.098 | 0.749 | ||
| Adjusted R-squared | 0.618 | 0.559 | 0.299 | |||||
This table shows the Representational Similarity Analysis (RSA) results in Study 2, detailing how various cultural variables (predictors) relate to representational geometries (outcome models) across different regions.
- Main Analysis: Considers broad cultural variables.
ModernizationandReligionare significant predictors for theFull-feature ModelandDimensional Model, whileModernizationis significant for theCategorical Model. Thebeta() values indicate the standardized regression coefficients, and values indicate statistical significance. - Follow-up Analysis: Breaks down
ModernizationintoEducation,Urbanization, andWealth. Here,Urbanizationconsistently shows significance across all models, andReligionremains significant for theDimensional Model. Adjusted R-squaredvalues indicate the overall variance explained by the set of predictors for eachrepresentational geometry model. TheFull-feature Modelgenerally has the highest explained variance.- Crucially,
Object knowledgeshows no significant predictive power, reinforcing that the observed cultural variability is specific torelationship conceptsand not general semantic knowledge or data quality issues.
6.3. Ablation Studies / Parameter Analysis
The authors conducted several analyses to verify the robustness and generalizability of their findings, which can be seen as forms of ablation studies or parameter analysis.
6.3.1. Robustness of PCA Solution to Different Algorithms
- Purpose: To ensure that the identified
five-dimensional FAVEEstructure was not an artifact of the specificdimensionality reductiontechnique used. - Procedure: In Study 1, after performing
PCA, the authors also evaluated otherdimensionality reduction techniques:Independent Component Analysis (ICA)Exploratory Factor Analysis (EFA)Multidimensional Scaling (MDS)
- Results: All these techniques
yielded the same five-factor solution(Supplementary Fig. 3), strongly confirming the robustness and inherent nature of theFAVEE dimensions. This indicates that theFAVEEmodel captures a fundamental structure independent of the specific mathematical method for its extraction.
6.3.2. Robustness of FAVEE-HPP to Different Numbers of Relationships
- Purpose: To determine if the findings were dependent on the full set of 159 relationships or if a smaller, more manageable subset could still capture the essential structure.
- Procedure: A
robustness testwas quantified by iteratively removing relationships one by one. The removal sequence prioritized less familiar relationships from pairs deemed most similar in themulti-arrangement task. All analyses (PCA, clustering, cross-cultural RSA) were re-performed on these reduced sets. - Results: The study found that
a subset of 40 relationships was good enough to replicate all findings based on 159 relationships(Supplementary Fig. 11). This is a crucial finding for future research, suggesting that simpler experimental designs with fewer relationships can still yield reliable results, making theFAVEE-HPPframework highly practical.
6.3.3. Determining Optimal Sample Size for Dimensional Survey (Monte Carlo Simulation)
- Purpose: To ensure
adequate statistical powerandstabilityofPCA resultswith minimal participant responses. - Procedure: A
Monte Carlo simulation testwas conducted using pilot study data.PCAwas performed onsubsamplesof varying sizes (from 2 to 40 participant responses per relationship-feature rating, with 1,000 iterations for each subsample). Theloading scoresandrelationship scoresfrom these subsamples were then compared to the overall dataset usingPearson's correlation. - Results: Subsamples with
ten responsesper relationship-feature rating were almost identical to the entire dataset (rating correlation ), and were adequate to ensure highly similar derivedPCA components(loading score correlation , relationship score correlation ). This informed the efficient design of the large-scale Study 2.
6.3.4. Model Comparison Performance and Consistency
- Purpose: To quantitatively demonstrate that the
FAVEEmodel is superior to existing theories in representing relationship conceptualization. - Procedure:
Model comparison analysiswas performed (Extended Data Fig. 3) where theFAVEEmodel was compared against15 other existing theories. This involved using linear combinations of features from each model asregressorsto predictremaining theoretical featuresand calculating (explained variance) andBIC(data fitting). - Results: Across global regions, the
FAVEE model outperformed all 15 other theoriesin bothexplained variance(mean Adjusted ) anddata fitting(mean BIC = 364.794), with 100,000 bootstrap resamples confirming significance.FAVEEwas thebest model in 17 out of 19 global regions. This directly validates the effectiveness of the proposed framework against established baselines, highlighting its advantages in comprehensiveness and parsimony.
6.3.5. Validation of Clustering Results
-
Purpose: To confirm the stability and interpretability of the
HPP categoriesandsix canonical types. -
Procedure: While
k-means clusteringwas the primary method, other clustering techniques (e.g.,hierarchical clustering,HDBSCAN) were also employed.UMAPwas used as a preprocessing step to improve clustering performance. Thesilhouette scorewas used to identify theoptimal number of clusters, and the stability and interpretability of the output clusters were examined. -
Results: Solutions were chosen that were insensitive to algorithm/parameter choice, consistent across different clustering algorithms, and had high interpretability and high
silhouette scores. This confirms the robustness of the identifiedcategorical structures.These
ablation studiesandparameter analysescollectively underscore the rigor of the methodology, ensuring that theFAVEE-HPPframework is not only empirically derived but also statistically robust, generalizable, and superior to existing alternatives.
7. Conclusion & Reflections
7.1. Conclusion Summary
This paper makes a groundbreaking contribution to the understanding of human relationships by identifying a universal conceptual structure that transcends diverse cultures and historical periods. The core finding is the FAVEE-HPP framework, comprising five principal dimensions (Formality, Activeness, Valence, Exchange, Equality) and three core categories (Hostile, Public, Private relationships).
The study rigorously established this framework through:
-
Discovery (Study 1): Synthesizing 15 existing theories and using extensive human ratings from US participants to derive the
FAVEE dimensionsandHPP categories, demonstrating how categories emerge from the continuous dimensional space. -
Universality and Cultural Variability (Study 2): Validating the
FAVEE-HPPframework across 19 global regions and a non-industrial society (Mosuo tribe), showing its global generalizability. Crucially, it outperformed 15 other theories. It also quantified cultural variability, linking it toreligionandmodernization, and highlighting nuanced differences (e.g., US vs. China). -
Historical Endurance (Study 3): Employing advanced
NLPandPLMson ancient Chinese texts, demonstrating that theFAVEE-HPPstructures are detectable and persistent over 3,000 years. This study also revealed shifts in the relative importance offormalityandequalityover time. -
Generalizability Beyond Dyads: The framework's applicability was extended to
triadicandgroup relations.The significance of this work lies in providing a
unified, parsimonious, and empirically robust modelfor human relationship knowledge, offering a computational framework for objective measurement and advancing our understanding ofhuman sociality. It serves as a foundational "Big Five" for relationship science, much like its counterpart in personality research.
7.2. Limitations & Future Work
The authors candidly acknowledge several limitations and propose future research directions:
- Focus on Lay Theory: The current work primarily taps into
lay theory(vernacular beliefs/common sense) about relationships. It may differ from the actual, dynamic organization of relationships observed through social acts and interactions.- Future Work: Examine social acts and interactions across relationships to complement
lay theory.
- Future Work: Examine social acts and interactions across relationships to complement
- Universality of FAVEE-HPP Not Conclusive: While extensively validated, the
FAVEE-HPPframework's universality is "far from conclusive." The reliance on online populations anddata-driven approaches(while powerful) necessitates further investigation intoboundary conditionsthat might influence its stability, validity, and generalizability.- Future Work: Explore factors that could influence the
FAVEE-HPPmodel's stability, validity, representativeness, and generalizability. Investigate thedifferent ordering of dimensionsin different regions, as it could reveal further cultural insights.
- Future Work: Explore factors that could influence the
- Need for Scientifically Rigorous Taxonomy: The
FAVEE-HPPmodel was derived from theoretical features originating fromlayperson languages.- Future Work: A more scientifically rigorous approach is needed to create a valid and reliable taxonomy of human relationships.
- Limited Historical Contexts: Due to resource limitations and the availability of high-quality
PLMs, Study 3 focused only onancient China.- Future Work: Validate the
FAVEE-HPPmodel in other historical contexts (e.g., Hebrew, Greek, Tamil, Old English) to further confirm its historical endurance across diverse civilizations.
- Future Work: Validate the
- Individual-Level Variation: The current work focuses on
culturalandpopulation levels. However, relationship cognition is subjective, varied, and dynamic at the individual level.- Future Work: Investigate how relationship representations are constructed during
human development(e.g., from infancy to adulthood) and how individuals form idiosyncratic impressions. - Future Work: Develop
psychometric testsbased on theFAVEE-HPPframework to measure individual differences across the five dimensions (analogous to the Big Five personality test). - Future Work: Examine how individual differences in relationship representations link to
interpersonal difficultiesorabnormalitiesinclinical populations(e.g., autism, sociopathy).
- Future Work: Investigate how relationship representations are constructed during
7.3. Personal Insights & Critique
This paper represents a monumental achievement in social science, offering an unprecedentedly comprehensive and empirically validated framework for understanding human relationships.
Innovations and Strengths:
- Interdisciplinary Synthesis: The approach of synthesizing 15 existing theories into a higher-order model is highly innovative. It moves beyond disciplinary silos to identify genuinely fundamental dimensions.
- Methodological Rigor and Scope: The sheer scale of data collection across diverse modern cultures (n = 20,427), the validation in a non-industrial society, and the pioneering use of
PLMsfor historical analysis are truly impressive. This multi-method, multi-context approach lends exceptional credibility to theFAVEE-HPPframework. - Computational Social Science at its Best: The seamless integration of online surveys, cognitive tasks, and advanced
NLPtechniques showcases the power ofcomputational social sciencefor tackling complex, abstract psychological constructs. The use ofPLMsas "proxies" for the ancient mind is particularly compelling and opens new avenues for historical psychology. - Practical Implications: The analogy to the "Big Five" for personality is apt. A standardized, quantifiable framework for relationships has immense practical value for designing interventions, informing cross-cultural communication, diplomacy, and even developing more socially intelligent AI.
- Clarifying Universality and Cultural Nuance: The paper skillfully balances the demonstration of universal structures with the identification of crucial cultural specificities. This nuanced approach is essential for a true understanding of human behavior globally.
Potential Issues and Areas for Improvement:
- "Lay Theory" Limitation: While acknowledged, the reliance on "lay theory" is a significant point for continued research. How do these conceptual structures actually manifest in real-time social interactions, behavioral choices, and emotional responses? Future work integrating observational or behavioral data with the
FAVEE-HPPframework would be critical. - Causality of Cultural Variables: While
religionandmodernizationwere identified as predictors of cultural variability, the correlational nature ofRSAmeans causality cannot be definitively established. Deeper dives into how these factors shape relationship conceptualization (e.g., through specific religious doctrines or aspects of urban life) would be valuable. - Interpretation of PLM Embeddings: While the paper validates
PLM embeddingsagainst human ratings and expert knowledge, the exact mechanism by which aPLM"understands" relationships and how that correlates with human cognition remains a complex area of research. Further theoretical work on the cognitive interpretability ofPLMswould strengthen this aspect. - The "Why" Behind FAVEE-HPP: The paper beautifully describes what the
FAVEE-HPPstructure is and where/when it applies. Further theoretical exploration of why these particular five dimensions and three categories might be evolutionarily or cognitively fundamental would add depth. Extended Data Fig. 8 offers some initial ideas linking toMaslow's hierarchyandBaumeister's theoryof cultural animals, which is a promising start. - Generalizability of "Formality" and "Equality" Shifts: The finding that
formalityandequalitydiffered in importance between ancient and modern China is fascinating. Repeating this analysis for other cultures and historical periods would confirm if this is a universal pattern of conceptual evolution or culture-specific.
Transferability and Application:
-
AI and Social Robotics: The
FAVEE-HPPframework offers a structured way to imbueAI systemsandsocial robotswith a more human-like understanding of social interactions, leading to more context-aware and appropriate responses. -
Cross-Cultural Communication and Diplomacy: The insights into cultural variability (e.g.,
physical vs. psychological distance,concrete vs. symbolic exchange) can directly informcross-cultural training,diplomacy,marketing, andinternational relations, fostering better understanding and reducing miscommunication. -
Mental Health and Interpersonal Therapy: Developing psychometric tools based on
FAVEE-HPPcould help diagnoseinterpersonal difficultiesand tailor therapeutic interventions for individuals or couples by identifying specific dimensions where their relationship conceptualization might be atypical or problematic. -
Content Creation and Storytelling: Writers, filmmakers, and game designers could use this framework to create more realistic, relatable, and culturally resonant characters and storylines by consciously mapping their relationships onto the
FAVEE-HPPspace.In conclusion, this paper provides a robust and deeply insightful map of the
conceptual structure of human relationships, establishing a new benchmark for interdisciplinary research in social cognition. Its impact will likely resonate across psychology, anthropology, and even the burgeoning field of artificial social intelligence.
Similar papers
Recommended via semantic vector search.