How do men and women rate themselves versus how their dates rate them?
Which attributes had the highest correlation to a participant’s popularity rate?
This dataset was the result of a speed dating experiment conducted by Columbia professors Ray Fisman and Sheena Iyengar. The dataset was lengthy and provided a lot of quantitative insights on how people percieve themselves and others in a dating context. The dataset was originally a 120x8379 data set and cleaning required referencing the accompanying document. The participants were asked many questions, but this project focuses on the attributes, not the hobbies and backgrounds.
1. Perception versus Reality of Self by Gender
The methodology focused on contrasting the self-perception versus others’ perception of speed dating participants, differentiated by gender.
Participants were asked to rate themselves on a scale from 1 - 10 on 5 attributes—attraction, sincerity, intelligence, fun, and ambition. Then their dates rated them on the same 5 attributes. Average self-scores for each gender were computed and rounded for precision. Parallelly, the average scores received from others on these attributes were calculated, thereby establishing a dataset reflective of external perceptions. The same was done for males then the two datasets were merged. To visualize the findings, the data was presented using radar charts, enabling a multidimensional comparison. Custom hex colors were applied to differentiate between self and others’ ratings within the charts for men and women.
Key Observations:
• Men and Women on average rated themselves lower than their dates did for each attribute. Maybe, we should think more highly of ourselves!
• Despite women rating themselves higher for each attribute on average, except for intelligence, the contrast between self and reality demonstrates that each gender may have underrated their own attributes
• The radar chart had to be scaled down from the 1 - 10 to 5 - 10 in order to provide meaningful contrast betwen the self and reality scores visually
Hover over the radars to explore the contrast
Cleaning the dataset
Visualize the different averages
Plot adjustments: radar labels, ticks, colors, legend, and annotation
2. Which attributes show the highest correlation to a participant’s popularity rate?
Answering this question required to parts, the first was quantifying a participant’s popularity rate and the second was creating a correlation matrix to plot.
The process began with grouping the data by participant identifier and aggregating the number of rounds they attended, the total affirmative decisions received (‘dec_o’), the decisions they made (‘dec’), and the matches they achieved (‘match’)—where a match indicated mutual affirmative decisions. Three different ratings were measures; ‘Popularity Rate’ indicated the frequency of participants being chosen by their dates, ‘Matches’ represented the rate of mutual interest per round, and ‘Reciprocated’ reflected the proportion of mutual interest when the participant had also expressed interest.
Unlike the self scored attributes, the scores from dates included a 6th attribute-shared interests-on which participants were evaluated by others. A correlation analysis was then conducted to discern the influence of each attribute on a participant’s popularity rate. The subsequent correlation dataframe was plotted utilizing a heatmap to best demonstrate a visual relationship between each attribute and popularity.
Key Observations:
• The strongest correlation to popularity was the attribute ‘attractive’
• Although all attributes had a postive correlation, sincerity and intelligence presented notably lower correlations to popularity rate
• Including shared interests in the analysis provided a more holistic picture of what contributed to a participant’s popularity due to its above average correlation
Correlation heatmap of 6 attributes
Data Sources: Speed Dating CSV and DOC
Match and questionnaire data from speed dating experiment run by Columbia professors Ray Fisman and Sheena Iyengar