From “Sooo excited!!!” to “So proud”: Using language to study development

From “Sooo excited!!!” to “So proud”: Using language to study development

By
Michal Kosinski, Margaret L. Kern, Johannes C. Eichstaedt, Andrew H. Schwartz, Gregory Park, Lyle H. Ungar, David J. Stillwell, Lukasz Dziurzynski, Martin E.P. Seligman
Developmental Psychology. January
2014, Vol. 50, Issue 1, Pages 178-188

We introduce a new method, differential language analysis (DLA), for studying human development in which computational linguistics are used to analyze the big data available through online social media in light of psychological theory. Our open vocabulary DLA approach finds words, phrases, and topics that distinguish groups of people based on 1 or more characteristics. Using a data set of over 70,000 Facebook users, we identify how word and topic use vary as a function of age and compile cohort specific words and phrases into visual summaries that are face valid and intuitively meaningful. We demonstrate how this methodology can be used to test developmental hypotheses, using the aging positivity effect (Carstensen & Mikels, 2005) as an example. While in this study we focused primarily on common trends across age-related cohorts, the same methodology can be used to explore heterogeneity within developmental stages or to explore other characteristics that differentiate groups of people. Our comprehensive list of words and topics is available on our web site for deeper exploration by the research community.