Data Colada

Logo of the blog
Logo of the blog

Data Colada is a blog dedicated to investigative analysis and replication of academic research, focusing in particular on the validity of findings in the social sciences.[1]

It is known for its advocacy against problematic research practices such as p-hacking, and for publishing evidence of data manipulation and research misconduct in several prominent cases, including celebrity professors Dan Ariely and Francesca Gino. Data Colada was established in 2013 by three behavioral science researchers: Uri Simonsohn, a professor at ESADE Business School, Barcelona/Spain (as of 2023), Leif Nelson, a professor at University of California, Berkeley, and Joe Simmons, a professor at University of Pennsylvania.[1]

History

[edit]

Around 2011, Simmons, Nelson and Simonsohn "bonded over the false, ridiculous, and flashy findings that the field [of behavioral sciences] was capable of producing", such as a paper by Cornell psychologist Daryl Bem that had supposedly found evidence for clairvoyance.[2] They reacted by publishing an influential[1] 2011 paper about false positive results in psychology, illustrating the problem with a parody research finding that supposedly showed that listening to the Beatles song "When I’m Sixty-Four" made experimental subjects one and a half years younger.[2]

The "Data Colada" blog was launched two years later, in 2013, carrying the tagline "Thinking about evidence, and vice versa", becoming what the New York Times described as "a hub for nerdy discussions of statistical methods — and, before long, various research crimes and misdemeanors".[1]

In particular, the three researchers objected to the then widespread practice of cherry-picking data and attempts to make insignificant results appear statistically credible, especially an approach for which they coined the term p-hacking in a 2014 paper.[2][3][4]

Notable findings

[edit]

Apart from calling out faulty, but presumably well-intended research practices, Data Colada also published evidence of data manipulations and research misconduct. These include studies about the concept of the moral high ground by psychologist Lawrence Sanna, and research by Flemish psychologist Dirk Smeesters.[2] According to The New Yorker, after Data Colada published their work, the careers of Sanna and Smeesters "came to an unceremonious end".[2]

In 2021, Data Colada discovered fabricated data in a 2012 field study published in PNAS[5] by Lisa L. Shu, Nina Mazar, Francesca Gino, Dan Ariely, and Max H. Bazerman.[6][7] All of the study's authors agreed with their assessment and the paper was retracted.[7] The authors also agreed that Ariely was the only author who had access to the data prior to transmitting it in its fraudulent form to Mazar, the analyst.[6] Ariely denied manipulating the data,[8] but Excel metadata showed that he created the spreadsheet and was the last to edit it. He also admitted to having mislabeled all of the values in an entire column of the data in an e-mail to Mazar shortly after he initially sent her the data.[6][9] Ariely has stated that someone at the insurance agency that provided the data must have fabricated it.[10][11][2]

Reception

[edit]

Data Colada's work is credited with contributing awareness to the replication crisis, the idea that many research results in the social sciences are difficult or impossible to reproduce.[2] Data Colada is also recognized for helping to establish better research practices, such as the sharing of replication data.[3]

The Nobel-prize winning psychologist Daniel Kahneman described Data Colada in 2023 as "heroes of mine" and expressed his regret about previously endorsing research findings that the blog later showed were faulty.[2] Brian Nosek of the Center for Open Science applauded Data Colada for having "done an amazing job of developing new methodologies to interrogate the credibility of research."[3]

On the other hand, as summarized by The New Yorker, "Data Colada's harshest critics saw the young men as jealous upstarts who didn’t understand the soft artistry of the social sciences".[2] Psychologist Norbert Schwarz accused Data Colada and other reformers of engaging in a "witch hunt," while psychologist Daniel Gilbert denounced what he called the "replication police" as "shameless little bullies".[2]

Francesca Gino lawsuit

[edit]

In 2021, researcher Zoé Ziani and another collaborator alerted Data Colada about problems replicating work by Harvard behavioral scientist Francesca Gino. Later that year, the Data Colada team contacted Harvard University about anomalies in four papers by Gino.[2][12] Harvard subsequently conducted its own internal investigation with the help of an outside firm, which discovered additional data alterations besides the cases raised by Data Colada.[2] In June 2023, Harvard Business School placed Gino on unpaid administrative leave after the internal investigation determined she had falsified data in her research.[13][14][15] Around the same time, Data Colada published four blog posts detailing evidence that the four papers (all of which had been retracted or set to be retracted at that point), and possibly others by Gino, "contain fake data."[15] Gino subsequently filed a defamation suit against Harvard, Harvard Business School Dean Srikant Datar, and the three members of Data Colada for $25 million, alleging that they had conspired to damage her reputation with false accusations, and that the penalties against her amounted to gender-based discrimination under Title IX.[15] Gino accused Harvard and the Data Colada team of having "worked together to destroy my career and reputation despite admitting they have no evidence proving their allegations."[12] The lawsuit raised concerns about chilling effects. Open science proponent Simine Vazire raised over $370,000 to help cover the legal fees of Data Colada.[16][17]

On September 11, 2024, the judge dismissed all of Gino's claims against the Data Colada defendants (defamation and other claims), and dismissed Gino's defamation and certain other claims (such as violation of privacy) against the Harvard University defendants, while allowing some breach of contract claims against Harvard to continue.[18][19]

References

[edit]
  1. ^ a b c d Scheiber, Noam (2023-09-30). "The Harvard Professor and the Bloggers". The New York Times. Archived from the original on 2023-09-30. Retrieved 2023-10-01.
  2. ^ a b c d e f g h i j k l Lewis-Kraus, Gideon (2023-09-30). "They Studied Dishonesty. Was Their Work a Lie?". The New Yorker. ISSN 0028-792X. Archived from the original on 2023-10-01. Retrieved 2023-10-01.
  3. ^ a b c Subbaraman, Nidhi (2023-09-24). "The Band of Debunkers Busting Bad Scientists". Wall Street Journal. Archived from the original on 2023-09-24. Retrieved 2023-10-08.
  4. ^ "APA PsycNet". psycnet.apa.org. Archived from the original on 2023-11-02. Retrieved 2023-10-08.
  5. ^ "A study on dishonesty was based on fraudulent data". The Economist. August 20, 2021. ISSN 0013-0613. Archived from the original on August 22, 2021. Retrieved August 23, 2021.
  6. ^ a b c "[98] Evidence of Fraud in an Influential Field Experiment About Dishonesty". Data Colada. August 17, 2021. Archived from the original on June 23, 2023. Retrieved August 18, 2021.
  7. ^ a b Lee, Stephanie M. (August 20, 2021). "A Famous Honesty Researcher Is Retracting A Study Over Fake Data". BuzzFeed News. Archived from the original on August 25, 2021. Retrieved August 23, 2021.
  8. ^ Ariely, Dan (August 16, 2021). "Dan Blog Comment" (PDF). datacolada.org. Retrieved 29 January 2023.
  9. ^ Charlton, Aaron (2021-08-17). "Conflicts between Dan Ariely's statement and Footnote #14 (DataColada #98)". OpenMKT.org. Archived from the original on 2023-01-30. Retrieved 2023-01-30.
  10. ^ Charlton, Aaron (2022-08-21). "Dan Ariely claims authorship order shields him from blame; speculates that a low-level envelope stuffer committed the fraud". OpenMKT.org. Archived from the original on 2023-01-30. Retrieved 2023-01-30.
  11. ^ "דן אריאלי: "אנשים צועקים עליי ברחוב, קוראים לי רוצח ופסיכופת"". הארץ (in Hebrew). Archived from the original on 2023-01-30. Retrieved 2023-01-30.
  12. ^ a b Svrluga, Susan (2023-08-03). "Professor accused of faking data in studies on dishonesty sues Harvard". Washington Post. ISSN 0190-8286. Archived from the original on 2023-09-30. Retrieved 2023-11-04.
  13. ^ "Francesca Gino - Faculty & Research - Harvard Business School". www.hbs.edu. Archived from the original on 2023-08-07. Retrieved 2023-08-07.
  14. ^ Quinn, Ryan. "Harvard Dishonesty Researcher Now on Administrative Leave". Inside Higher Ed. Archived from the original on 2023-07-24. Retrieved 2023-07-24.
  15. ^ a b c Hamid, Rahem D.; Yuan, Claire (2023-08-03). "Embattled by Data Fraud Allegations, Business School Professor Francesca Gino Files Defamation Suit Against Harvard". The Harvard Crimson. Archived from the original on 2023-09-25. Retrieved 2023-10-31.
  16. ^ Piper, Kelsey (2023-08-23). "A disgraced Harvard professor sued them for millions. Their recourse: GoFundMe". Vox. Archived from the original on 2023-10-31. Retrieved 2023-10-31.
  17. ^ O'Grady, Cathleen (2023-10-13). "How the reform-minded new editor of psychology's flagship journal will shake things up". science.org. Archived from the original on 2023-10-29. Retrieved 2023-10-31.
  18. ^ Lee, Stephanie M. (2024-09-11). "She Sued the Sleuths Who Found Fraud in Her Data. A Judge Just Ruled Against Her". The Chronicle of Higher Education. Retrieved 2024-09-26.
  19. ^ "Memorandum of Decision" (PDF). Court Listener. Free Law Project. September 11, 2024. Retrieved 26 September 2024. Full text of decision on defendants' motions to dismiss
[edit]