Who does what: Collaboration patterns in the wikipedia and their impact on data quality

Jun Liu, Sudha Ram

Research output: Contribution to conferencePaperpeer-review

21 Scopus citations


Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.

Original languageEnglish (US)
Number of pages6
StatePublished - 2009
Event19th Workshop on Information Technologies and Systems, WITS 2009 - Phoenix, AZ, United States
Duration: Dec 14 2009Dec 15 2009


Other19th Workshop on Information Technologies and Systems, WITS 2009
Country/TerritoryUnited States
CityPhoenix, AZ


  • Collaboration pattern
  • Data provenance
  • Data quality
  • Wikipedia

ASJC Scopus subject areas

  • Information Systems
  • Control and Systems Engineering

Cite this