Abstract
Data quality in the Wikipedia is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of those articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and data quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for data quality, thus providing insights for improving data quality in Wikipedia.
Original language | English (US) |
---|---|
Pages | 175-180 |
Number of pages | 6 |
State | Published - 2009 |
Event | 19th Workshop on Information Technologies and Systems, WITS 2009 - Phoenix, AZ, United States Duration: Dec 14 2009 → Dec 15 2009 |
Other
Other | 19th Workshop on Information Technologies and Systems, WITS 2009 |
---|---|
Country/Territory | United States |
City | Phoenix, AZ |
Period | 12/14/09 → 12/15/09 |
Keywords
- Collaboration pattern
- Data provenance
- Data quality
- Wikipedia
ASJC Scopus subject areas
- Information Systems
- Control and Systems Engineering