TY - GEN
T1 - What does the language of foods say about us?
AU - Van, Hoang
AU - Musa, Ahmad
AU - Chen, Hang
AU - Surdeanu, Mihai
AU - Kobourov, Stephen
N1 - Publisher Copyright:
© 2019 Association for Computational Linguistics
PY - 2019
Y1 - 2019
N2 - In this work we investigate the signal contained in the language of food on social media. We experiment with a dataset of 24 million food-related tweets, and make several observations. First, the language of food has predictive power. We are able to predict if states in the United States (US) are above the median rates for type 2 diabetes mellitus (T2DM), income, poverty, and education - outperforming previous work by 4-18%. Second, we investigate the effect of socioeconomic factors (income, poverty, and education) on predicting state-level T2DM rates. Socioeconomic factors do improve T2DM prediction, with the greatest improvement coming from poverty information (6%), but, importantly, the language of food adds distinct information that is not captured by socioeconomics. Third, we analyze how the language of food has changed over a five-year period (2013 - 2017), which is indicative of the shift in eating habits in the US during that period. We find several food trends, and that the language of food is used differently by different groups such as different genders. Last, we provide an online visualization tool for real-time queries and semantic analysis.
AB - In this work we investigate the signal contained in the language of food on social media. We experiment with a dataset of 24 million food-related tweets, and make several observations. First, the language of food has predictive power. We are able to predict if states in the United States (US) are above the median rates for type 2 diabetes mellitus (T2DM), income, poverty, and education - outperforming previous work by 4-18%. Second, we investigate the effect of socioeconomic factors (income, poverty, and education) on predicting state-level T2DM rates. Socioeconomic factors do improve T2DM prediction, with the greatest improvement coming from poverty information (6%), but, importantly, the language of food adds distinct information that is not captured by socioeconomics. Third, we analyze how the language of food has changed over a five-year period (2013 - 2017), which is indicative of the shift in eating habits in the US during that period. We find several food trends, and that the language of food is used differently by different groups such as different genders. Last, we provide an online visualization tool for real-time queries and semantic analysis.
UR - http://www.scopus.com/inward/record.url?scp=85098423789&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85098423789&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85098423789
T3 - LOUHI@EMNLP 2019 - 10th International Workshop on Health Text Mining and Information Analysis, Proceedings
SP - 87
EP - 96
BT - LOUHI@EMNLP 2019 - 10th International Workshop on Health Text Mining and Information Analysis, Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - 10th International Workshop on Health Text Mining and Information Analysis, LOUHI@EMNLP 2019
Y2 - 3 November 2019
ER -