The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts: A User Study

Partha Mukherjee, Gondy Leroy, David Kauchak, Brianda Armenta Navarrete, Damian Y. Diaz, Sonia Colina

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied.

Original languageEnglish (US)
Pages (from-to)1322-1331
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
Volume2017
StatePublished - 2017

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts: A User Study'. Together they form a unique fingerprint.

Cite this