TY - GEN
T1 - Using Shallow Semantic Analysis to Implement Automated Quality Assessment of Web Health Care Information
AU - Zhang, Yanjun
AU - Mercer, Robert E.
AU - Burkell, Jacquelyn
AU - Cui, Hong
N1 - Publisher Copyright:
© 2023, Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Evidence-based clinical practice guidelines have been widely used as an objective rating instrument for assessing the content quality of health care information on the web. In many previous studies, human raters check the concordance between text content and evidence-based practice guidelines in order to evaluate information accuracy and completeness. However, human rating cannot be a practical solution, particularly when there is an extremely large volume of health care information on the web. This study explores a semantics-based approach to identify health care information content in web documents with reference to evidence-based health care guidelines. With this approach terms and phrases in English are extracted and transformed into semantic concepts and units. Thus, web text is transformed, sentence by sentence, into a semantic representation which computer programs can classify depending on whether the content of a sentence is in concordance with evidence-based guidelines or not. Through aggregating the classification result of all sentences in a web document, computer programs are able to generate for each document a quality score indicating the number of unique evidence-based guidelines that are referred to in the document. In a test using a set of depression treatment web pages and evidence-based clinical guidelines, the quality rating performance of the computer system is shown to be close to human quality rating performance.
AB - Evidence-based clinical practice guidelines have been widely used as an objective rating instrument for assessing the content quality of health care information on the web. In many previous studies, human raters check the concordance between text content and evidence-based practice guidelines in order to evaluate information accuracy and completeness. However, human rating cannot be a practical solution, particularly when there is an extremely large volume of health care information on the web. This study explores a semantics-based approach to identify health care information content in web documents with reference to evidence-based health care guidelines. With this approach terms and phrases in English are extracted and transformed into semantic concepts and units. Thus, web text is transformed, sentence by sentence, into a semantic representation which computer programs can classify depending on whether the content of a sentence is in concordance with evidence-based guidelines or not. Through aggregating the classification result of all sentences in a web document, computer programs are able to generate for each document a quality score indicating the number of unique evidence-based guidelines that are referred to in the document. In a test using a set of depression treatment web pages and evidence-based clinical guidelines, the quality rating performance of the computer system is shown to be close to human quality rating performance.
KW - Evidence-based clinical practice
KW - Natural language processing
KW - Semantics-based classification
KW - Semantics-based quality rating approach
KW - Shallow semantic analysis
KW - Web health care information
UR - http://www.scopus.com/inward/record.url?scp=85149977326&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149977326&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-23804-8_2
DO - 10.1007/978-3-031-23804-8_2
M3 - Conference contribution
AN - SCOPUS:85149977326
SN - 9783031238031
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 17
EP - 31
BT - Computational Linguistics and Intelligent Text Processing - 19th International Conference, CICLing 2018, Revised Selected Papers
A2 - Gelbukh, Alexander
PB - Springer Science and Business Media Deutschland GmbH
T2 - 19th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2018
Y2 - 18 March 2018 through 24 March 2018
ER -