TY - GEN
T1 - Unsupervised semantic markup of literature for biodiversity digital libraries
AU - Cui, Hong
PY - 2008
Y1 - 2008
N2 - This paper reports the further development of machine learning techniques for semantic markup of biodiversity literature, especially morphological descriptions of living organisms such as those hosted at efloras.org and algaebase.org. Syntactic parsing and supervised machine learning techniques have been explored by earlier research. Limitations of these techniques promoted our investigation of an unsupervised learning approach that combines the strength of earlier techniques and avoids the limitations. Semantic markup at the organ and character levels is discussed. Research on semantic markup of natural heritage literature has direct impact on the development of semantic-based access in biodiversity digital libraries.
AB - This paper reports the further development of machine learning techniques for semantic markup of biodiversity literature, especially morphological descriptions of living organisms such as those hosted at efloras.org and algaebase.org. Syntactic parsing and supervised machine learning techniques have been explored by earlier research. Limitations of these techniques promoted our investigation of an unsupervised learning approach that combines the strength of earlier techniques and avoids the limitations. Semantic markup at the organ and character levels is discussed. Research on semantic markup of natural heritage literature has direct impact on the development of semantic-based access in biodiversity digital libraries.
KW - Biodiversity informatics
KW - Morphological description
KW - Natural heritage literature
KW - Semantic annotation
KW - Semantic markup
KW - Tagging
KW - Unsupervised machine learning
KW - XML
UR - http://www.scopus.com/inward/record.url?scp=57749115937&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=57749115937&partnerID=8YFLogxK
U2 - 10.1145/1378889.1378894
DO - 10.1145/1378889.1378894
M3 - Conference contribution
AN - SCOPUS:57749115937
SN - 9781595939982
T3 - Proceedings of the ACM International Conference on Digital Libraries
SP - 25
EP - 28
BT - JCDL'08
T2 - 8th ACM/IEEE-CS Joint Conference on Digital Libraries 2008, JCDL'08
Y2 - 16 June 2008 through 20 June 2008
ER -