DLS@CU: Sentence Similarity from Word Alignment and Semantic Vector Composition

Md Arafat Sultan, Steven Bethard, Tamara Sumner

Research output: Chapter in Book/Report/Conference proceedingConference contribution

100 Scopus citations

Abstract

We describe a set of top-performing systems at the SemEval 2015 English Semantic Textual Similarity (STS) task. Given two English sentences, each system outputs the degree of their semantic similarity. Our unsupervised system, which is based on word alignments across the two input sentences, ranked 5th among 73 submitted system runs with a mean correlation of 79.19% with human annotations. We also submitted two runs of a supervised system which uses word alignments and similarities between compositional sentence vectors as its features. Our best supervised run ranked 1st with a mean correlation of 80.15%.

Original languageEnglish (US)
Title of host publicationSemEval 2015 - 9th International Workshop on Semantic Evaluation, co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, NAACL-HLT 2015 - Proceedings
EditorsPreslav Nakov, Torsten Zesch, Daniel Cer, David Jurgens
PublisherAssociation for Computational Linguistics (ACL)
Pages148-153
Number of pages6
ISBN (Electronic)9781941643402
StatePublished - 2015
Externally publishedYes
Event9th International Workshop on Semantic Evaluation, SemEval 2015 - Denver, United States
Duration: Jun 4 2015Jun 5 2015

Publication series

NameSemEval 2015 - 9th International Workshop on Semantic Evaluation, co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015 - Proceedings

Conference

Conference9th International Workshop on Semantic Evaluation, SemEval 2015
Country/TerritoryUnited States
CityDenver
Period6/4/156/5/15

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'DLS@CU: Sentence Similarity from Word Alignment and Semantic Vector Composition'. Together they form a unique fingerprint.

Cite this