TY - GEN
T1 - Clustering main concepts from e-mails
AU - Aguilar-Ruiz, Jesús S.
AU - Rodriguez-Baena, Domingo S.
AU - Cohen, Paul R.
AU - Riquelme, Jose Cristóbal
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2004.
PY - 2004
Y1 - 2004
N2 - E–mail is one of the most common ways to communicate, assuming, in some cases, up to 75% of a company’s communication, in which every employee spends about 90 minutes a day in e–mail tasks such as filing and deleting. This paper deals with the generation of clusters of relevant words from E–mail texts. Our approach consists of the application of text mining techniques and, later, data mining techniques, to obtain related concepts extracted from sent and received messages. We have developed a new clustering algorithm based on neighborhood, which takes into account similarity values among words obtained in the text mining phase. The potential of these applications is enormous and only a few companies, mainly large organizations, have invested in this project so far, taking advantage of employees’s knowledge in future decisions.
AB - E–mail is one of the most common ways to communicate, assuming, in some cases, up to 75% of a company’s communication, in which every employee spends about 90 minutes a day in e–mail tasks such as filing and deleting. This paper deals with the generation of clusters of relevant words from E–mail texts. Our approach consists of the application of text mining techniques and, later, data mining techniques, to obtain related concepts extracted from sent and received messages. We have developed a new clustering algorithm based on neighborhood, which takes into account similarity values among words obtained in the text mining phase. The potential of these applications is enormous and only a few companies, mainly large organizations, have invested in this project so far, taking advantage of employees’s knowledge in future decisions.
UR - http://www.scopus.com/inward/record.url?scp=7444223632&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=7444223632&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-25945-9_23
DO - 10.1007/978-3-540-25945-9_23
M3 - Conference contribution
AN - SCOPUS:7444223632
SN - 3540222189
SN - 9783540222187
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 231
EP - 240
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
A2 - Conejo, Ricardo
A2 - Perez-de-la-Cruz, Jose-Luis
A2 - Urretavizcaya, Maite
PB - Springer-Verlag
T2 - 10th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2003 and 5th Conference on Technology Transfer, TTIA 2003
Y2 - 12 November 2003 through 14 November 2003
ER -