Clustering main concepts from e-mails

Jesús S. Aguilar-Ruiz, Domingo S. Rodriguez-Baena, Paul R. Cohen, Jose Cristóbal Riquelme

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    E–mail is one of the most common ways to communicate, assuming, in some cases, up to 75% of a company’s communication, in which every employee spends about 90 minutes a day in e–mail tasks such as filing and deleting. This paper deals with the generation of clusters of relevant words from E–mail texts. Our approach consists of the application of text mining techniques and, later, data mining techniques, to obtain related concepts extracted from sent and received messages. We have developed a new clustering algorithm based on neighborhood, which takes into account similarity values among words obtained in the text mining phase. The potential of these applications is enormous and only a few companies, mainly large organizations, have invested in this project so far, taking advantage of employees’s knowledge in future decisions.

    Original languageEnglish (US)
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    EditorsRicardo Conejo, Jose-Luis Perez-de-la-Cruz, Maite Urretavizcaya
    PublisherSpringer-Verlag
    Pages231-240
    Number of pages10
    ISBN (Print)3540222189, 9783540222187
    DOIs
    StatePublished - 2004
    Event10th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2003 and 5th Conference on Technology Transfer, TTIA 2003 - San Sebastian, Spain
    Duration: Nov 12 2003Nov 14 2003

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume3040
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference10th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2003 and 5th Conference on Technology Transfer, TTIA 2003
    Country/TerritorySpain
    CitySan Sebastian
    Period11/12/0311/14/03

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Clustering main concepts from e-mails'. Together they form a unique fingerprint.

    Cite this