説明
This paper proposes a new method that discovers characteristic sequential patterns in textual data. The data are composed of three kinds of information: time information, attributes, and text. The method gathers items of the data with the same attribute values, arranges the gathered items in order of the time, and generates sequences. The method also extracts events from each text by using a text mining method. Finally, the method discovers characteristic sequential patterns, composed of sets of events, from sequences by a sequential mining method. In this paper, we apply the method to business reports collected by our sales force automation system and try to discover characteristic sequential patterns. We verify whether the patterns are valid by investigating texts relating to the patterns.
収録刊行物
-
- 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583)
-
2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583) 3279-3284, 2005-04-12
IEEE