MINING CLOSED
SEQUENTIAL PATTERNS IN LARGE SEQUENCE DATABASES
V. Purushothama Raju1
and G.P. Saradhi Varma2
1Research
Scholar, Dept. of CSE, Acharya Nagarjuna University Guntur, A.P., India
2Department
of Information Technology S.R.K.R. Engineering College, Bhimavaram, A.P., India
ABSTRACT
Sequential pattern mining is
studied widely in the data mining community. Finding sequential patterns is a
basic data mining method with broad applications. Closed sequential pattern
mining is an important technique among the different types of sequential
pattern mining, since it preserves the details of the full pattern set and it
is more compact than sequential pattern mining. In this paper, we propose an
efficient algorithm CSpan for mining closed sequential patterns. CSpan uses a
new pruning method called occurrence checking that allows the early detection
of closed sequential patterns during the mining process. Our extensive performance
study on various real and synthetic datasets shows that the proposed algorithm
CSpan outperforms the CloSpan and a recently proposed algorithm ClaSP by an
order of magnitude.
KEYWORDS
Data mining, sequential pattern
mining, closed sequential pattern mining, sequence database
ORIGINAL SOURCE URL: http://airccse.org/journal/ijdms/papers/7115ijdms03.pdf
ORIGINAL SOURCE URL: http://airccse.org/journal/ijdms/papers/7115ijdms03.pdf
VOLUME LINK: http://airccse.org/journal/ijdms/current2015.html
Comments
Post a Comment