MINING CLOSED SEQUENTIAL PATTERNS IN LARGE SEQUENCE DATABASES

V. Purushothama Raju1 and G.P. Saradhi Varma2

1Research Scholar, Dept. of CSE, Acharya Nagarjuna University Guntur, A.P., India

2Department of Information Technology S.R.K.R. Engineering College, Bhimavaram, A.P., India

ABSTRACT

Sequential pattern mining is studied widely in the data mining community. Finding sequential patterns is a basic data mining method with broad applications. Closed sequential pattern mining is an important technique among the different types of sequential pattern mining, since it preserves the details of the full pattern set and it is more compact than sequential pattern mining. In this paper, we propose an efficient algorithm CSpan for mining closed sequential patterns. CSpan uses a new pruning method called occurrence checking that allows the early detection of closed sequential patterns during the mining process. Our extensive performance study on various real and synthetic datasets shows that the proposed algorithm CSpan outperforms the CloSpan and a recently proposed algorithm ClaSP by an order of magnitude.

KEYWORDS
Data mining, sequential pattern mining, closed sequential pattern mining, sequence database

ORIGINAL SOURCE URL: http://airccse.org/journal/ijdms/papers/7115ijdms03.pdf

Comments

Popular posts from this blog

3rd International Conference on Computer Science, Engineering and Artificial Intelligence (CSEAI 2025)

A REVIEW OF THE USE OF R PPROGRAMMING FOR DATA SCIENCE RESEARCH IN BOTSWANA

HYBRID ENCRYPTION ALGORITHMS FOR MEDICAL DATA STORAGE SECURITY IN CLOUD DATABASE