Sequence Pattern Mining in Data Streams

dc.contributor.advisorSaheb, Mahmoud
dc.contributor.authorHijawi, Hamza
dc.contributor.authorSaheb, Mahmoud
dc.date.accessioned2017-08-10T10:29:25Z
dc.date.accessioned2022-05-22T08:28:38Z
dc.date.available2017-08-10T10:29:25Z
dc.date.available2022-05-22T08:28:38Z
dc.date.issued2015-08-01
dc.description.abstractSequential pattern mining in data streams environment is an interesting data mining problem. The problem of finding sequential patterns in static databases had been studied extensively in the past years, however mining sequential patterns in the data streams still an active field for researches. In this research a new greedy sequence pattern mining algorithm for the data streams is introduced, it will be used to find the strongly supported sequences. The proposed algorithm is built based on the sequence tree which is used to find the sequential patterns in static databases. The proposed algorithm divides the streams into patches or windows and each patch will update the sequence tree which built from the previous windows. An example is introduced to explain how this algorithm works. We also show the efficiency and the effectiveness of the proposed algorithm on a synthetic dataset and prove how it is suited for data streams environment. We showed experimentally that the proposed algorithm is more efficient than the PrefixSpan algorithm for patterns with any support less than 30% for CPU time and with any support less than 60% for memory usageen_US
dc.identifier.citationH. M. Hijawi, M. H. Saheb, Vol 8, No 3, August 2015, (2015),Sequence Pattern Mining in Data Streams, Computer and Information Science ,ISSN 1913-8989 (Print) ISSN 1913-8997 (Online), DOI: 10.5539/cis.v8n3p64, http://ccsenet.org/journal/index.php/cis/article/view/48654en_US
dc.identifier.issn913-8989
dc.identifier.issn1913-8997
dc.identifier.urihttp://ccsenet.org/journal/index.php/cis/article/view/48654
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/7940
dc.language.isoenen_US
dc.publisherComputer and Information Scienceen_US
dc.relation.ispartofseriescis.v8n3;64
dc.subjectsequential patterns miningen_US
dc.subjectdata streamsen_US
dc.subjectsequence miningen_US
dc.subjectsequence treeen_US
dc.titleSequence Pattern Mining in Data Streamsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
48654-178324-1-PB.pdf
Size:
533.93 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: