A new compression algorithm for fast text search

dc.authoridMesut, Altan/0000-0002-1477-3093
dc.authorwosidMesut, Altan/AAE-8734-2019
dc.contributor.authorCarus, Aydin
dc.contributor.authorMesut, Altan
dc.date.accessioned2024-06-12T10:58:19Z
dc.date.available2024-06-12T10:58:19Z
dc.date.issued2016
dc.departmentTrakya Üniversitesien_US
dc.description.abstractWe propose a new compression algorithm that compresses plain texts by using a dictionary-based model and a compressed string-matching approach that can be used with the compressed texts produced by this algorithm. The compression algorithm (CAFTS) can reduce the size of the texts to approximately 41% of their original sizes. The presented compressed string matching approach (SoCAFTS), which can be used with any of the known pattern matching algorithms, is compared with a powerful compressed string matching algorithm (ETDC) and a compressed string-matching tool (Lzgrep). Although the search speed of ETDC is very good in short patterns, it can only search for exact words and its compression performance differs from one natural language to another because of its word-based structure. Our experimental results show that SoCAFTS is a good solution when it is necessary to search for long patterns in a compressed document.en_US
dc.identifier.doi10.3906/elk-1407-178
dc.identifier.endpage4367en_US
dc.identifier.issn1300-0632
dc.identifier.issn1303-6203
dc.identifier.issue5en_US
dc.identifier.scopus2-s2.0-84978221338en_US
dc.identifier.scopusqualityQ3en_US
dc.identifier.startpage4355en_US
dc.identifier.trdizinid247519en_US
dc.identifier.urihttps://doi.org/10.3906/elk-1407-178
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/247519
dc.identifier.urihttps://hdl.handle.net/20.500.14551/20008
dc.identifier.volume24en_US
dc.identifier.wosWOS:000378097800076en_US
dc.identifier.wosqualityQ4en_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.publisherTubitak Scientific & Technological Research Council Turkeyen_US
dc.relation.ispartofTurkish Journal Of Electrical Engineering And Computer Sciencesen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectCompressed String Matchingen_US
dc.subjectText Compressionen_US
dc.subjectDictionary-Based Compressionen_US
dc.subjectExact Pattern Matchingen_US
dc.subjectCAFTSen_US
dc.subjectOracleen_US
dc.titleA new compression algorithm for fast text searchen_US
dc.typeArticleen_US

Dosyalar