A method to improve full-text search performance of MongoDB MongoDB'nin tam metin arama performans?n? iyile?tirme y?ntemi

dc.authoridÖztürk, Emir/0000-0002-3734-5171
dc.authorwosidÖztürk, Emir/Z-1726-2018
dc.contributor.authorMesut, Altan
dc.contributor.authorOzturk, Emir
dc.date.accessioned2024-06-12T11:09:26Z
dc.date.available2024-06-12T11:09:26Z
dc.date.issued2022
dc.departmentTrakya Üniversitesien_US
dc.description.abstractB-Tree based text indexes used in MongoDB are slow compared to different structures such as inverted indexes. In this study, it has been shown that the full-text search speed can be increased significantly by indexing a structure in which each different word in the text is included only once. The Multi-Stream Word-Based Compression Algorithm (MWCA), developed in our previous work, stores word dictionaries and data in different streams. While adding the documents to a MongoDB collection, they were encoded with MWCA and separated into six different streams. Each stream was stored in a different field, and three of them containing unique words were used when creating a text index. In this way, the index could be created in a shorter time and took up less space. It was also seen that Snappy and Zlib block compression methods used by MongoDB reached higher compression ratios on data encoded with MWCA. Search tests on text indexes created on collections using different compression options shows that our method provides 19 to 146 times speed increase and 34% to 40% less memory usage. Tests on regex searches that do not use the text index also shows that the MWCA model provides 7 to 13 times speed increase and 29% to 34% less memory usage.en_US
dc.identifier.doi10.5505/pajes.2021.89590
dc.identifier.endpage729en_US
dc.identifier.issn1300-7009
dc.identifier.issn2147-5881
dc.identifier.issue5en_US
dc.identifier.startpage720en_US
dc.identifier.trdizinid1134587en_US
dc.identifier.urihttps://doi.org/10.5505/pajes.2021.89590
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/1134587
dc.identifier.urihttps://hdl.handle.net/20.500.14551/22789
dc.identifier.volume28en_US
dc.identifier.wosWOS:000875336200011en_US
dc.identifier.wosqualityN/Aen_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.publisherPamukkale Univen_US
dc.relation.ispartofPamukkale University Journal Of Engineering Sciences-Pamukkale Universitesi Muhendislik Bilimleri Dergisien_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectNosqlen_US
dc.subjectMongodben_US
dc.subjectText Indexen_US
dc.subjectFull-Text Searchen_US
dc.subjectMWCAen_US
dc.titleA method to improve full-text search performance of MongoDB MongoDB'nin tam metin arama performans?n? iyile?tirme y?ntemien_US
dc.typeArticleen_US

Dosyalar