A Character Based Steganography Using Masked Language Modeling

dc.authoridÖztürk, Emir/0000-0002-3734-5171
dc.authoridAydin, Ozlem/0000-0002-6401-4183
dc.authorwosidÖztürk, Emir/Z-1726-2018
dc.contributor.authorOzturk, Emir
dc.contributor.authorMesut, Andac Sahin
dc.contributor.authorFidan, Ozlem Aydin
dc.date.accessioned2024-06-12T11:15:39Z
dc.date.available2024-06-12T11:15:39Z
dc.date.issued2024
dc.departmentTrakya Üniversitesien_US
dc.description.abstractIn this study, a steganography method based on BERT transformer model is proposed for hiding text data in cover text. The aim is to hide information by replacing specific words within the text using BERT's masked language modeling (MLM) feature. In this study, two models, fine-tuned for English and Turkish, are utilized to perform steganography on texts belonging to these languages. Furthermore, the proposed method can work with any transformer model that supports masked language modeling. While traditionally the hidden information in text is often limited, the proposed method allows for a significant amount of data to be hidden in the text without distorting its meaning. In this study, the proposed method is tested by hiding stego texts of varying lengths in cover text of different lengths in two different language scenarios. The test results are analyzed in terms of perplexity, KL divergence and semantic similarity. Upon examining the results, the proposed method has achieved the best results compared to other methods found in the literature, with KL divergence of 7.93 and semantic similarity of 0.99. It can be observed that the proposed method has low detectability and demonstrates success in the data hiding process.en_US
dc.identifier.doi10.1109/ACCESS.2024.3354710
dc.identifier.endpage14259en_US
dc.identifier.issn2169-3536
dc.identifier.scopus2-s2.0-85182940815en_US
dc.identifier.scopusqualityQ1en_US
dc.identifier.startpage14248en_US
dc.identifier.urihttps://doi.org/10.1109/ACCESS.2024.3354710
dc.identifier.urihttps://hdl.handle.net/20.500.14551/24007
dc.identifier.volume12en_US
dc.identifier.wosWOS:001157958300001en_US
dc.identifier.wosqualityN/Aen_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherIEEE-Inst Electrical Electronics Engineers Incen_US
dc.relation.ispartofIeee Accessen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectSteganographyen_US
dc.subjectTransformersen_US
dc.subjectIndexesen_US
dc.subjectMediaen_US
dc.subjectPredictive Modelsen_US
dc.subjectHash Functionsen_US
dc.subjectData Modelsen_US
dc.subjectBit Error Rateen_US
dc.subjectNatural Language Processingen_US
dc.subjectBERTen_US
dc.subjectMasked Language Modelingen_US
dc.subjectSteganographyen_US
dc.subjectLinguistic Steganographyen_US
dc.subjectText Steganographyen_US
dc.subjectSynonym Substitutionen_US
dc.titleA Character Based Steganography Using Masked Language Modelingen_US
dc.typeArticleen_US

Dosyalar