A Character Based Steganography Using Masked Language Modeling

Ozturk, Emir; Mesut, Andac Sahin; Fidan, Ozlem Aydin

A Character Based Steganography Using Masked Language Modeling

Tarih

2024

Yazarlar

Ozturk, Emir

Mesut, Andac Sahin

Fidan, Ozlem Aydin

Yayıncı

IEEE-Inst Electrical Electronics Engineers Inc

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

In this study, a steganography method based on BERT transformer model is proposed for hiding text data in cover text. The aim is to hide information by replacing specific words within the text using BERT's masked language modeling (MLM) feature. In this study, two models, fine-tuned for English and Turkish, are utilized to perform steganography on texts belonging to these languages. Furthermore, the proposed method can work with any transformer model that supports masked language modeling. While traditionally the hidden information in text is often limited, the proposed method allows for a significant amount of data to be hidden in the text without distorting its meaning. In this study, the proposed method is tested by hiding stego texts of varying lengths in cover text of different lengths in two different language scenarios. The test results are analyzed in terms of perplexity, KL divergence and semantic similarity. Upon examining the results, the proposed method has achieved the best results compared to other methods found in the literature, with KL divergence of 7.93 and semantic similarity of 0.99. It can be observed that the proposed method has low detectability and demonstrates success in the data hiding process.

Anahtar Kelimeler

Steganography, Transformers, Indexes, Media, Predictive Models, Hash Functions, Data Models, Bit Error Rate, Natural Language Processing, BERT, Masked Language Modeling, Steganography, Linguistic Steganography, Text Steganography, Synonym Substitution

Kaynak

Ieee Access

WoS Q Değeri

N/A

Scopus Q Değeri

Q1

Cilt

12

Bağlantı

https://doi.org/10.1109/ACCESS.2024.3354710
https://hdl.handle.net/20.500.14551/24007

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

A Character Based Steganography Using Masked Language Modeling

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon