Devulgarization of Polish Texts Using Pre-trained Language Models

Computational Science – ICCS 2022(2022)

Cited 2|Views7
No score
Abstract
We propose a text style transfer method for replacing vulgar expressions in Polish utterances with their non-vulgar equivalents while preserving the meaning of the text. We fine-tune three pre-trained language models on a newly created parallel corpus of vulgar/non-vulgar sentence pairs, then we evaluate style transfer accuracy, content preservation and language quality. To the best of our knowledge, the proposed solution is the first of its kind for Polish.
More
Translated text
Key words
Text style transfer,Removing obscenities,Transformer
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined