Planning spelling normalization: typology of graphic variation in the Parish Memoirs (1758) corpus

LABORHISTORICO(2023)

引用 0|浏览1
暂无评分
摘要
Digital Humanities are now essential for studies on large-scale textual corpora, where the transformation of text into processable data regarding linguistic phenomena requires a multidisciplinary treatment. In this article we will present an approach in Digital Humanities, which was applied to a Portuguese textual corpus from the 18th-century, gathered from a set of documents known as Memorias Paroquiais ["The Parish Memoirs"], with high historical and heritage value. We will highlight some corpus constitution characteristics, questions concerning the expressive spelling variation perceived in the texts. We propose a typology towards a future automatic normalization of this textual corpus.
更多
查看译文
关键词
Digital Humanities,Disciplinary frontiers,Portuguese,18th-century,Linguistic variation,Memorias Paroquiais
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要