Dictionary Speech, Phraseological Mold and Text Corpus
LANGAGES(2022)
摘要
Starting from the computerized dictionary Le Tresor de la langue francaise informatise as lexicographic corpus, we shall automatically retrieve transparent phraseologisms which have indicators of varying origins, and to extract phraseological molds from them. Then, the dictionary data will be used again to automatically generate phraseologism candidates according to semantic criteria. Finally, the generated phraseologisms will be projected into a large corpus for validation. The objective of our experience is to develop a method and a tool to, on the one hand, simulate the human process in linking dictionary data, and on the other hand, to model the different types of dictionary content in automatic recognition and generation of phraseologisms. This would highlight the role that phraseologisms play in linking semantically lexical units.
更多查看译文
关键词
phraseologism, dictionary, corpus, simulation, semantic networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要