A corpus of Australian contract language: description, profiling and analysis

ICAIL '11: Proceedings of the 13th International Conference on Artificial Intelligence and Law(2011)

引用 13|浏览0
暂无评分
摘要
Written contracts are a fundamental framework for economic and cooperative transactions in society. Little work has been reported on the application of natural language processing or corpus linguistics to contracts. In this paper we report the design, profiling and initial analysis of a corpus of Australian contract language. This corpus enables a quantitative and qualitative characterisation of Australian contract language as an input to the development of contract drafting tools. Profiling of the corpus is consistent with its suitability for use in language engineering applications. We provide descriptive statistics for the corpus and show that document length and document vocabulary size approximate to log normal distributions. The corpus conforms to Zipf's law and comparative type to token ratios are consistent with lower term sparsity (an expectation for legal language). We highlight distinctive term usage in Australian contract language. Results derived from the corpus indicate a longer prepositional phrase depth in sentences in contract rules extracted from the corpus, as compared to other corpora.
更多
查看译文
关键词
contract rule,distinctive term usage,australian contract language,natural language processing,document vocabulary size,legal language,written contract,corpus linguistics,language engineering application,document length,ontologies,bayesian belief networks,batna,log normal distribution,owl
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要