Automated patent classification for crop protection via domain adaptation

Applied AI Letters(2023)

Cited 0|Views8
No score
Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state‐of‐the‐art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain‐adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state‐of‐the‐art models with increased precision, recall, and F 1‐score. We base our evaluation on both standard patent classification datasets derived from patent offices‐defined code hierarchies and more practical real‐world use‐case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated. We study how domain adaptation push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain‐adaptive pretraining and adapters. We base our evaluation on both standard patent classification baseline datasets and more practical real‐world use‐case scenarios containing labels from the agrochemical industrial domain.
Translated text
Key words
patent classification,domain adaptation,crop protection
AI Read Science
Must-Reading Tree
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined