Automated population of an i2b2 clinical data warehouse from an openEHR-based data repository.

Journal of Biomedical Informatics(2016)

引用 54|浏览40
暂无评分
摘要
Display Omitted We investigate the feasibility of i2b2 to assist secondary use of openEHR data.We present an import tool to automatically populate i2b2 from an openEHR data repository.We describe data representation, querying, performance and limitations of our approach. BackgroundDetailed Clinical Model (DCM) approaches have recently seen wider adoption. More specifically, openEHR-based application systems are now used in production in several countries, serving diverse fields of application such as health information exchange, clinical registries and electronic medical record systems. However, approaches to efficiently provide openEHR data to researchers for secondary use have not yet been investigated or established. MethodsWe developed an approach to automatically load openEHR data instances into the open source clinical data warehouse i2b2. We evaluated query capabilities and the performance of this approach in the context of the Hanover Medical School Translational Research Framework (HaMSTR), an openEHR-based data repository. ResultsAutomated creation of i2b2 ontologies from archetypes and templates and the integration of openEHR data instances from 903 patients of a paediatric intensive care unit has been achieved. In total, it took an average of 2527s to create 2.311.624 facts from 141.917 XML documents. Using the imported data, we conducted sample queries to compare the performance with two openEHR systems and to investigate if this representation of data is feasible to support cohort identification and record level data extraction. DiscussionWe found the automated population of an i2b2 clinical data warehouse to be a feasible approach to make openEHR data instances available for secondary use. Such an approach can facilitate timely provision of clinical data to researchers. It complements analytics based on the Archetype Query Language by allowing querying on both, legacy clinical data sources and openEHR data instances at the same time and by providing an easy-to-use query interface. However, due to different levels of expressiveness in the data models, not all semantics could be preserved during the ETL process.
更多
查看译文
关键词
Archetypes,Clinical data repository,Clinical information systems,Data warehouse,Detailed clinical models,Healthcare analytics,Secondary use,i2b2,openEHR
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要