Bridging Big Data: Procedures for Combining Non-equivalent Cognitive Measures from the ENIGMA Consortium

Eamonn Kennedy, Shashank Vadlamani,Hannah M Lindsey,Pui-Wa Lei,Mary Jo Pugh,Maheen Adamson,Martin Alda,Silvia Alonso-Lana,Sonia Ambrogi,Tim J Anderson,Celso Arango,Robert Asarnow,Mihai Avram,Rosa Ayesa-Arriola,Talin Babikian,Nerisa Banaj,Laura J Bird,Stefan Borgwardt,Amy Brodtmann,Katharina Brosch,Karen Caeyenberghs,Vince D Calhoun,Nancy D Chiaravalloti,David X Cifu,Benedicto Crespo-Facorro,John C Dalrymple-Alford,Kristen Dams-O'Connor,Udo Dannlowski, David Darby,Nicholas Davenport,John DeLuca,Covadonga M Diaz-Caneja,Seth G Disner,Ekaterina Dobryakova,Stefan Ehrlich,Carrie Esopenko,Fabio Ferrarelli,Lea E Frank,Carol Franz,Paola Fuentes-Claramonte,Helen Genova,Christopher C Giza,Janik Goltermann,Dominik Grotegerd,Marius Gruber,Alfonso Gutierrez-Zotes,Minji Ha,Jan Haavik,Charles Hinkin,Kristen R Hoskinson,Daniela Hubl,Andrei Irimia,Andreas Jansen,Michael Kaess,Xiaojian Kang,Kimbra Kenney,Barbora Kerkova,Mohamed Salah Khlif,Minah Kim,Jochen Kindler,Tilo Kircher,Karolina Knizkova,Knut K Kolskar,Denise Krch,William S Kremen,Taylor Kuhn,Veena Kumari,Jun Soo Kwon, Roberto Langella,Sarah Laskowitz, Jungha Lee,Jean Lengenfelder,Spencer W. Liebel,Victoria Liou-Johnson,Sara M Lippa,Marianne Lovstad,Astri J Lundervold,Cassandra Marotta,Craig A Marquardt,Paulo Mattos,Ahmad Mayeli,Carrie R McDonald,Susanne Meinert,Tracy R Melzer,Jessica Merchan-Naranjo,Chantal Michel,Rajendra A Morey,Benson Mwangi,Daniel J Myall,Igor Nenadi_,Mary R Newsome,Abraham Nunes,Terence O'Brien,Viola Oertel,John Ollinger,Alexander Olsen,Victor Ortiz Garcia de la Foz,Mustafa Ozmen,Heath Pardoe,Marise Parent,Fabrizio Piras,Federica Piras,Edith Pomarol-Clotet,Jonathan Repple,Genevieve Richard, Jonathan Rodriguez, Mabel Rodriguez,Kelly Rootes-Murdy,Jared Rowland,Nicholas P Ryan,Raymond Salvador,Anne-Marthe Sanders,Andre Schmidt,Jair C Soares,Gianfranco Spalletta,Filip _paniel,Alena Stasenko,Frederike Stein,Benjamin Straube,April Thames,Florian Thomas-Odenthal,Sophia I Thomopoulos,Erin Tone,Ivan Torres,Maya Troyanskaya,Jessica A Turner,Kristine M Ulrichsen, Guillermo Umpierrez,Elisabet Vilella,Lucy Vivash,William C Walker,Emilio Werden,Lars T Westlye, Krista Wild,Adrian Wroblewski,Mon-Ju Wu, Glenn R Wylie,Lakshmi N Yatham,Giovana B Zunta-Soares,Paul M Thompson,David F Tate,Frank G Hillary,Emily L Dennis,Elisabeth A Wilde

biorxiv(2023)

引用 0|浏览50
暂无评分
摘要
Investigators in the cognitive neurosciences have turned to Big Data to address persistent replication and reliability issues by increasing sample sizes, statistical power, and representativeness of data. While there is tremendous potential to advance science through open data sharing, these efforts unveil a host of new questions about how to integrate data arising from distinct sources and instruments. We focus on the most frequently assessed area of cognition - memory testing - and demonstrate a process for reliable data harmonization across three common measures. We aggregated raw data from 53 studies from around the world which measured at least one of three distinct verbal learning tasks, totaling N = 10,505 healthy and brain-injured individuals. A mega analysis was conducted using empirical bayes harmonization to isolate and remove site effects, followed by linear models which adjusted for common covariates. After corrections, a continuous item response theory (IRT) model estimated each individual subjects latent verbal learning ability while accounting for item difficulties. Harmonization significantly reduced inter-site variance by 37% while preserving covariate effects. The effects of age, sex, and education on scores were found to be highly consistent across memory tests. IRT methods for equating scores across AVLTs agreed with held-out data of dually-administered tests, and these tools are made available for free online. This work demonstrates that large-scale data sharing and harmonization initiatives can offer opportunities to address reproducibility and integration challenges across the behavioral sciences. ### Competing Interest Statement Dr. Arango has been a consultant to or has received honoraria or grants from Acadia, Angelini, Biogen, Boehringer, Gedeon Richter, Janssen Cilag, Lundbeck, Medscape, Menarini, Minerva, Otsuka, Pfizer, Roche, Sage, Servier, Shire, Schering Plough, Sumitomo Dainippon Pharma, Sunovion and Takeda. Dr. Brodtmann serves on the editorial boards of Neurology and International Journal of Stroke. Dr. Diaz-Caneja has received honoraria from Exeltis and Angelinii. Dr. Giza: consultant for NBA, NFL, NHLPA, Los Angeles Lakers; Advisory Board: Highmark Interactive, Novartis, MLS, NBA, USSF; Medicolegal 1-2 cases annually. Dr. Soares: ALKERMES (Research Grant), ALLERGAN (Research Grant), ASOFARMA (Consultant), ATAI (Stock), BOEHRINGER Ingelheim (Consultant), COMPASS (Research Grant), JOHNSON & JOHNSON (Consultant), LIVANOVA (Consultant), PFIZER (Consultant), PULVINAR NEURO LLC (Consultant), RELMADA (Consultant), SANOFI (Consultant), SUNOVIAN (Consultant). Dr. Thompson received partial research support from Biogen, Inc., for research unrelated to this manuscript. Dr. Yatham has been on speaker or advisory boards for, or has received research grants from, Alkermes, Abbvie, Canadian Institutes of Health Research, Sumitomo Dainippon Pharma, GlaxoSmithKline, Intracellular Therapies, Merck, Sanofi, Sequiris, Servier, and Sunovion, over the past 3 years, all outside this work. The collection of this cohort was partially supported by an investigator-initiated research grant from Biogen (US). Biogen had no role in the analysis or writing of this manuscript. Eisai (JP) and Life Molecular Imaging for research unrelated to this manuscript. Dr. Wylie has received research support from the NJ Commission for brain injury research, from the Dept of Veterans' Affairs, from Biogen, from Bristol, Myers, Squibb, from Genetech, and has served on advisory boards for the CDMRP and the VA. All of these activities are unrelated to this research. The views expressed in this article are those of the author(s) and do not reflect the official policy of the Department of Army/Navy/Air Force, Department of Defense, or U.S. Government.
更多
查看译文
关键词
measures,big data,enigma consortium,non-equivalent
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要