Pattern Discovery in Brain Imaging Genetics via SCCA Modeling with a Generic Non-convex Penalty

Lei Du,Diana Kerwin,Alice D Brown,Virginia Lee,Joanne Allard,Mohammed O Sheikh,John Olichney,Kathleen Johnson,William M Brooks,Erin Franklin,Ronald G Thomas,Curtis Caldwell,Brendan Kelly,Kenneth Spicer,John Brockington,Norman Relkin,Raymond Scott Turner,Michael Lin,Christine M Belden,Stephanie Reeder,Ki Won Nam,Sara S Mason,Marilyn Albert,Anna Burke,Peter Hardy,Allyson Rosen,Carl Sadowsky,Kyle Womack,Lisa Ravdin,Karen Ekstam Smith,Michal Figurski,M Saleem Ismail,Stephanie Kielb,Mary L Hynes,Daniel D’Agostino,Nick Fox,Raj C Shah,Karen Anderson,Nancy Johnson,Sandra A Jacobson,Adam Fleisher,Gary Conrad,Kris Johnson,Martin Sadowski,Bret Borowski,Effie Mitsis,John Rogers,Danielle Harvey,M Marcel Mesulam,Tom Montine,Maria Kataki,Matt Senjem,Scott Neu,Bryan M Spann,Michael Donohue,Susan Landau,Ronald Killiany,Sherye A Sirrel,Jacobo Mintzer,Terence Z Wong,Li Shen,Chris Hosein,Prashanthi Vemuri,Joy L Taylor,Lori A Daiello,Eric M Reiman,Alexander Norbash,Iris Sim,Rachelle S Doody,Benita Mudge,Leyla Detoledo-Morrell,Stephen Pasternack,P Murali Doraiswamy,Rob Bartha,Keith A Johnson,Curtis Tatsuoka,Norm Foster,Scott Herring,Connie Brand,Chet Mathis,Mary Ann Oakley,Vernice Bates,Pradeep Varma,Howard Fillit,Howard Feldman,Jingwen Yan,Colleen S Albers,Jeff Gunter,James J Lah,Evan Fletcher,Clifford R Jack,Tamie Sather,Stacy Schneider,Adrian Preda,Michael Borrie,Nigel J Cairns,Barton Lane,David A Wolk,Gloria Chaing,Charles Bernick,Xiaohui Yao,Hyungsub Shim,Vesna Sossi,Leslie Gordineer,Robert C Green,Anton P Porsteinsson,Shannon L Risacher,Donna Munic,Lon S Schneider,Dzintra Celmins,Anaztasia Ulysse,Lei Guo,George Bartzokis,Sanjay Asthana,Robert A Koeppe,Sarah Walter,Martin R Farlow,Kristine Lipowski,Victoria Shibley,Arthur W Toga,John Q Trojanowki,Sungeun Kim,Paula Ogrocki,Karen Blank,Greg Sorensen,Martha G Macavoy,Thomas O Obisesan,Jeffrey M Burns,Antero Sarrael,Karen Crawford,Kewei Chen,Brian R Ott,Norbert Schuff,Marc Seltzer,Godfrey D Pearlson,Marwan N Sabbagh,Steven E Arnold,Kim Martin,Borna Bonakdarpour,Sonia Pawluczyk,Partha Sinha,Marissa Natelson Love,Maria Carroll,Kejal Kantarci,Pradeep Garg,Robert B Santulli,Elizabeth Oates,Balebail Ashok Raj,Mrunalini Gaikwad,Mary L Creech,Howard Bergman,Geoffrey Tremont,Kefei Liu,Edward Coleman,Peter Davies,Lawrence S Honig,Mark A Mintun,Christopher M Clark,Jennifer Mason,Cynthia M Carlsson,Francine Parfitt,Nadira Trncic,Douglas W Scharre,Russell H Swerdlow,Kelley Faber,Owen Carmichael,Raina Carter,Angela Oliver,Mauricio Beccera,Horacio Capote,Kaycee M Sink,Kathleen Tingus,Raymundo Hernando,Yaakov Stern,Stephen Correia,Bojana Stefanović ,Liana G Apostolova ,Matt A Bernstein ,Kim Poki‐Walker ,John C Morris ,Pierre N Tariot ,David S Knopman ,John K Hsiao ,Jared R Tinklenberg ,Jerome A Yesavage ,Steven G Potkin ,David S Geldmacher ,Aĺlan I Levey ,Neill R Graff‐Radford ,Lisa C Silbert ,Paul M Thompson ,Paul S Aisen ,Joseph F Quinn ,Gad A Marshall ,Sandra E Black ,Richard A Frank ,Chiadi U Onyike ,Javier Villanueva‐Meyer ,Alan J Lerner ,David C Perry ,Sterling C Johnson ,Warren Barker ,William Z Potter ,William J Jagust ,Patricia Johnson ,Erik D Roberson ,Ronald C Petersen ,Lisa Taylor‐Reinwald ,Daniel Silverman ,Tatiana Foroud ,Zaven S Khachaturian ,Neil S Buckholtz ,David T Jones ,Valory N Pavlik ,Peter J Snyder ,Laurel A Beckett ,James B Brewer ,Chuang Kuo Wu ,Marı́a C Carrillo ,Adam J Schwartz ,Neil W Kowall ,Sandra Weıntraub ,Jason Karlawish ,Steven M Paul ,Maria T Greig‐Custo ,Emily Rogalskı ,Beau M Ances ,Paul F Malloy ,Daniel C Marson ,Oscar L López ,Leon J Thal ,T Y Lee ,Joanne Lord ,David G Clark ,David M Holtzman ,Salvador Borges‐Neto ,Charles D Smith,Mary Quiceno,Archana B Balasubramanian,Stephen Salloway,Hillel Grossman,Chad Ward,Jeff D Williamson,Franklin Watkins,Helen Vanderswag,Mimi Dang,Magdalena Korecka,Bruce L Miller,Smita Kittur,Jeffrey Kaye,Kelly M Makino,Jared R Brosch,Susan K Schultz,Susan Rountree,Donna M Simpson,Howard J Rosen,Junwei Han,Judith L Heidebrink,Christopher H Van Dyck,Ellen Woo,Randall Griffith,Nunzio Pomara,Sara Dolen,Kwangsik Nho,Reisa A Sperling,David Bachman,Marc Raichle,Andrew Kertesz,Andrew E Budson,Dana Nguyen,Pauline Maillard,Karen L Bell,Hristina Koleva,Saba Wolday,Greg Jicha,Charles Decarli,Franz Hefti,Andrew J Saykin,Michele Assaly,Earl A Zimmerman,Betty Lind,Brigid Reynolds,Howard Chertkow,Michelle Rainka,Richard E Carson,Po H Lu,Jeffrey R Petrella,Parianne Fatica,Ann Marie Milliken,Leslie M Shaw,Lew Kuller,Devon Gessert,Dick Trost,Liberty Teodoro,Teresa Villena,Laura A Flashman,Michael W Weiner,Brandy R Matthews,Elizabeth Finger,Irina Rachisky,Janet S Cellar,Ranjan Duara,Amanda Smith,Gus Jiminez,Bonnie S Goldstein,Ann Marie Hake,Ging-Yuek Robin Hsiung,Munir Chowdhury,Dana Mathews,Kristin Fargher

doi:10.1038/s41598-017-13930-y

Abstract

Brain imaging genetics intends to uncover associations between genetic markers and neuroimaging quantitative traits. Sparse canonical correlation analysis (SCCA) can discover bi-multivariate associations and select relevant features, and is becoming popular in imaging genetic studies. The L1-norm function is not only convex, but also singular at the origin, which is a necessary condition for sparsity. Thus most SCCA methods impose {ell }_{{bf{1}}}-norm onto the individual feature or the structure level of features to pursuit corresponding sparsity. However, the {ell }_{{bf{1}}}-norm penalty over-penalizes large coefficients and may incurs estimation bias. A number of non-convex penalties are proposed to reduce the estimation bias in regression tasks. But using them in SCCA remains largely unexplored. In this paper, we design a unified non-convex SCCA model, based on seven non-convex functions, for unbiased estimation and stable feature selection simultaneously. We also propose an efficient optimization algorithm. The proposed method obtains both higher correlation coefficients and better canonical loading patterns. Specifically, these SCCA methods with non-convex penalties discover a strong association between the APOE e4 rs429358 SNP and the hippocampus region of the brain. They both are Alzheimer’s disease related biomarkers, indicating the potential and power of the non-convex methods in brain imaging genetics.

Highlights

The CCA technique has been introduced for several decades[24]
The L1-S2CCA and Smoothly Clipped Absolute Deviation (SCAD) methods identify a weak signal from the parahippocampal gyrus, which is previously reported as an early biomarker of AD54
We have proposed a unified non-convex Sparse canonical correlation analysis (SCCA) model and an efficient optimization algorithm using a family of non-convex penalty functions

Summary

Introduction

The CCA technique has been introduced for several decades[24]. CCA can only perform well when the number of observations is larger than the combined feature number of the two views. These penalties includes the γ-norm (0 < γ < 1) penalty[42], the Geman penalty[43], the Smoothly Clipped Absolute Deviation (SCAD) penalty[38], the Laplace penalty[44], the Minimax Concave Penalty (MCP)[45], the Exponential-Type Penalty (ETP)[46] and the Logarithm penalty[47] These non-convex functions have attractive theoretical properties for they all are singular at the origin and leave those larger coefficients unpenalized. Though they have gained great success in generalized linear models (GLMs), it is an unexplored topic to apply them to the SCCA models for achieving sparsity and unbiased prediction simultaneously

Objectives

Methods

Results

Conclusion