A model-based clustering algorithm with covariates adjustment and its application to lung cancer stratification.

Carlos E M Relvas,Noriko Gotoh,Asuka Nakata,Guoan Chen,David G Beer,Andre Fujita

doi:10.1142/s0219720023500191

Abstract

Usually, the clustering process is the first step in several data analyses. Clustering allows identify patterns we did not note before and helps raise new hypotheses. However, one challenge when analyzing empirical data is the presence of covariates, which may mask the obtained clustering structure. For example, suppose we are interested in clustering a set of individuals into controls and cancer patients. A clustering algorithm could group subjects into young and elderly in this case. It may happen because the age at diagnosis is associated with cancer. Thus, we developed CEM-Co, a model-based clustering algorithm that removes/minimizes undesirable covariates' effects during the clustering process. We applied CEM-Co on a gene expression dataset composed of 129 stage I non-small cell lung cancer patients. As a result, we identified a subgroup with a poorer prognosis, while standard clustering algorithms failed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A model-based clustering algorithm with covariates adjustment and its application to lung cancer stratification.

Abstract

Talk to us

Similar Papers

More From: Journal of bioinformatics and computational biology

Lead the way for us

Similar Papers

Abstract 599: Investigation of PD-L1 expression in circulating tumor cells isolated using the Parsortix system in metastatic lung and breast cancer patients
Mariacristina Ciccioli ... Ofure Alenkhe
Cancer Research | VOL. 81
Mariacristina Ciccioli, et. al.Mariacristina Ciccioli ... Ofure Alenkhe
01 Jul 2021
Cancer Research | VOL. 81

Abstract 634: Combination chemotherapy significantly reduces indoleamine 2,3-dioxygenase activity in NSCLC patients
Cara C Schafer ... Anandi Sawant
Cancer Research | VOL. 74
Cara C Schafer, et. al.Cara C Schafer ... Anandi Sawant
30 Sep 2014
Abstract 634: Combination chemotherapy significantly reduces indoleamine 2,3-dioxygenase activity in NSCLC patients
Cara C Schafer ... Anandi Sawant

Surgical Assessment and Intraoperative Management of Mediastinal Lymph Nodes in Non-Small Cell Lung Cancer
Bryan A Whitson ... Michael A Maddaus
The Annals of Thoracic Surgery | VOL. 84
Bryan A Whitson, et. al.Bryan A Whitson ... Michael A Maddaus
23 Aug 2007
The Annals of Thoracic Surgery | VOL. 84

Normal protein anabolic response to hyperaminoacidemia in insulin-resistant patients with lung cancer cachexia
Aaron Winter ... Stéphanie Chevalier
Clinical Nutrition | VOL. 31
Aaron Winter, et. al.Aaron Winter ... Stéphanie Chevalier
29 May 2012
Clinical Nutrition | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A model-based clustering algorithm with covariates adjustment and its application to lung cancer stratification.

Abstract

Talk to us

Similar Papers

More From: Journal of bioinformatics and computational biology