Segmentation of genomic data through multivariate statistical approaches: comparative analysis

Arfa Anjum,Arpan Bhowmik,Dwijesh Chandra Mishra,Eldho Varghese,Anil Rai,Seema Jaggi,Shwetank Lall

doi:10.56093/ijas.v92i7.118040

Abstract

Segmenting a series of measurements along a genome into regions with distinct characteristics is widely used toidentify functional components of a genome. The majority of the research on biological data segmentation focuses on the statistical problem of identifying break or change-points in a simulated scenario using a single variable. Despite the fact that various strategies for finding change-points in a multivariate setup through simulation are available, work on segmenting actual multivariate genomic data is limited. This is due to the fact that genomic data is huge in size and contains a lot of variation within it. Therefore, a study was carried out at the ICAR-Indian Agricultural Statistics Research Institute, New Delhi during 2021 to know the best multivariate statistical method to segment the sequences which may influence the properties or function of a sequence into homogeneous segments. This will reduce the volume of data and ease the analysis of these segments further to know the actual properties of these segments. The genomic data of Rice (Oryza sativa L.) was considered for the comparative analysis of several multivariate approaches and was found that agglomerative sequential clustering was the most acceptable due to its low computational cost and feasibility.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Segmentation of genomic data through multivariate statistical approaches: comparative analysis

Abstract

Talk to us

Similar Papers

More From: The Indian Journal of Agricultural Sciences

Lead the way for us

Journal: The Indian Journal of Agricultural Sciences	Publication Date: Mar 30, 2022
License type: cc-by-nc-sa

Similar Papers

Assessment of yield stability in sorghum using univariate and multivariate statistical approaches
Asfaw Adugna
Hereditas | VOL. 145
Asfaw AdugnaAsfaw Adugna
25 Apr 2008
Hereditas | VOL. 145

Screening of prevailing processes that drive surface water quality of running waters in a cultivated wetland region of Germany — A multivariate approach
Sebastian Maassen ... Oliver Gabriel
Science of the Total Environment | VOL. 438
Sebastian Maassen, et. al.Sebastian Maassen ... Oliver Gabriel
18 Sep 2012
Science of the Total Environment | VOL. 438

Soil sealing footprint as an indicator of dispersed urban growth: a multivariate statistics approach
Ilaria Tombolini ... Luca Salvati
Urban Research & Practice | VOL. 9
Ilaria Tombolini, et. al.Ilaria Tombolini ... Luca Salvati
29 Apr 2015
Urban Research & Practice | VOL. 9

Effect of Environmental Factors on the Chemical Weathering of Plagioclase in Hawaiian Basalt
Steven J Gordon
Physical Geography | VOL. 26
Steven J GordonSteven J Gordon
01 Jan 2004
Physical Geography | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Segmentation of genomic data through multivariate statistical approaches: comparative analysis

Abstract

Talk to us

Similar Papers

More From: The Indian Journal of Agricultural Sciences