A parametrized approach for linear regression of interval data

Leandro C Souza,Renata M.C.R Souza,Getúlio J.A Amaral,Telmo M Silva Filho

doi:10.1016/j.knosys.2017.06.012

Abstract

Interval symbolic data is a complex data type that can often be obtained by summarizing large datasets. All existing linear regression approaches for interval data use certain fixed reference points to model intervals, such as midpoints, ranges and lower and upper bounds. This is a limitation, because different datasets might be better represented by different reference points. In this paper, we propose a new method for extracting knowledge from interval data. Our parametrized approach automatically extracts the best reference points from the regressor variables. These reference points are then used to build two linear regressions: one for the lower bounds of the response variable and another for its upper bounds. Before the regressions are applied, we compute a criterion to verify the mathematical coherence of predicted values. Mathematical coherence means that the upper bounds are greater than the lower bounds. If the criterion shows that the coherence is not guaranteed, we suggest the use of a novel interval Box-Cox transformation of the response variable. Experimental evaluations with synthetic and real interval datasets illustrate the advantages and the usefulness of the proposed method to perform interval linear regression.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A parametrized approach for linear regression of interval data

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Jun 8, 2017
Citations: 38

Similar Papers

Extreme Learning Machine based Pattern Classifiers for Symbolic Interval Data
...
International Journal of Engineering | VOL. 34
, et. al. ...
01 Nov 2021
International Journal of Engineering | VOL. 34

A dynamical clustering method for symbolic interval data based on a single adaptive Euclidean distance
Francisco A.T De Carvalho ... Renata De Souza
-
Francisco A.T De Carvalho, et. al.Francisco A.T De Carvalho ... Renata De Souza
01 Oct 2006
01 Oct 2006

Partitional clustering algorithms for symbolic interval data based on single adaptive distances
Francisco De A.T De Carvalho ... Yves Lechevallier
Pattern Recognition | VOL. 42
Francisco De A.T De Carvalho, et. al.Francisco De A.T De Carvalho ... Yves Lechevallier
03 Dec 2008
Pattern Recognition | VOL. 42

A Partitioning Fuzzy Clustering Algorithm for Symbolic Interval Data based on Adaptive Mahalanobis Distances
Camilo P Tenorio ... Julio T Pimentel
-
Camilo P Tenorio, et. al.Camilo P Tenorio ... Julio T Pimentel
01 Sep 2007
01 Sep 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A parametrized approach for linear regression of interval data

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems