Local Linear Regression Estimator Research Articles

We introduce a predictive modeling solution that provides high quality predictive analytics over aggregation queries in Big Data environments. Our predictive methodology is generally applicable in environments in which large-scale data owners may or may not restrict access to their data and allow only aggregation operators like COUNT to be executed over their data. In this context, our methodology is based on historical queries and their answers to accurately predict ad-hoc queries’ answers. We focus on the widely used set-cardinality, i.e., COUNT, aggregation query, as COUNT is a fundamental operator for both internal data system optimizations and for aggregation-oriented data exploration and predictive analytics. We contribute a novel, query-driven Machine Learning (ML) model whose goals are to: (i) learn the query-answer space from past issued queries, (ii) associate the query space with local linear regression & associative function estimators, (iii) define query similarity, and (iv) predict the cardinality of the answer set of unseen incoming queries, referred to the Set Cardinality Prediction (SCP) problem. Our ML model incorporates incremental ML algorithms for ensuring high quality prediction results. The significance of contribution lies in that it (i) is the only query-driven solution applicable over general Big Data environments, which include restricted-access data, (ii) offers incremental learning adjusted for arriving ad-hoc queries, which is well suited for query-driven data exploration, and (iii) offers a performance (in terms of scalability, SCP accuracy, processing time, and memory requirements) that is superior to data-centric approaches. We provide a comprehensive performance evaluation of our model evaluating its sensitivity, scalability and efficiency for quality predictive analytics. In addition, we report on the development and incorporation of our ML model in Spark showing its superior performance compared to the Spark’s COUNT method.

Recently, some new techniques have been proposed for the estimation of semi-parametric fixed effects varying coefficient panel data models. These new techniques fall within the class of the so-called differencing estimators. In particular, we consider first-differences and within local linear regression estimators. Analyzing their asymptotic properties it turns out that, keeping the same order of magnitude for the bias term, these estimators exhibit different asymptotic bounds for the variance. In both cases, the consequences are suboptimal non-parametric rates of convergence. In order to solve this problem, by exploiting the additive structure of this model, a one-step backfitting algorithm is proposed. Under fairly general conditions, it turns out that the resulting estimators show optimal rates of convergence and exhibit the oracle efficiency property. Since both estimators are asymptotically equivalent, it is of interest to analyze their behavior in small sample sizes. In a fully parametric context, it is well-known that, under strict exogeneity assumptions the performance of both first-differences and within estimators is going to depend on the stochastic structure of the idiosyncratic random errors. However, in the non-parametric setting, apart from the previous issues other factors such as dimensionality or sample size are of great interest. In particular, we would be interested in learning about their relative average mean square error under different scenarios. The simulation results basically confirm the theoretical findings for both local linear regression and one-step backfitting estimators. However, we have found out that within estimators are rather sensitive to the size of number of time observations.

Local Linear Regression Estimator Research Articles

Related Topics

Articles published on Local Linear Regression Estimator

Scalable aggregation predictive analytics

Nonparametric estimations of the sea state bias for a radar altimeter

Nonparametric geostatistical risk mapping

Functional-coefficient spatial autoregressive models with nonparametric spatial weights

Local linear estimation for regression models with locally stationary long memory errors

Longitudinal Survey, Nonmonotone, Nonresponse, Imputation, Nonparametric Regression

A partially linear single‐index transformation model and its nonparametric estimation

Nonlinearities in the Slovenian apple price transmission

Differencing techniques in semi-parametric panel data varying coefficient models with fixed effects: a Monte Carlo study

Reference Curves Estimation Using Conditional Quantile and Radial Basis Function Network with Mass Constraint

Estimation and inference in regression discontinuity designs with asymmetric kernels

Asymptotic normality for a local composite quantile regression estimator of regression function with truncated data

Does Education Matter for Economic Growth?

On Projection‐type Estimators of Multivariate Isotonic Functions

Asymptotic distributions of two “synthetic data” estimators for censored single-index models

On kernel nonparametric regression designed for complex survey data

Local linear regression for functional predictor and scalar response

Improved double kernel local linear quantile regression

선형보간법에 의한 자료 희소성 해결방안의 문제와 대안

Relative error prediction via kernel regression smoothers

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Local Linear Regression Estimator Research Articles

Related Topics

Articles published on Local Linear Regression Estimator

Scalable aggregation predictive analytics

Nonparametric estimations of the sea state bias for a radar altimeter

Nonparametric geostatistical risk mapping

Functional-coefficient spatial autoregressive models with nonparametric spatial weights

Local linear estimation for regression models with locally stationary long memory errors

Longitudinal Survey, Nonmonotone, Nonresponse, Imputation, Nonparametric Regression

A partially linear single‐index transformation model and its nonparametric estimation

Nonlinearities in the Slovenian apple price transmission

Differencing techniques in semi-parametric panel data varying coefficient models with fixed effects: a Monte Carlo study

Reference Curves Estimation Using Conditional Quantile and Radial Basis Function Network with Mass Constraint

Estimation and inference in regression discontinuity designs with asymmetric kernels

Asymptotic normality for a local composite quantile regression estimator of regression function with truncated data

Does Education Matter for Economic Growth?

On Projection‐type Estimators of Multivariate Isotonic Functions

Asymptotic distributions of two “synthetic data” estimators for censored single-index models

On kernel nonparametric regression designed for complex survey data

Local linear regression for functional predictor and scalar response

Improved double kernel local linear quantile regression

선형보간법에 의한 자료 희소성 해결방안의 문제와 대안

Relative error prediction via kernel regression smoothers