Usage Of Big Data Research Articles

We review tasks and methods most relevant to Big Data analysis. Emphasis is made on the conceptual and pragmatic issues of the tasks and methods (avoiding unnecessary mathematical details). We suggest that all scope of jobs with Big Data fall into four conceptual modes (types): four modes of large-scale usage of Big Data: 1) intelligent information retrieval; 2) massive (large-scale) conveyed data processing (mining); 3) model inference from data; 4) knowledge extraction from data (regularities detection and structures discovery). The essence of various tasks (clustering, regression, generative model inference, structures discovery etc.) are elucidated. We compare key methods of clustering, regression, classification, deep learning, generative model inference and causal discovery. Cluster analysis may be divided into methods based on mean distance, methods based on local distance and methods based on a model. The targeted (predictive) methods fall into two categories: methods which infer a model; "tied to data" methods which compute prediction directly from data. Common tasks of temporal data analysis are briefly overviewed. Among diverse methods of generative model inference we make focus on causal network learning because models of this class are very expressive, flexible and are able to predict effects of interventions under varying conditions. Independence-based approach to causal network inference from data is characterized. We give a few comments on specificity of task of dynamical causal network inference from timeseries. Challenges of Big Data analysis raised by data multidimensionality, heterogeneity and huge volume are presented. Some statistical issues related to the challenges are summarized. Problems in programming 2019; 3: 58-85

Read full abstract

We review directions (avenues) of Big Data analysis and their practical meaning as well as problems and tasks in this field. Big Data Analytics appears a dominant trend in development of modern information technologies for management and planning in business. A few examples of real applications of Big Data are briefly outlined. Analysis of Big Data is aimed to extract useful sense from raw data collection. Big Data and Big Analytics have evolved as computer society’s response to the challenges raised by rapid grows in data volumes, variety, heterogeneity, velocity and veracity. Big Data Analytics may be seen as today’s phase of researches and developments known under names ‘Data Mining’, ‘Knowledge Discovery in Data’, ‘intelligent data analysis’ etc. We suggest that there exist three modes of large-scale usage of Big Data: 1) ‘intelligent information retrieval; 2) massive “intermediate” data processing (concentration, mining), which may be performed during one or two scanning; 3) model inference from data; 4) knowledge discovery in data. Stages in data analysis cycle are outlined. Because of Big Data are raw, distributed, unstructured, heterogeneous and disaggregated (vertically splitted), this data should be prepared for deep analysis. Data preparation may comprise such jobs as data retrieval, access, filtering, cleaning, aggregation, integration, dimensionality reduction, reformatting etc. There are several classes of typical data analysis problems (tasks), including: cases grouping (clustering), predictive model inference (regression, classification, recognition etc.), generative model inference, extracting structures and regularities from data. Distinction between model inference and knowledge discovery is elucidated. We give some suggestion why ‘deep learning’ (one of the most attractive topic by now) is so successive and popular. One of drawbacks of traditional models is they disability to make prediction under incomplete list of predictors (when some predictors are missed) or under augmented list of predictors. One may overcome this drawback using causal model. Causal networks are illuminated in the survey as attractive in that they appear to be expressive generative models and (simultaneously) predictive models in strict sense. This means they pretend to explain how the object at hand is acting (provided they are adequate). Being adequate, causal network facilitates predicting causal effect of local intervention on the object. Methods used in Big Data Analytics will be reviewed in the next paper.

Read full abstract

Usage Of Big Data Research Articles

Articles published on Usage Of Big Data

Fine-grained data-locality aware MapReduce job scheduler in a virtualized environment

Health Professionals' Perception about Big Data Technology in Greece.

A methodology for strategic diagnosis of business corruption behaviours using network analysis

How Healthcare Can Change Based On Big Data and Biomedical Sciences

Big Data Management in Maritime Transport

Picturing diphtheria outbreak in Indonesia using national annual report data: what are the lessons learned?

Big Data Use and Challenges: Insights from Two Internet-Mediated Surveys

Задачі та методи аналізу великих даних (огляд)

Impact of technology in financial reporting: The case of Amazon Go

Spatio Temporal with Scalable Automatic Bisecting-Kmeans for Network Security Analysis in Matagaruda Project

New(s) data for entrepreneurship research? An innovative approach to use Big Data on media coverage

Big Data Analytics Implications for Smart Tourism Destinations Towards the Enrichment of Content Tourism

Accelerating the Internet in the presence of Big Data: Reducing user delays by leveraging historical user request patterns for web caching

Big data for product innovation in manufacturing

The Market for Data Privacy

Usage of the Term Big Data in Biomedical Publications: A Text Mining Approach

Big Data Driven Healthcare Supply Chain: Understanding Potentials and Capabilities

A Game-Based Economic Model for Price Decision Making in Cyber-Physical-Social Systems

Аналітика великих даних: принципи, напрямки і задачі (огляд)

Big Data in Political Communication: Implications for Group Privacy

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Usage Of Big Data Research Articles

Articles published on Usage Of Big Data

Fine-grained data-locality aware MapReduce job scheduler in a virtualized environment

Health Professionals' Perception about Big Data Technology in Greece.

A methodology for strategic diagnosis of business corruption behaviours using network analysis

How Healthcare Can Change Based On Big Data and Biomedical Sciences

Big Data Management in Maritime Transport

Picturing diphtheria outbreak in Indonesia using national annual report data: what are the lessons learned?

Big Data Use and Challenges: Insights from Two Internet-Mediated Surveys

Задачі та методи аналізу великих даних (огляд)

Impact of technology in financial reporting: The case of Amazon Go

Spatio Temporal with Scalable Automatic Bisecting-Kmeans for Network Security Analysis in Matagaruda Project

New(s) data for entrepreneurship research? An innovative approach to use Big Data on media coverage

Big Data Analytics Implications for Smart Tourism Destinations Towards the Enrichment of Content Tourism

Accelerating the Internet in the presence of Big Data: Reducing user delays by leveraging historical user request patterns for web caching

Big data for product innovation in manufacturing

The Market for Data Privacy

Usage of the Term Big Data in Biomedical Publications: A Text Mining Approach

Big Data Driven Healthcare Supply Chain: Understanding Potentials and Capabilities

A Game-Based Economic Model for Price Decision Making in Cyber-Physical-Social Systems

Аналітика великих даних: принципи, напрямки і задачі (огляд)

Big Data in Political Communication: Implications for Group Privacy