Abstract

The Big data challenge includes dealing with a big number of heterogeneous and multidimensional datasets of all possible sizes not only with data of big size. As a result a huge number of Machine Learning (ML) tasks, which must be solved dramatically exceeds the number of data scientists who can solve these tasks. Next many ML tasks require critical input from subject matter experts (SME) and end users/decision makers who are not ML experts. A set of tools that we call a “virtual data scientist” is needed to assist SMEs and end users to construct ML models for their tasks to meet this Big data challenge with a minimal contribution from data scientists. This paper describes our vision of such a “virtual data scientist” based on the visual approach with collocated and shifted paired coordinates. The approach is illustrated with real world data and ML tasks, as well as simulated data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call