Abstract
With the realisation of the Internet of Things (IoT) paradigm, the analysis of the Activities of Daily Living (ADLs), in a smart home environment, is becoming an active research domain. The existence of representative datasets is a key requirement to advance the research in smart home design. Such datasets are an integral part of the visualisation of new smart home concepts as well as the validation and evaluation of emerging machine learning models. Machine learning techniques that can learn ADLs from sensor readings are used to classify, predict and detect anomalous patterns. Such techniques require data that represent relevant smart home scenarios, for training, testing and validation. However, the development of such machine learning techniques is limited by the lack of real smart home datasets, due to the excessive cost of building real smart homes. This paper provides two datasets for classification and anomaly detection. The datasets are generated using OpenSHS, (Open Smart Home Simulator), which is a simulation software for dataset generation. OpenSHS records the daily activities of a participant within a virtual environment. Seven participants simulated their ADLs for different contexts, e.g., weekdays, weekends, mornings and evenings. Eighty-four files in total were generated, representing approximately 63 days worth of activities. Forty-two files of classification of ADLs were simulated in the classification dataset and the other forty-two files are for anomaly detection problems in which anomalous patterns were simulated and injected into the anomaly detection dataset.
Highlights
Recent developments in technology have increased the adoption of smart devices and sensors in smart homes
This paper presents two datasets generated by OpenSHS for classification and anomaly detection problems
Synnott et al [20] conducted a survey of existing simulation tools for generating datasets in a smart home environment
Summary
Recent developments in technology have increased the adoption of smart devices and sensors in smart homes. With the widespread usage of smart devices in smart homes, these environments will generate an enormous amount of streaming data These generated data have the potential to provide novel services to the smart home inhabitants to improve their standards of living. There is less cost and effort involved in the process, and they can cope with new emerging techniques Many of these simulation tools are not available in the public domain as an open-source project, or they lack the flexibility and accessibility for both the researchers and the participants. The generated datasets do not capture realistic and fine-grained interactions that happen in real smart homes. It allows the researcher to design a smart home specific to their research problem and generate a sufficiently large dataset in reasonable time while retaining the fine-grained interactions that the participants are performing.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.