Abstract

With the realisation of the Internet of Things (IoT) paradigm, the analysis of the Activities of Daily Living (ADLs), in a smart home environment, is becoming an active research domain. The existence of representative datasets is a key requirement to advance the research in smart home design. Such datasets are an integral part of the visualisation of new smart home concepts as well as the validation and evaluation of emerging machine learning models. Machine learning techniques that can learn ADLs from sensor readings are used to classify, predict and detect anomalous patterns. Such techniques require data that represent relevant smart home scenarios, for training, testing and validation. However, the development of such machine learning techniques is limited by the lack of real smart home datasets, due to the excessive cost of building real smart homes. This paper provides two datasets for classification and anomaly detection. The datasets are generated using OpenSHS, (Open Smart Home Simulator), which is a simulation software for dataset generation. OpenSHS records the daily activities of a participant within a virtual environment. Seven participants simulated their ADLs for different contexts, e.g., weekdays, weekends, mornings and evenings. Eighty-four files in total were generated, representing approximately 63 days worth of activities. Forty-two files of classification of ADLs were simulated in the classification dataset and the other forty-two files are for anomaly detection problems in which anomalous patterns were simulated and injected into the anomaly detection dataset.

Highlights

  • Recent developments in technology have increased the adoption of smart devices and sensors in smart homes

  • This paper presents two datasets generated by OpenSHS for classification and anomaly detection problems

  • Synnott et al [20] conducted a survey of existing simulation tools for generating datasets in a smart home environment

Read more

Summary

Introduction

Recent developments in technology have increased the adoption of smart devices and sensors in smart homes. With the widespread usage of smart devices in smart homes, these environments will generate an enormous amount of streaming data These generated data have the potential to provide novel services to the smart home inhabitants to improve their standards of living. There is less cost and effort involved in the process, and they can cope with new emerging techniques Many of these simulation tools are not available in the public domain as an open-source project, or they lack the flexibility and accessibility for both the researchers and the participants. The generated datasets do not capture realistic and fine-grained interactions that happen in real smart homes. It allows the researcher to design a smart home specific to their research problem and generate a sufficiently large dataset in reasonable time while retaining the fine-grained interactions that the participants are performing.

Real Datasets
Simulation Tool
OpenSHS
OpenSHS Advantages
Smart Home Design
The Participants
The Anomalies
Participants
Dataset Aggregation
Dataset Description
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call