Abstract

This paper aims to address the challenges associated with evaluating the impact of neighborhood environments on health outcomes. Google street view (GSV) images provide a valuable tool for assessing neighborhood environments on a large scale. By annotating the GSV images with labels indicating the presence or absence of specific neighborhood features, we can develop classifiers capable of automatically analyzing and evaluating the environment. However, the process of labeling GSV images to analyze and evaluate the environment is a time-consuming and labor-intensive task. To overcome these challenges, we propose using a multi-task classifier to enhance the training of classifiers with limited supervised GSV data. Our multi-task classifier utilizes readily available, inexpensive online images collected from Flickr as a related classification task. The hypothesis is that a classifier trained on multiple related tasks is less likely to overfit to small amounts of training data and generalizes better to unseen data. We leverage the power of multiple related tasks to improve the classifier's overall performance and generalization capability. Here we show that, with the proposed learning paradigm, predicted labels for GSV test images are more accurate. Across different environment indicators, the accuracy, F1 score and balanced accuracy increase up to 6 % in the multi-task learning framework compared to its single-task learning counterpart. The enhanced accuracy of the predicted labels obtained through the multi-task classifier contributes to a more reliable and precise regression analysis determining the correlation between predicted built environment indicators and health outcomes. The R2 values calculated for different health outcomes improve by up to 4 % using multi-task learning detected indicators.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call