Crowd cluster data in the USA for analysis of human response to COVID-19 events and policies
We provide data on daily social contact intensity of clusters of people at different types of Points of Interest (POI) by zip code in Florida and California. This data is obtained by aggregating fine-scaled details of interactions of people at the spatial resolution of 10 m, which is then normalized as a social contact index. We also provide the distribution of cluster sizes and average time spent in a cluster by POI type. This data will help researchers perform fine-scaled, privacy-preserving analysis of human interaction patterns to understand the drivers of the COVID-19 epidemic spread and mitigation. Current mobility datasets either provide coarse-level metrics of social distancing, such as radius of gyration at the county or province level, or traffic at a finer scale, neither of which is a direct measure of contacts between people. We use anonymized, de-identified, and privacy-enhanced location-based services (LBS) data from opted-in cell phone apps, suitably reweighted to correct for geographic heterogeneities, and identify clusters of people at non-sensitive public areas to estimate fine-scaled contacts.
7
- Dec 1, 1981
- The Nebraska medical journal
43
- 10.15585/mmwr.mm6943e1
- Oct 30, 2020
- Morbidity and Mortality Weekly Report
87
- 10.1371/journal.pcbi.1005481
- Apr 7, 2017
- PLOS Computational Biology
1292
- 10.1038/s41586-020-2923-3
- Nov 10, 2020
- Nature
34
- 10.1177/002076407001600107
- Jan 1, 1970
- International Journal of Social Psychiatry
1978
- 10.1145/3068335
- Jul 31, 2017
- ACM Transactions on Database Systems
438
- Jan 1, 1973
- Journal fur Hirnforschung
39
- 10.3168/jds.s0022-0302(87)80307-7
- Nov 1, 1987
- Journal of Dairy Science
10
- 10.1176/ps.46.6.580
- Jun 1, 1995
- Psychiatric Services
11886
- 10.1038/sdata.2016.18
- Mar 15, 2016
- Scientific Data
- Research Article
- 10.1016/j.asoc.2024.112549
- Jan 1, 2025
- Applied Soft Computing
Evaluating the Performance of Countries in COVID-19 Management: A Data-Driven Decision-Making and Clustering
- Research Article
3
- 10.1038/s41597-024-03490-y
- Jun 17, 2024
- Scientific Data
Despite the importance of measuring racial-ethnic segregation and diversity in the United States, current measurements are largely based on the Census and, thus, only reflect segregation and diversity as understood through residential location. This leaves out the social contexts experienced throughout the course of the day during work, leisure, errands, and other activities. The National Experienced Racial-ethnic Diversity (NERD) dataset provides estimates of diversity for the entire United States at the census tract level based on the range of place and times when people have the opportunity to come into contact with one another. Using anonymized and opted-in mobile phone location data to determine co-locations of people and their demographic backgrounds, these measurements of diversity in potential social interactions are estimated at 38.2 m × 19.1 m scale and 15-minute timeframe for a representative year and aggregated to the Census tract level for purposes of data privacy. As well, we detail some of the characteristics and limitations of the data for potential use in national, comparative studies.
- Preprint Article
- 10.2139/ssrn.4868119
- Jan 1, 2024
Device Tracking Privacy Regulations Lead to Unexpected Data Bias in Smartphone Trace Data
- Research Article
3
- 10.5194/ica-abs-1-366-2019
- Jul 15, 2019
- Abstracts of the ICA
Virtual Circular Geofences for Points and Regions of Interests with Spatial Context
- Conference Article
5
- 10.1109/icc.2015.7249500
- Jun 1, 2015
Location Based Service (LBS) is gaining popularity on smart phones. One fundamental LBS is range search, which returns all Point of Interests (POIs) within a user-specified range. However, people also leave their location privacy at risks when using LBS like range search. How can a user invoke such service without revealing his location is an interesting, yet challenging problem to solve. Most existing approaches blur a user's location into a cloaked region, so that LBS cannot figure out the exact location of the requesting user. However, this would make the returning results inaccurate, containing some out-of-range POIs. To this end, we propose PAPERS, a new method to provide location privacy for users of range search. PAPERS leverage homomorphic encryption to let the user encrypt her location, and the LBS server can compute distances on ciphertext. In this way, the returning results by LBS are exactly the POIs within the specified range, while LBS learns nothing about user's real location. We implement a prototype of PAPERS, and evaluate it with real POI set of a large-scale production LBS. Experimental results show that PAPERS can achieve the goal of privacy protection, with reasonable overhead in response time and communication cost.
- Research Article
- 10.1038/s41598-025-00677-0
- May 13, 2025
- Scientific Reports
Studying carbon emissions (CE) at different administrative scales will help facilitate crafting tailored emission reduction policies for China’s regions, which is vital for achieving the dual carbon goals. However, previous studies focused on a single administrative scale, lacking multiscale research. This paper combined energy consumption data with nighttime light data and adopted a spatial autocorrelation, variation coefficient (VC), and decoupling model to study the spatiotemporal dynamics and decoupling effect of CE at the three administrative scales of provinces, prefecture-level cities, and counties in China from 2000 to 2020. The results were as follows: (1) The VC of CE showed different trends at different scales, with its coefficient size successively ranked at the county, prefecture, and province levels. (2) CE at different scales showed positive spatial autocorrelation and the significance was strongest at the county level. (3) The decoupling trend between CE and economic growth has generally shown a positive development across different spatial scales. The average elasticity decoupling index at the provincial, prefectural, and county levels has decreased overall, from 0.88, 1.82, and 3.74 to 0.19, 1.36, and 3.05, respectively. However, the characteristics of these changes differ. The CV of the elasticity decoupling index increased at the provincial and prefectural levels, rising from 1.161 to 1.563 to 1.419 and 2.669, respectively, while it decreased slightly at the county level, from 3.862 to 3.765. (4) The dominant type of decoupling at the provincial level had changed from expansive negative decoupling (ED) to strong decoupling (SD). Meanwhile, at the prefectural and county levels, the ED was still dominant, but the number of SD had increased significantly, rising from 75 to 138 at the prefectural level and from 748 to 1160 at the county level. This study demonstrates China’s carbon emissions sensitivity to scale, emphasizing the importance of adapting emission reduction measures to local conditions.
- Conference Article
1
- 10.1109/icicip53388.2021.9642167
- Dec 3, 2021
With the continuous progress of mobile Internet technology and GPS positioning technology of mobile devices, Social Network and Location Based Services (LBS) are gradually converging to form Location Based Social Network (LBSN). POI (Point of Interest) recommendation systems face the problems of variable user interests, very sparse user and POI check-in matrices, and nonlinear interaction modeling. To address the above problems, a Graph-enhanced Attention Graph Neural Network model is proposed for POI recommendation (POI-GAGN in short). POI-GAGN mines user/POI node representations on user-POI interaction graph, user-user social interaction graph, and POI-POI association interaction graph through interaction node feature extraction module, learns POI attribute information representations through text feature extraction module, and extracts short-term preference representations of users through short-term preference extraction module. A graph-enhanced attention mechanism is designed to correlates node representations, attribute information representations of POI, and short-term preferences of users with each other to achieve better information fusion. Finally, we conduct sufficient experiments on two real datasets to prove that the recommendation effect of POI-GAGN is better than other current advanced POI recommendation methods, and POI-GAGN can better overcome the problems of data sparsity and cold start in recommendations.
- Conference Article
17
- 10.1109/ieec.2009.159
- Jan 1, 2009
This paper analyzes the application of point of interest (POI) data in the location based service (LBS), and reveals the problems of POI data update. Referring to the share intentions of map communities, the paper proposes a user participating POI update model. The model reveals that the millions of mobile handlers can be the POI data sensors, and LBS is the platform of the model. A basic LBS platform is constructed in order to verify this model, and a prototype system of POI query and update based on LBS is implemented. The function modules and technical details of this system are discussed. Result of the system proves the model.
- Research Article
5
- 10.1109/tvt.2017.2737017
- Jun 1, 2018
- IEEE Transactions on Vehicular Technology
Location-based services (LBSs) are becoming an increasingly important component in our social and business life. All existing LBS providers support the nearest place searching via a single point of interest (POI) query. That is, in one query, a user is allowed to search for only one type of service. However, in real life, people usually need to search multiple different types of services and hope that their locations are as close as possible for convenience. For example, one user would like to search for a restaurant with a KTV nearby. To support this application scenario, we propose a novel LBS termed “ $N$ -in-One,” which is the first scheme to extend the function of single-POI LBS to multiple-POI LBS such that a single query can be employed to request multiple POIs that are geographically close. Providing “ $N$ -in-One” is challenging because: 1) serving a “ $N$ -in-One” query is not equivalent to serving $N$ queries independently due to the distance correlation among the $N$ POIs; and 2) the cask effect is getting worse in the service area mode of “ $N$ -in-one” as most of the returned results may be rendered useless when some hot POIs are blocked. To overcome these challenges, we propose several algorithms using computational geometry techniques to identify the best $K$ POIs that are geographically close and the service area (denoted by a given-sized rectangle) that can cover as many the best $Q$ clusters as possible while reducing the cask effect in the service area mode. Extensive simulations based on both synthetic and real world data demonstrate the effectiveness of the proposed algorithms.
- Peer Review Report
- 10.7554/elife.80466.sa0
- Nov 10, 2022
Editor's evaluation: Disentangling the rhythms of human activity in the built environment for airborne transmission risk: An analysis of large-scale mobility data
- Peer Review Report
- 10.7554/elife.80466.sa1
- Nov 10, 2022
Decision letter: Disentangling the rhythms of human activity in the built environment for airborne transmission risk: An analysis of large-scale mobility data
- Dissertation
3
- 10.14264/uql.2017.749
- Aug 11, 2017
With the ubiquity of GPS-enabled smartphones, Location-Based Service (LBS) as a prominent product of social networks has become an essential part of our daily lives. People can easily socialize and share their check-in data (location, time, text, photo and etc.) via Location-Based Social Networks (LBSN). Through mining the check-in dataset, Point-Of-Interest (POI) recommendation systems assist users in exploring new attractive venues.The primary challenge regarding POI recommendation is to suggest a list of new interesting locations to the query user. Specifically, the excessive sparsity observed in user-location matrices is the main problem. The well known Collaborative Filtering (CF) methods are commonly used with other factors like geographical, social, context-oriented (e.g. text contents) and temporal effects to promote the effectiveness of the location recommendation systems. Despite the vital role of temporal influence, an insufficient amount of research has been devoted to considering the time factor in location recommendation. While we have an insufficient number of records regarding a user’s check-in at a particular location, predicting the time of the visit seems more problematic. We have dedicated part of our research to study various aspects of the temporal influence in the recommendation. Additionally, we use Twitter textual contents to extract a set of spatial phrases associated with each region. Such an act enriches the textual contents of the local POIs. This process enhances location recommendation systems, as it facilitates textual similarity among POI tags and the tweet history of the query user.In short, we aim to address the problems involved with both aspects of Location Inference and Recommendation in social networks. Our research in this thesis has four parts. Firstly, we define a problem which merely considers a single temporal aspect to enhance the performance of a location recommendation task. Subsequently, we develop a probabilistic model which detects a user’s temporal orientation based on visibility weights of POIs visited by her during weekday/weekend cycles. While this method is limited to a single temporal scale, the idea can be adapted to other time-related periodic cycles (e.g. daily home-work return trips). Secondly, we argue that the majority of existing methods merely concentrate on a limited number of temporal scales and neglect others. We propose a probabilistic generative model, named after Multi-aspect Time-related Influence (MATI) which employs the user’s check-in log to detect her temporal mobility pattern under various scales (e.g. minute, hour, day and so on). It then performs recommendation using multi-aspect temporal correlations between the query user and proposed locations. Thirdly, we further study the role of the time factor in recommendation models. We define a new problem to jointly model a pair of heterogeneous time-related effects (recency and the subset feature) in location recommendation. To address the challenges, we propose a generative model which computes the probability of the query user visiting a proposed location based on various homogeneous subset attributes. At the same time, the model calculates how likely the newly visited venues will obtain a higher rank compared to others. The model finally performs a POI recommendation through combining the effects learned from both homogeneous and heterogeneous temporal influences. Fourthly, we take textual contents into consideration to tackle the data sparsity problem in location recommendation systems. We propose an approach to detect focal spatial phrases associated with each specific scalable geographical area. For this task, we process GPS-enabled tweets. The main problem here is that Twitter messages are lexically varied and contain limited information. Our model calculates stickiness threshold to exploit the most probable segments out of the tweet contents. We employ a probabilistic model to measure how strongly each of the spatial keywords can be linked to the predefined region.
- Research Article
- 10.2139/ssrn.2671168
- Oct 8, 2015
- SSRN Electronic Journal
A Choice Model with a Diverging Choice Set for POI Data Analysis
- Conference Article
2
- 10.1109/wcnc.2013.6555346
- Apr 1, 2013
Recent developments in mobile techniques have enabled a great variety of Location Based Services (LBS). A high positioning accuracy is a fundamental requirement for precision LBS applications, e.g., precise LBS marketing in shopping malls or indoor emergency evacuation services with mobile devices. However, most offerable commercial positioning systems, such as GPS/GNSS and RF-based systems, can not provide positioning accuracy within one meter. In this work, a new positioning approach is proposed for mobile devices, Called Photo Positioning, it can provide a high accuracy positioning service. The ”positioning” here means to find the location where a photo was taken by investigating the geometric relations between the images of points of interest (POI) in the photo and their location in the real world based on the principle of photo imaging. To implement a photo positioning system, three major components are needed, including a POI database, a method to recognize and locate POIs in a photo and an algorithm to calculate the position where the photo was taken from the POI information. A positioning algorithm based on the geometric similarity in photo imaging is presented in this work, and a prototype system is developed for Android smartphone platforms. Our experimental results shows that the average positioning error of the proposed photo positioning approach can be as low as 74.34 cm.
- Conference Instance
- 10.1145/2025876
- Sep 18, 2011
We would like to welcome you to the first international workshop on mobile location-based service or MLBS 2011! The number of "smart" wireless devices such as mobile phones and iPad-like computers has been rapidly growing. Being able to keep track of locations of moving devices can enhance a number of applications. Location-Based Service (LBS) is quickly becoming the next ubiquitous technology for a wide range of mobile applications, such as location positioning, location navigation, location-aware search, social networks, and ads, just to name a few. LBS was first deployed in the turn of the century by Palm VII, Swisscom, Vodafone, and DoCoMo [1]. These first wave of deployment performed location positioning based on the locations of the nearest cell towers. The accuracy of such approach ranges from one hundred to a few thousand meters, depending on the density of cell towers. In 2004, Global Position System (GPS) was tested successfully to work with a mobile phone by Quadcomm [2]. GPS is now available on most smart phones. GPS can achieve outdoor location positioning with approximately ten-meter accuracy. Its major shortcomings are high power consumption, long TTFF (time to the first fix), and unavailability in urban tunnels and indoor. Some remedies such as AGPS and hybrid cells have recently been researched, experimented, and deployed. Indoor positioning and indoor navigation (IPIN) is a capability that is in high demand in Asia, where most commerce activities take place in high-rise buildings. A shopper may want to find nearby stores selling a particular merchandise, a store may want to deliver coupons to nearby shoppers, and a user may want to locate nearby friends, to name just a few. Existing cell-based, GPS-based and hybrid technologies cannot perform accurate indoor location positioning. Solutions using WIFI signals can achieve sub-five-meter accuracy, both indoor and outdoor. However, access-point density must be high. Furthermore, WIFI-based positioning suffers from both time-consuming site survey (to survey their locations and signal strengths) and signal disturbance [3]. New signal sources such as radar, sonar and inertial navigation systems are now being actively investigated [4]. These devices also suffer from their limitations. A good solution is likely to be one that combines signals from complementary sources. Besides hardware solutions, data-driven software solutions can assist location positioning. For instance, maps and POIs (points of interest) can be mined from users' traveling history. Google recently experiments with a transit alert system based on user moving patterns [5]. It was observed that in a city like Zurich, where transits are always on time and traffic is light, predicting bus arrival time is trivial. In a city like Taipei and Hangzhou, where every bus is equipped with a GPS, bus arrival time can be predicted fairly accurately by taking traffic congestion into consideration. In a city like Bangalore and Beijing, where limited signals are available with transits (to-date) and traffic can be insanely congested, the system can use the position of a user on a bus (assuming the route of the bus is known) to predict the bus's location. We believe that user moving patterns can provide useful information for applications such as information ranking and ad matching. While the opportunities are exciting, preserving user privacy is an issue that must be thoroughly addressed when historical data of users are processed and mined [6]. Another important research area of LBS is data storage and query processing. As location-cognizant devices become ubiquitous, the volume of such data poses unprecedented challenges for LBS providers. Fresh data must be stored and indexed as they arrive, and historical data archived before main memory is full. Spatial query processing of LBS must be both fast and accurate (location updates are reflected in real-time). Traditional spatial data structures such as Quad-trees may not be able to handle such high-frequency current updates and queries. Furthermore, user privacy, including both location privacy and query privacy, must be preserved. In summary, there are at least six key areas of research and development to enhance LBS: Location signal acquisition and processing,Signal fusion,Spatial query processing,Privacy-preserved data mining,Power-conserving algorithms, andLBS-enabled system evaluation. We are excited to organize this first international workshop on mobile location-based service (MLBS). The workshop accepted twelve papers covering the aforementioned areas. We believe MLBS will be an active research direction in the next decade because of the momentum of smart mobile devices, the set of exciting problems to be tackled, and applications to be developed.
- Research Article
24
- 10.1080/13658816.2015.1133820
- Jan 12, 2016
- International Journal of Geographical Information Science
ABSTRACTWith the popularity of mobile devices and smartphones, we have witnessed rapid growth in mobile applications and services, especially in location-based services (LBS). According to a mobile marketing survey, maps/location searches are among the most utilized services on smartphones. Points of interest (POIs), such as stores, shops, gas stations, parking lots, and bus stops, are particularly important for maps/location searches. Existing map services such as Google Maps and Wikimapia are constructed manually either professionally or with crowd sourcing. However, manual annotation is costly and limited in current POI search services. With the abundance of information on the Web, many store POIs can be extracted from the Web. In this paper, we focus on automatically constructing a POI database to enable store POI map searches. We propose techniques that are required to construct a POI database, including focused crawling, information extraction, and information retrieval techniques. We first crawl Yellow Page web sites to obtain vocabularies of store names. These vocabularies are then investigated with search engines to obtain sentences containing these store names from search snippets in order to train a store name recognition model. To extract POIs scattered across the Web, we propose a query-based crawler to find address-bearing pages that might be used to extract addresses and store names. We crawled 1.25 million distinct POI pairs scattered across the Web and implemented a POI search service via Apache Lucent’s search platform, called Solr. The experimental results demonstrate that the proposed geographical information retrieval model outperforms Wikimapia and a commercial app called ‘What’s the Number?’
- Conference Article
8
- 10.1109/icde53745.2022.00020
- May 1, 2022
Context has been recognized as an important factor to consider in personalized recommender systems. Particularly in location-based services (LBSs), a fundamental task is to recommend to a mobile user where he/she could be interested to visit next at the right time. Additionally, location-based social networks (LBSNs) allow users to share location-embedded information with friends who often co-occur in the same or nearby points-of-interest (POIs) or share similar POI visiting histories, due to the social homophily theory and Tobler's first law of geography. So, both the time information and LBSN friendship relations should be utilized for POI recommendation. Tensor completion has recently gained some attention in time-aware recommender systems. The problem decomposes a user-item-time tensor into low-rank embedding matrices of users, items and times using its observed entries, so that the underlying low-rank subspace structure can be tracked to fill the missing entries for time-aware recommendation. However, these tensor completion methods ignore the social-spatial context information available in LBSNs, which is important for POI recommendation since people tend to share their preferences with their friends, and near things are more related than distant things. In this paper, we utilize the side information of social networks and POI locations to enhance the tensor completion model paradigm for more effective time-aware POI recommendation. Specifically, we propose a regularization loss head based on a novel social Hausdorff distance function to optimize the reconstructed tensor. We also quantify the popularity of different POIs with location entropy to prevent very popular POIs from being over-represented hence suppressing the appearance of other more diverse POIs. To address the sensitivity of negative sampling, we train the model on the whole data by treating all unlabeled entries in the observed tensor as negative, and rewriting the loss function in a smart way to reduce the computational cost. Through extensive experiments on real datasets, we demonstrate the superiority of our model over state-of-the-art tensor completion methods.
- Conference Article
2
- 10.1109/i-span.2018.00038
- Oct 1, 2018
This work developed an open accessed Mobile Digital Culture Heritage (M-DCH) platform called Demodulating and Encoding Heritage (DEH) that allows users to download M-DCH contents and upload M-DEH contents. For the M-DCH content concern, DEH interprets M-DEH from the viewpoint of point, line, area and story/site. Point of Interest (POI) is the basic unit of M-DCH and can contain text plus image/pictures, video or audio. A Line of Interest (LOI) is a sequence of POIs in which a visiting sequence of the composed POIs is enforced to have the temporal or spatial view and interpretation of the contained POIs. An Area of Interest (AOI) is a set of POIs in which the composed POIs are related, but the visiting of them can be random. A Story/Site of Interest (SOI) can contain many units, in which each unit can be a POI, a LOI or an AOI, to describe a complete story or a site of culture heritage. For the technical concern, DEH defines a new scenario of the grouping function. A group in DEH is composed of a group creator/leader and some members. In addition to the regular group functions i.e., sharing the content to group members, which belongs to the read action, defined in common social network, e.g., FB, Line, and WeChat, DEH's grouping function allows the group creator/leader to have the privilege of modifying contents created by group members, which belongs to the write action. This function is especially needed for content creation and sharing that is instructed/advised/guided in the teacher-student way. A set of metadata elements was defined for POI, LOI, AOI and SOI respectively such that the system can conveniently manipulate the M-DEH contents. The DEH platform was implemented by utilizing many tools for web and mobile APPs, e.g., Django for web API and Node.js for mobile API, based on the techniques of Location Based Service (LBS) and Geographical Information System (GIS).
- New
- Research Article
- 10.1038/s41597-025-06041-1
- Nov 7, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06039-9
- Nov 7, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06040-2
- Nov 7, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-05925-6
- Nov 7, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06033-1
- Nov 7, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06038-w
- Nov 6, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06035-z
- Nov 6, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06037-x
- Nov 6, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06217-9
- Nov 6, 2025
- Scientific data
- New
- Research Article
- 10.1038/s41597-025-06034-0
- Nov 6, 2025
- Scientific data
- Ask R Discovery
- Chat PDF