Abstract

Understanding segregation plays a significant role in determining the development pathways of a country as it can help governmental and other concerned agencies to prepare better-targeted policies for the needed groups. However, inferring segregation through alternative data, apart from governmental surveys remains limited due to the non-availability of representative datasets. In this work, we utilize Call Data Records (CDR) provided by one of Estonia’s major telecom operators to research the complexities of social interaction and human behavior in order to explain gender segregation. We analyze the CDR with two objectives. First, we study gender segregation by exploring the social network interactions of the CDR. We find that the males are tightly linked which allows information to spread faster among males compared to females. Second, we perform the micro-analysis using various users’ characteristics such as age, language, and location. Our findings show that the prime working-age population (i.e., (24,54] years) is more segregated than others. We also find that the Estonian-speaking population (both males and females) are more likely to interact with other Estonian-speaking individuals of the same gender. Further to ensure the quality of this dataset, we compare the CDR data features with publicly available Estonian census datasets. We observe that the CDR dataset is indeed a good representative of the Estonian population, which indicates that the findings of this study reasonably reflect the reality of gender segregation in the Estonian Landscape.

Highlights

  • Segregation has long been assumed to play a critical role in many developing countries’ socioeconomic structure and overall stability [1]

  • The homophily index (HI) index for the Estonian-speaking population shows that both males and females are inclined towards the same gender and language

  • Since the representation of Estonian and Russian-speaking population covers more than 99% in reality and in Call Data Records (CDR) data, the results of this study using CDR data can be considered useful for understanding gender segregation based on Estonian and Russian language in Estonia

Read more

Summary

Introduction

Segregation has long been assumed to play a critical role in many developing countries’ socioeconomic structure and overall stability [1]. We use the Call Data Records (CDR) to understand segregation in Estonian society. This work investigates gender segregation within society by analyzing the users’ characteristics and their interaction through social network analysis. We analyze anonymized CDR data provided by one of the leading mobile operators in Estonia to cover the following research directions:. Macro-analysis: In this analysis, we explore the social network interactions to understand gender segregation using CDR. Micro-analysis: During this analysis, we investigate users’ characteristics such as age, language, and location to understand gender segregation in detail. We analyze the users’ interactions based on gender, age-groups, language, and locations (Section 4.2). Our analysis using users’ characteristics, such as gender, age, language, and location show that the prime working-age population (i.e., (24,54] years) is more segregated than other age groups.

Related work
Dataset
Descriptive analysis
Macro-analysis
Micro-analysis
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call