We use country reviews in United Nations’s Universal Periodic Review (UPR) to obtain multi-dimensional measures of similarity between countries. UPR is a mechanism that involves a review of all UN member states. The process is designed to treat all states equally and be cognizant of the level of development and specificities of countries under review. One of the unique aspects of the UPR is the peer review, where any state can offer recommendations for the state under review. The first review cycle was conducted from 2008-2012 and the second cycle has finished in November 2016. We obtain measures of similarity based on the similarity of the recommendations that states gave other states and also based on the similarity of the recommendations that states received from other states. In order to obtain these measures of semantic similarity, we use a method that exploits continuous vector representation of words known as word vectors, which is considered the state-of-the-art for measuring textual similarities in natural language processing. We use word-vectors pre-trained on the Google News dataset to ensure a reasonable coverage of political terminology. This method provides many improvements over the existing methods. Most importantly, there are fewer subjective decisions to be made (for example, one does not need to choose the number of latent topics). The proposed method does not need to rely on a huge corpus and can be used on small and large data sets alike. Furthermore, the method is computationally very efficient and does not need to solve a non-convex optimization problem. Our early results indicate that comments received by countries in the UPR process indeed captures real human rights records of countries, especially when one averages over all the comments received from different states.
Read full abstract