Abstract

Information geometry provides a correspondence between differential geometry and statistics through the Fisher information matrix. In particular, given two models from the same parametric family of distributions, one can define the distance between these models as the length of the geodesic connecting them in a Riemannian manifold whose metric is given by the model’s Fisher information matrix. One limitation that has hindered the adoption of this similarity measure in practical applications is that the Fisher distance is typically difficult to compute in a robust manner. We review such complications and provide a general form for the distance function for one parameter model. We next focus on higher dimensional extreme value models including the generalized Pareto and generalized extreme value distributions that will be used in financial risk applications. Specifically, we first develop a technique to identify the nearest neighbors of a target security in the sense that their best fit model distributions have minimal Fisher distance to the target. Second, we develop a hierarchical clustering technique that utilizes the Fisher distance. Specifically, we compare generalized extreme value distributions fit to block maxima of a set of equity loss distributions and group together securities whose worst single day yearly loss distributions exhibit similarities.

Highlights

  • Quantifying the similarity between two probability distributions is a central component of a variety of applications in the statistics, quantitative finance, and engineering literature

  • We will consider the Fisher information distance on a model space of distributions which in turn will be used in both nearest neighbor and clustering applications that focus on the entire risk profiles of financial securities

  • The main practical focus will be to group loss distributions by their risk profiles; how similar are the left tails of the two distributions? With this in mind, we focus on the generalized Pareto and generalized extreme value distributions from extreme value theory

Read more

Summary

Introduction

Quantifying the similarity between two probability distributions is a central component of a variety of applications in the statistics, quantitative finance, and engineering literature. Each of [9,10,11], utilize correlation distance measures to develop hierarchical clustering algorithms in order to explore the network structures in equity markets These techniques take only the return time series of a collection of stocks as input and are able to identify hierarchical sector structure solely from this information. We focus on the generalized Pareto and generalized extreme value distributions from extreme value theory This distance function may be used as an input into a nearest neighbor or hierarchical clustering method, both of which provide a risk based clustering of financial securities analogous to the previously mentioned correlation methods. We provide nearest neighbor and hierarchical clustering applications that group together equity return distributions based on the Fisher distance between them This results in a new risk focused clustering technique of financial securities.

Information Geometry Background
Distance Functions for One-Dimensional Models
Higher Dimensional Examples
Gaussian Distribution
Generalized Pareto Distribution
Generalized Extreme Value Distribution
Computing Geodesic Distances
Applications
Comparison between KL Divergence and Fisher Distance
Generalized Pareto Nearest Neighbor Example
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call