The pessimistic and optimistic neighborhood multi-granulation rough sets (PNMRS and ONMRS) have been applied to attribute reduction. Nonetheless, the setting of the granulation space in both models is usually based on expert experience or continuous trials, which are poorly interpretable. Besides, the dependency, an important attribute evaluation function in PNMRS and ONMRS, only considers classification information in the lower approximation and ignores that of the upper approximation, affecting its evaluation effect. Thus, we firstly propose the pessimistic and optimistic neighborhood extreme-granulation rough sets (PNERS and ONERS) based on the minimum granulation space, avoiding the unobjective setting of the granulation space. However, PNERS describes knowledge with low accuracy, and therefore we construct two self-information uncertainty measures in ONERS and propose the joint self-information that is more suitable for attribute reduction as it takes full consideration of classification information in the lower and upper approximations. Then, an attribute reduction method using the joint self-information is designed, and Fisher Score is introduced into our method to decrease time cost when processing high dimensional data. Extensive experiments validate that our method is able to achieve superior reduction and classification results on different dimensional datasets.