Abstract

Most cropping-and-segmenting pattern parsers typically establish a single metric/scheme to reason diverse inner correlations, resulting in over-general and redundant representations. To make pattern parsing procedure more streamlined and concise, a part-relation scissor network (PRSN) with multi-constrained attention shifters (MCASs) and multi-head attention expectation-maximum routing agreement (MhAEMRA) is proposed on the basic of matrix-based capsule network (CapsNet). MCASs detect and prune fragile part-to-whole correlations from the perspectives of inter-part diversity and intra-object cohesion. They stipulate that only those primary entities fulfilling the criteria of inter-part diversity and intra-object cohesiveness can update senior entities. MhAEMRA is defined to shield the redundant capsule voting signals. PRSN gradually parses objective semantic patterns by clustering highly associated secondary entities in a bottom-up “part backtracking” manner. Quantitative and ablation experiments surrounding face and human parsing tasks demonstrate the superiority of PRSN over the state-of-the-arts, especially for the definition of fine-grained semantic boundaries.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call