A soybean nodulin cDNA clone (E41) hybrid-selected mRNA for three in vitro translation products with apparent molecular weights of 26 kDa, 25 kDa and 24 kDa. Based on Southern analysis of soybean genomic DNA, combined with mapping and sequencing of genomic clones, we identified four genes that are related to E41, one of which was identified to be the previously characterized N-20 gene. Our data indicate the linkage of three of the genes, of which one is a truncated version and suggest that they originated by gene duplication combined with deletion and conversion. The genes are highly expressed and we postulate that the sequence conservation in the 5' and 3' flanking regions of all four genes, has a functional role in their expression. Hybrid-selected translation products of E41 are not immunoprecipitable with antibody to the soluble fraction of nodules suggesting that they are membrane associated. The N-20 gene, which is a member of this gene subfamily, showed sequence similarity to four previously characterized nodulin genes and a phylogenetic tree is proposed based on the extent of sequence similarity.