Multivesicular endosomes (MVEs) sequester membrane proteins destined for degradation within intralumenal vesicles (ILVs), a process mediated by the membrane-remodeling action of Endosomal Sorting Complex Required for Transport (ESCRT) proteins. In Arabidopsis, endosomal membrane constriction and scission are uncoupled, resulting in the formation of extensive concatenated ILV networks and enhancing cargo sequestration efficiency. Here, we used a combination of electron tomography, computer simulations, and mathematical modeling to address the questions of when concatenated ILV networks evolved in plants and what drives their formation. Through morphometric analyses of tomographic reconstructions of endosomes across yeast, algae, and various land plants, we have found that ILV concatenation is widespread within plant species, but only prevalent in seed plants, especially in flowering plants. Multiple budding sites that require the formation of pores in the limiting membrane were only identified in hornworts and seed plants, suggesting that this mechanism has evolved independently in both plant lineages. To identify the conditions under which these multiple budding sites can arise, we used particle-based molecular dynamics simulations and found that changes in ESCRT filament properties, such as filament curvature and membrane binding energy, can generate the membrane shapes observed in multiple budding sites. To understand the relationship between membrane budding activity and ILV network topology, we performed computational simulations and identified a set of membrane remodeling parameters that can recapitulate our tomographic datasets.