Bioinformatics We know that the human microbiome contains a wealth of largely unknown genetic diversity. Whole metagenome shotgun sequencing is needed to make the links between genes and phenotypes. Sberro et al. have computationally analyzed thousands of small protein-coding genes in the Human Microbiome Project data. More than 400,000 clusters of open reading frames (ORFs) of less than 150 nucleotides were constructed de novo. Within the approximately 4000 small ORFs likely to be protein-coding, the majority have no homology with known protein domains, and some are evolutionarily conserved, indicating that they have essential functions. Some peptides are transmembrane and secreted, which hints that they are used in signaling between bacteria and host. Others are implicated in bacterial immunity and horizontal gene transfer. Thus, this effort provides a rich resource for understanding the body's relationship with its microbiota. Cell 178 , 1245 (2019).