TY - JOUR
T1 - Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle
AU - Pasolli, Edoardo
AU - Asnicar, Francesco
AU - Manara, Serena
AU - Zolfo, Moreno
AU - Karcher, Nicolai
AU - Armanini, Federica
AU - Beghini, Francesco
AU - Manghi, Paolo
AU - Tett, Adrian
AU - Ghensi, Paolo
AU - Collado, Maria Carmen
AU - Rice, Benjamin L.
AU - DuLong, Casey
AU - Morgan, Xochitl C.
AU - Golden, Christopher D.
AU - Quince, Christopher
AU - Huttenhower, Curtis
AU - Segata, Nicola
N1 - Funding Information:
We thank the MAHERY team, particularly Anjaranirina Evelyn Jean Gasta, Hervet Randriamady, and Miadana Vonona Arisoa, and the GeNaPi Project Team, particularly Mari Olcina, Lourdes Larruy, Carla Muñoz, and Cristina Alcántara. We thank Levi Waldron and all the members of the Segata and Huttenhower laboratories for fruitful discussions, the students of the Computational Microbial Genomics master course at University of Trento for help with the manual curation of assemblies, and the HPC and NGS facilities at University of Trento. This work was supported by EU-H2020 ( DiMeTrack-707345 ) to E.P.; by NIH NHGRI ( R01HG005220 ), NIDDK ( R24DK110499 ), NIDDK ( U54DE023798 ), CMIT ( 6935956 ) to C.H.; and by ERC ( MetaPG-716575 ), MIUR ( RBFR13EWWI ), EU-FP7 ( PCIG13-GA-2013-618833 ), CARITRO ( 2013.0239 ), and LEO Pharma Foundation to N.S. Madagascar data and sample collection was supported by the Rockefeller Foundation to C.D.G.
Funding Information:
We thank the MAHERY team, particularly Anjaranirina Evelyn Jean Gasta, Hervet Randriamady, and Miadana Vonona Arisoa, and the GeNaPi Project Team, particularly Mari Olcina, Lourdes Larruy, Carla Muñoz, and Cristina Alcántara. We thank Levi Waldron and all the members of the Segata and Huttenhower laboratories for fruitful discussions, the students of the Computational Microbial Genomics master course at University of Trento for help with the manual curation of assemblies, and the HPC and NGS facilities at University of Trento. This work was supported by EU-H2020 (DiMeTrack-707345) to E.P.; by NIH NHGRI (R01HG005220), NIDDK (R24DK110499), NIDDK (U54DE023798), CMIT (6935956) to C.H.; and by ERC (MetaPG-716575), MIUR (RBFR13EWWI), EU-FP7 (PCIG13-GA-2013-618833), CARITRO (2013.0239), and LEO Pharma Foundation to N.S. Madagascar data and sample collection was supported by the Rockefeller Foundation to C.D.G.
Publisher Copyright:
© 2019 The Author(s)
PY - 2019/1/24
Y1 - 2019/1/24
N2 - The body-wide human microbiome plays a role in health, but its full diversity remains uncharacterized, particularly outside of the gut and in international populations. We leveraged 9,428 metagenomes to reconstruct 154,723 microbial genomes (45% of high quality) spanning body sites, ages, countries, and lifestyles. We recapitulated 4,930 species-level genome bins (SGBs), 77% without genomes in public repositories (unknown SGBs [uSGBs]). uSGBs are prevalent (in 93% of well-assembled samples), expand underrepresented phyla, and are enriched in non-Westernized populations (40% of the total SGBs). We annotated 2.85 M genes in SGBs, many associated with conditions including infant development (94,000) or Westernization (106,000). SGBs and uSGBs permit deeper microbiome analyses and increase the average mappability of metagenomic reads from 67.76% to 87.51% in the gut (median 94.26%) and 65.14% to 82.34% in the mouth. We thus identify thousands of microbial genomes from yet-to-be-named species, expand the pangenomes of human-associated microbes, and allow better exploitation of metagenomic technologies.
AB - The body-wide human microbiome plays a role in health, but its full diversity remains uncharacterized, particularly outside of the gut and in international populations. We leveraged 9,428 metagenomes to reconstruct 154,723 microbial genomes (45% of high quality) spanning body sites, ages, countries, and lifestyles. We recapitulated 4,930 species-level genome bins (SGBs), 77% without genomes in public repositories (unknown SGBs [uSGBs]). uSGBs are prevalent (in 93% of well-assembled samples), expand underrepresented phyla, and are enriched in non-Westernized populations (40% of the total SGBs). We annotated 2.85 M genes in SGBs, many associated with conditions including infant development (94,000) or Westernization (106,000). SGBs and uSGBs permit deeper microbiome analyses and increase the average mappability of metagenomic reads from 67.76% to 87.51% in the gut (median 94.26%) and 65.14% to 82.34% in the mouth. We thus identify thousands of microbial genomes from yet-to-be-named species, expand the pangenomes of human-associated microbes, and allow better exploitation of metagenomic technologies.
KW - human microbiome
KW - metagenomic assembly
KW - metagenomic mappability
KW - metagenomic meta-analysis
KW - metagenomics
KW - non-Westernized microbiomes
KW - unexplored microbial diversity
UR - http://www.scopus.com/inward/record.url?scp=85060181041&partnerID=8YFLogxK
U2 - 10.1016/j.cell.2019.01.001
DO - 10.1016/j.cell.2019.01.001
M3 - Article
C2 - 30661755
AN - SCOPUS:85060181041
VL - 176
SP - 649-662.e20
JO - Cell
JF - Cell
SN - 0092-8674
IS - 3
ER -