• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    SeSaMe:Metagenome Sequence Classification of Arbuscular Mycorrhizal Fungi-associated Microorganisms

    2020-09-02 00:04:14JeeEunKangAntonioCiampiMohamedHijri
    Genomics,Proteomics & Bioinformatics 2020年5期

    Jee Eun Kang * ,Antonio Ciampi ,Mohamed Hijri *

    1 Institut de Recherche en Biologie Ve′ge′tale,De′partement de Sciences Biologiques,Universite′ de Montre′al,Montre′al,QC H1X 2B2,Canada

    2 Department of Epidemiology,Biostatistics and Occupational Health,McGill University,Montre′al,QC H3A 1A2,Canada

    KEYWORDS SeSaMe;Spore-associated symbiotic microbes;Arbuscular mycorrhizalfungi;Taxonomic classification;3-codon DNA 9-mer

    Abstract Arbuscular mycorrhizal fungi(AMF)are plant root symbionts that play key roles in plant growth and soil fertility.They are obligate biotrophic fungi that form coenocytic multinucleated hyphae and spores.Numerous studies have shown that diverse microorganisms live on the surface of and inside their mycelia,resulting in a metagenome when whole-genome sequencing(WGS)data are obtained from sequencing AMF cultivated in vivo.The metagenome contains not only the AMF sequences,but also those from associated microorganisms.In this study,we introduce a novel bioinformatics program, Spore-associated Symbiotic Microbes (SeSaMe),designed for taxonomic classification of short sequences obtained by next-generation DNA sequencing.A genus-specific usage bias database was created based on amino acid usage and codon usage of a three consecutive codon DNA 9-mer encoding an amino acid trimer in a protein secondary structure.The program distinguishes between coding sequence (CDS)and non-CDS,and classifies a query sequence into a genus group out of 54 genera used as reference.The mean percentages of correct predictions of the CDS and the non-CDS test sets at the genus level were 71% and 50% for bacteria,68% and 73%for fungi (excluding AMF),and 49% and 72% for AMF (Rhizophagus irregularis),respectively.SeSaMe provides not only a means for estimating taxonomic diversity and abundance but also the gene reservoir of the reference taxonomic groups associated with AMF.Therefore,it enables users to study the symbiotic roles of associated microorganisms.It can also be applicable to other microorganisms as well as soil metagenomes.SeSaMe is freely available at www.fungalsesame.org.

    Introduction

    Arbuscular mycorrhizal fungi(AMF)are plant root inhabiting fungi,of the subphylum Glomeromycotina,which form symbioses with more than 80% of vascular plants worldwide [1].They supply plants with essential nutrients particularly phosphorus and nitrogen,protect them against soil-borne pathogens,and alleviate their abiotic stresses [1-3].Therefore,AMF-based inoculants have been applied in agriculture as a biofertilizer and in phytoremediation for cleaning up contaminated soil [2,4-7].Despite the ecological,agricultural,and environmental importance of AMF,their genetics is poorly understood due to their complex genome organization.They form coenocytic hyphae,reproduce through multinucleated asexual spores,and are strict symbionts[8].Furthermore,it is suggested that AMF are heterokaryons,although this is under debate [9].In addition,numerous studies reported that bacteria and fungi inhabit the surface and the interior of mycelia and spores[10-14].In 2012 and 2013,Tisserant et al.[15,16]separately published the transcriptome and genome of the AMFRhizophagus irregulariscultivatedin vitro.However,only a few AMF taxa are able to grow in axenicin vitrosystems with transformed roots as a host.Thus,whole-genome sequencing (WGS) data from AMF spore DNA originating fromin vivocultures (a conventional cultivation method in a pot culture with a host plant),contain a substantial number of non-AMF DNA sequences,but do provide important information on the microbial communities associated with AMF.In contrast,WGS data fromin vitropetri-dishes contain fewer non-AMF sequences,because antibiotics are used to initiate axenic cultures [17].

    Taxonomic classification of WGS data obtained from AMF cultivatedin vivousing current bioinformatics approaches is challenging because these data represent a complex metagenome containing sequences of prokaryotic and eukaryotic microorganisms.Two major approaches for taxonomic classification of random whole metagenome sequencing data (e.g.,whole metagenome shotgun sequencing data)include composition-based methods and similarity-based search methods [18,19].The latter ones include BLAST and its sister programs that are adequate for inferring functions of a query sequence[19,20].Nevertheless,they have limitations in taxonomic classification,because they calculate scores based on a 20 by 20 matrix containing the overall rates of the 20 amino acid substitutions created from the most conserved regions of proteins.The same matrix is applied to all types of query sequences,irrespective of functions,structures,and taxonomic groups.However,due to a lack of bioinformatics tools for analyzing random whole metagenome sequencing data,similarity-based search methods have been commonly used for taxonomic classification.In addition to similaritybased search methods,taxonomic classification pipelines for analyzing targeted metagenome sequencing data (e.g.,16S rRNA gene-based metagenome sequencing data) have been widely used for analyzing random whole metagenome sequencing data in combination with homology search program.Numerous repository databases and pipelines have been developed based on the 16S rRNA gene.However,a previous study has reported the horizontal gene transfer of 16S rRNA genes in prokaryotic organisms and the multiple heterogeneous rRNA genes within a single prokaryotic cell [21].Therefore,they may cause misrepresentation of data if they are not properly dealt with,which may result in erroneous taxonomic classification.

    Composition-based methods utilize unique sequence properties such as codon usage bias,compositional patterns in nucleotide sequences(k-mers),and GC content that have been widely used for studying microbial genome evolution in areas of bioinformatics [18,22-26].K-mers are subsequences of length k in a DNA sequence (e.g.,tetramer or 4-mer:ATGT).Composition-based methods using k-mers have been employed in bioinformatics programs for taxonomic classification of random whole metagenome sequencing data [27].They have a number of advantages over similarity-based search methods.It is estimated that more than 99%of existing microorganisms cannot be cultured in laboratory conditions[28],and microbial sequences available in bioinformatics databases represent only a tiny fraction of the diversity of existing microorganisms.Therefore,composition-based methods,which do not require sequence alignments but make predictions based on microorganism’s unique sequence signatures,supposedly excel in taxonomical classification of novel sequences.However,existing bioinformatics programs based on composition-based methods are designed for prokaryotic organisms and their utilization in fungi is inefficient.

    In this study,we introduce a novel bioinformatics program for random whole metagenome sequence classification,SeSaMe (Spore-associated Symbiotic Microbes).It provides a means for estimating taxonomic diversity and abundance,as well as,the reservoir of genes of reference taxonomic groups in AMF metagenome.It therefore enables users to study symbiotic roles of taxonomic groups associated with AMF.In order to filter complex evolutionary signals and obtain comparable evolutionary footprints,we calculated codon usage bias based on the amino acid usage and the codon usage of a 3-codon DNA 9-mer that encodes three consecutive amino acids located in a protein secondary structure.We joined three consecutive codons into one unit,and calculated the unit’s relative frequency among synonymous 3-codon DNA 9-mers,which will be hereafter referred to as 3-codon usage.3-codon usage has higher resolution than mono codon usage in assessing the differences among taxonomic groups because evolutionary forces acting on a codon and its encoded amino acid vary widely across protein secondary structures as well as across taxonomic groups.For example,the evolutionary forces acting on the codon AAA,encoding the amino acid Lysine (K) in TGGAAAGTG (WKV),have been different from the evolutionary forces acting on the codon AAA in GACAAAGAA(DKE).We found that 3-codon usage of a 3-codon DNA 9-mer belonging to a protein secondary structure is a taxonomically unique sequence property.SeSaMe calculates a score based on six sets of 3-codon DNA 9-mers from all reading frames (Figure 1),and distinguishes between coding sequence(CDS) and non-CDS.It has an advantage over existing composition-based methods that do not identify nucleotide subsequences with structural roles,or do not consider the biological importance of codons and reading frames.SeSaMe is freely available at www.fungalsesame.org.

    Figure 1 Unique advantage of SeSaMe over existing programs

    Method

    Bacterial and fungal sequence databases

    We selected bacterial genera that were dominant in soil based on a literature review [10,28,29-32].While NCBI offered a broad selection of more than 2300 completely sequenced bacterial genomes,we did not have many choices for the majority of fungal phyla.Most of the completely sequenced fungal genomes in NCBI or JGI were Dikarya,while we needed diverse fungal genomes covering Mucoromycotina,AMF,Blastocladiomycota,Neocallimastigomycota,Microsporidia,and Chytridiomycota.We assigned the completely sequenced genomes of 444 bacteria and 11 fungi,includingR.irregularis,to 45 bacterial and 9 fungal genera,respectively,and created CDS and non-CDS databases per genus based on CDS lists provided by NCBI,JGI,and Tisserant et al.[16].The number of genomes per genus varied from 1 to 81,depending on their availability in public databases.The total number of the bacterial genes per genus and the total number of the fungal genes and introns per genus are shown in Tables S1 and S2,respectively.Sequences with an ambiguous nucleotide or with a length shorter than nine—the minimum length of nucleotides required for a 3-codon DNA 9-mer—were excluded.Cryptococcusand Agaricomycetes (Phanerochaete,Scleroderma,andSebacina) belong to the same subdivision,Agaricomycotina,and were grouped together in order to simplify the analysis.

    Database design

    For selecting a parameter k of k-mer,we chose 3-codon DNA 9-mer as the length of amino acids and of nucleotides,considering the approximate number of amino acids required to form a helical turn in helix and a beta-strand.The program consists of two main components:databases and scoring methods.The major distinguishing feature is the trimer reference sequence database (Trimer Ref.DB).126,093 Protein Data Bank(PDB)entry files were processed with in-house developed parsing programs to extract 7674 amino acid trimers (A.A.Trimers),subunits of protein secondary structures,which were assigned to the sequence variable — A.A.Trimer [33].224,383 3-codon DNA 9-mers,encoding 7674 A.A.Trimers,were assigned to the sequence variable — 3-codon DNA 9-mer.In Trimer Ref.DB,the sequence variables,A.A.Char Trimer (amino acid characteristic trimer),A.A.Trimer,and 3-codon DNA 9-mer,form a three-level hierarchy where A.A.Char Trimer is the highest level (Figure 2).To create A.A.Char,first,we assigned amino acids with similar properties into one group according to polarity and charge of their side chain,and secondly subdivided each group according to their volume(Table 1).Cysteine,glycine,histidine,methionine,and proline have special properties.Therefore,each of them was assigned as a sole member of an A.A.Char group.Generally,multiple A.A.Trimers with similar properties belong to one A.A.Char Trimer.An A.A.Char Trimer and an A.A.Trimer have an A.A.Trimer table and a 3-codon DNA 9-mer table containing multiple members,respectively (Figure 2).

    Genus-specific usage bias database (Genus-specific DB)contains the main numerical variable—trimer usage bias.Trimer usage bias is calculated by multiplying the A.A.Trimer usage of A.A.Trimer by the 3-codon usage of 3-codon DNA 9-mer in Trimer Ref.DB (Figure 2).There are 54 CDS Genus-specific DBs and the same number of non-CDS Genus-specific DBs in the program.Each CDS Genusspecific DB contains 1296 A.A.Trimer Usage Tables and 7674 3-codon Usage Tables created based on the CDS database.Non-CDS Genus-specific DB contains the same number of tables derived from the same sequence variables of the Trimer Ref.DB for cost-effective CDS and non-CDS classification.Because SeSaMe compares frequency information of 54 genera calculated based on the same standard genetic code table for the same 3-codon DNA 9-mers,inaccuracy in calculating trimer usage bias of non-CDS is assumed to be insignificant.

    Scoring methods

    We developed two scoring methods,and each equipped with aPvalue scoring method.The trimer usage probability scoring method classifies a query sequence into one out of 54 genus references,while the rank probability scoring method classifies a query sequence into one out of 13 taxonomic groups:Clostridia,Bacilli,Oscillatoriophycideae,Nostocales,Acidobacteriales,Betaproteobacteria,Deltaproteobacteria,Gammaproteobacteria,Alphaproteobacteria,Actinobacteria,AMF (R.irregularis),Agaricomycotina,and Pezizomycotina.To avoid word repetition,these taxonomic groups will be hereafter referred to as 13 taxon groups,and in the same order as shown in the list above.

    Trimer usage probability scoring method

    This method converts 3-codon DNA 9-mers in a query sequence into A.A.Char Trimers and identifies those with structural roles by searching them against Trimer Ref.DB.For each matching A.A.Char Trimer,the method first searches a matching A.A.Trimer,and second,a matching 3-codon DNA 9-mer in Trimer Ref.DB (Figure 3).It retrieves trimer usage biases of the matching 3-codon DNA 9-mers from CDS Genus-specific DB per reading frame of a query sequence.It repeats the same process with non-CDS Genusspecific DB.It then compares scores from CDS and non-CDS Genus-specific DBs,and selects a genus with the highest score (Figure 3).Users are provided with an option to include genera whose scores have little difference from the highest score.

    Rank probability scoring method

    This method measures a standardized 3-codon usage relative to an expected 3-codon usage computed from three individual mono codon usages.The Average A.A.Usage Table(20 amino acids and stop codons for 12 A.A.Char monomers) and the Average Codon Usage Table (64 codons for 20 amino acid monomers and stop codons)were created based on CDS database per genus.1296 Expected A.A.Trimer Usage Tables and 7674 Expected 3-codon Usage Tables were created based on the Average A.A.Usage Table and the Average Codon Usage Table,respectively (Figure 4).

    Figure 2 Database design

    Figure 3 Flow chart of the program

    Figure 4 Creation of expected usage tables for the rank probability scoring method

    A standardized 3-codon usage was calculated by dividing a 3-codon usage in a 3-codon Usage Table by an expected 3-codon usage in an Expected 3-codon Usage Table.Based on trimer usage biases and standardized 3-codon usages,we calculated a group mean for each taxon group and Kruskal Wallis(KW) test’s h-score on ranks of 13 taxon groups,from which we developed a rank probability score per 3-codon DNA 9-mer.In addition to the trimer usage biases,the rank probability scoring method adds another set of 224,383 scores per genus into Genus-specific DB.This method is applicable only to CDS.

    Table 1 Conversion table from A.A.to A.A.Char

    P value scoring method

    We applied the concept of the sum of rolled numbers from a pair of dice to develop thePvalue scoring method (http://www.lucamoroni.it/the-dice-roll-sum-problem/).We drew analogies between the number of faces of a dice and 54 genera and between the number of dices we roll and the number of matching 3-codon DNA 9-mers identified in a reading frame of a query sequence.There were 54 possible ranks computed based on trimer usage biases per matching 3-codon DNA 9-mer.Pvalue scores were calculated based on a sum of ranks of matching 3-codon DNA 9-mers.Computational costs ofPvalues for all possible outcomes,sums of ranks,were too high.To reduce the computational costs,we approximatedPvalues.We obtained sample data per number of matching 3-codon DNA 9-mers based on Equation (1).

    wherepis the sum of ranks,nis the number of dices per roll,sis the number of faces of the dice,54,and maximum ofkis](p-n)/s[ where ]x[ is the floor function (e.g.,]7.9[=7).We created a table ofPvalue scores per number of matching 3-codon DNA 9-mers.If a rank sum was less than one with the highestPvalue score,the approximate mean of all of the rank sums in each table,we multiplied thePvalue score with-1,indicating statistically non-significant outcome.In the test sets,the number of matching 3-codon DNA 9-mers varied widely,with a minimum of 30 and a maximum of 97.We have 624 tables in thePvalue score database covering 2-625 matching 3-codon DNA 9-mers.

    Implementation and program availability

    SeSaMe was implemented using the Java programming language (Java 8).We provided two sets of the programs.One requires Apache commons math3 (3.3) and IO (2.4) libraries(www.apache.org),while the other does not.The programs consist of executable Java JAR files and Java class files for Linux/Unix operating systems.SeSaMe has been tested andconfirmed to work on Linux system—CentOS Linux 7(www.centos.org)and is currently being used at the Biodiversity Center,Institut de Rechercheen Biologie Ve′ge′tale,De′partement de Sciences Biologiques,Universite′ de Montre′al.The trimer usage probability scoring method offered to the public produces output of smaller size,but is sufficient for the purpose of taxonomic classification and is freely available at www.fungalsesame.org.There are no restrictions to use the programs by academic or non-academic organizations as long as they comply with the terms and conditions of the license agreements.

    Input,output,and options

    SeSaMe utilizes a command-line interface.Input files should contain DNA sequence(s) in fasta format.The Java JAR files produce output files with sequence information (seq_id,matching A.A.Char Trimers,A.A.Trimers,and 3-codon DNA 9-mers) and genus information (rank,scores,andPvalue score).The output provides the information per reading frame per sequence.After processing the output file with Java class files,users are able to obtain a summary file containing one predicted outcome per query sequence.Java JAR files require users to give a mandatory command line argument— input file path.Java JAR files with the trimer probability scoring method may produce multiple genera as an answer if their scores have little difference.A user is given the option with 6 choices to select a cut-off value for the difference:0.01,0.05,0.1,0.15,0.2,or 0.3.Users can give the option to the Java class file called compare_result_coding_non_coding.class.The default cut-off value is 0.05.The lower the cut-off value is,the fewer genera will be included in an answer.

    Results

    We assessed the accuracy of the program by conducting classification experiments.We created metagenome test sets,ran the programs with them,and calculated the mean percentages of correct predictions.We showed the relationship between the correct prediction proportion and thePvalue score in order to provide users with useful examples in assessing the statistical significance of predicted outcomes.

    Metagenome test sets

    We randomly chose 100 sequences from each of the CDS and non-CDS databases per genus.We randomly selected a starting base pair position in each of the randomly chosen sequences.From the starting position,we randomly selected an ending base pair position so that a sequence length is within the range of 150-300 bp.Both of the CDS and non-CDS test sets consisted of 4500 bacterial and 900 fungal sequences (including AMF).

    Mean percentages of correct predictions from the trimer usage probability scoring method

    The mean percentages of correct predictions of the CDS and non-CDS test sets at the genus level were 71% and 50% for the bacterial group,68% and 73% for the fungal group (excluding AMF),and 49% and 72% for AMF,respectively.AMF had the lowest prediction percentage in the CDS genus test sets possibly due to a large number of heterogeneous nuclei and horizontal gene transfers from a variety of endobacteria during their evolution[8,10-14].The mean percentages of correct predictions at the genus level and at higher taxonomic ranks of the 13 taxon groups are shown inTable 2.

    SeSaMe produced more than one genus as an answer per query sequence when multiple genera had little difference in their scores.We converted each predicted genus into one of the 13 taxon groups and calculated a proportion of the correct taxon group in answer per query sequence.We calculated the mean and the standard deviation of the proportions in each genus test set;1 represented that answers contained correct taxon groups only,while 0 represented that answers contained incorrect taxon groups only(Tables S3 and S4).The means of the bacterial and fungal CDS test sets were 0.87 and 0.57,respectively (Tables S3 and S4).

    60% and 46% of the correctly predicted sequences contained only one genus as an answer in the bacterial CDS and non-CDS test sets,respectively (Figure S1;Table S5).90%and 76% of the correctly predicted sequences had a correct taxon group in the first rank in the bacterial CDS and non-CDS test sets,respectively (Figure S2;Table S6).Only 1%-5% of the sequences in the non-AMF test sets had AMF in an answer (Table S7).Although the trimer usage probability scoring method provides us not with the individual trimer usage biases but with result of multiplying all of the trimer usage biases,we can often derive general ideas about a query sequence from its answer(Figures S3 and S4).Does it contain only one genus in answer?Or what other genera does it contain in answer? For example,an AMF test sequence that containsClostridiumand AMF in answer may imply that the query sequence might have been acquired by horizontal gene transfer from a bacterium ancestor or thatClostridium’s ancestor might have become a heritable endosymbiont.

    Mean percentages of correct predictions from the rank probability scoring method

    The mean percentages of correct predictions of the CDS test sets were 82% for the bacterial group,72% for the fungal group (excluding AMF),and 42% for AMF.The mean percentages and the standard deviations of correct predictions of the CDS test sets were 64%±4.2%,71%±6.4%,84%±2.5%,70%±2.8%,73%±0%,83%±8%,74%±10%,81%±7.8%,88%±9.2%,85%±5.9%,42%±0%,65%±6.4%,and 79%±6.7% for the 13 taxon groups.Compared to the trimer usage probability scoring method,the rank probability scoring method produced the higher mean and the smaller standard deviation for the bacterial group.In general,the rank probability scoring method showed improvement in performance.Although the means for Clostridia and Gammaproteobacteria were lower,their standard deviations were much smaller in the rank probability scoring method:4.2%vs.22% and 7.8%vs.9.4%,respectively.The trimer usage probability scoring method showed better performance in Actinobacteria that had low withingroup variation of trimer usage bias.In contrast,the rank probability scoring method showed better performance in genera that had relatively flat peakness in a frequency distribution curve of synonymous 3-codon DNA 9-mers,in addition to genera that had relatively large within-group variation of trimer usage bias.

    Relationship between correct prediction proportion and P value score

    The means of the correct prediction proportions per number of matching 3-codon DNA 9-mers calculated based on the result of the trimer usage probability scoring method are shown in Figure S5A and Table S8.The means of correct prediction proportions per base 10 logarithm of an approximated inverse ofPvalue score (log10inverse ofPvalue score) calculated based on result of the trimer usage probability scoring method and of the rank probability scoring method are shown in Figure S5B,Table S9,Figure S5C,and Table S10,respectively.We divided the results of each genus test set into quartiles and calculated the range of log10inverse ofPvalue score and the mean and the standard deviation of the correct prediction proportions in each quartile (Tables S11 and S12).The first ranked genus with the highest probability score always had positivePvalue score.In general,as log10inverse ofPvalue score became higher —i.e.,as positivePvalue score became lower — the correct prediction proportion increased in all test sets.The frequencies of fungal sequences that had a correct taxon group in the 1st,2nd,3rd,4th,or 5th rank in an answer were comparable due to similarity of Dikarya(Figure S2;Table S6).Because the data for Figure S5B were generated based only on the first rank,the fungi showed relatively weak correlation between correct prediction proportion and log10inverse ofPvalue score.

    Classification of an example sequence

    The example sequence(AAATCCCAATGTCAGAATAAAG AAACTACCAGATGATCATCCTGTTTATCCTGGGTAT GGATTATTTGCTAACAAAGATCTTAAAAAATTTAAT CTAGTCGTTTGTTATACTGGCAAAGTTACAAAAAGA GAAATTGGGGGTGAAGAAGGAAGTGA) was selected from the AMF CDS test set and is 156 bp in length.The sequence had the highest trimer usage probability score in the second reading frame in the forward direction,which was then assumed as the open reading frame.SeSaMe identified 49 matching 3-codon DNA 9-mers that were matched to Trimer Ref DB.The program correctly classified the example sequence into CDS of AMF.Firmicutes,Cyanobacteria,Rickettsia,and AMF had higher trimer usage biases than Proteobacteria,Actinobacteria,and Dikarya in a majority of 3-codon DNA 9-mers.Figure 5shows trimer usage biases of the 11th 3-codon DNA 9-mer,GATGATCAT,in 54 genera.GATGATCAT belongs to A.A.Char Trimer,CCB and to A.A.Trimer,DDH.The multidimensional scaling (MDS)method was applied to a matrix containing trimer usage biases;it had 54 genera in rows and matching 3-codon DNA 9-mers identified in the open reading frame in columns (http://www.inf.uni-konstanz.de/algo/software/mdsj/) [34].It visualized proximity relationships among 54 genera in XY axis graph(www.jfree.org).It showed that Actinobacteria,Alphaproteobacteria,and Dikarya were compactly clustered,while Betaproteobacteria were spread out in the left side of the graph(Figure S6).Nostocales,Oscillatoriophycideae,Bacilli,and Clostridia were scattered across in the right side.AMF,Cyanobacterium,andRickettsiawere located in the far-right side.

    Figure 5 Trimer usage biases of the 11th 3-codon DNA 9-mer,GATGATCAT,in 54 genera

    Future work

    Microorganisms contain a number of heterogeneous alternative sigma factors that are selectively induced in response to environmental stress [35].They not only provide functionally specialized RNA polymerase subpopulations,but are also involved in regulating the expression of a set of target genes,or regulons [36,37].Ribosomal heterogeneity also has an important role in governing cellular stress.Sequence comparison of rRNA genes and ribosomal coding genes as well as sigma factor genes will be required in order to study their influence on adaptation of microorganisms [38,39].

    Codon usage and codon context have been documented to play various important roles in microorganism’s adaptation to environment.If we sort the target genes in the CDS and in the non-CDS databases according to the alternative regulators,and create Genus-specific DB per group of the target genes,it may increase the accuracy of taxonomic classification.Moreover,comparative studies on alternative regulator subpopulations may provide useful insights into the development of genetic markers with which we can detect changes in microbial community structures in response to environmental stress(Figure S7).It may lead to new perspectives and strategies for improving the analysis of metagenome data,especially AMF inoculant field data sampled from highly stressful environments.

    Data availability

    SeSaMe is freely available at www.fungalsesame.org.

    CRediT author statement

    Jee Eun Kang:Conceptualization,Methodology,Software,Validation,Software,Writing-original draft.Antonio Ciampi:Supervision,Writing -review & editing.Mohamed Hijri:Supervision,Writing -review & editing.All authors read and approved the final manuscript.

    Competing interests

    The authors have declared no competing interests.

    Acknowledgments

    The authors gratefully acknowledge E′ ducation et de l’Enseignement supe′rieur Quebec (AFE),Faculte′ des e′tudes supe′rieures et postdoctorales de l’UdeM (FESP),and Institut de Recherche en Biologie Ve′ge′tale de l’Universite′ de Montre′al(IRBV) for awarding scholarships to KJE.The authors gratefully acknowledge insightful comments from Bachir Iffis,David Walsh,Etienne Yergeau,Franck Stefani,Ivan de la Providencia,Jesse Shapiro,Sylvie Hamel,and Yves Terrat.We also thank Andrew Blakney for English editing.

    Supplementary material

    Supplementary data to this article can be found online at https://doi.org/10.1016/j.gpb.2018.07.010.

    ORCID

    0000-0003-2475-0474 (Jee Eun Kang)

    0000-0003-4838-8297 (Antonio Ciampi)

    0000-0001-6112-8372 (Mohamed Hijri)

    岛国毛片在线播放| 国产精品蜜桃在线观看| 色哟哟·www| 亚洲国产av新网站| 亚洲,一卡二卡三卡| 啦啦啦在线观看免费高清www| 亚洲综合色网址| 亚洲综合色网址| 五月玫瑰六月丁香| 2018国产大陆天天弄谢| 99热网站在线观看| 美女主播在线视频| 国产高清不卡午夜福利| 色94色欧美一区二区| 午夜视频国产福利| 秋霞伦理黄片| 一个人看视频在线观看www免费| 日韩欧美一区视频在线观看| 欧美老熟妇乱子伦牲交| 久久久久久人妻| 嘟嘟电影网在线观看| 成年av动漫网址| 51国产日韩欧美| 国产精品99久久99久久久不卡 | 一区在线观看完整版| 美女脱内裤让男人舔精品视频| 亚洲精品美女久久av网站| www.av在线官网国产| 尾随美女入室| 韩国高清视频一区二区三区| 中文字幕人妻丝袜制服| .国产精品久久| 欧美最新免费一区二区三区| 亚洲国产av影院在线观看| 国产伦理片在线播放av一区| 欧美+日韩+精品| 国产精品不卡视频一区二区| 国产 精品1| 欧美xxⅹ黑人| 亚洲人成网站在线观看播放| a级片在线免费高清观看视频| 不卡视频在线观看欧美| 亚洲,欧美,日韩| kizo精华| 亚洲av综合色区一区| 精品久久蜜臀av无| 伦理电影免费视频| 亚洲欧美清纯卡通| 女的被弄到高潮叫床怎么办| 91精品一卡2卡3卡4卡| 成人毛片60女人毛片免费| 国产成人91sexporn| 免费观看无遮挡的男女| 日本黄色片子视频| 考比视频在线观看| 午夜福利网站1000一区二区三区| 久久久久久伊人网av| 久久国产亚洲av麻豆专区| 在线观看人妻少妇| 国产毛片在线视频| 久久久午夜欧美精品| 精品久久久久久久久av| 一本一本综合久久| a 毛片基地| 国产精品嫩草影院av在线观看| 汤姆久久久久久久影院中文字幕| 亚洲精品视频女| 日韩av在线免费看完整版不卡| 男女边摸边吃奶| 国产极品天堂在线| 91aial.com中文字幕在线观看| 69精品国产乱码久久久| 亚洲国产av影院在线观看| 国产欧美另类精品又又久久亚洲欧美| 国产精品免费大片| 免费观看性生交大片5| 亚洲国产最新在线播放| 国产老妇伦熟女老妇高清| 天堂俺去俺来也www色官网| 在线观看www视频免费| 亚洲av不卡在线观看| 国产成人精品一,二区| 欧美变态另类bdsm刘玥| 色网站视频免费| 22中文网久久字幕| 国产日韩一区二区三区精品不卡 | 两个人免费观看高清视频| 日韩一本色道免费dvd| 国产亚洲最大av| 99热全是精品| 五月开心婷婷网| 亚洲欧美成人综合另类久久久| 中文字幕av电影在线播放| 18禁观看日本| 精品酒店卫生间| 成人国产麻豆网| 王馨瑶露胸无遮挡在线观看| 亚洲国产毛片av蜜桃av| 亚洲欧美精品自产自拍| 国产综合精华液| 纯流量卡能插随身wifi吗| 亚洲中文av在线| 色婷婷av一区二区三区视频| 我要看黄色一级片免费的| 天天影视国产精品| 美女大奶头黄色视频| 免费久久久久久久精品成人欧美视频 | 91成人精品电影| 中国美白少妇内射xxxbb| 日本与韩国留学比较| 色视频在线一区二区三区| 亚洲国产成人一精品久久久| 欧美三级亚洲精品| 免费大片18禁| 在线观看人妻少妇| 只有这里有精品99| 最后的刺客免费高清国语| 黄色欧美视频在线观看| 久久久午夜欧美精品| 精品少妇久久久久久888优播| 在线亚洲精品国产二区图片欧美 | 日韩伦理黄色片| 亚洲成人手机| 涩涩av久久男人的天堂| 亚洲精品亚洲一区二区| 在线观看免费高清a一片| 伊人久久国产一区二区| 91午夜精品亚洲一区二区三区| 国产精品欧美亚洲77777| 51国产日韩欧美| 久久婷婷青草| 三级国产精品欧美在线观看| 国产成人免费观看mmmm| 欧美+日韩+精品| 精品一区二区三卡| 亚洲精品美女久久av网站| 色94色欧美一区二区| 亚洲激情五月婷婷啪啪| 日本免费在线观看一区| 18在线观看网站| 自拍欧美九色日韩亚洲蝌蚪91| 日本爱情动作片www.在线观看| 亚洲国产精品成人久久小说| 亚洲av综合色区一区| 精品久久久噜噜| 最近中文字幕2019免费版| 制服诱惑二区| 国产一区二区在线观看av| 啦啦啦视频在线资源免费观看| 国产69精品久久久久777片| 久久av网站| 一区二区三区精品91| 欧美精品高潮呻吟av久久| 99热这里只有精品一区| 免费人成在线观看视频色| 欧美日本中文国产一区发布| 成人毛片60女人毛片免费| 成人国产av品久久久| 婷婷成人精品国产| 精品视频人人做人人爽| 国产免费现黄频在线看| 一区二区三区精品91| 亚洲精品色激情综合| av国产精品久久久久影院| 国产精品一区www在线观看| 在线观看人妻少妇| 波野结衣二区三区在线| 亚洲美女黄色视频免费看| 中文精品一卡2卡3卡4更新| 观看av在线不卡| 色吧在线观看| av国产久精品久网站免费入址| 高清在线视频一区二区三区| 免费人妻精品一区二区三区视频| 亚洲综合精品二区| 18+在线观看网站| 国产极品粉嫩免费观看在线 | www.av在线官网国产| 爱豆传媒免费全集在线观看| 久热久热在线精品观看| 免费观看av网站的网址| 在线观看美女被高潮喷水网站| 97在线人人人人妻| 中文欧美无线码| 亚洲国产最新在线播放| 一本一本久久a久久精品综合妖精 国产伦在线观看视频一区 | 国产精品人妻久久久久久| 男人操女人黄网站| av女优亚洲男人天堂| 一个人看视频在线观看www免费| 成人国产av品久久久| 人妻夜夜爽99麻豆av| 国产高清国产精品国产三级| 在线观看人妻少妇| 69精品国产乱码久久久| 免费黄色在线免费观看| 99视频精品全部免费 在线| 黑人高潮一二区| 性色av一级| 国产熟女欧美一区二区| 欧美日韩视频高清一区二区三区二| 免费黄网站久久成人精品| av.在线天堂| 欧美国产精品一级二级三级| 老熟女久久久| 日本与韩国留学比较| 一本久久精品| 日韩,欧美,国产一区二区三区| xxx大片免费视频| 韩国高清视频一区二区三区| 成人毛片60女人毛片免费| 啦啦啦中文免费视频观看日本| 亚洲精品一区蜜桃| 免费观看a级毛片全部| 美女cb高潮喷水在线观看| 国产精品不卡视频一区二区| 又粗又硬又长又爽又黄的视频| 香蕉精品网在线| 欧美xxⅹ黑人| videossex国产| 亚洲欧美成人精品一区二区| 一边亲一边摸免费视频| 狠狠精品人妻久久久久久综合| 亚洲欧美精品自产自拍| 王馨瑶露胸无遮挡在线观看| 久久精品国产亚洲av涩爱| 日韩大片免费观看网站| 国产精品久久久久久久久免| 午夜福利网站1000一区二区三区| 免费观看无遮挡的男女| 久久av网站| 黑人高潮一二区| 久热久热在线精品观看| 免费高清在线观看视频在线观看| 亚洲国产色片| 狠狠婷婷综合久久久久久88av| 精品久久久久久久久av| 久久精品国产鲁丝片午夜精品| 亚洲精品日韩在线中文字幕| 免费日韩欧美在线观看| 色网站视频免费| 成人18禁高潮啪啪吃奶动态图 | 国产精品成人在线| 熟女人妻精品中文字幕| 亚洲欧美一区二区三区黑人 | 男女边吃奶边做爰视频| 亚洲少妇的诱惑av| 街头女战士在线观看网站| 国语对白做爰xxxⅹ性视频网站| 亚洲无线观看免费| 男女高潮啪啪啪动态图| 91成人精品电影| 国产精品一国产av| 少妇熟女欧美另类| 国产女主播在线喷水免费视频网站| 91精品伊人久久大香线蕉| 亚洲精华国产精华液的使用体验| 亚洲国产欧美在线一区| 精品亚洲乱码少妇综合久久| 欧美日韩视频精品一区| 一本一本综合久久| 一级毛片黄色毛片免费观看视频| 91精品伊人久久大香线蕉| 熟女av电影| 97超视频在线观看视频| 高清毛片免费看| 18在线观看网站| 日本欧美国产在线视频| 国产高清不卡午夜福利| xxxhd国产人妻xxx| 欧美激情 高清一区二区三区| 各种免费的搞黄视频| 晚上一个人看的免费电影| 亚洲精品乱码久久久久久按摩| av电影中文网址| 亚洲精品美女久久av网站| 欧美国产精品一级二级三级| 大又大粗又爽又黄少妇毛片口| 精品国产一区二区久久| 妹子高潮喷水视频| 午夜日本视频在线| 亚洲欧美中文字幕日韩二区| 欧美三级亚洲精品| 性色avwww在线观看| 亚洲精品美女久久av网站| 一级毛片电影观看| av专区在线播放| 色婷婷av一区二区三区视频| 久久精品国产亚洲av天美| 在线观看免费日韩欧美大片 | 成人黄色视频免费在线看| 狂野欧美激情性bbbbbb| 我的老师免费观看完整版| 毛片一级片免费看久久久久| 亚洲,欧美,日韩| 多毛熟女@视频| 欧美精品高潮呻吟av久久| 亚洲av成人精品一二三区| 国产精品一二三区在线看| 男女免费视频国产| 国产精品熟女久久久久浪| 性高湖久久久久久久久免费观看| 国精品久久久久久国模美| 中国美白少妇内射xxxbb| 少妇的逼好多水| 高清毛片免费看| 亚洲精品日韩av片在线观看| 啦啦啦啦在线视频资源| 汤姆久久久久久久影院中文字幕| 免费黄色在线免费观看| 亚洲精品亚洲一区二区| 久久精品夜色国产| 久久热精品热| a级毛片黄视频| 女性生殖器流出的白浆| 亚洲色图综合在线观看| 国产探花极品一区二区| 日韩成人av中文字幕在线观看| 国产不卡av网站在线观看| av免费在线看不卡| 久久久a久久爽久久v久久| 亚洲美女视频黄频| 香蕉精品网在线| 日本与韩国留学比较| 久久精品夜色国产| 婷婷色麻豆天堂久久| 欧美日韩av久久| 肉色欧美久久久久久久蜜桃| 国产黄片视频在线免费观看| av免费在线看不卡| 久久久国产欧美日韩av| 少妇人妻久久综合中文| 纯流量卡能插随身wifi吗| 国产精品一国产av| 飞空精品影院首页| 少妇丰满av| 如何舔出高潮| 又大又黄又爽视频免费| 中文字幕制服av| 国产黄频视频在线观看| 青春草国产在线视频| 久久免费观看电影| 在线免费观看不下载黄p国产| 久久久久久久久久人人人人人人| 美女福利国产在线| 日韩熟女老妇一区二区性免费视频| 免费观看性生交大片5| 久久久久久久久久成人| 91在线精品国自产拍蜜月| 欧美+日韩+精品| 成人漫画全彩无遮挡| 国产成人午夜福利电影在线观看| 成人影院久久| 在线播放无遮挡| 国产高清国产精品国产三级| 亚洲国产精品999| av天堂久久9| 精品人妻在线不人妻| 国产成人免费观看mmmm| 一级爰片在线观看| 国产精品国产三级国产av玫瑰| 91成人精品电影| 色婷婷av一区二区三区视频| 国产色爽女视频免费观看| 精品一区二区免费观看| 免费黄网站久久成人精品| 国产国语露脸激情在线看| 国产精品蜜桃在线观看| 亚洲av成人精品一二三区| 蜜桃国产av成人99| 国产女主播在线喷水免费视频网站| 男男h啪啪无遮挡| 亚洲精品日本国产第一区| 久久精品国产a三级三级三级| 国产爽快片一区二区三区| 国产精品成人在线| 插阴视频在线观看视频| 亚洲精品456在线播放app| 亚洲精品亚洲一区二区| 曰老女人黄片| 久久久国产精品麻豆| 人人澡人人妻人| 成人综合一区亚洲| 最新中文字幕久久久久| 国产日韩欧美视频二区| 欧美xxⅹ黑人| 青青草视频在线视频观看| 中文字幕最新亚洲高清| 嘟嘟电影网在线观看| 80岁老熟妇乱子伦牲交| 中国三级夫妇交换| 久热久热在线精品观看| 国产黄色免费在线视频| 热re99久久国产66热| 在线播放无遮挡| 五月玫瑰六月丁香| 乱码一卡2卡4卡精品| 一级毛片黄色毛片免费观看视频| 亚洲精品久久午夜乱码| 久久精品国产鲁丝片午夜精品| 人成视频在线观看免费观看| 丰满迷人的少妇在线观看| 国产日韩欧美亚洲二区| 只有这里有精品99| 一本大道久久a久久精品| 亚洲欧美成人精品一区二区| 久久久久久伊人网av| 丝袜脚勾引网站| www.av在线官网国产| 久久人人爽人人爽人人片va| 蜜桃久久精品国产亚洲av| 亚洲欧美一区二区三区黑人 | 亚洲成色77777| 久久ye,这里只有精品| 乱人伦中国视频| 亚洲精品亚洲一区二区| 丰满少妇做爰视频| 日韩成人伦理影院| 亚洲精品乱久久久久久| 丝袜在线中文字幕| 一级,二级,三级黄色视频| 中文字幕精品免费在线观看视频 | 国产 一区精品| 国产精品久久久久久精品电影小说| 欧美精品一区二区大全| 国产精品一二三区在线看| 免费观看a级毛片全部| 高清毛片免费看| 十分钟在线观看高清视频www| 亚洲av成人精品一区久久| 有码 亚洲区| 熟女av电影| 一区二区三区四区激情视频| 色网站视频免费| 在现免费观看毛片| 国产精品国产三级专区第一集| 国产精品久久久久久精品古装| 久久精品国产自在天天线| 国产乱人偷精品视频| 亚州av有码| 午夜影院在线不卡| 精品一区在线观看国产| 成年人免费黄色播放视频| 国产精品三级大全| 久久婷婷青草| 观看美女的网站| 午夜福利,免费看| 啦啦啦视频在线资源免费观看| 国产男女超爽视频在线观看| 色婷婷久久久亚洲欧美| 免费人妻精品一区二区三区视频| 亚洲人成网站在线播| 美女国产视频在线观看| 日韩精品免费视频一区二区三区 | 男男h啪啪无遮挡| 熟女电影av网| 成年人午夜在线观看视频| freevideosex欧美| 99热这里只有精品一区| 777米奇影视久久| 午夜视频国产福利| 多毛熟女@视频| 国产午夜精品一二区理论片| 国产亚洲欧美精品永久| 少妇熟女欧美另类| 国产在线一区二区三区精| 成人黄色视频免费在线看| 亚洲四区av| 夫妻性生交免费视频一级片| 91久久精品国产一区二区成人| 99国产精品免费福利视频| 91精品国产九色| 亚洲欧美日韩另类电影网站| 满18在线观看网站| 亚洲精品国产av成人精品| 日韩熟女老妇一区二区性免费视频| 视频中文字幕在线观看| 91精品一卡2卡3卡4卡| 丰满乱子伦码专区| 人妻 亚洲 视频| 免费久久久久久久精品成人欧美视频 | 亚洲国产精品专区欧美| 色网站视频免费| 嫩草影院入口| 狠狠婷婷综合久久久久久88av| 狂野欧美白嫩少妇大欣赏| 国产黄色免费在线视频| 高清毛片免费看| 久久久国产一区二区| 久久亚洲国产成人精品v| 91精品伊人久久大香线蕉| 大话2 男鬼变身卡| 亚洲av综合色区一区| 桃花免费在线播放| 日韩一区二区视频免费看| 日本爱情动作片www.在线观看| 99国产综合亚洲精品| 美女cb高潮喷水在线观看| 中文字幕人妻丝袜制服| av不卡在线播放| 满18在线观看网站| 曰老女人黄片| 尾随美女入室| 18在线观看网站| 亚洲四区av| 亚洲欧美精品自产自拍| 一本大道久久a久久精品| 少妇被粗大的猛进出69影院 | 97超碰精品成人国产| 久久午夜综合久久蜜桃| 99久久精品一区二区三区| 精品一区二区免费观看| 亚洲av在线观看美女高潮| 极品人妻少妇av视频| 国产在线一区二区三区精| 婷婷色麻豆天堂久久| 18在线观看网站| 下体分泌物呈黄色| 男女免费视频国产| 久久国内精品自在自线图片| 精品熟女少妇av免费看| 国内精品宾馆在线| 国产免费一级a男人的天堂| 国产黄色视频一区二区在线观看| 2022亚洲国产成人精品| 飞空精品影院首页| 日韩精品有码人妻一区| 97在线人人人人妻| 精品一区二区免费观看| 国产精品麻豆人妻色哟哟久久| 自拍欧美九色日韩亚洲蝌蚪91| 日韩欧美一区视频在线观看| 一级二级三级毛片免费看| 精品人妻熟女毛片av久久网站| 日日摸夜夜添夜夜爱| 精品酒店卫生间| 国产精品久久久久久av不卡| 99re6热这里在线精品视频| 国产成人av激情在线播放 | 在线播放无遮挡| 国产亚洲最大av| 国产午夜精品一二区理论片| 精品一区在线观看国产| 亚洲精品中文字幕在线视频| 欧美日韩视频高清一区二区三区二| 国产欧美日韩一区二区三区在线 | 女性被躁到高潮视频| 久久久国产精品麻豆| av国产精品久久久久影院| 国产男女超爽视频在线观看| 最近手机中文字幕大全| 欧美日韩一区二区视频在线观看视频在线| 中文字幕最新亚洲高清| 一区二区三区免费毛片| 亚洲成色77777| 久久精品久久精品一区二区三区| 亚洲国产日韩一区二区| 久久久久久久精品精品| 黄色配什么色好看| 黄色毛片三级朝国网站| 亚洲人成77777在线视频| 99热网站在线观看| 亚洲国产最新在线播放| 久久久久久久久久久免费av| 日日爽夜夜爽网站| 高清视频免费观看一区二区| 高清av免费在线| 少妇被粗大的猛进出69影院 | 免费高清在线观看日韩| 好男人视频免费观看在线| 亚洲成人av在线免费| 丰满少妇做爰视频| 一级毛片黄色毛片免费观看视频| 综合色丁香网| 高清视频免费观看一区二区| 嫩草影院入口| 美女脱内裤让男人舔精品视频| 一本久久精品| av.在线天堂| 女性被躁到高潮视频| 日本免费在线观看一区| 久久久久国产精品人妻一区二区| 欧美日韩成人在线一区二区| 18禁动态无遮挡网站| 搡女人真爽免费视频火全软件| 国产69精品久久久久777片| 亚洲天堂av无毛| 亚洲精品乱码久久久v下载方式| 亚洲国产欧美在线一区| 国产av一区二区精品久久| 国产男女内射视频| 午夜免费观看性视频| 国产高清国产精品国产三级| 国产一区二区三区综合在线观看 | 亚洲av福利一区| 亚洲丝袜综合中文字幕| 欧美性感艳星| 免费av不卡在线播放| 大香蕉97超碰在线| 日韩人妻高清精品专区| 丰满饥渴人妻一区二区三| 日韩中字成人| 国产欧美日韩一区二区三区在线 | 亚洲在久久综合| 日本av手机在线免费观看| 不卡视频在线观看欧美| 99九九在线精品视频| 大话2 男鬼变身卡| 在线观看www视频免费| 美女国产视频在线观看| 国产午夜精品一二区理论片| 毛片一级片免费看久久久久| 伊人亚洲综合成人网| 欧美 日韩 精品 国产| 国产男人的电影天堂91|