• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    SeSaMe PS Function:Functional Analysis of the Whole Metagenome Sequencing Data of the Arbuscular Mycorrhizal Fungi

    2020-09-02 00:04:16JeeEunKangAntonioCiampiMohamedHijri
    Genomics,Proteomics & Bioinformatics 2020年5期

    Jee Eun Kang * ,Antonio Ciampi ,Mohamed Hijri *

    1 Institut de Recherche en Biologie Ve′ge′tale,De′partement de Sciences Biologiques,Universite′ de Montre′al,QC H1X 2B2,Canada

    2 Department of Epidemiology,Biostatistics and Occupational Health,McGill University,Montre′al,QC H3A 1A2,Canada

    KEYWORDS SeSaMe;Spore-associated symbiotic microbes;Position-specific function;Outlier;Metagenome

    Abstract In this study,we introduce a novel bioinformatics program, Spore-associated Symbiotic Microbes Position-specific Function(SeSaMe PS Function),for position-specific functional analysis of short sequences derived from metagenome sequencing data of the arbuscular mycorrhizal fungi.The unique advantage of the program lies in databases created based on genus-specific sequence properties derived from protein secondary structure,namely amino acid usages,codon usages,and codon contexts of 3-codon DNA 9-mers.SeSaMe PS Function searches a query sequence against reference sequence database,identifies 3-codon DNA 9-mers with structural roles,and creates a comparative dataset containing the codon usage biases of the 3-codon DNA 9-mers from 54 bacterial and fungal genera.The program applies correlation principal component analysis in conjunction with K-means clustering method to the comparative dataset.3-codon DNA 9-mers clustered as a sole member or with only a few members are often structurally and functionally distinctive sites that provide useful insights into important molecular interactions.The program provides a versatile means for studying functions of short sequences from metagenome sequencing and has a wide spectrum of applications.SeSaMe PS Function is freely accessible at www.fungalsesame.org.

    Introduction

    Arbuscular mycorrhizal fungi(AMF)are plant root colonizing symbiotic microorganisms that promote plant growth and improve soil quality [1-3].AMF increase the effectiveness of phytoremediation and improve crop yields in agroecosystems[1,4-10].Despite the importance of AMF,their genetics is poorly understood,due in large part to their coenocytic multinucleate nature and strict symbiotic partnership with plants[11].A number of studies reported strong evidence that AMF interact closely — tightly adhering to the surface or in the interior of mycelia and spores — or loosely with a myriad of microorganisms covering major bacterial and fungal taxa[6,12-16].These microorganisms can be removed from AMF by using cocktails of antibiotics in axenic cultivation systems[17].Yet,only few AMF taxa are able to be cured and cultivatedin vitro,and most successful isolates in such systems mainly belong to the genusRhizophagus[18].Given that the majority of AMF have not been successfully cultured axenically,it is possible that AMF may be meta-organisms,inseparable from their bacterial and fungal partners.

    Whole genome sequencing (WGS) of AMF taxa has been achieved exclusively from those grownin vitro.Although they provide important insights into AMF genetics,they have limitations in serving as reference genome due to large intra-and inter-isolate genome variations[19,20].Furthermore,sequence analysis of the WGS of AMF taxa grownin vivo,typically in a pot culture with a host plant,can be challenging because the sequencing data contain a large proportion of sequences belonging to AMF-associated microorganisms;the WGS data of AMF represent a complex metagenome [16,21].However,they provide invaluable information about the associated microbial community because a great majority of the associated microorganisms cannot be cultured in laboratory conditions.Taxonomic classification of the whole metagenome sequencing (WMS) data is essential for studying AMF genomics and their interactions with the associated microorganisms.We introduced the bioinformatics program,Sporeassociated Symbiotic Microbes (SeSaMe),for taxonomic classification of the WMS of AMF[22].In this study,we introduce a novel bioinformatics program — SeSaMe Position-specific Function (SeSaMe PS Function).It predicts important position-specific functional sites in a query sequence,based on amino acid usages,codon usages,and codon contexts of 3-codon DNA 9-mers derived from protein secondary structures extracted from Protein Data Bank (PDB) (https://www.rcsb.org/) [23].

    Previous studies have documented the multiple regulatory roles of codon usage and codon context in transcription and translation (e.g.,regulation of gene expression,diversification of gene products,translational efficiency and accuracy,and protein degradation efficiency) [24-30].Several studies have emphasized the regulatory roles of codon usage and codon context of multiple consecutive codons[25,29,30].In addition,synonymous codons are believed to be a key factor in determining the active folding state of a gene product in response to environmental changes.One study has shown that a gene with multiple synonymous mutations produces a protein with increased tolerance to abiotic stresses [31].Moreover,nonoptimal codons serve specific roles in regulating circadian rhythms in response to changes of environmental conditions[32,33].Therefore,codon usage and codon context must have been playing important roles in the adaptation of microorganisms to abiotic stresses[34,35].We are beginning to scratch the surface of the regulatory roles of codon usage and codon context,and these studies appear to be just a tip of iceberg.

    The main variable of the program — trimer usage bias —takes usages and contexts of both amino acids and nucleotides into consideration;it is the product of amino acid usage and 3-codon usage of 3-codon DNA 9-mer.Generally,trimer usage bias has a broad range of variations among taxonomic groups but low variations among microorganisms belonging to the same taxonomic group.Trimer usage bias reflects the important attributes of multiple consecutive codons.Codon composition (i.e.,codon context of three consecutive codons) is an important determinant of properties of RNA structures that plays key roles in regulating gene expression.Codon usage is associated with pauses in translation and determines biochemical properties of gene products.Both of the attributes affect protein folding.

    SeSaMe PS Function identifies 3-codon DNA 9-mers with structural roles in a query sequence,and creates a comparative dataset based on their trimer usage biases that are retrieved from 54 genus-specific usage bias databases (genus-specific DBs)(Figure 1).SeSaMe PS Function applies correlation Principal Component Analysis (PCA) in conjunction with Kmeans clustering method (PCA-Kmeans) to the comparative dataset.It enables users to identify 3-codon DNA 9-mers with distinctive characteristics:outliers.Outliers are often important position-specific functional sites that provide useful insights into molecular interactions.

    Figure 1 Dynamic creation of a comparative dataset per query sequence

    In this study,we analyzed one example sequence to demonstrate how to use the program for studying the structure and the function of a query sequence:one of the program’s various applications.The program helped to identify the outliers with potentially important functions.Existing bioinformatics programs predicted that most of the outliers belonged to stemloops,stems,and stem transitions in RNA structures [36].Some of the outliers were matched to elements that play roles in promotor regions or incis-regulatory mechanisms [37-39].Other bioinformatics programs predicted that the example sequence may bind to DNA/RNA [23,40].These results suggest that the outliers may contribute to binding activities in undiscovered mechanisms that may have attributes similar tocis-regulatory mechanisms.

    A majority of existing bioinformatics tools for positionspecific sequence annotation rely on sequence alignments,which have low sensitivity toward hypervariable sequence motifs with flexible structures and various functions.Although they provide important information about a query sequence,their usage is limited to a particular set of motifs with known functions.In contrast,SeSaMe PS Function employs PCA to identify outliers based on internal structure of a comparative dataset that contains usage information of structural units of a query sequence measured in 54 genera.Therefore,it may reveal important molecular interaction sites not only in known but also in undiscovered mechanisms.It has been only several decades since advances have been made in molecular biology.Therefore,it is believed that only a small fraction of mechanisms in biological systems have been discovered.SeSaMe PS Function provides a useful tool for studying unknown functions of short sequences from metagenome sequencing data.It is freely accessible at www.fungalsesame.org.

    Method

    Database design and comparative dataset creation

    The databases were originally created for the metagenome taxonomic classifier — SeSaMe,and then incorporated into SeSaMe PS Function[22].While NCBI offered a large number of completely sequenced bacterial genomes,only a small number of fungal genomes were completely sequenced.The completely sequenced genomes of 444 bacteria and 11 fungi,known to be present in soil,were downloaded and assigned into 45 bacterial and 9 fungal genera,respectively.CDS database per genus was created based on CDS lists provided by NCBI,JGI,or Tisserant et al.[19].

    The program consists of two types of databases and a PCAKmeans method.126,093 structure files were downloaded from PDB.7674 amino acid trimers (A.A.Trimers) were selected among protein secondary structures from PDB,and then assigned to the sequence variable — A.A.Trimer in the trimer reference sequence database(Trimer Ref.DB)(Figure 2)[41-44].Amino acid characteristic (A.A.Char) is defined as a group of amino acid(s)with similar property(s),and consists of 12 groups:A [Lysine (K),Arginine (R)],B [Histidine (H)],C[Aspartic acid (D),Glutamic acid (E)],D [Serine (S),Threonine (T)],E [Asparagine (N),Glutamine (Q)],F [Cysteine(C)],G [Glycine (G)],H [Proline (P)],I [Methionine (M)],J[Alanine (A),Isoleucine (I),Leucine (L),Valine (V)],K[Phenylalanine (F),Tryptophan (W),Tyrosine (Y)],and L(stop codons).Trimer Ref.DB consists of three sequence variables that form a three-level hierarchy:amino acid characteristic trimer (A.A.Char Trimer),A.A.Trimer,and 3-codon DNA 9-mer (Figure 1).

    Genus-specific usage bias database(genus-specific DB)contains the numerical variables — A.A.Trimer usage of A.A.Trimer and 3-codon usage of 3-codon DNA 9-mer.The main numerical variable,trimer usage bias,is calculated by multiplying A.A.Trimer usage by 3-codon usage.There are 54 genus-specific DBs where each genus-specific DB consists of 1296 A.A.Trimer Usage Tables and 7674 3-codon Usage Tables created based on the CDS database (Figure 2).

    For each reading frame of a query sequence,the program uses a query sequence to search against Trimer Ref.DB,identifying matching A.A.Char Trimers,A.A.Trimers,and 3-codon DNA 9-mers.It retrieves the trimer usage biases of the matching 3-codon DNA 9-mers from 54 Genus-specific DBs,and creates a comparative dataset of 54 genera(Figure 1).The input matrix to the correlation PCA method is the comparative dataset with 54 genera in rows (observations) and the matching 3-codon DNA 9-mers in columns.The input matrix will be called hereafter Z (I×J).

    Figure 2 Database design

    Annotation for catalytic and allosteric sites

    According to Catalytic Site Atlas and Allosteric Database,A.A.Trimers were divided into four subgroups based on the property of their second amino acids:catalytic site (CSA),allosteric site(ASD),both CSA and ASD(BothCA),and none of them (None) [45,46].An A.A.Trimer in CSA,ASD,or BothCA groups was annotated with the list of functions of PDB molecules that contained the A.A.Trimer.This feature is for making inferences about functionality not of a query sequence but of its A.A.Trimers.

    Implementation of the correlation PCA-Kmeans method

    The correlation PCA method was implemented based on the method reported by Abdi et al.[47],which provides important definitions and multiple examples to help readers understand the concepts underlying PCA [47].Interpretation of the result from SeSaMe PS Function also relies on Abdi et al.[47]because eigenvalue decomposition is mathematically closely related to singular value decomposition and has similar underlying concepts.Pearson’s correlation method is applied to the centered Z and produces a correlation matrix X(J×J).Eigenvalue decomposition is applied to X and produces components.V is an eigenvector matrix with J×J dimensions and is also called a loading matrix.

    Loadings:elements of the loading eigenvector matrix V

    The program calculates an eigenvector matrix V.Loading is defined as the element of V.V has matching 3-codon DNA 9-mers in rows and the same number of components in columns.The program examines loadings on components whose sum accounts for 80% of inertia (80% components) in addition to loadings on the first principal component and the second component (the First/Second components) [47].The program creates two different input matrices based on V,called L1 and L2.They have the same number of 3-codon DNA 9-mers in the rows.L1 has 80%components in columns while L2 has the First/Second components in columns.The program separately applies the K-means clustering method(default k=13) to L1 and L2.

    Taxon scores of 54 genera in component spaces

    The program calculates taxon scores of 54 genera observations.Taxon score matrix(I×J)results from multiplying centered Z by V.Inertia of a component is defined as a sum of squared taxon scores in corresponding component column[47].The program creates two matrices based on taxon score matrix called T1 and T2.They have 54 genera observations in rows.T1 has 80% components in columns,while T2 has the First/Second components in columns.The program separately applies the K-means clustering method (default k=10) to T1 and T2.

    Program availability

    The program was implemented in Java programming language(Java8).We used the Pearson’s correlation,the eigenvalue decomposition,and the K-means clustering methods in the Apache Commons Math3 library (3.3).The program requires the Apache Commons Math3 (3.3) and IO (2.4) libraries(www.apache.org).The program has been made to run on Linux/Unix operating systems,packaged into an executable Java JAR file,and tested and confirmed to work on Linux system—CentOS Linux 7(www.centos.org).The program that is being introduced in this study is version 1 and was implemented with the correlation PCA only.The program (version 2) was implemented both with the covariance PCA and with the correlation PCA.They have been used at the Biodiversity Center,Institut de Recherche en Biologie Ve′ge′tale,De′partement de Sciences Biologiques,Universite′ de Montre′al.They are freely accessible at www.fungalsesame.org.There are no restrictions for using the programs by academic or nonacademic organizations as long as a user complies with the license agreement.

    Input,output,and options

    The program has a command-line interface.Input files should contain DNA sequence(s) in fasta format.It requires a command-line argument,input file path.SeSaMe PS Function produces three different types of outputs per query sequence.One is the standard PCA output:the sequence information of matching 3-codon DNA 9-mers,the percentage of an explained inertia by a component,and the contribution of an observation to a component [47].Another is the loading cluster output with the loading information.3-codon DNA 9-mers are annotated with subgroups — CSA/ASD/BothCA/None and the functions of PDB molecules.The other is the genus cluster output with the taxon scores.It should be noted that the cluster result is different for every run,because the Kmeans clustering method in the Apache Commons Math library randomly chooses initial centers for multiple iterations to decrease chances of poor clustering.

    SeSaMe PS Function version 1 and version 2 have an option to specify the k parameter in the K-means clustering method both for genus clusters and for loading clusters (e.g.,11_15).The program version 2 has an additional option called‘‘a(chǎn)uto”.If a user wants to run SeSaMe PS Function for a large number of query sequences with varying lengths,he can use the prefix ‘‘a(chǎn)uto”to set the k parameter for loading clusters according to a simple equation:the number of matching 3-codon DNA 9-mers divided by a user specified number.For example,if the user gives the following option ‘‘a(chǎn)uto_14_8”,it will automatically set one eighth of the number of matching 3-codon DNA 9-mers as the k parameter for loading clusters while it will set 14 as the k parameter for genus clusters.A suitable k value may vary widely depending on the length and the complexity of a query sequence.User can supply the option following the input file path(e.g.,/home/input-file auto_14_8).

    Demonstration of the program usage

    Selection of the example sequence

    We selected 25 correctly predicted sequences out of 100 AMF CDS test sequences that were used for evaluating the accuracy of the metagenome taxonomic classifier,SeSaMe[22].From 25 sequences,we selected one example sequence that had the largest number of 3-codon DNA 9-mers where AMF had the highest trimer usage bias among 54 genera.The example query sequence is TGAGTTTAAAAACTGGACCAGTGAAAAT GAAATAATTGATAATCTTATTTTAGAAATGCAATTA AAAATTAATAGTACATATGATAAAATAGTTGAATG GATACCATACAATCAGTTTATTAACATTAACGAAAT AGGAAAAGTTGGTGATAATACTGCTGTATATTCAG CAATATGGAAAAATGGTCCACTATATTATAGAAAG AAATGGATAAGGAAATCCAATGAAAAAGTTGTATT AAATTACTTAACATTAGATATTAAGGAATT.

    Outlier’s unique pattern of the trimer usage bias and of the 3-codon usage

    Landscape pattern is the comparison of 54 genera based either on the trimer usage bias or on the 3-codon usage of a 3-codon DNA 9-mer.It provides an accurate way to estimate the relative measure of the usage information across 54 genera.In this study,we abbreviate 3-codon DNA 9-mer according to the order of its position in DNA sequence and its A.A.Trimer(Table S1).For example,AATACTGCT is the 51st matching 3-codon DNA 9-mer and encodes for the amino acids NTA.Because the program is zero-based,its abbreviation is 50 NTA.Graphs showing the landscape patterns of 3-codon usages and of trimer usage biases retrieved from 54 genera were generated for 17 EMQ and 67 KKW and for 18 MQL and 3 NWT,respectively.

    Comparison of the frequencies of a nucleotide among 13 loading clusters

    We counted the frequencies of the nucleotide—adenine(A)—in each of the individual 3-codon DNA 9-mers and applied a one-way ANOVA test to compare the means among 13 clusters.We repeated the same process for the nucleotides cytosine(C),guanine (G),and thymine (T).

    Comparison between the trimer usage bias and the 3-codon usage in functional segment

    We assigned matching 3-codon DNA 9-mers into functional segments(FSs)based on the loading clusters with 80%components and based on the prediction result of the protein secondary structure from a bioinformatics tool—SCRATCH[48].We created two matrices per FS:one was based on the 3-codon usage,and the other was based on the trimer usage bias.Each matrix consisted of the usage information of the matching 3-codon DNA 9-mers retrieved from 54 genera;it had the 3-codon DNA 9-mers of an FS in rows and the 54 genera in columns.After centering each matrix,we applied Pearson’s correlation to the matrix to yield a correlation matrix (I×I),and calculated the mean of the correlations per pair of taxonomic groups—Clostridia,Bacilli,Oscillatoriophycideae,Nostocales,Acidobacteria,Alphaproteobacteria,Betaproteobacteria,Deltaproteobacteria,Gammaproteobacteria,AMF,Agaricomycotina,and Pezizomycotina.From the mean of the correlations of a pair of genera belonging to the same taxonomic group in each FS,we calculated the mean and the standard deviation per taxonomic group.In the same way,we calculated the mean of the correlations for pairs of taxonomic groups—Firmicutes,Cyanobacteria,Proteobacteria,Actinobacteria,AMF,a group of 7 Dikarya,andPhanerochaetein each FS.

    Results

    Loading clusters

    The example sequence had 270 bp.When we ran the metagenome taxonomic classifier — SeSaMe — with the example sequence,it had the highest trimer usage probability score in the 2nd reading frame translation [22].It had 87 matching 3-codon DNA 9-mers in the 2nd reading frame translation.The PCA method applied to the comparative dataset showed that 51 components represented 80% components,while the First/Second components explained approximately 29% of total inertia.

    The K-means clustering method (k=13) applied to the loadings of 80% components identified outliers,14 3-codon DNA 9-mers.One major cluster had 73 members.12 clusters had 14 outliers:10 clusters had a sole member (50 NTA,63 LYY,72 KSN,4 WTS,69 WIR,73 SNE,24 STY,30 VEW,80 NYL,and 51 TAV),while 2 clusters had 2 members (33 IPY and 61 GPL,and 39 INI and 86 IKE).

    Structural homology search in PDB and inference of DNA-binding residues in DRNApred suggested that the example sequence may be a DNA/RNA binding protein[23,40].We used the outliers to search publicly available bioinformatics databases containing DNA motifs with known functions.RSAT indicated that the outlier and its adjacent 3-codon DNA 9-mer (i.e.,4 WTS and 3 NWT) were matched to motifs involved incis-regulatory mechanisms,one in the+strand and the other in the -strand [37].BPROM (Prediction of bacterial promoters) predicted that the outliers 30 VEW and 33 IPY were promoter-related elements [38].GPMiner indicated that three outliers (4 WTS,33 IPY,and 61 GPL) were matched to statistically significant overrepresented oligonucleotides in the promoter region [39].RNA structure prediction tools predicted that most outliers formed stem-loops,stems,and transition routes to stem in RNA structure of the example sequence (Figure S1) [36].A large number of studies have documented stem-loop and stem structures in RNAs as important regulatory sites and binding sites [49,50].Considering that we are just beginning to understand the regulatory roles of codon usage and codon context,considerable portions of outliers and their adjacent 3-codon DNA 9-mers identified by the program may serve important roles in undiscovered mechanisms.

    The loading clusters with the First/Second components based on the trimer usage bias are shown in Table S1.It should be noted that Table S1 indicates the 3-codon usages for comparison purpose,which will be discussed in another section.The loadings of 3-codon DNA 9-mers with the catalytic or allosteric site in the second amino acid were plotted on the space of the First/Second components (Figure 3).A majority of 3-codon DNA 9-mers where Firmicutes,Cyanobacteria,Rickettsia,or AMF had the highest 3-codon usage were aggregately located on the far-right side (Figure 3).In contrast,those where Deltaproteobacteria,Gammaproteobacteria,or Actinobacteria had the highest 3-codon usage were dispersed across the left side and the middle of the graph.For example,3 NWT whereKocuriahad the highest value was located on the far-left side (Table S1).

    Genus clusters

    The genus clusters based on 80% components indicated that genera with close phylogenetic relationships were assigned to the same cluster.In the scatter plot of taxon scorers on the space of First/Second components,Firmicutes,Cyanobacteria,Rickettsia,and AMF that frequently had high trimer usage biases were located on the right,while most members of Actinobacteria and Proteobacteria (cluster 1) that frequently had low values were located on the far-left side(Figure 4).

    Figure 3 Loading clusters of the example sequence

    Outlier’s unique landscape pattern of trimer usage bias and of 3-codon usage

    For each of the 3-codon DNA 9-mers in loading clusters with the First/Second components,we ranked 54 genera in order of decreasing 3-codon usages.We then ranked the 3-codon DNA 9-mers in each subgroup (CSA/ASD/BothCA/None) of the clusters based on a maximum of the 3-codon usages(Table S1).The mean of the maxima was 0.256.AMF,Clostridium,andRickettsiafrequently had the maximum.

    Most of 3-codon DNA 9-mers in the major cluster demonstrated similar landscape patterns of the 3-codon usage and of the trimer usage bias.For example,17-CIE-EMQGAAATGCAA and 18-IEJ-MQL-ATGCAATTA had the frequently demonstrated landscape pattern (Figures S2 and S3).Outliers had a unique landscape pattern;for example,genera belonging to Dikarya had a higher value than AMF both in 67-AAK-KKW-AAGAAATGG and 3-EKD-NWTAACTGGACC (Figure 5,Figure S4).

    Comparison of the frequencies of a nucleotide among 13 loading clusters

    One-way ANOVA tests showed that the means of the frequencies of G and C in each of the individual 3-codon DNA 9-mers were significantly different among 13 clusters;F-statistics andPvalues of A,T,G,and C among 13 clusters were 0.69(0.76),1.26 (0.26),1.91 (0.047),and 3.09 (0.0014),respectively.

    Comparison between the trimer usage bias and the 3-codon usage in FS

    We merged some of the outliers in 12 clusters according to their proximity in the example sequence,which produced 8 groups.The merged outliers were 50 NTA with 51 TAV,33 IPY and 61 GPL with 63 LYY,as well as 69 WIR with 72 KSN and 73 SNE.This was done to simplify the analysis,and was not recommended for real case analyses.Examining the protein tertiary structure predicted by SCRATCH,we added another group(20 LKI),a member of alpha helix,which made a total of 9 groups[48].We assigned 87 3-codon DNA 9-mers into 9 FSs according to the outliers.These include 4 WTS(3-codon DNA 9-mer:0-12)in FS1;20 LKI(alpha helix1:13-21)in FS2;24 STY(22-29)in FS3;30 VEW(30-32)in FS4;33 IPY,61 GPL,and 63 LYY(33-35,52-65)in FS5;39 INI and 86 IKE(36-41,82-86)in FS6;50 NTA and 51 TAV(42-51)in FS7;69 WIR,72 KSN,and 73 SNE(66-73)in FS8;as well as 80 NYL (alpha helix2:74-81) in FS9 (Figure S5).

    Figure 4 Genus clusters of the example sequence

    Figure 5 Landscape pattern of the 3-codon usage of 67-AAK-KKW-AAGAAATGG

    Generally,the mean of the correlations of a pair of genera belonging to the same taxonomic group was the highest in each taxonomic group for all 9 FSs (Tables S2 and S3).Table S4 shows the mean and the standard deviation of 9 FSs calculated from the mean of the correlations of a pair of genera belonging to the same taxonomic group in a FS.

    The mean of the correlations of a pair of taxonomic groups based on 3-codon usage (left) and the mean based on trimer usage bias (right) are shown in Table S3.Most of them had strong correlations in both alpha helices—FS2 and FS9.This may suggest that roles of amino acids and of codons in alpha helices may be relatively more conserved across taxonomic groups due to functional and structural constraints compared to those in random coils and loops of which flexible structures are equipped for a variety of functions.

    Comparable properties of 25 selected sequences in AMF CDS test set

    In order to show that the program provided outliers of 3-codon DNA 9-mers in loading clusters based on 80% components not only in the example sequence but also in all 25 sequences,we included the cluster results of 5 additional query sequences.Genus clusters and loading clusters of the sequences are shown in Tables S5 and S6,respectively.The early diverged bacteria and AMF were often clustered as a sole member or with each other.A great majority of 3-codon DNA 9-mers were grouped together into one major cluster,while outliers were clustered as a sole member or with only one other member.

    Future work

    It has been decades since important functions of non-coding RNAs (ncRNAs) were discovered.A previous study has documented that long non-coding RNAs (lncRNAs) play important roles in various cellular processes [51].Because a large number of lncRNAs contain putative ORFs,it is challenging to distinguish them from protein CDS,especially in AMF CDS database created based on results from a number of gene prediction programs.In the future,we may take a different approach depending on whether a query sequence transcribes either a coding or a non-coding transcript,or both.In addition,we will take into consideration the presence of interaction sites with ncRNAs in a query sequence,because structures required for interactions may impose considerable constraints on codon usages and codon contexts.

    Recent studies have documented that codon usage and mRNA structure regulate protein folding [25,26,28,30].For example,some studies have shown associations between rare codons or double stranded mRNA structures and a decrease of translational speed [26,30].Other studies have documented relationships between protein secondary structure and mRNA structure;double stranded mRNA regions tend to have an association with alpha helix and beta strand,while single stranded mRNA regions tend to have an association with random coils[52,53].However,the roles of the codons involved in these rules may vary widely across taxonomic groups.Furthermore,while we need defined structures across various taxa,they are mostly from a small number of model organisms.Therefore,it is challenging to study associations between mRNA structures and their corresponding protein structures in metagenome sequencing data.We may be able to improve SeSaMe PS Function by incorporating a new feature that predicts mRNA single and double stranded regions in a query sequence.

    Data availability

    SeSaMe PS Function is freely accessible at www.fungalsesame.org.

    CRediT author statement

    Jee Eun Kang:Conceptualization,Methodology,Software,Validation,Writing -original draft.Antonio Ciampi:Supervision,Writing -review & editing.Mohamed Hijri:Supervision,Writing-review& editing.All authors read and approved the final manuscript.

    Competing interests

    The authors have declared no competing interests.

    Acknowledgments

    The authors gratefully acknowledge AFE(Aide financie`re aux e′tudes),FESP (Faculte′ des e′tudes supe′rieures et postdoctorales de l’UdeM),and IRBV(Institut de Recherche en Biologie Ve′ge′tale de l’Universite′ de Montre′al) for awarding scholarships to Jee Eun Kang.We thank David Morse for editing and commenting on the manuscript.

    Supplementary material

    Supplementary data to this article can be found online at https://doi.org/10.1016/j.gpb.2018.07.011.

    ORCID

    0000-0003-2475-0474 (Jee Eun Kang)

    0000-0003-4838-8297 (Antonio Ciampi)

    0000-0001-6112-8372 (Mohamed Hijri)

    特大巨黑吊av在线直播| 夜夜躁狠狠躁天天躁| 欧美精品啪啪一区二区三区| 久久久久亚洲av毛片大全| 精品乱码久久久久久99久播| 99riav亚洲国产免费| 精品久久久久久成人av| 欧美 亚洲 国产 日韩一| 97碰自拍视频| 色综合站精品国产| 桃红色精品国产亚洲av| √禁漫天堂资源中文www| 日本熟妇午夜| 少妇人妻一区二区三区视频| 久久中文字幕人妻熟女| 三级毛片av免费| 制服人妻中文乱码| 麻豆成人av在线观看| 国产又色又爽无遮挡免费看| 岛国在线免费视频观看| 国产亚洲欧美98| 成人欧美大片| www.自偷自拍.com| 国产精品爽爽va在线观看网站| 1024手机看黄色片| 久久久精品大字幕| 免费在线观看影片大全网站| 免费高清视频大片| 别揉我奶头~嗯~啊~动态视频| 日本一本二区三区精品| 妹子高潮喷水视频| 深夜精品福利| 午夜精品在线福利| 99精品在免费线老司机午夜| 男人舔女人的私密视频| 90打野战视频偷拍视频| 夜夜夜夜夜久久久久| 久久精品国产综合久久久| 成人国产一区最新在线观看| 九九热线精品视视频播放| 99精品久久久久人妻精品| 国产在线精品亚洲第一网站| 999久久久精品免费观看国产| 国产精品,欧美在线| 欧美成人一区二区免费高清观看 | 成人国语在线视频| 久久久久国产精品人妻aⅴ院| 午夜福利18| 国产一区二区三区在线臀色熟女| e午夜精品久久久久久久| 色综合婷婷激情| 男人舔女人的私密视频| av在线天堂中文字幕| 精品一区二区三区四区五区乱码| 国产亚洲精品一区二区www| 99国产极品粉嫩在线观看| 90打野战视频偷拍视频| 久久久久久亚洲精品国产蜜桃av| 天天一区二区日本电影三级| 男女做爰动态图高潮gif福利片| 人成视频在线观看免费观看| svipshipincom国产片| 国产精品国产高清国产av| 搡老岳熟女国产| 亚洲欧美一区二区三区黑人| 国产精品久久电影中文字幕| 岛国在线观看网站| 国产精品香港三级国产av潘金莲| 欧美黑人巨大hd| 国产成人啪精品午夜网站| 国产精品一区二区精品视频观看| 宅男免费午夜| 在线观看免费午夜福利视频| 国产精品亚洲美女久久久| 国产成人系列免费观看| 久久天躁狠狠躁夜夜2o2o| 亚洲国产精品久久男人天堂| 一二三四在线观看免费中文在| 久久精品国产亚洲av香蕉五月| 久久亚洲真实| 国产aⅴ精品一区二区三区波| 国产视频一区二区在线看| 他把我摸到了高潮在线观看| 成年人黄色毛片网站| 国产在线精品亚洲第一网站| 久久欧美精品欧美久久欧美| 丝袜美腿诱惑在线| 精品久久久久久成人av| 精品国产亚洲在线| 天堂动漫精品| 无人区码免费观看不卡| 国产成+人综合+亚洲专区| 亚洲欧美日韩无卡精品| 99久久无色码亚洲精品果冻| 欧美一级毛片孕妇| 国产精品1区2区在线观看.| 免费av毛片视频| 欧美成人一区二区免费高清观看 | 亚洲电影在线观看av| 波多野结衣高清作品| 成人av在线播放网站| 免费一级毛片在线播放高清视频| 国产在线观看jvid| 亚洲一码二码三码区别大吗| 精品一区二区三区av网在线观看| av天堂在线播放| 制服丝袜大香蕉在线| 成人国语在线视频| 中文亚洲av片在线观看爽| 岛国视频午夜一区免费看| 99在线视频只有这里精品首页| 精品国内亚洲2022精品成人| 亚洲美女黄片视频| 日韩大码丰满熟妇| 中文字幕高清在线视频| 国产野战对白在线观看| 久久精品国产清高在天天线| 久久久国产成人免费| 久久久久久久久中文| 亚洲全国av大片| 日韩欧美免费精品| 丰满人妻熟妇乱又伦精品不卡| 成年版毛片免费区| 中文字幕人妻丝袜一区二区| 最新在线观看一区二区三区| 久久香蕉精品热| 国产黄a三级三级三级人| 日韩有码中文字幕| 国内精品一区二区在线观看| 丰满的人妻完整版| 国产麻豆成人av免费视频| 亚洲一区二区三区不卡视频| 啦啦啦免费观看视频1| 欧美成人性av电影在线观看| aaaaa片日本免费| 国产单亲对白刺激| 国产精品av视频在线免费观看| 国产欧美日韩精品亚洲av| 欧美黑人巨大hd| svipshipincom国产片| 欧美精品啪啪一区二区三区| 日本免费一区二区三区高清不卡| 日韩 欧美 亚洲 中文字幕| 90打野战视频偷拍视频| 岛国在线免费视频观看| 欧美黑人巨大hd| 中出人妻视频一区二区| 国产激情偷乱视频一区二区| 亚洲精品一卡2卡三卡4卡5卡| 99国产极品粉嫩在线观看| 久久婷婷人人爽人人干人人爱| 伊人久久大香线蕉亚洲五| 亚洲精品美女久久久久99蜜臀| 老鸭窝网址在线观看| aaaaa片日本免费| 日日摸夜夜添夜夜添小说| 欧美成人午夜精品| 中文字幕人成人乱码亚洲影| 亚洲人成77777在线视频| 最近在线观看免费完整版| 天堂影院成人在线观看| 亚洲一区中文字幕在线| 国产三级中文精品| 国产三级黄色录像| 日本一二三区视频观看| 欧美中文综合在线视频| 亚洲专区字幕在线| 日本五十路高清| 变态另类丝袜制服| 又大又爽又粗| 国产精品 欧美亚洲| 少妇熟女aⅴ在线视频| 成人三级黄色视频| 真人一进一出gif抽搐免费| 欧美日本视频| 丰满人妻一区二区三区视频av | 久久亚洲精品不卡| 在线观看免费日韩欧美大片| 不卡av一区二区三区| 久久99热这里只有精品18| 欧美色视频一区免费| 欧美乱色亚洲激情| www.999成人在线观看| 午夜精品久久久久久毛片777| 久久精品亚洲精品国产色婷小说| 十八禁网站免费在线| av片东京热男人的天堂| 国产精品98久久久久久宅男小说| or卡值多少钱| 亚洲人成网站高清观看| 国内揄拍国产精品人妻在线| 国产成人aa在线观看| 欧美不卡视频在线免费观看 | 中文资源天堂在线| 国产免费av片在线观看野外av| 最近在线观看免费完整版| 老司机深夜福利视频在线观看| 欧美午夜高清在线| 性欧美人与动物交配| 中文亚洲av片在线观看爽| 亚洲国产欧美人成| av片东京热男人的天堂| 亚洲成av人片在线播放无| 欧美日韩黄片免| 婷婷六月久久综合丁香| 国产精品久久视频播放| 丰满人妻熟妇乱又伦精品不卡| 欧美成人午夜精品| 国产精品,欧美在线| 午夜福利在线观看吧| 中出人妻视频一区二区| 国产99久久九九免费精品| 亚洲熟女毛片儿| 国产一区在线观看成人免费| 国产精品av久久久久免费| 在线看三级毛片| 欧美日韩国产亚洲二区| 午夜影院日韩av| 欧美成人性av电影在线观看| 国产视频内射| netflix在线观看网站| 99国产极品粉嫩在线观看| 国产精品精品国产色婷婷| 国内精品久久久久久久电影| 欧美高清成人免费视频www| 亚洲av成人不卡在线观看播放网| 最近视频中文字幕2019在线8| 无人区码免费观看不卡| 在线观看舔阴道视频| 色噜噜av男人的天堂激情| 亚洲一卡2卡3卡4卡5卡精品中文| 小说图片视频综合网站| 亚洲av第一区精品v没综合| 欧美最黄视频在线播放免费| 精品国产美女av久久久久小说| 99热6这里只有精品| 亚洲专区国产一区二区| 亚洲一码二码三码区别大吗| 在线观看舔阴道视频| 欧美黄色片欧美黄色片| 国产激情久久老熟女| 91字幕亚洲| 久久精品91无色码中文字幕| 一本综合久久免费| 天堂动漫精品| 久久午夜综合久久蜜桃| avwww免费| 美女 人体艺术 gogo| 一二三四在线观看免费中文在| av在线天堂中文字幕| 夜夜躁狠狠躁天天躁| 午夜a级毛片| 一级毛片高清免费大全| 99热只有精品国产| 午夜成年电影在线免费观看| www日本黄色视频网| 国产成人aa在线观看| 精品不卡国产一区二区三区| 啪啪无遮挡十八禁网站| 亚洲专区字幕在线| 亚洲性夜色夜夜综合| 男女那种视频在线观看| 日本 av在线| 亚洲国产精品sss在线观看| 精品欧美国产一区二区三| 欧美不卡视频在线免费观看 | 国产精品国产高清国产av| 露出奶头的视频| 日韩欧美国产在线观看| 午夜福利欧美成人| 性欧美人与动物交配| 国内少妇人妻偷人精品xxx网站 | 脱女人内裤的视频| 99热只有精品国产| 国产成人啪精品午夜网站| 男女午夜视频在线观看| 精品无人区乱码1区二区| 日本黄大片高清| 亚洲国产欧美一区二区综合| 女人爽到高潮嗷嗷叫在线视频| 十八禁网站免费在线| 国内久久婷婷六月综合欲色啪| 国产亚洲精品第一综合不卡| 三级毛片av免费| 999久久久精品免费观看国产| 免费搜索国产男女视频| 人人妻人人看人人澡| 宅男免费午夜| 中文字幕最新亚洲高清| 亚洲美女视频黄频| 美女扒开内裤让男人捅视频| 国产成人系列免费观看| 可以免费在线观看a视频的电影网站| 日韩免费av在线播放| 久久性视频一级片| 亚洲中文字幕日韩| 嫁个100分男人电影在线观看| 又黄又爽又免费观看的视频| 日本免费a在线| 中文字幕高清在线视频| 亚洲av日韩精品久久久久久密| 精品一区二区三区av网在线观看| 岛国视频午夜一区免费看| 久久伊人香网站| 久久国产乱子伦精品免费另类| 在线观看日韩欧美| 三级国产精品欧美在线观看 | 日韩大尺度精品在线看网址| 97超级碰碰碰精品色视频在线观看| 色av中文字幕| 男女下面进入的视频免费午夜| 久久精品国产亚洲av高清一级| e午夜精品久久久久久久| 日韩大尺度精品在线看网址| 一进一出抽搐gif免费好疼| 成人国产一区最新在线观看| 激情在线观看视频在线高清| 村上凉子中文字幕在线| 亚洲中文av在线| 老鸭窝网址在线观看| 精品一区二区三区四区五区乱码| 99久久99久久久精品蜜桃| 午夜视频精品福利| 久久久水蜜桃国产精品网| 黄色片一级片一级黄色片| 人妻丰满熟妇av一区二区三区| 麻豆成人av在线观看| 色综合亚洲欧美另类图片| 禁无遮挡网站| 国产在线精品亚洲第一网站| 两个人的视频大全免费| 色综合站精品国产| 狂野欧美白嫩少妇大欣赏| 好看av亚洲va欧美ⅴa在| 男女午夜视频在线观看| 最近视频中文字幕2019在线8| 麻豆成人午夜福利视频| 亚洲av成人精品一区久久| 国产高清视频在线播放一区| 视频区欧美日本亚洲| 夜夜夜夜夜久久久久| 亚洲欧美日韩东京热| 中文字幕人成人乱码亚洲影| 久久亚洲真实| 校园春色视频在线观看| 亚洲最大成人中文| 国产在线观看jvid| 久久久久久国产a免费观看| av视频在线观看入口| 欧美日韩中文字幕国产精品一区二区三区| 1024手机看黄色片| 成人一区二区视频在线观看| 好男人电影高清在线观看| 免费av毛片视频| 香蕉丝袜av| 桃色一区二区三区在线观看| 观看免费一级毛片| 国产精品国产高清国产av| 国产av在哪里看| 精品乱码久久久久久99久播| 久久性视频一级片| 最新美女视频免费是黄的| 特级一级黄色大片| av免费在线观看网站| 日本黄大片高清| 日韩欧美三级三区| 男女视频在线观看网站免费 | 亚洲国产欧美网| 久久久久亚洲av毛片大全| 亚洲精品国产精品久久久不卡| 黄片大片在线免费观看| 久久久精品大字幕| 久久精品91蜜桃| 99久久精品热视频| 日韩欧美免费精品| 亚洲成av人片免费观看| 免费在线观看亚洲国产| 蜜桃久久精品国产亚洲av| 天堂av国产一区二区熟女人妻 | 99久久久亚洲精品蜜臀av| 麻豆成人av在线观看| 天天躁夜夜躁狠狠躁躁| 国产精品av视频在线免费观看| 亚洲美女黄片视频| 欧美av亚洲av综合av国产av| 久久香蕉国产精品| 亚洲五月天丁香| 黄频高清免费视频| 精品熟女少妇八av免费久了| 午夜精品在线福利| 色av中文字幕| 日韩欧美在线乱码| 日本黄大片高清| 美女高潮喷水抽搐中文字幕| cao死你这个sao货| 精品久久蜜臀av无| 脱女人内裤的视频| 精品国内亚洲2022精品成人| 久久人妻av系列| 中文资源天堂在线| 国产精品影院久久| 国产黄片美女视频| 日本 av在线| 久久久久久久精品吃奶| 又爽又黄无遮挡网站| 免费在线观看日本一区| 在线国产一区二区在线| 亚洲va日本ⅴa欧美va伊人久久| 亚洲熟妇熟女久久| 国产激情偷乱视频一区二区| 香蕉av资源在线| 日本三级黄在线观看| 男女做爰动态图高潮gif福利片| 亚洲avbb在线观看| 婷婷六月久久综合丁香| 天堂影院成人在线观看| 五月玫瑰六月丁香| 日日摸夜夜添夜夜添小说| 国产一区二区激情短视频| 免费一级毛片在线播放高清视频| 国产一区二区三区在线臀色熟女| 国产成+人综合+亚洲专区| 国产伦人伦偷精品视频| 国产免费男女视频| 人成视频在线观看免费观看| 亚洲精品一卡2卡三卡4卡5卡| 精品久久久久久久末码| 中文亚洲av片在线观看爽| 久久久久九九精品影院| 免费在线观看影片大全网站| 黄色片一级片一级黄色片| 97碰自拍视频| 性欧美人与动物交配| 国产在线观看jvid| 黄色丝袜av网址大全| 宅男免费午夜| 啦啦啦韩国在线观看视频| 国产一区二区在线av高清观看| 亚洲一码二码三码区别大吗| 天堂动漫精品| 91国产中文字幕| 欧美高清成人免费视频www| 日韩欧美精品v在线| 亚洲黑人精品在线| 特大巨黑吊av在线直播| 777久久人妻少妇嫩草av网站| 一进一出好大好爽视频| 99精品久久久久人妻精品| 最近最新中文字幕大全电影3| 精品高清国产在线一区| 日本黄大片高清| 久久久水蜜桃国产精品网| 欧美zozozo另类| 久久九九热精品免费| 国产精品电影一区二区三区| 婷婷精品国产亚洲av在线| 亚洲国产欧美一区二区综合| 人妻丰满熟妇av一区二区三区| 国产三级在线视频| 丝袜人妻中文字幕| 一级毛片女人18水好多| 99久久久亚洲精品蜜臀av| 国产欧美日韩精品亚洲av| 亚洲国产欧美一区二区综合| 久久人妻av系列| 久久久国产欧美日韩av| 日韩大码丰满熟妇| 欧美乱码精品一区二区三区| 少妇被粗大的猛进出69影院| 18禁黄网站禁片午夜丰满| 老司机在亚洲福利影院| 亚洲av日韩精品久久久久久密| 色av中文字幕| 国内少妇人妻偷人精品xxx网站 | 国产爱豆传媒在线观看 | 亚洲午夜理论影院| av在线播放免费不卡| 久久性视频一级片| 亚洲成av人片在线播放无| 午夜免费观看网址| 午夜激情福利司机影院| 一二三四在线观看免费中文在| av在线播放免费不卡| 听说在线观看完整版免费高清| 国产精品久久电影中文字幕| 亚洲国产高清在线一区二区三| 久久中文字幕一级| 黑人欧美特级aaaaaa片| 天天躁狠狠躁夜夜躁狠狠躁| 成人手机av| 国产精品一区二区三区四区免费观看 | 亚洲色图av天堂| 亚洲国产欧美一区二区综合| 白带黄色成豆腐渣| 亚洲一码二码三码区别大吗| 欧美精品啪啪一区二区三区| 欧美zozozo另类| 成年版毛片免费区| 大型av网站在线播放| 成人高潮视频无遮挡免费网站| 午夜亚洲福利在线播放| 亚洲自偷自拍图片 自拍| 精品国产乱码久久久久久男人| 亚洲中文日韩欧美视频| 麻豆国产97在线/欧美 | 日韩免费av在线播放| 久久久国产成人免费| 久久婷婷成人综合色麻豆| 可以在线观看的亚洲视频| 午夜视频精品福利| 桃红色精品国产亚洲av| 日本黄大片高清| 真人一进一出gif抽搐免费| 身体一侧抽搐| 欧美乱妇无乱码| 精品不卡国产一区二区三区| 欧美zozozo另类| 曰老女人黄片| 亚洲aⅴ乱码一区二区在线播放 | 亚洲av第一区精品v没综合| 超碰成人久久| 搡老妇女老女人老熟妇| 国产伦一二天堂av在线观看| 国产亚洲精品久久久久5区| 亚洲一区二区三区不卡视频| a级毛片a级免费在线| 在线观看舔阴道视频| 国产一区二区三区在线臀色熟女| 91在线观看av| 最近视频中文字幕2019在线8| 91九色精品人成在线观看| 欧美不卡视频在线免费观看 | 久久久久久久久免费视频了| 极品教师在线免费播放| 老熟妇乱子伦视频在线观看| bbb黄色大片| av欧美777| 亚洲av中文字字幕乱码综合| 最近最新免费中文字幕在线| 久久精品人妻少妇| www.熟女人妻精品国产| 国产精品自产拍在线观看55亚洲| 精品福利观看| 美女 人体艺术 gogo| 色在线成人网| 怎么达到女性高潮| 国产成人精品无人区| 成人av一区二区三区在线看| 欧美日韩亚洲国产一区二区在线观看| 亚洲精品国产精品久久久不卡| 国产乱人伦免费视频| 18禁黄网站禁片午夜丰满| 国产精品香港三级国产av潘金莲| 国产精品一区二区免费欧美| 91av网站免费观看| 妹子高潮喷水视频| 久久久国产成人精品二区| 88av欧美| 天天一区二区日本电影三级| 亚洲人成电影免费在线| 久久精品国产亚洲av香蕉五月| 真人做人爱边吃奶动态| 这个男人来自地球电影免费观看| 色在线成人网| 看免费av毛片| 一个人观看的视频www高清免费观看 | 天天一区二区日本电影三级| 99精品久久久久人妻精品| 婷婷六月久久综合丁香| 国产熟女xx| 亚洲色图 男人天堂 中文字幕| 亚洲国产看品久久| 真人做人爱边吃奶动态| 丝袜人妻中文字幕| 丁香欧美五月| 99精品在免费线老司机午夜| av超薄肉色丝袜交足视频| ponron亚洲| 精品久久久久久久久久久久久| 香蕉丝袜av| 一进一出抽搐gif免费好疼| 久久伊人香网站| 精品久久久久久久毛片微露脸| 午夜两性在线视频| 国产成人欧美在线观看| 国产一区二区在线av高清观看| 一级黄色大片毛片| 桃色一区二区三区在线观看| 老汉色av国产亚洲站长工具| 国产精品1区2区在线观看.| 99久久精品国产亚洲精品| 丰满的人妻完整版| 亚洲一区二区三区不卡视频| 久久香蕉激情| 久久99热这里只有精品18| 国产伦在线观看视频一区| 国产真实乱freesex| 久久精品国产亚洲av香蕉五月| 国产亚洲精品一区二区www| tocl精华| 午夜a级毛片| 国产亚洲精品一区二区www| 久久香蕉激情| 国产一区二区在线观看日韩 | 在线观看免费午夜福利视频| ponron亚洲| 精品无人区乱码1区二区| 国产成人系列免费观看| 高清在线国产一区| 九色成人免费人妻av| 国产亚洲精品av在线| 岛国在线免费视频观看| 久久精品91蜜桃| 很黄的视频免费| 亚洲美女视频黄频|