• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    The Birth of Bio-data Science:Trends,Expectations,and Applications

    2020-07-29 05:34:34WilsonWenBinGohLimsoonWong
    Genomics,Proteomics & Bioinformatics 2020年1期

    Wilson Wen Bin Goh ,Limsoon Wong

    1School of Biological Sciences,Nanyang Technological University,Singapore 637551,Singapore

    2Department of Computer Science,National University of Singapore,Singapore 117417,Singapore

    Components of bio-data science

    Biology is becoming increasingly digitized and has now taken on the sheen of a quantitative scientific discipline.A key driving factor is the increasing pervasiveness of high-throughput technological platforms in biological research,allowing millions of data points on genes,proteins,and other biological moieties across thousands of tissues and organisms to be compiled,cleaned,stored,and integrated for the purpose of systematic studies. In this data-rich landscape, it is not an exaggeration to say that the future of biological(and where deployed on clinical samples, biomedical) research lies in strategic maximization of data.

    Big bio-data is not a distant fantasy.Not only have we already been living in the age of big bio-data,biological data is also being generated and accrued in an increasingly accelerated manner.Between 1990 and 2003,unraveling the human genome cost approximately$2.7 billion and took several years with many teams involved for completion[1].By 2016,the same experiment now costs less than$1500 and requires only an afternoon within a single laboratory.Similarly,mapping a single tomato genome initially took an international consortium 5 years[2];but today,150 different tomato genomes may be completed within a year[3].The big bio-data landscape has also spurred the development of big data management systems such as the Expression Atlas[4]and proteomics identification(PRIDE)database[5].

    The rise of big bio-data needs to be leveraged upon for understanding diseases and improving health.Problems in the generation,management,analysis,visualization,and interpretation of data should assume a leading role,requiring a paradigm shift in attitude and know-how.Moreover,addressing larger data volumes requires advances in database management platforms and also improved algorithm efficiency.Where large amounts of data are accrued,issues with regard to veracity and complexity also emerge and need to be tackled with more urgency than ever.Traditional disciplines such as bioinformatics and computational biology are now more challenged than ever.In today’s technological landscape,data science and artificial intelligence(AI)have already acted as innovation drivers in areas such as business and finance,where data scientists take helm in converting data into practicable insights instead of working behind the scenes in operations.Examples include AI-driven algorithmic trading and stock recommendation systems in financial technology(fintech)and automated engine design, system maintenance, and robotics in engineering.Given the recent data explosion of and concomitant advances in data science in other disciplines such as business,finance,and computing,we predict that alongside the rapid and voluminous generation of biological data,a new variant of data science,which will specifically address domain-specific issues pertinent to biology,will emerge.We term this variant of data science as‘‘bio-data science(BDS).”

    BDS comprises three core disciplinary areas: biology(which constitutes the application domain),computer science,as well as mathematics and statistics(Figure 1).The biology core area is concerned with questions regarding biological origin,such as the cause of a disease or understanding the diagnostic utility of an inferred biomarker.The computer science core area is concerned with devising appropriate algorithms for problem-solving,dealing with repetition(e.g.,running the same algorithm on large subsets of data many times over),and resolving data storage issues,especially if the data to be analyzed is large.The mathematics and statistics core area is concerned with issues such as data summarization,normalization,and modeling.Although descriptive and exploratory statistical data analysis is by no means unique to BDS(also being an essential component of biostatistics and,to a lesser degree,bioinformatics),BDS has an added focus on prediction using emerging technology based on applying AI/machine learning(ML)on big data.

    Thinking of BDS additively in terms of the disciplinary cores is a mistake.BDS is more than the sum of its parts.Data science is often likened to storytelling with data.And to tell a good story requires one to have in-depth domain knowledge,such that these idiosyncrasies are carefully considered during data interpretation.In other words,BDS requires synergy amongst its disciplinary core areas.To give an example of the importance of domain and synergy with statistics,proteins do not operate independently but rather,as functional units called protein complexes.For a complex to function,its components must be co-expressed tightly,so that the complex can form in the first place.However,when we interpret a matrix of gene or protein expression from a purely statistical viewpoint,we mistakenly assume that each gene or protein operates independent of each other,a fundamental assumption of many statistical tests.This means that when we try to limit false positive rates,we make corrections based on the total number of genes being considered,even though the genes are not independent of each other(e.g.,two proteins in a protein complex tend to be correlated in their expression profiles).Assuming independence results in overcorrection, causing loss of statistical power.In such cases,a more reasonable approach would have been to make corrections based on the potential number of protein complexes that can be formed instead[6-8].Therefore,the biological domain does not merely create the questions that need to be answered,but it also provides constraints that must be understood and incorporated to create robust models.

    We may also categorize BDS by analytical outcomes.Borrowing from Gartner(www.gartner.com),data science outcomes may be categorized into four levels in the order of difficulty and value:descriptive,diagnostic,predictive,and prescriptive.We have summarized these outcomes and levels in Table 1.

    Currently,most modern-day investigations are at the first two levels.Descriptive analytics is concerned with simple data exploration and data description by plotting basic graphs such as pie charts and line graphs,as well as calculating simple statistics such as mean and median.Diagnostic analytics goes a step further and is concerned with identifying potential underlying causes that can explain why something happens.For example,if the stock market crashes today,we may examine existing data to identify potential causes.It may so happen that a political crisis occurs somewhere else.We know that in general,political uncertainty leads towards economic instability;so,this is a potential explanation,even though it may not in fact,be the correct explanation.The purpose of diagnostic analytics is to attempt,given evidence constraints,to figure out the true cause.To see why it is so hard to determine the true cause in,say,a stock market crash,we only have empirical data showing correlations in the past linking uncertainty and market crashes;it may be just that these two phenomena tend to happen together,that’s all.The more direct way to determine the true cause with certainty is to test for causality;however,it would be unethical and unfeasible to deliberately cause a crisis,just to observe its impact on the stock market.

    Figure 1 The core areas of bio-data science

    Table 1 The four levels of a bio-data science analysis goal or achieved outcome

    Predictive analytics is concerned with translating what we currently know, into judgements on future phenomena.Unlike diagnostic analytics, which retrospectively analyzes the possible explanatory causes,predictive analytics goes a step further and attempts to predict the phenomena before it happens.In order to do so,it needs to have a good grasp on the potential causes and appropriate indicators.But this is all it requires,a good grasp on the causes and indicators.It may be able to predict that something will happen;however,without knowing how the causes and indicators actually work together, it is helpless to change what will eventually happen.

    Being able to control outcome is the realm of prescriptive analytics.Here,a good grasp on the causes and indicators is not enough.Prescriptive analytics demands that you know how the causes work together,and how changes in specific factors will result in a change consistent with the desired outcome.When working with complex systems,although prescriptive analytics is incredibly difficult to achieve,it is also powerful.Prescriptive analytics requires a deep and detailed understanding of the system.In a complex system where many alternative pathways exist,several factors need to be targeted simultaneously in order to achieve an intended effect.Hence,network modeling in biology has proven to be especially vital for prescriptive analytics[9].

    Categorization of BDS by core area or by outcome is useful for theoretical discourse but has otherwise limited practical value.Moreover,in the case of core areas,notions of what should constitute core skills and expertise for data scientists is rapidly evolving.As we enter the‘‘third wave”(at the point of writing),strategic and leadership skills are being increasingly touted as critical areas for enablement and empowerment,which is hardly surprising,as without any charm or charisma,it is difficult to convince other stakeholders to act on advice.As bio-data scientists are probably less concerned with the exigent needs of the business sector,such revisions in the core skill set are useful but nonessential.We do hold the viewpoint that regardless of anyone’s beliefs regarding what should be a disciplinary core area,being an effective bio-data scientist is less about what one knows,than what one does with it.Therefore,emergent skills and behaviors that arise from such divergent multidisciplinary training is more important than the core content(knowledge and skills)themselves.We also cannot emphasize enough that to be an effective bio-data scientist, it is critical to leverage on idiosyncrasies and informative contexts drawn from domain knowledge and use these creatively for problem-solving.

    As far as analytical outcome level is concerned,there are also some gray areas.For example,descriptive analytics may also involve denoising and normalization approaches to some extent without the use of any correlation analysis.Also,an intended‘‘prescriptive”analysis may fall short,perhaps due to unresolvable technical errors or other reasons,such that the predictive model cannot generalize and therefore has to be abandoned at the‘‘diagnostic”level.

    Ultimately,these divisions and classifications,no matter by disciplinary component or by analytical outcome, are arbitrary.

    Despite its seemingly‘‘new”status,BDS is ultimately a science of inquiry,and in this respect,not different from any typical scientific investigation. In the example shown in Figure 2,as a simplified mode of BDS inquiry,we may use the following seven steps to help us answer the question of whether alterations in gene expression correlate meaningfully with mental states.The main difference is that BDS requires strong ability in meaningful data manipulation and analysis,with less emphasis on lower-throughput or underpowered physical experiments.

    Bioinformaticians and computational biologists can be bio-data scientists

    Figure 2 A bio-data science inquiry requires a well-defined question

    We define BDS as the application of data science principles and associated technologies for deriving insights from biodata.This has important implications for drug development,personalized medicine,automated diagnosis,and health service monitoring systems.Currently,some bioinformaticians,depending on their scope and/or research question,already function as bio-data scientists.

    Bioinformatics is the application of information technology(IT)and computer science(CS)to biology.It emerged and evolved in response to the growth of digital biological information,which creates new analytical problems.For example,when full-length DNA or protein sequences became more common, data storage, organization, and representations emerged,paving the way toward pioneering databases such as Dayhoff’s Atlas of protein sequence in 1966[10].In the dawn of the Human Genome Project(HGP)and the emergence of DNA-sequencing technologies,it was unnecessarily arduous to identify overlapping DNA fragments by eye.Such tasks are highly repetitive and can be automated by designing and implementing appropriate algorithms. Bioinformatics emerged in a time to provide support for these emerging analytical requirements.Some successes of bioinformatics include the provision of algorithms for assembling a full genome or performing highly intensive annotation tasks,such as marking approximately 10 million single nucleotide polymorphism(SNP)locations in the human genome.Bioinformatics also includes algorithms for noise removal and bias correction.This includes normalization procedures such as robust microarray analysis(RMA)[11]in microarrays,base-calling[12],and gene length-based correction approaches,e.g.,transcripts per million(TPM)and reads per kilobase million(RPKM)[13],in RNA sequencing.

    Bioinformatics draws upon IT and CS concepts to identify suitable parallels,create reasonable models,and then solve the biological problem.In this respect,bioinformatics acts as a support discipline that solves a technical issue,so that the biologist may move forward in dissection of some biological problems,such as unraveling causal mechanisms that give rise to a phenotype.However,a bioinformatician acting in this respect does not take the lead in generating actionable interventions or building possible explanatory models that lead directly toward understanding the biological problem.

    This is not to say that all bioinformaticians do not care about developing models that explain biological phenomena.Certainly,within many laboratories,many bioinformaticians double up to provide explanatory models by collaborating closely with biologists.We regard activities requiring a bioinformatician to translate the digitized data output into biological insight as the realm of computational biology.A bioinformatician can therefore act as a computational biologist.

    Both bioinformaticians and computational biologists may act as bio-data scientists,provided they use similar skillsets associated with the data science field.This includes being able to tweak,optimize,and deploy ML and AI technologies,and being well trained in applied statistics.Notably,these are not formal training requirements for computational biologists and bioinformaticians currently.

    Computational biologists,bioinformaticians,and bio-data scientists will occupy and share the analytical space in this new digital biology landscape.The distinctions can be muddy,but there is certainly no barrier for a skilled individual to occupy all three professional spaces.Moreover,we do not think that there will be any form of superseding amongst the three professions:bioinformaticians will certainly continue to play important roles as frontline data generators and aggregators.This becomes increasingly important,given the large volumes of data being generated.Computational biologists will evolve,and the biological questions that interest them will change as new possibilities open with the advent of big biodata.Computational biologists may have already been trained bioinformaticians,and certainly,they may cross the barrier/divide to leverage on data science,thus becoming bio-data scientists themselves and using the new know-how to create new solutions to their interested biological problems.

    Finally,bio-data scientists,no matter self-professed or by professional designation,will emerge as new players.They may be existing purveyors from a bioinformatics or computational biology background.Nonetheless,they may also include new players with non-biological backgrounds such as mathematics,physics,and engineering,or even,pure data science or AI training backgrounds.Just as data scientists are transforming other fields,we foretell the emergence of a new breed of bio-data scientists,who will actively shape and lead the narrative for research and development direction in their chosen biological domains or disease contexts.

    Drivers for BDS

    BDS is accelerated by three main drivers:the emergence of big bio-data,the second coming of AI,and a revolution in statistical thinking.

    Emergence of big data in biology

    HGP marked the start of numerous large-scale data acquisition initiatives such as the International HapMap Project[14],1000 Genomes[15],the Cancer Genome Atlas(https://www.cancer.gov/tcga), as well as the recently announced Human Brain Project[16]and the Human Proteome Project[17]. These ambitious initiatives require advances in approaches for data generation,data flow,data storage,data access,and data representation.To address this need,new cloud technologies provide powerful methods for data storage and access beyond the limitations of our local hard drives[18].Parallel/distributed computing methods such as Hadoop provide powerful ways of performing analysis on the cloud[19].

    Today,genotypic data based on DNA and RNA sequences is the major driving force for the evolution of biology into a data science.There are more than 2.7 million samples that are currently available from the Gene Expression Omnibus database(at the point of writing:November 19,2018).Assuming the size of each file is approximately 1 GB(a very modest estimate),size of these samples can easily add up to the amount of 2.7 petabytes(PB).Improvements in RNA sequencing technologies will accelerate data explosion.For instance,the current Illumina HiSeq X sequencing platform can generate 900 billion nucleotides of raw DNA sequence within 3 days.It is estimated that by 2025,the storage of human genomes alone will require 2-40 exabytes(EB)[20,21].Besides genotyping data,other sources of big bio-data are also emerging.These include medical records,phenotyping and trait-based measurements collectively referred to as‘‘phenome,”imaging and microscopy data,as well as network-based information garnered from various interaction-based experiments.

    This changing data landscape did not go by unnoticed.In a survey across 704 National Science Foundation investigators,the unanimous response was that biology is awash with big data[22].Respondents also ranked training on integration of multiple data types,data management and metadata,as well as scaling analysis to cloud/high-performance computing as the three greatest unmet needs critical to advancement in their research fields[22].It appears that the problem is the growing gap between the accumulation of big data and the limited knowledge of researchers about how to use it effectively[22].

    Second coming of AI

    Despite AI being heralded as the technological game changer that will drive the digital economy of the future,this is not the first time such high expectations have been heaped on the AI technology.During the 1970s-1980s,AI was expected to usher in the age of the self-driving car and other technological marvels;these unfortunately did not come to pass,eventually leading to a period known as the ‘‘AI winter” [23].Improvements in AI-based learning platforms,particularly neural networks[24],and newly revitalized paradigms such as deep learning(DL)[25]and reinforcement learning(RL)[26]have created new opportunities and applications.

    RL is loosely defined as learning that does not require perfect or large amounts of data.Encapsulated as an AI system,RL is about making appropriate decision and then taking action to maximize reward in a particular situation or acting under specific constraints(e.g.,chess playing rules).

    DL is loosely defined as architectures that facilitate complex decision-making by modeling AI as neural networks,not unlike the neural connections found in the human brain.DL is compatible with big datasets with high levels of complexities as it aims at learning feature hierarchies from the data,where higher-level features of the hierarchy are formed by composition of lower-level features.We may think of these multi-level features as abstractions,allowing a system to learn complex inputs without necessarily depending completely on pre-defined human-based inputs.DL is gaining great popularity in biological research,with novel applications in proteomics[27],genomics[28],and biomedicine[29-31].

    While there is much anticipated potential that has led to several high-profile tie-ups between IBM’s Watson AI and various pharmaceutical giants(see the section‘‘Trends and expectations for BDS”),it is important to remember that AI and ML techniques are intimately connected,of which the latter is already commonplace in bioinformatics.Algorithms for gene finding based on hidden Markov models(HMMs),e.g.,GENSCAN[32],and neural networks for motif finding[33]are just a few notable examples.A key difference between the AI applications of old days,and today’s new applications is scale,wherein AI is expected to identify long-range patterns and perform multi-omics integration across various levels of big bio-data,such as the genome and proteome,thus proposing mechanisms and/or testable targets.

    A revolution in statistical thinking

    The field of statistics is undergoing a major transformation.Scientific arguments based solely on P values are no longer viewed as sufficiently robust.For example,a replication study across leading psychology journals has revealed that <50%of the studies examined are replicated[34].Halsey et al demonstrate the instability and variability of P values;even as sample size increment and exact replication experiments(EREs)converge on the true effect size,there lacks any concomitant reduction in the variability of P values[35].Halsey et al’s work partially explains the high non-replication rates in Ioannidis’experiment[34]and warns against the use of the convenient yet ill-founded strategy of claiming conclusive research findings solely on the basis of P values,despite it being a commonly accepted practice.

    Relatively simple mitigating measures against P value instability include using confidence intervals(CIs)[35](although this viewpoint has also been confronted by van Helden[36]),ranking variables by effect sizes[37],reporting the P value replicability or p-rep[38,39],and performing repeated subsampling on the data to determine if the findings are consistent[40].There has already been much discussion regarding the nature of P values;therefore,we will not elaborate this further.

    A very useful, and in our opinion, a more balanced approach is to incorporate Bayesian thinking,when it comes to reasoning about the P values.The Bayesian perspective says that instead of only considering the evidence that suggests support for a true effect,we should consider the evidence in totality,which also includes considering the same evidence that suggests support for a non-true effect.

    We may express the probability[P(T|e)]for a true effect(T),given some evidence(e).By Bayes’theorem,the probability is expressed as follows:

    The right hand side is the probability of obtaining a true effect,P(T),which is multiplied by the probability of obtaining some evidence,e,given a true effect,P(e|T),and divided by the probability of observing the evidence,e,independently.We also need to consider the probability[P(-T|e)]for a non-true effect(-T),given the same evidence(e).Accordingly,the probability is expressed via Bayes’s theorem as follows:

    The right hand side is the probability of obtaining a nontrue effect,P(-T),which is multiplied by the probability of obtaining some evidence,e,given a non-true effect,P(e|-T),and divided by the probability of observing the evidence,e,independently.

    Given some evidence e,we may then calculate the odds of obtaining true effects against non-true effects as follows:

    When people observe strong effect(e.g.,a significant P value)in support of their hypothesis,they will think that there is a true effect.However,they often fail to consider the alternative possibility that a significant P value can also arise when there is no true effect.Thus,the Bayesian perspective is more balanced.We can use this perspective in practical settings.For example,when a gene is reported as significantly correlated with a phenotype,we will be less inclined to immediately declare this finding as important without first estimating the likelihood that the same gene will also be reported as significantly correlated,even if it has no true correlation with the phenotype.This perspective can also be usefully extended toward situations beyond‘‘no effect”to situations wherein a significant result is due to a confounder(e.g.,batch effects)as well.

    A second important and changing statistical perspective is the movement against blind use of centralities such as mean,mode,and median.In symmetrical distributions,the arithmetic mean and median,combined with a sense of the underlying dispersion such as the standard deviation or interquartile range,are generally useful metrics.However,there are many instances wherein the use of centralities is unwarranted and extreme metrics such as minimum and maximum values are actually more useful[41]in situations,including adverse environments where a biological phenomenon is rare[42].To provide an example,suppose we are interested in examining the optimal configuration for fire resistance,given a fixed number of trees and lakes in a simulation model.The model that provides the maximum number of surviving trees would be the optimal configuration we want.Suppose we simulate the random placement of trees and lakes and return the number of surviving trees each round.In this case,reporting the average values of the models only tells us on average what is the remaining number of trees but is otherwise pointless.

    The third perspective is the recognition of the gap between theoretical and applied statistics.The studies of Halsey et al and Ioannidis et al,wherein the former reports P value instability leading to the Winner’s curse(the analogy pertains to one winning the lottery out of sheer chance,just as a false positive but spectacular finding also arises due to chance)and the latter shows that >50%of real-world studies in psychology are not reproducible,have demonstrated that theoretical statistical perspectives do not work well in practice[34,35].Similarly,in our own practice,we have also found that statistical significance is abundant,due to the presence of confounders and other irrelevant factors[43].This is also known as the Anna Karenina effect.Such problems are remediable by performing statistical analysis more logically and considering disparities and idiosyncrasies associated with both statistical techniques and data[43].

    Trends and expectations for BDS

    The rise of data science,AI,and ML has led toward several high-level collaborations between industry and computing firms,spawned new biotechnology companies,and created new opportunities for advancing scientific discovery.

    We list a few examples.In late 2016,pharmaceutical giant company Pfizer announced a collaboration with IBM,involving the use of the latter’s Watson AI for immuno-oncological research. In June 2017, GNS Healthcare and Genentech(Roche)announced a collaboration to use the causal ML and simulation platform of GNS Healthcare to power development of novel cancer therapies.In that same month,Novartis also announced a collaboration with IBM Watson to use AI for improving health outcomes in patients with breast cancer.New enterprises are also emerging rapidly. For example,XtalPi is a pharmaceutical technology company that is reinventing the industry’s approach toward drug R&D with its Intelligent Digital Drug Discovery and Development(ID4)platform,which integrates quantum mechanics,AI,and cloud computing, thus allowing pharmaceutical companies to increase their efficiency,accuracy,and success rates at critical stages of drug R&D.Since 2016,Bayer has been offering money through its grant programs,with clear preference for AI medical startups working on cancer(Turbine)and preventable diseases(xbird).

    Besides pharmaceuticals,there are also instances of AI-led advances in biological research. For example, Allen Cell Explorer uses ML to predict stem cell topology based on thousands of images;BenevolentAI and Microsoft Academic AI are learning algorithms that process natural language,formulate new ideas from what they read,and sift through vast chemical libraries,medical databases,and conventionally presented scientific papers to establish connections across knowledge networks.

    Biological education is also expected to benefit from the advancement of AI/ML and data science.Smart learning platforms based on adaptive learning models are emerging[44].

    Risks for BDS

    BDS will be a challenging field,but its difficulties are not necessarily distinct from those of bioinformatics or computational biology.Biological systems are highly complex,while the technological platforms intended for assaying these biological systems are in themselves also highly sophisticated.Moreover,technological instruments developed for measuring biological entities are subject to technical uncertainty,while the components of biological systems change and vary naturally over time.Big bio-data is not a natural solution for such issues,and it presents new difficulties.While big bio-data may facilitate data science endeavors,such as the process of identifying conserved patterns over very large numbers of observations,it may only do so if appropriate analytical pipelines are developed.This task is non-trivial.One may imagine such an analytical pipeline as an end-to-end integration of various approaches,forming an analysis stack starting with data collection and continuing through computational and statistical evaluations toward higher-level biological interpretations and insights.A simplified pipeline for biomarker analysis from high-throughput omics data and the associated key considerations are shown in Figure 3.

    Analytical pipelines need to be very flexible and change according to the needs of the research question.Since we lack perfect knowledge,it is also usual to iterate and refine,moving back and forth across several steps,to achieve some sense of optimization and reproducibility.For example,suppose in the normalization step,we find that the use of two different normalization procedures results in very different and nonoverlapping differential gene sets.It is possible that the normalization procedure makes erroneous assumptions about the data or that it may have been wrongly implemented.The key considerations shown in Figure 3 are non-exhaustive.The purpose of showing the steps with examples of considerations is to demonstrate that while there is no perfect system or pipeline,given each step,there are many considerations,with each decision point having consequence for the steps that come afterward.We also need to evaluate compatibility issues,such as whether a particular normalization approach works well with a downstream statistical procedure.Other issues include whether a particular procedure might lead toward over-cleaning and overcorrection(problems associated with batch effect correction algorithms[45,46]and some multiple test correction methods[47]).BDS may be likened to recipe development in the kitchen,requiring multiple rounds of trial-and-error,while keeping a close eye on the intended endpoint or objective.There is no route map or standard operating procedure that guarantees a universally good result.In this regard,BDS is as much an art as it is a science[43,48].

    Suppose we are able to reach our intended analytical objective;it still should not be forgotten that the output is ultimately based on inference.And inferences,when based on massive data wherein we are less able to control heterogeneity and variability,run the risk of generating errors(both false positives and false negatives).These errors in turn lead to overfitting,that is,the predictive models are over-tuned to work well only on the training data but not on future independently generated datasets.

    Figure 3 An analysis pipeline for biomarker analysis using a data-centric approach

    In practice,good research and development should include an accurate evaluation of error rates, and good methods should minimize error rates where practical.However,there is always a trade-off between getting only correct answers(higher false negative rate)and getting all the correct answers(higher false positive rate).Furthermore,estimations of error rates may be off,if the statistical model is a poor fit with the data,for example,using reference models that assume normality of distribution when the data is clearly non-normally distributed.

    Toward a unified BDS curriculum

    There are many insertion points into BDS.A computer scientist,statistician,or fintech analyst may enter the field by increasing their biological domain knowledge.A practicing computational biologist and bioinformatician may strengthen their statistical knowledge,learn parallel computing platforms such as Apache Spark or Hadoop,and learn how to use ML and AI implementations such as TensorFlow.Professional training in the BDS landscape will prove highly heterogeneous.It would take more work(and time)for a pure biologist to crossover,as fundamental training in mathematics,statistics,and computing would be required.

    The increased momentum toward data science has led to education reforms internationally.In recent years,the University of California,Berkeley and Carnegie Mellon University have sought to make digital literacy(basic programming and data science)a core component of all undergraduate education. Where BDS is concerned, the School of Biological Sciences,Nanyang Technological University(NTU),in consultation with other stakeholders,has proposed the following curricula(Figure 4)for BDS.The basic purpose is to equip biological science undergraduates with timely computational thinking and digital literacy skills essential for the modern economy.This set of courses is also meant to provide an insertion point for undergraduates to pursue further training as biodata scientists.

    At the graduate level,a handful of Masters/PhD-level programs using the term BDS have emerged as well.The University of Wisconsin-Madison has launched a pre-doctoral training program and Master’s program in BDS,with emphasis on statistics,mathematics,data visualization,and ML.Other similar and related programs include the Masters of health data science,available in the Faculty of Biology,Medicine and Health,University of Manchester,UK;the Masters of biomedical research(data science),available in the Faculty of Medicine,Imperial College London,UK;and the Masters of biostatistics and data science,available in the Graduate School of Medical Sciences,Cornell University,USA.Beginning from 2020,the School of Biological Sciences,Nanyang Technological University(NTU)will also offer a Masters program in biomedical data science.

    We believe what should constitute the core curriculum of BDS is still being formulated and may take several more years before the field matures and stabilizes.We have noticed that the term BDS is now being marketed in some graduate programs.In several cases,besides a name change,the distinction between BDS and bioinformatics/biostatistics is not explicit.While we agree that bioinformatics knowledge is essential in BDS training,it is less clear as to exactly which aspects of bioinformatics are relevant and must be included.The advent of BDS will also drive changes in bioinformatics education,as educators re-examine the course content for timely relevance,and explore areas for synergistic collaboration.Indeed,given the rise of big data,educators are questioning if current bioinformatics curriculum include sufficient components to address this issue.After all,many bioinformatics programs were established before big data became a prominent area of focus.

    Figure 4 Current digital literacy offerings in SBS,NTU to facilitate immersion into the bio-data science

    For biology educators seeking to implement a BDS curriculum,we feel that it is crucial not to just teach programming and employing existing software tools such as TensorFlow[49].Educational components incorporating abstract,algorithmic,and logical thinking(computational thinking),which are important for problem-solving,are absolutely necessary.

    Some analytical situations requiring BDS

    BDS will emerge as a new discipline in light of novel challenges stemming from big bio-data,an increasing recognition of the gulf between applied and theoretical statistics,and expectations heaped upon it given the rise of AI.In this section,we describe some interesting challenges for BDS.

    Creating new perspectives in doing cross-validation right

    In our‘‘Turning straw into gold”paper,it is shown that about 50%of randomly generated(and therefore meaningless)gene signatures work well on a given breast cancer survival dataset,with some even outperforming published signatures[50].On the surface,this would imply rather dramatically that all manuscripts focusing on finding prognostic signatures on breast cancer survival are a waste of effort(and therefore,that all manuscripts with focus on finding such signatures should be rejected without review).Of course,that would be too drastic.However,it does suggest that if we rethink more deeply,even higher stringency should be placed on validation than currently practiced by data mining or ML researchers.In particular, given the observation that a random signature has about 50%chance to be significant in a dataset,more independent datasets must be used to ensure that the observed associations are not due to chance.

    Assuming that the datasets are fully independent,we also observed that seven datasets are needed to ensure that a random signature has <1%chance to be universally significant in all seven datasets.This requirement(of seven independent test datasets)is much higher than the common practice of simple cross-validation on a training dataset and a single independent test dataset in the data mining and ML communities.In other words,biology demands higher proof of generalizability.

    Perceived interdependence of datasets in independent validation

    In our meta-analysis of various breast cancer datasets,we also observed that the number of independent datasets in which a randomly generated gene signature is significant is not distributed according to the binomial distribution,although the mode of the distribution is preserved and accentuated[50].This suggests that the independent datasets might not be fully independent despite being collected from different independent groups.Perhaps there are some shared intrinsic population characteristics that confound the random signatures(besides the effects of proliferation-associated genes,which is reportedly a major source of confounding effects).A deeper investigation into the meta-characteristics of these datasets is therefore useful and may reveal the existence of yet,unreported confounders.In other words,while existing ML and AI practitioners may use only one independent validation approach,there are instabilities associated with this extremely crucial step.Just because an independent validation proved positive does not mean that the gene signature is truly good.It could also be because it so happens that the independent validation dataset has some commonalities with the training data,and that therefore,data leakage has occurred.

    Stop to question even when prediction accuracy is good

    Suppose we train a neural network W,on a training set and test it on a test set only to get a high accuracy(e.g.,90%).Next,we randomly remove two edges in W to get a new network W′and train/test it on the same training/testing set as W,it is very likely to get a high accuracy similar to that of W.

    Now,suppose we randomly generate lots of new test data and feed these to both W and W′.Although we have no idea what the true class labels on the new test data are,we still can determine whether W and W′agree on these test data(i.e.,W and W′agree—both predict‘‘yes”or both predict‘‘no,”or W and W′disagree—one predicts‘‘yes”and the other‘‘no”).It can be observed quite often that W and W′would drastically disagree on the new test data(with disagreement rates that may be >50%of the new test instances).This means that despite having very similar and common origins,we may have the following findings.(1)W and W′are drastically different rules/models;(2)a single test dataset is insufficient to validate W and ensure that it is meaningful;and(3)there is often significant sampling gap/bias in a test dataset.A corollary is:(4)it is critical to carefully analyze W to obtain/derive a full explanation of the set of rules it represents and to properly ascertain the biological meaningfulness of these rules.

    In short,the ability to achieve a good prediction accuracy may have little to do with the true biological meaning.This is also a major stumbling block when transcending from‘‘predictive”toward‘‘prescriptive”analytical levels.While we offer no direct solution to this problem,it is important to realize that ML and AI are but tools with high tuneability and many performance exceptions.It is therefore important that aspiring bio-data scientists to train hard on logical thinking processes instead of merely relying on feeding massive heaps of data into an algorithmic blackbox.If good results are obtained,the good performance may be misleading.If bad results are obtained,knowing the likely factors to consider and test is crucial,instead of trial-and-error,which may prove daunting when there are many more variables in big data to consider.

    Conclusion

    Biology has a golden opportunity to ride on the current data science wave.This will inevitably give rise to a new subfield—BDS. We are beginning to see new initiatives and achievements as a result.We foresee the rise of bio-data scientists as a new breed of specialists who will act as navigators and overseers in directing and leading future innovation from a data-centric/informed perspective.

    Competing interests

    The authors have declared no competing interests.

    Acknowledgments

    WWBG gratefully acknowledges support from the Accelerating Creativity and Excellence(ACE)and EdeX grants from Nanyang Technological University,Singapore.WWBG also acknowledges the support from a National Research Foundation of Singapore-National Natural Science Foundation of China (Grant No. NRF2018NRF-NSFC003SB-006). LW acknowledges support from a Kwan Im Thong Hood Cho Temple Chair Professorship and the National Research Foundation Singapore under its AI Singapore Programme(Grant Nos. AISG-100E-2019-027 and AISG-100E-2019-028).WWBG and LW also acknowledge Alex Bateman for close reading and contribution of ideas that helped improve the manuscript.

    ORCID

    0000-0003-3863-7501(Goh WWB)

    0000-0003-1241-5441(Wong L)

    免费高清在线观看日韩| 国产av国产精品国产| 美女国产高潮福利片在线看| 亚洲国产毛片av蜜桃av| 老司机影院毛片| 欧美日韩一级在线毛片| www.自偷自拍.com| 国产精品二区激情视频| 69精品国产乱码久久久| 国产av国产精品国产| 日本黄色视频三级网站网址 | 色94色欧美一区二区| 在线观看免费日韩欧美大片| 我的亚洲天堂| 老熟妇仑乱视频hdxx| 国产91精品成人一区二区三区 | 国产一区二区三区视频了| 一区二区日韩欧美中文字幕| 日韩大码丰满熟妇| 最近最新中文字幕大全电影3 | 国产无遮挡羞羞视频在线观看| av视频免费观看在线观看| 日本a在线网址| 国产区一区二久久| 欧美日韩国产mv在线观看视频| 男人舔女人的私密视频| 露出奶头的视频| 国产欧美日韩综合在线一区二区| 99久久99久久久精品蜜桃| 亚洲专区中文字幕在线| 叶爱在线成人免费视频播放| 婷婷丁香在线五月| 亚洲欧美日韩另类电影网站| 人人妻人人爽人人添夜夜欢视频| 亚洲人成电影免费在线| 咕卡用的链子| 美女午夜性视频免费| 黄频高清免费视频| 午夜久久久在线观看| 久久久久网色| kizo精华| 亚洲天堂av无毛| 精品一区二区三区av网在线观看 | 18禁美女被吸乳视频| 午夜福利在线观看吧| 国产精品秋霞免费鲁丝片| 美女扒开内裤让男人捅视频| av国产精品久久久久影院| 老司机靠b影院| 欧美黄色片欧美黄色片| 久久精品成人免费网站| 50天的宝宝边吃奶边哭怎么回事| 男人舔女人的私密视频| 久久人妻熟女aⅴ| 国产精品亚洲av一区麻豆| 香蕉久久夜色| 国产xxxxx性猛交| 黄频高清免费视频| 国产人伦9x9x在线观看| 蜜桃国产av成人99| 老司机影院毛片| 两人在一起打扑克的视频| 久久精品国产综合久久久| www.自偷自拍.com| 国产成人精品无人区| 亚洲综合色网址| 女性被躁到高潮视频| 美女午夜性视频免费| 欧美在线一区亚洲| av天堂久久9| 激情视频va一区二区三区| 亚洲,欧美精品.| 中文欧美无线码| 国产在线精品亚洲第一网站| 好男人电影高清在线观看| 欧美日韩中文字幕国产精品一区二区三区 | 国产成人精品久久二区二区91| 老司机福利观看| 女性被躁到高潮视频| av欧美777| 久久免费观看电影| 国产99久久九九免费精品| 欧美 日韩 精品 国产| 亚洲精品国产精品久久久不卡| 一进一出抽搐动态| 欧美人与性动交α欧美软件| 亚洲视频免费观看视频| bbb黄色大片| 亚洲国产欧美网| 无遮挡黄片免费观看| 亚洲欧美一区二区三区黑人| 最近最新中文字幕大全电影3 | 亚洲国产毛片av蜜桃av| 大型av网站在线播放| 操美女的视频在线观看| 精品国产国语对白av| 蜜桃国产av成人99| 搡老岳熟女国产| 久久天堂一区二区三区四区| 国产三级黄色录像| 久久久久久久精品吃奶| 久久香蕉激情| 老司机福利观看| 精品卡一卡二卡四卡免费| 亚洲欧美激情在线| 丁香六月天网| 欧美激情极品国产一区二区三区| 男女下面插进去视频免费观看| 黄色片一级片一级黄色片| 久久影院123| 在线十欧美十亚洲十日本专区| 久久久久久久大尺度免费视频| 亚洲精品一二三| 男女下面插进去视频免费观看| 极品人妻少妇av视频| 高潮久久久久久久久久久不卡| 成人永久免费在线观看视频 | 欧美激情高清一区二区三区| 多毛熟女@视频| 亚洲欧美精品综合一区二区三区| 国产成人系列免费观看| 精品福利永久在线观看| 夫妻午夜视频| 欧美日韩av久久| 18禁国产床啪视频网站| 精品国产一区二区久久| 欧美日韩精品网址| 亚洲精品成人av观看孕妇| 另类亚洲欧美激情| 黄色片一级片一级黄色片| 午夜福利影视在线免费观看| 亚洲精品一二三| 69精品国产乱码久久久| a级毛片黄视频| 国产一卡二卡三卡精品| 99热网站在线观看| 一边摸一边做爽爽视频免费| 国产精品自产拍在线观看55亚洲 | 久久国产亚洲av麻豆专区| 97在线人人人人妻| 免费看十八禁软件| 国产成人精品久久二区二区免费| 国产主播在线观看一区二区| 日韩有码中文字幕| 欧美日韩av久久| 99精品欧美一区二区三区四区| 国产欧美日韩综合在线一区二区| 美女福利国产在线| 国产高清激情床上av| 国产精品成人在线| 9191精品国产免费久久| 黄片小视频在线播放| 日韩有码中文字幕| 欧美精品亚洲一区二区| videosex国产| 黄色丝袜av网址大全| 少妇粗大呻吟视频| 欧美精品人与动牲交sv欧美| 99国产精品99久久久久| 午夜福利一区二区在线看| 波多野结衣一区麻豆| 午夜91福利影院| 久久精品熟女亚洲av麻豆精品| 大香蕉久久成人网| 又黄又粗又硬又大视频| 人人妻人人爽人人添夜夜欢视频| 一本—道久久a久久精品蜜桃钙片| 国产精品免费视频内射| 极品教师在线免费播放| 国产在线精品亚洲第一网站| 日本欧美视频一区| 久久午夜亚洲精品久久| 欧美国产精品va在线观看不卡| 日韩欧美一区二区三区在线观看 | 女人精品久久久久毛片| 999久久久精品免费观看国产| 国产三级黄色录像| 人妻久久中文字幕网| 亚洲三区欧美一区| 亚洲av电影在线进入| 国产成人欧美在线观看 | 亚洲成av片中文字幕在线观看| 久久久精品国产亚洲av高清涩受| 在线观看66精品国产| 岛国毛片在线播放| 久久久精品94久久精品| 日本五十路高清| 久久天躁狠狠躁夜夜2o2o| 亚洲欧美日韩高清在线视频 | 国产aⅴ精品一区二区三区波| 三上悠亚av全集在线观看| 亚洲av第一区精品v没综合| 久久人妻av系列| 国产深夜福利视频在线观看| 亚洲精品自拍成人| 一级毛片精品| 国产欧美日韩一区二区三| 最黄视频免费看| av不卡在线播放| 日韩视频在线欧美| 免费观看a级毛片全部| 99国产精品一区二区三区| 亚洲精品一二三| 亚洲成a人片在线一区二区| 麻豆乱淫一区二区| 欧美在线一区亚洲| 国产在视频线精品| 国产在线免费精品| 久久影院123| 在线观看人妻少妇| 99精品久久久久人妻精品| 女性被躁到高潮视频| 99在线人妻在线中文字幕 | 97人妻天天添夜夜摸| 日本黄色日本黄色录像| √禁漫天堂资源中文www| 中文字幕制服av| 亚洲一码二码三码区别大吗| 亚洲av片天天在线观看| 十八禁网站网址无遮挡| 国产片内射在线| 午夜免费成人在线视频| 国产熟女午夜一区二区三区| 五月天丁香电影| 亚洲熟妇熟女久久| 黄色丝袜av网址大全| 午夜免费成人在线视频| 日本av免费视频播放| 欧美 亚洲 国产 日韩一| 日本精品一区二区三区蜜桃| 欧美日韩亚洲国产一区二区在线观看 | 欧美性长视频在线观看| 国产一区二区三区综合在线观看| av一本久久久久| 精品人妻1区二区| 高清毛片免费观看视频网站 | 精品久久蜜臀av无| 999久久久国产精品视频| 一级,二级,三级黄色视频| 啦啦啦在线免费观看视频4| 亚洲国产欧美日韩在线播放| 亚洲精华国产精华精| 91九色精品人成在线观看| 一本一本久久a久久精品综合妖精| 久久久久国产一级毛片高清牌| 久久久久久久大尺度免费视频| 亚洲国产欧美一区二区综合| 欧美精品啪啪一区二区三区| 久久人妻福利社区极品人妻图片| 国产av国产精品国产| 国产免费av片在线观看野外av| 老司机午夜十八禁免费视频| 每晚都被弄得嗷嗷叫到高潮| 国产黄频视频在线观看| 久热爱精品视频在线9| 丝袜美腿诱惑在线| 久久国产亚洲av麻豆专区| 国产不卡av网站在线观看| 日韩中文字幕视频在线看片| 亚洲少妇的诱惑av| 久久这里只有精品19| 午夜两性在线视频| 亚洲国产欧美一区二区综合| 欧美激情久久久久久爽电影 | 一二三四在线观看免费中文在| 亚洲性夜色夜夜综合| 两个人看的免费小视频| 蜜桃在线观看..| 狂野欧美激情性xxxx| 在线亚洲精品国产二区图片欧美| 国产精品成人在线| 午夜两性在线视频| 亚洲一区二区三区欧美精品| 国产精品1区2区在线观看. | 成人手机av| 久久久久网色| 狠狠狠狠99中文字幕| 法律面前人人平等表现在哪些方面| 国产在线一区二区三区精| 成人手机av| 国产精品一区二区精品视频观看| 国产成人精品久久二区二区91| 9191精品国产免费久久| 91精品三级在线观看| 色尼玛亚洲综合影院| 色在线成人网| 性少妇av在线| 精品人妻1区二区| av福利片在线| 12—13女人毛片做爰片一| 久久精品成人免费网站| 在线观看66精品国产| 亚洲国产精品一区二区三区在线| 两性午夜刺激爽爽歪歪视频在线观看 | 三上悠亚av全集在线观看| 桃红色精品国产亚洲av| 黑人巨大精品欧美一区二区mp4| 久久人妻熟女aⅴ| 99久久99久久久精品蜜桃| 成年人免费黄色播放视频| 国产高清videossex| 在线观看人妻少妇| 国产区一区二久久| 亚洲美女黄片视频| 又大又爽又粗| av电影中文网址| 考比视频在线观看| 中文字幕人妻熟女乱码| 国产成+人综合+亚洲专区| 国产不卡av网站在线观看| 中文字幕人妻熟女乱码| 午夜福利影视在线免费观看| 久久天堂一区二区三区四区| 午夜老司机福利片| 五月天丁香电影| 91精品三级在线观看| 最新在线观看一区二区三区| 又紧又爽又黄一区二区| 欧美日韩中文字幕国产精品一区二区三区 | 国产成人精品无人区| 怎么达到女性高潮| 丁香欧美五月| 欧美乱妇无乱码| 香蕉久久夜色| 欧美激情 高清一区二区三区| 日韩一卡2卡3卡4卡2021年| 国产精品1区2区在线观看. | 午夜福利乱码中文字幕| 日本av免费视频播放| netflix在线观看网站| 久热这里只有精品99| 久久久国产成人免费| 亚洲自偷自拍图片 自拍| 亚洲,欧美精品.| 两人在一起打扑克的视频| 国产亚洲av高清不卡| 成人亚洲精品一区在线观看| 免费久久久久久久精品成人欧美视频| 国产一区二区三区视频了| 久久人妻熟女aⅴ| 我的亚洲天堂| 欧美精品高潮呻吟av久久| 国产色视频综合| 欧美性长视频在线观看| 亚洲久久久国产精品| 777米奇影视久久| 国产成人欧美| 在线看a的网站| 啦啦啦中文免费视频观看日本| 国产精品香港三级国产av潘金莲| 一区二区av电影网| 亚洲va日本ⅴa欧美va伊人久久| 欧美黑人欧美精品刺激| 精品午夜福利视频在线观看一区 | 黄色视频,在线免费观看| 精品国产乱子伦一区二区三区| 涩涩av久久男人的天堂| 亚洲五月婷婷丁香| 老司机深夜福利视频在线观看| 久久久欧美国产精品| 一区二区av电影网| 亚洲专区国产一区二区| 午夜福利在线免费观看网站| 搡老乐熟女国产| 无遮挡黄片免费观看| 一区在线观看完整版| www日本在线高清视频| 国产在线视频一区二区| 老司机福利观看| 久久人人97超碰香蕉20202| 麻豆av在线久日| 欧美精品啪啪一区二区三区| 大香蕉久久网| 中文字幕最新亚洲高清| 国产男靠女视频免费网站| 亚洲色图 男人天堂 中文字幕| 亚洲一区中文字幕在线| 色94色欧美一区二区| av天堂久久9| 天天影视国产精品| 99香蕉大伊视频| 97人妻天天添夜夜摸| 日韩视频一区二区在线观看| 黄片播放在线免费| 精品一品国产午夜福利视频| 丰满人妻熟妇乱又伦精品不卡| 久久狼人影院| 一级毛片电影观看| 高清视频免费观看一区二区| 日日摸夜夜添夜夜添小说| 国产一区二区激情短视频| 成人精品一区二区免费| 99精国产麻豆久久婷婷| 成人永久免费在线观看视频 | 大片免费播放器 马上看| 蜜桃在线观看..| av又黄又爽大尺度在线免费看| 免费看a级黄色片| 99香蕉大伊视频| 免费在线观看视频国产中文字幕亚洲| 欧美日本中文国产一区发布| 久久久久精品国产欧美久久久| 日韩视频在线欧美| 十分钟在线观看高清视频www| 男人舔女人的私密视频| 久久九九热精品免费| 亚洲欧美一区二区三区久久| 啦啦啦 在线观看视频| 国产精品一区二区在线不卡| 欧美精品高潮呻吟av久久| 国产欧美日韩综合在线一区二区| 一边摸一边做爽爽视频免费| 精品免费久久久久久久清纯 | 热99国产精品久久久久久7| 久久久国产精品麻豆| 丰满迷人的少妇在线观看| 国产在线观看jvid| 99久久国产精品久久久| 精品一区二区三区视频在线观看免费 | 十分钟在线观看高清视频www| 成人av一区二区三区在线看| 黄片大片在线免费观看| av免费在线观看网站| 一边摸一边抽搐一进一小说 | 考比视频在线观看| 久久久久久亚洲精品国产蜜桃av| 水蜜桃什么品种好| 国产精品电影一区二区三区 | 国产人伦9x9x在线观看| av线在线观看网站| 一区二区三区精品91| 热99国产精品久久久久久7| 亚洲中文av在线| 黄色丝袜av网址大全| 国产一区有黄有色的免费视频| 女警被强在线播放| 亚洲国产欧美一区二区综合| 汤姆久久久久久久影院中文字幕| 蜜桃国产av成人99| 人人妻人人澡人人看| 亚洲av第一区精品v没综合| 免费看a级黄色片| 精品人妻熟女毛片av久久网站| av超薄肉色丝袜交足视频| 在线观看免费日韩欧美大片| 国产亚洲欧美精品永久| 老司机亚洲免费影院| 美女视频免费永久观看网站| 在线观看一区二区三区激情| 狠狠婷婷综合久久久久久88av| 免费在线观看黄色视频的| 色播在线永久视频| 日本欧美视频一区| 纵有疾风起免费观看全集完整版| 国产高清videossex| 热re99久久精品国产66热6| 宅男免费午夜| 在线观看免费午夜福利视频| 99国产精品99久久久久| 久久精品熟女亚洲av麻豆精品| 老鸭窝网址在线观看| 精品国产乱子伦一区二区三区| 亚洲精品在线观看二区| 亚洲av日韩在线播放| 久久人人爽av亚洲精品天堂| 久久毛片免费看一区二区三区| 99国产精品免费福利视频| 视频区图区小说| 我要看黄色一级片免费的| 啦啦啦免费观看视频1| 免费人妻精品一区二区三区视频| 国精品久久久久久国模美| 日本欧美视频一区| av福利片在线| 高清欧美精品videossex| av视频免费观看在线观看| 日本一区二区免费在线视频| 成人永久免费在线观看视频 | 大片电影免费在线观看免费| 不卡一级毛片| 成人亚洲精品一区在线观看| 老司机福利观看| 精品国产亚洲在线| 精品久久久久久久毛片微露脸| 高清在线国产一区| av天堂在线播放| 欧美精品一区二区大全| 啦啦啦在线免费观看视频4| 国产真人三级小视频在线观看| 免费不卡黄色视频| 成人特级黄色片久久久久久久 | 免费观看a级毛片全部| 国产老妇伦熟女老妇高清| 美女高潮喷水抽搐中文字幕| 十分钟在线观看高清视频www| 动漫黄色视频在线观看| 欧美激情高清一区二区三区| 正在播放国产对白刺激| 丁香六月天网| 一区在线观看完整版| 亚洲色图综合在线观看| 女人被躁到高潮嗷嗷叫费观| 欧美精品人与动牲交sv欧美| 国产亚洲精品一区二区www | 蜜桃在线观看..| 亚洲avbb在线观看| 亚洲av成人不卡在线观看播放网| 亚洲精品自拍成人| 欧美乱码精品一区二区三区| 久久中文看片网| 欧美日韩一级在线毛片| 国产精品一区二区精品视频观看| 一个人免费在线观看的高清视频| 久久久久久久久免费视频了| 亚洲国产看品久久| 色婷婷av一区二区三区视频| 在线观看免费视频日本深夜| 在线看a的网站| 亚洲欧美一区二区三区久久| 成年人免费黄色播放视频| 欧美大码av| 日韩欧美一区视频在线观看| 国产亚洲欧美精品永久| 亚洲国产看品久久| 久久久久视频综合| 欧美精品高潮呻吟av久久| 国产极品粉嫩免费观看在线| 飞空精品影院首页| 亚洲人成电影观看| 国产精品久久久久成人av| 丝袜美腿诱惑在线| 国产精品久久电影中文字幕 | 亚洲国产中文字幕在线视频| 男女午夜视频在线观看| 视频区图区小说| av线在线观看网站| 精品国产一区二区久久| 999久久久国产精品视频| 成年人黄色毛片网站| 亚洲精品国产区一区二| 欧美精品人与动牲交sv欧美| a级毛片黄视频| 国产精品av久久久久免费| 精品高清国产在线一区| 国产一卡二卡三卡精品| 国产亚洲午夜精品一区二区久久| 精品亚洲成国产av| 一区二区av电影网| 乱人伦中国视频| 久9热在线精品视频| 午夜两性在线视频| 久久久久久久久久久久大奶| 成年版毛片免费区| 久久中文字幕人妻熟女| 精品少妇久久久久久888优播| 欧美激情 高清一区二区三区| 首页视频小说图片口味搜索| 亚洲成人免费av在线播放| 久9热在线精品视频| 亚洲 国产 在线| 极品人妻少妇av视频| 精品亚洲成国产av| 亚洲成a人片在线一区二区| 999久久久国产精品视频| 人人妻人人澡人人看| 亚洲成人免费av在线播放| 日本vs欧美在线观看视频| 999久久久国产精品视频| 国产极品粉嫩免费观看在线| 日韩免费av在线播放| 欧美日韩福利视频一区二区| 婷婷成人精品国产| 久久香蕉激情| 每晚都被弄得嗷嗷叫到高潮| 成人手机av| 亚洲中文字幕日韩| 91国产中文字幕| 久久久精品区二区三区| 超色免费av| 亚洲欧美精品综合一区二区三区| 69av精品久久久久久 | 欧美激情极品国产一区二区三区| 50天的宝宝边吃奶边哭怎么回事| 日韩三级视频一区二区三区| av天堂久久9| 国产欧美日韩一区二区三| 午夜成年电影在线免费观看| 国产一区有黄有色的免费视频| 色婷婷av一区二区三区视频| 极品少妇高潮喷水抽搐| 两人在一起打扑克的视频| 国产野战对白在线观看| 狂野欧美激情性xxxx| 亚洲欧美色中文字幕在线| 热99re8久久精品国产| 亚洲国产成人一精品久久久| 免费在线观看日本一区| 国产免费av片在线观看野外av| 国产片内射在线| 高清视频免费观看一区二区| 亚洲精品av麻豆狂野| 亚洲av日韩精品久久久久久密| 国产成人精品久久二区二区免费| 精品国产一区二区三区久久久樱花| 不卡av一区二区三区| 日本a在线网址| 麻豆成人av在线观看| 一区二区三区精品91| 欧美日韩黄片免| 亚洲天堂av无毛| 不卡一级毛片| 欧美日韩成人在线一区二区| 一级a爱视频在线免费观看| 亚洲一区二区三区欧美精品| 精品久久久精品久久久|