• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Constructing Large Scale Cohort for Clinical Study on Heart Failure with Electronic Health Record in Regional Healthcare Platform: Challenges and Strategies in Data Reuse

    2019-07-12 10:02:38DaowenLiuLiqiLeiTongRuanPingHe
    Chinese Medical Sciences Journal 2019年2期
    關(guān)鍵詞:背痛早搏鈉離子

    Daowen Liu, Liqi Lei, Tong Ruan*, Ping He

    1School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China

    2Shanghai Hospital Development Center, Shanghai 200041, China

    Key words: electronic health records; clinical terminology knowledge graph;clinical special disease case repository; evaluation of data quality;large scale cohort study

    Abstract Regional healthcare platforms collect clinical data from hospitals in specific areas for the purpose of healthcare management. It is a common requirement to reuse the data for clinical research. However, we have to face challenges like the inconsistence of terminology in electronic health records (EHR) and the complexities in data quality and data formats in regional healthcare platform. In this paper, we propose methodology and process on constructing large scale cohorts which forms the basis of causality and comparative effectiveness relationship in epidemiology. We firstly constructed a Chinese terminology knowledge graph to deal with the diversity of vocabularies on regional platform. Secondly, we built special disease case repositories (i.e., heart failure repository) that utilize the graph to search the related patients and to normalize the data. Based on the requirements of the clinical research which aimed to explore the effectiveness of taking statin on 180-days readmission in patients with heart failure, we built a large-scale retrospective cohort with 29647 cases of heart failure patients from the heart failure repository. After the propensity score matching, the study group (n=6346) and the control group (n=6346) with parallel clinical characteristics were acquired. Logistic regression analysis showed that taking statins had a negative correlation with 180-days readmission in heart failure patients. This paper presents the workflow and application example of big data mining based on regional EHR data.

    R EGIONAL healthcare platforms collect clinical data from hospitals in specific areas for the purpose of healthcare management. It is a common requirement to reuse the data for clinical researches. Shah et al. set up a typical example of such reuse.1They found associations of type 2 diabetes with a wide range of incident cardiovascular diseases based on regional electronic health records.The dataset they used is from CArdiovascular disease research using LInked Bespoke studies and Electronic health Records (CALIBER) program. Denaxas et al.2gave a detailed description of CALIBER. They integrated data from difference sources, including the primary care data from the Clinical Practice Research Datalink(CPRD), disease registration data from the Myocardial Ischaemia National Audit Project (MINAP), the secondary care data from the Hospital Episodes Statistics(HES), and mortality and social deprivation data from the Office for National Statistics (ONS), upon which they constructed a regional dataset with populations in UK. Rea et al. explored an end to end data processing flow, normalizing disparate data to common objects with standard terminologies. They also implemented a prototype platform to perform transport, data normalization and common phenotyping services on disparate electronic health record (EHR) data. Abrah?o et al.3proposed a method to allow a peproducible cohort extraction for use of secondary data in observational studies. Especially, they described the process of cleaning EHR data and constructing cohort, and finally get a cohort for cardiovascular disease with 27,698 patients. The main difference between our work and theirs is that we focus on the EHRs from regional healthcare platform but their data comes from one hospital merely.

    In order to conduct observational studies with the EHR data more effectively, considerable efforts have been made to apply predictive modeling to mine EHR data. For example, Jin et al.4employed word vectors to model the diagnosis events and predict the risk of heart failure events using long short-term memory network model. Lei et al.5applied recurrent neural network based denoising autoencoder to learn patients’ representation with time series information in EHR data, and used the representation to predict health status, such as mortality prediction, disease risk prediction and similarity analysis. Google6proposed a representation of patients’ EHRs based on the Fast Healthcare Interoperability Resources (FHIR) format.Deep learning models using their representation are capable of accurately and effectively predicting multiple medical events from multiple centers. Shanghai Regional Healthcare Platform has stored EHRs from 38 top hospitals in Shanghai for more than ten years.We tried to find the methodology to reuse the data for clinical researches. We built an interdisciplinary team of clinicians, clinical information experts, computer engineers and data analysts. The following challenges were encountered.

    (1) The overall process. The research process can be driven from three different aspects, i.e., research topic-driven, method-driven and data-driven.For research topic-driven, clinicians raise the research topic firstly. As to data-driven, the computer engineers collect and clean all the required data, and then data analysts analyse the data with help from clinicians for medical discoveries. If the computer programs are intelligent enough, we may use method-driven approach to re-organize the noise data to find possible medical discoveries.

    (2) The terminologies issue. The regional EHR data consists of data from various hospitals. Hospitals have different data dictionaries and each clinician use his own terminologies. For example, although disease coding specifications such as ICD10 are enforced by the government, most clinicians do not use the disease codes correctly. As to the disease names, they may use different expressions to describe the disease. The similar problems happen to procedures, and ICD-9-CM codes are used. The other clinical terminologies such as laboratory tests and symptoms are worse, and they even do not have standards. In such cases, it is diffi-cult to find all patients who have abnormal laboratory test results or special diseases. Therefore, it is an inevitable task to unify the terminologies of hospitals and clinicians to construct a clean disease case repository.

    (3) Data quality and data format issue. EHR data is generated from the patient’s actual diagnoses and treatments, and its purpose does not directly target at scientific researches. In other words, a patient record in good quality for clinical practice may not necessarily meet the needs of scientific researches. The EHR data does not contain enough data columns, for example,death records are not complete, since not all patients died in hospitals. Besides, some data are redundant for research purpose, for example, the disease diagnoses appear in multiple places, such as the discharge abstract, the admission records, and the first page of medical records.

    (4) Data mining and analysis methods. Randomized clinical trials are used in traditional medical researches for comparative effectiveness research.However, there is no clinical control group in the big data scenario. How to construct the causal relationship between medical events becomes important. At the same time, although deep learning methods outperform traditional machine learning methods on disease prediction and treatment decision, their interpretability is relatively poor and difficult to be understood by human.

    In this paper, we address the above problems. We aslo demonstrate the end-to-end process from constructing heart failure repository for building and utilizing a large scale cohort to conduct research of comparative effectiveness on the treatment of heart failure.

    THE OVERALL PROCESS OF CLINICAL BIG DATA MINING BASED ON REGIONAL MEDICAL EHRS

    The EHR data for the regional platform consists of patient data from multiple hospitals. Each hospital integrates data from Hospital Information System(HIS), Laboratory Information System (LIS), Radiology Information System (RIS), and Picture Archiving and Communication System (PACS). Each hospital integrates EHR data and uploads it to the regional platform. Specifically, the regional platform defines a series of data standard, and then hospitals uploads corresponding EHR data to the front-end machine of each hospital, the regional platform collect, merge and store the data into a centralized database. Based on the patient’s identification information, namely Medical Insurance Card or ID card, the patient’s information from different hospitals has been integrated in regional platform. Subsequently, the regional platform operates data masking to protect the patients privacy for consequent research usage. Therefore data privacy is not discussed in this paper. The overall process of clinical big data mining based on the regional EHRs is illustrated in Figure 1.

    Constructing special disease case repositories

    Special disease case repositories are constructed from EHRs on regional platform. We build a repositoriy for heart failure. Firstly, we selected patients by disease names and its ICD codes, and all information related to the patients is extracted. After that, data quality assessment of the repository is executed to check whether the information contained is good enough for subsequent data mining. Evaluation metrics we used included data completeness, data consistency, medical code consistency, and data accuracy. If the special disease case repository meets the assessment requirements, especially about data completeness, we will start the second step, namely data cleaning step.

    Figure 1. The overall process of clinical big data mining based on the regional EHRs. HIS, hospital information system;LIS, laboratory information system; PACS, picture archiving and communication system; RIS, radiology information system; GBDT, gradient boosting decision tree; ICD, International Classification of Diseases; SNOMED, Systematized Nomenclature of Medicine; LOINC, Logical Observation Identifier Names and Code; CRF, case report form.

    Data cleaning

    The second step is performed to clean the special disease case repository. Firstly, clinical experts define the Case Report Forms (CRFs) or disease model which hold all the required features of each patient in order to conduct researches on the disease. Then, feature preprocessing rules are defined according to the information in CRFs. Afterwards, Chinese terminology knowledge graphs are built to normalize different vocabularies of the same terminology by multiple hospitals. The Chinese terminology knowledge graphs learn from various existing ontologies, including Systematized Nomenclature of Medicine-clinical Terminology(SNOMED-CT),7International Classification of Diseases (ICD), and Logical Observation Identifiers Names and Codes (LOINC).8In addition, in order to comply with usage habits of clinicians, we integrate disease,diagnosis and other data from regional health platforms into Chinese terminology knowledge graphs to better normalize terminologies. With the CRFs and the terminology graphs, a cleansed special disease case repository containing all cases and related features is constructed.

    Cohorts construction for particular research topics

    In the third step, cohorts9are built for special research topics. First of all, the goal of the research should be determined by clinical experts, and then inclusion and exclusion criteria10are defined to select patients with the study. Study variables and outcome events are also described. After construction of the cohort, analysis on the baseline characteristics may be performed to get the statistical description about the cohort.

    Model selection and experiments implementation

    The last step is to select the appropriate big data mining model11as well as to design and implement the experiments. Typically, there are two kinds of models,one is based on traditional regression models to discover associations in cohorts, for example, COX model;the other is machine-learning algorithms such as risk prediction. For the former one, we use the Propensity Score Matching (PSM)12to control and eliminate the selection bias caused by confounding variables. For the machine-learning algorithms, there are lots of issues such as feature selection and feature engineering. In our previous publication on about risk prediction on EHR data,13the major challenges encountered were discussed.

    Currently, regional platform in Shanghai contains only structure data, the healthcare management bureaus in shanghai plan to collect clinical text data and follow-up data in the near future. In such cases, natural language processing techniques will be required to structure texts, such as named entity recognition,14entity linking,15and relationship extraction16techniques.

    CONSTRUCTING A CLINICAL SPECIAL DISEASE REPOSITORY BASED ON CHINESE TERMINOLOGY KNOWLEDGE GRAPH

    Complexity of medical terminology in EHRs data

    The same medical terminologies are expressed differently by different clinicians in EHR data. For example,a symptom may have multiple expressions, such as“pre-contraction(期間收縮)”, “premature beat (過(guò)早搏動(dòng))”, and “premature beat (早搏)”. We call them synonyms. Furthermore, a symptom is often modified by different words to express a slightly different semantic meaning, such as “acute back pain (急性背痛)”, and“chronic back pain (慢性背痛)”. They are hyponyms of“back pain(背痛)”.

    While there are many terminology systems such as ICD codes and LOINC, the medical terminology graph lacks commonly used vocabularies for symptoms, laboratory tests, and diseases. Take laboratory indicators as an example, “serum sodium (血清鈉)” has more than 10 different expressions in real EHR data,such as “sodium ion concentration (鈉離子濃度)” , “NA+”,and “arterial blood sodium (動(dòng)脈血鈉)”. Because of the lack of a complete commonly used dictionary for laboratory indicators, different descriptions of the same laboratory indicators by different hospitals have also brought difficulties to regional clinical research.

    Therefore, it is necessary to establish a complete and comprehensive knowledge graph of Chinese medical terminologies, especially to include vocabularies that have been used in daily clinical practice. The vocabularies can be expressed as synonyms, hypernym and hyponym relationships to normalize terminology in the Chinese terminology graph. Then the terminology in EHRs can be linked and normalized to standard expression more easily, which promotes the construction of specific disease repositories.

    Construction of Chinese medical terminology knowledge graph

    A variety of medical classification systems and ontologies have been established abroad. Common medical classification systems and ontologies include Unified Medical Language System (UMLS),17Medical Subject Headings (MeSH),18and SNOMED-CT. There are also several fine-grained ontologies and systems, such as RxNorm19which distinguishes different type of medicine concepts, LOINC for laboratory tests, gene ontology and ICD systems. Moreover, based on these systems, oversea researchers have built multiple medical dataset platforms and published a large number of open-link datasets. The well-known open-link datasets include Linked Open Drug Data (LODD),20Linked Life Data21and Bio2RDF.22The publishment of these openlink datasets has greatly facilitated the researches in the field of medicine.

    Currently, the Chinese version of SNOMED-CT has been released. However, the SNOMED-CT system is quite different from the Chinese clinical system and does not meet the usage habits of Chinese clinicians.To process EHR data of regional platform, the first task is to establish Chinese terminology knowledge graph that is consistent with the practical habit of Chinese healthcare workers.

    Combined with the actual situation of Chinese hospitals, we built a Chinese terminology knowledge graph based on some international medical knowledge graphs.We extended ICD system with commonly used vocabularies for diseases. We choose frequently used terminologies from LOINC and maped them with vocabularies in EHR. We also used some terminologies from medical web sites, as described in our previous paper.13Based on automatic extraction method, we first obtained the different descriptors for diseases, symptoms and laboratory tests from the regional EHR data. Then we utilized synonym detection algorithms23and hypernym detection algorithms24,25to link those different medical terminologies to the terminology knowledge graph. We will introduce the schema diagram of the Chinese terminology knowledge graph and the synonym detection algorithm for laboratory tests in this section.

    Schema diagram

    With the help of clinical experts, we manually created a schema of medical terminology knowledge graph based on medical knowledge, including concepts,conceptual attributes, and hierarchical relationships between concepts. Figure 2 shows the schema of the Chinese medical terminology knowledge graph. We defined eight top-level concepts including symptoms,diseases, medicines, departments, examinations, body structures, laboratory tests, and procedures. “Medicine” is subdivided into two sub-concepts of “Chinese medicine” and “Western medicine”. The concepts are connected through relationships such as “disease-related medicines”, “disease-related departments”, “disease-related tests”, and “finding sites”. Each concept is given several instances which may be defined by“synonym” relationship because they are the same meaning form different sources, or a “hypernym and hyponym” relationship. For example, “meningeal carcinomatosis(腦膜癌)” recorded in a hospital and “meningiomas(腦膜瘤)” in ICD10 are synonymous; those two words are the hyponym of “tumor”.

    The synonym detection algorithm for laboratory tests

    In terms of laboratory tests, aiming at laboratory indicator normalization in regional medical health platform,we proposed a normalization algorithm framework for laboratory indicators. The overall flow of the laboratory indicator normalization algorithm is shown in Figure 3. First, the preprocessing steps for laboratory indicators include unit normalization and extracting indicators reference. Then, using the character features of the laboratory indicators, the density-based clustering algorithm is used to cluster different laboratory indicators into groups to narrow the scope of the laboratory indicators for normalization. A standard name is determined for each group of laboratory indicators, and binary classification algorithms are used to find synonyms of the standard name within the group. For the remaining non-synonym indicators, a new standard name is selected, and the binary classification algorithm is used to search for the synonymous indicators.This process iterates till all groups are synonymous or only one laboratory indicator remains in the group.Finally, the clinical experts check and correct the laboratory indicators normalization results.

    It should be noted that clustering algorithm has two problems because it is an unsupervised learning process. 1) The laboratory indicators are clustered into one group owing to similar names or similar abbreviations, but they actually have different medical meanings; 2) some outliers are not core objects, so that they are not clustered. Therefore, the clustering results need post-processing. Post-processing steps are as follows:

    Figure 2. Schema diagram of Chinese medical terminology knowledge graph.

    Figure 3. The framework for normalization of laboratory indicators.

    1. Unit verification. Assuming that the units of the synonym indicators are the same, unit verification can be performed for each group, and the indicators of different units will be separated into different groups.

    2. Outlier recommended. For each outlier that are not clustered, it is likely to be a completely new laboratory indicator because the outlier is far away from other groups.

    The results of laboratory indicators clustering are shown in Table 1. It can be seen that the Density-Based Clustering Algorithm (DBSCAN) has significantly higher F1-score than the other four clustering algorithms, and the improvement range is above 10%.However, the recall of our method can reach 91.36%,and the precision is not good enough. In order to improve precision, it is necessary to conduct the binary classification mapping.

    To investigate the influence of different features and the performance of different classification models,we select the various combinations of three features including name, abbreviation (Abbr.) and reference value (Ref.). They are compared with different classification models such as Logistic Regression (LR), naive Bayes (NB), k-Nearest Neighbor (kNN), Support Vector Machine (SVM), Random Forest (RF), and Gradient Boosting Decision Tree (GBDT). By comparing the experimental results with F1-score, the GBDT algorithm is superior to other classification algorithms, so the GBDT model is used for binary classification.

    We compared our method with the knowledge graph fusion method (KG Fusion), diagnostic alignment method (Diag. Alignment) and knowledge base alignment method (KB Alignment). The results are shown in Table 2. Although our method is slightly lower in precision than KB Alignment method, it has the best recall and F1-score.

    With the help of synonym detection algorithms and hypernym detection algorithms, we have constructed the Chinese terminology knowledge graph.The data we fetched from regional health platform contains 57,729 disease phrases. Though ICD system contains 31,898 unique diseases, we only match 9,995 diseases by synonym detection algorithm. Besides, we supplement 28,692 hyponyms by hypernym detection algorithm. In regional health platform, there exists 12,624 original laboratory test concepts. Filtered and merged by synonym detection algorithm, there exists 2,395 unique concepts. In addition, the Chinese terminology knowledge graph also contains 35,279 symptom concepts and 13,370 operation concepts which are manually checked by clinicians.

    Table 1. Performance comparison of different clusteringalgorithms

    Table 2. Performance comparison with existing entity alignment methods

    The construction and data cleaning of clinical heart failure repository

    In order to analyze special disease, it is preferred to get all data stored in one repository. Here, we take heart failure for an example. There are three steps in the construction and data cleaning of the heart failure repository:

    Step1: Determining patients of heart failure

    The patients are selected based on the ICD codes and the disease names “heart failure”. Considering the different versions of the hospital information systems, all the codes related to heart failure in ICD9 and ICD10 are used to extract the patients. However, disease names of heart failure recorded in EHRs may not have a corresponding ICD code. It is difficult to extract all the medical records of the disease by using its ICD code alone, so the disease name of heart failure and its synonyms are considered as well. This process can be assisted by Chinese terminology knowledge graph that we built before.

    Step 2: Identifying rules of feature preprocessing

    The definition of the feature preprocessing rules is derived from the information in case report forms(CRFs).26CRF is a specialized document in clinical research, but the terminologies in CRF are not totally the same as those in the literatures. So the terms in clinical record texts need to be normalized. Meanwhile,the CRF can also be regarded as a disease model to describe all the possible related features. However, we use the term CRF instead of disease model because it is easier to understand by clinicians.

    Part of Heart Failure CRF is shown in Table 3.The first column describes the category of features in CRF, such as population information, prescription, and laboratory test. The second column records the features defined by clinical experts in the CRF of heart failure. The third column introduces the format of possible feature values. For example, the information of heart function level is necessary for heart failure patients. It can be extracted from the first page of medical record.

    According to the CRF, the preprocessing rules of regional data are defined in Table 4. The first column is the source table name in regional EHR database,and the second column is the feature name of the source table from which the target features is extracted. The preprocessing rules are described in the third column. For instance, we obtain the hospitalization date and birth date in patient information table, and the patient’s age equals to hospitalization date minus birth date. Similarly, the heart function level should be derived from the diagnostic instructions in diagnostic details table, and heart function level I, II, III, and IV are mapped as “1”, “2”, “3”, and “4” respectively.

    Step 3: data cleaning based on terminology normalization

    Based on the preprocessing rules, we obtain the information of medications, diagnoses, procedures and laboratory tests that the doctor wishes to get from the EHR data. There are two types of data cleaning methods. One is to obtain information directly from the special case repository, such as the results of laboratory tests, diagnoses and medications. In particular, we need to normalize the name of medical terminologies before extracting information directly. The synonym and hypernym and hyponym relationships of Chinese terminology knowledge graph are used in this step.The second data cleaning method is to perform calculations on some features to obtain derived information.For example, the information of readmission time does not exist in the special case repository, and it can be derived by certain calculations (i.e., the next admission date minus the discharge date).

    EHR data quality assessment

    Several factors may lead to quality problem of the special disease case repository, which prevents the disease case repository being reused. For example, inconsistency and incompleteness of EHR data. Therefore,data quality assessment27of EHRs is a crucial step forclinical analysis.The quality assessment processes we use in this paper has been proposed in a special paper.28In detail, the process consists of six steps: (1) using the Delphi process29,30to collect assessment requirements; (2) identifying and collecting EHR data based on the assessment requirements, and the dataset (i.e.,the special disease case repository) is constructed to be evaluated; (3) mapping assessment requirements to the dataset; (4) proposing quality assessment met

    Table 3. Sample of case report form for heart failure

    rics. Metrics are selected or defined based on the purpose of using the dataset; (5) performing data quality assessment. Each quality assessment metric is given a score based on the scoring criteria; (6) analyzing the assessment results. The quality of the dataset is analyzed to determine if the dataset is suitable for research.

    A part of evaluation contents is described in Table 5. Take the disease code for an example, we evaluate the completeness and consistency of it. In details,patients in heart failure repository must have the information of disease code to identify the realated disease.Moreover, the expressions of the disease code need to satisfy the Chinese standrad.

    Table 4. The preprocessing rules to convert features from source table to target CRF

    DATA ANALYSIS BASED ON COHORTS

    Cohort construction based on heart failure repository

    Research topic is defined according to the requirements of clinicians who design the study. For instance,they want to find out the effect of taking statins or traditional Chinese medicine in patients with heart failure. Although benefits of statin treatment have been demonstrated in many patient groups, its effects in heart failure patients with reduced ejection fraction are still controversial.31Therefore, we conduct an experiment to verify whether statins are associated with improved outcomes utilizing the regional EHR data.

    Based on this goal, we define the inclusion and exclusion criteria to construct a cohort based on the EHR data in the heart failure repository that we built before. The heart failure repository has totally 178,628 patients from 38 top hospitals in Shanghai, among which, 75,598 patiens from January 2012 to June 2016 was used for the research. The inclusion criteria for the cohort are : 1) patients are 40 years old or older; 2)patients have at least two inpatient records. Patients who have only one inpatient record are excluded because it is impossible to determine the time of readmission for those patients. Figure 4 shows the flow of patients selection for heart failure cohort based on the heart failure repository. The cohort of heart failure resulted in a dataset of 29,647 patients, which is 16.6%of all patients in the heart failure repository.

    Table 5. Evaluation contents of heart failure repository

    Figure 4. The flowchat of patients selection for the cohort of heart failure.

    Comparison of therapeutic effects based on propensity score analysis

    As the research aims to investigate the effect of taking statins in heart failure patients, based on this objective, whether patients take statins is the study variable, and patient’s age, gender, medication, and other information are confounding variables. The outcome event is 180-day readmission. The time period between the date of discharge and the date of next admission is the readmission time. If the readmission time exceeds 180 days, the label of 180-day readmission is set to 0, otherwise it is set to 1.

    According to the study variable, the cohort patients population is divided into the study group(i.e.,patient who take statins) and control group (i.e.,patients who do not take statins). The ideal situation is that the confounding variables between the study group and the control group are parallel. Unfortunately,statisital analysis found that the value of confounding variables in the two groups were typically unparalles,>which may affect the outcome event as well.

    To eliminat the affect of the confusing variables,we conducted the propensity score matching (PSM)proposed in the work of Caliendo et al.32The five steps are performed as follows: (1) Run logistic regression analysis and get the propensity score; (2)check that propensity score is balanced across study group (use Statin) and the control groups;(3) match each patient in the study group to one or more patients in the control group on propensity score; (4)verify that confounding variables are balanced across study and control groups in the matched sample;(5)multivariate analysis based on new sample. We used all the records of patients who met the inclusion and exclusion criteria in executing PSM. We used Caliper matching33in this paper to reduce bias. In particular,we randomly extracted a patient namely Pafrom the study group, and then selected one or more patients Pbin the control group by the following condition: the difference between the propensity score of Paand Pbmust within the given caliper (i.e., a threshold such as 0.05). We repeated the operation until all the patients in the study group got the matched ones in the control group.

    As shown in Figure 5, there is a significant difference in the propensity scores between treatment group and control group for the original cohort population,which indicates the unbance of the confounding variables between the two groups. The propensity scores of the two group in the mached cohort were parallel,although the total number of patients decreased from 29,647 to 12,698.

    Figure 5. Box plot presentation of propensity scores for statin use in the unmatched and matched cohorts. Boxes represent median and interquartile range; whiskers represent minimum and maximum (if not outliers). Outliers are displayed with circles and are defined as observations>1.5 times the interquartile range from the first or third quartile, respectively.

    Logistic regression analysis was then performed based on the matched cohort. The P value of taking statins on patient’s 180-day readmission is 0.0123. We used the same method to match and analyze patients who take Chinese medicine, and P value of taking Chinese medicine on the patient’s 180-day readmission is 0.1798 instead. Taking significance level as 0.05, the result showed that taking statins had a significant effect on 180-day readmission for patients with heart failure. The logistic regression coefficient was -0.2239, which represents a negative correlation between taking statin and 180-day readmission. That is to say, taking statins can reduce the risk of 180-day readmission.

    SUMMARY

    Although there are huge amount of EHR data on regional health platform, the medical terminologies of each hospital are extremely inconsistent. To address these shortcomings, this paper presents the workflow and application example of big data mining based on regional EHR data.The major contributions of our work are as follows:

    We present a methodology and an overall process for second-use of the electronic healthcare data on a regional platform. The methodology combines topic-driven approach and data-drive approach. Clinicians give CRFs to describe all related medical events about a special disease, and computer programs extract data from regional platform according to the CRFs to form special disease case repositories. Then, a data quality profile is returned to the clinicians to give them a detailed understanding of the data. The clinicians can conduct different clinical studies based on these data,such as cross-sectional studies, longitudinal studies,and artificial intelligence applications studies such as disease risk predictions. In this paper, we focus on building prospective cohort since it forms the basis of discovering associations between events.

    The unified terminology system is an important basis for dealing with the diversity of vocabulary issues. We construct a Chinese clinical terminology knowledge graph containing symptoms and hyponymies between clinical terminologies. We build the special disease case repositories with the knowledge graph selecting special diseases and normalizing the data.

    For future work, to keep the special disease case repository containing more complete information, we may use natural language processing techniques to process medical texts in regional data. Follow-up data may be collected as well when there is data missing in the special disease case repository. In addition, more big data mining algorithms will be applied to clinical researches.

    Conflict of intrests statement

    All authors declared no conflicts of interest.

    猜你喜歡
    背痛早搏鈉離子
    肩背痛竟然也是肺癌信號(hào)
    祝您健康(2022年1期)2022-01-10 04:56:23
    SWAN在線(xiàn)鈉離子分析儀的使用及維護(hù)研究
    查出早搏,不等于得了心臟病
    背痛, 你需要 了解更多
    索羅斯的背痛和梅西的點(diǎn)球
    查出早搏,不等于得了心臟病
    后背痛怎么辦
    大家健康(2016年4期)2016-12-23 00:47:16
    不必談“早搏”色變
    查出早搏,不等于得了心臟病
    鈉離子通道與慢性心力衰竭
    免费女性裸体啪啪无遮挡网站| 国产深夜福利视频在线观看| 欧美 亚洲 国产 日韩一| 18+在线观看网站| 国产欧美日韩一区二区三区在线| 久久国内精品自在自线图片| 中文字幕av电影在线播放| 国产97色在线日韩免费| 国产一区二区在线观看av| 中文字幕人妻熟女乱码| 国产精品一区二区在线观看99| 久久狼人影院| 自线自在国产av| 日韩 亚洲 欧美在线| 青青草视频在线视频观看| 久久午夜综合久久蜜桃| 精品久久蜜臀av无| 免费在线观看黄色视频的| 69精品国产乱码久久久| 亚洲精品aⅴ在线观看| 蜜桃在线观看..| 欧美av亚洲av综合av国产av | 边亲边吃奶的免费视频| 欧美日韩一区二区视频在线观看视频在线| 国产精品.久久久| 亚洲精品美女久久久久99蜜臀 | 国产成人精品婷婷| 日本wwww免费看| 亚洲伊人久久精品综合| 国产精品一国产av| 麻豆av在线久日| 伊人亚洲综合成人网| 人体艺术视频欧美日本| 久久久久久久久久久免费av| 老汉色av国产亚洲站长工具| 蜜桃在线观看..| 国产精品不卡视频一区二区| 亚洲男人天堂网一区| 欧美日本中文国产一区发布| 欧美日韩综合久久久久久| 久久久国产精品麻豆| 亚洲中文av在线| 18禁国产床啪视频网站| 亚洲精品一二三| 午夜福利视频在线观看免费| 一个人免费看片子| 午夜影院在线不卡| av视频免费观看在线观看| av在线app专区| 在线观看人妻少妇| 精品亚洲成国产av| 黑人巨大精品欧美一区二区蜜桃| 婷婷成人精品国产| 亚洲,一卡二卡三卡| 国产高清国产精品国产三级| 国产精品 欧美亚洲| 91久久精品国产一区二区三区| 久久久久视频综合| av一本久久久久| 亚洲精品国产一区二区精华液| 亚洲综合色网址| 国产精品人妻久久久影院| 亚洲成国产人片在线观看| 国产一区亚洲一区在线观看| 18禁裸乳无遮挡动漫免费视频| 亚洲成国产人片在线观看| 国产亚洲欧美精品永久| 夫妻午夜视频| 欧美另类一区| 国产片内射在线| 我的亚洲天堂| 啦啦啦中文免费视频观看日本| 久久97久久精品| 久久午夜福利片| 婷婷色综合www| 黄色视频在线播放观看不卡| 中文乱码字字幕精品一区二区三区| 97精品久久久久久久久久精品| 国产日韩欧美视频二区| 妹子高潮喷水视频| 亚洲av在线观看美女高潮| 日日啪夜夜爽| 久久精品熟女亚洲av麻豆精品| 国产av一区二区精品久久| 国产精品二区激情视频| 成人亚洲欧美一区二区av| 大陆偷拍与自拍| 亚洲第一区二区三区不卡| 纵有疾风起免费观看全集完整版| 视频在线观看一区二区三区| 欧美日韩视频精品一区| 五月伊人婷婷丁香| 少妇的丰满在线观看| √禁漫天堂资源中文www| 一级毛片电影观看| 精品少妇久久久久久888优播| 天美传媒精品一区二区| 美女xxoo啪啪120秒动态图| 午夜福利视频精品| 又黄又粗又硬又大视频| 丝袜脚勾引网站| 热re99久久精品国产66热6| 伊人久久大香线蕉亚洲五| 国产深夜福利视频在线观看| 精品一品国产午夜福利视频| 免费看不卡的av| 捣出白浆h1v1| 夫妻性生交免费视频一级片| 男女下面插进去视频免费观看| 最新的欧美精品一区二区| 免费观看a级毛片全部| 久久久久视频综合| 成人18禁高潮啪啪吃奶动态图| 好男人视频免费观看在线| 亚洲欧美中文字幕日韩二区| 久久精品久久久久久久性| 女性被躁到高潮视频| 男人爽女人下面视频在线观看| 侵犯人妻中文字幕一二三四区| 啦啦啦啦在线视频资源| 秋霞在线观看毛片| 晚上一个人看的免费电影| 女人高潮潮喷娇喘18禁视频| 久久毛片免费看一区二区三区| 国产欧美亚洲国产| 色94色欧美一区二区| 精品国产一区二区三区久久久樱花| 成人毛片a级毛片在线播放| 亚洲熟女精品中文字幕| 精品久久久久久电影网| 久久久久视频综合| 色视频在线一区二区三区| 日韩欧美精品免费久久| 亚洲国产精品成人久久小说| 免费看av在线观看网站| 国产日韩欧美亚洲二区| av视频免费观看在线观看| 国产在线免费精品| 精品国产乱码久久久久久男人| 亚洲激情五月婷婷啪啪| 日韩中字成人| 大片电影免费在线观看免费| 日本av手机在线免费观看| 亚洲四区av| a 毛片基地| 天天躁夜夜躁狠狠躁躁| 在线观看免费视频网站a站| 亚洲在久久综合| 天堂中文最新版在线下载| 亚洲精品日韩在线中文字幕| 99香蕉大伊视频| 成人国产av品久久久| 极品人妻少妇av视频| 99香蕉大伊视频| 午夜久久久在线观看| 国产免费视频播放在线视频| 人妻人人澡人人爽人人| www日本在线高清视频| 国产日韩欧美视频二区| 天天躁狠狠躁夜夜躁狠狠躁| kizo精华| 伦精品一区二区三区| 午夜福利在线观看免费完整高清在| 麻豆av在线久日| 桃花免费在线播放| www.熟女人妻精品国产| 日日摸夜夜添夜夜爱| 午夜激情av网站| tube8黄色片| 考比视频在线观看| 免费大片黄手机在线观看| 午夜福利在线观看免费完整高清在| 日韩av不卡免费在线播放| 少妇人妻 视频| 国产片内射在线| 久久午夜福利片| 侵犯人妻中文字幕一二三四区| 91成人精品电影| 亚洲第一青青草原| 精品国产露脸久久av麻豆| 91国产中文字幕| 国产精品秋霞免费鲁丝片| 国产成人精品婷婷| 久久 成人 亚洲| 久久久久精品性色| 熟妇人妻不卡中文字幕| 成人午夜精彩视频在线观看| 国产在线视频一区二区| 中文字幕人妻丝袜制服| 亚洲熟女精品中文字幕| 精品午夜福利在线看| 丰满乱子伦码专区| 久久97久久精品| 在线免费观看不下载黄p国产| 天天躁狠狠躁夜夜躁狠狠躁| 欧美+日韩+精品| 男女下面插进去视频免费观看| 成人国产麻豆网| √禁漫天堂资源中文www| 日日爽夜夜爽网站| 在现免费观看毛片| 亚洲av综合色区一区| av在线老鸭窝| 精品少妇黑人巨大在线播放| 久久精品国产亚洲av高清一级| 大香蕉久久成人网| 国产男人的电影天堂91| 色视频在线一区二区三区| 日韩免费高清中文字幕av| 如何舔出高潮| 亚洲第一av免费看| 九草在线视频观看| 日韩av在线免费看完整版不卡| 少妇人妻久久综合中文| 亚洲国产精品一区三区| 免费在线观看视频国产中文字幕亚洲 | 成年美女黄网站色视频大全免费| 麻豆av在线久日| av片东京热男人的天堂| 色哟哟·www| 午夜福利影视在线免费观看| 亚洲国产最新在线播放| 亚洲精品日本国产第一区| 午夜精品国产一区二区电影| 考比视频在线观看| 熟女电影av网| 超色免费av| 亚洲内射少妇av| 久久久久久久精品精品| 下体分泌物呈黄色| 999久久久国产精品视频| 亚洲精品av麻豆狂野| 国产精品一国产av| 一级毛片电影观看| 亚洲婷婷狠狠爱综合网| 国产日韩欧美视频二区| 久久精品国产鲁丝片午夜精品| 国产不卡av网站在线观看| 一级毛片黄色毛片免费观看视频| 91国产中文字幕| 日本欧美国产在线视频| 免费黄色在线免费观看| 国语对白做爰xxxⅹ性视频网站| 日本午夜av视频| 大话2 男鬼变身卡| 汤姆久久久久久久影院中文字幕| 精品亚洲成国产av| 日韩欧美一区视频在线观看| 欧美亚洲日本最大视频资源| 热99久久久久精品小说推荐| 日韩人妻精品一区2区三区| 亚洲欧洲日产国产| 免费日韩欧美在线观看| 国产爽快片一区二区三区| 国产在线视频一区二区| 成人手机av| 久久午夜综合久久蜜桃| 亚洲欧美中文字幕日韩二区| 午夜影院在线不卡| 国产有黄有色有爽视频| 少妇被粗大猛烈的视频| 人人澡人人妻人| 亚洲精品成人av观看孕妇| 精品少妇一区二区三区视频日本电影 | 国产不卡av网站在线观看| 黄频高清免费视频| 极品人妻少妇av视频| 免费观看性生交大片5| 亚洲欧美日韩另类电影网站| 国产精品久久久av美女十八| 久久久久久久久久人人人人人人| 亚洲色图 男人天堂 中文字幕| 大片电影免费在线观看免费| 日韩欧美精品免费久久| 黑人巨大精品欧美一区二区蜜桃| 久久久久国产网址| 免费高清在线观看视频在线观看| 777米奇影视久久| 日本欧美国产在线视频| 一级爰片在线观看| 美女脱内裤让男人舔精品视频| 免费在线观看黄色视频的| 久久人人爽av亚洲精品天堂| 麻豆精品久久久久久蜜桃| 午夜影院在线不卡| 狠狠婷婷综合久久久久久88av| 亚洲国产av影院在线观看| 观看美女的网站| 久久人人爽人人片av| 欧美激情 高清一区二区三区| 亚洲精品自拍成人| 日本猛色少妇xxxxx猛交久久| 亚洲欧美中文字幕日韩二区| 日韩一区二区视频免费看| av网站免费在线观看视频| 欧美精品av麻豆av| 在线免费观看不下载黄p国产| 午夜免费男女啪啪视频观看| 国产黄频视频在线观看| 亚洲情色 制服丝袜| 亚洲国产色片| 国产xxxxx性猛交| 老司机影院毛片| 美女国产视频在线观看| 美国免费a级毛片| 欧美av亚洲av综合av国产av | 国产免费现黄频在线看| 亚洲av成人精品一二三区| 免费观看无遮挡的男女| 国产爽快片一区二区三区| 国产白丝娇喘喷水9色精品| 精品福利永久在线观看| 亚洲av男天堂| 婷婷色综合www| 欧美97在线视频| 亚洲精品aⅴ在线观看| 午夜久久久在线观看| 欧美精品亚洲一区二区| 少妇的逼水好多| 青草久久国产| 校园人妻丝袜中文字幕| 中文字幕制服av| 久久久久久久久久人人人人人人| 日韩中文字幕视频在线看片| 欧美人与性动交α欧美软件| 美女中出高潮动态图| 中文天堂在线官网| 亚洲av电影在线观看一区二区三区| 久久久久久免费高清国产稀缺| 久久精品国产亚洲av涩爱| 中文字幕人妻熟女乱码| 大片免费播放器 马上看| 波多野结衣av一区二区av| 久久99一区二区三区| 男女国产视频网站| 麻豆精品久久久久久蜜桃| 国产欧美亚洲国产| 久久久精品国产亚洲av高清涩受| 亚洲精品国产av成人精品| 精品一区二区三卡| 久久99一区二区三区| 午夜久久久在线观看| av女优亚洲男人天堂| 成年女人在线观看亚洲视频| 国产精品 国内视频| 国产男女内射视频| 三级国产精品片| 精品99又大又爽又粗少妇毛片| 在线观看国产h片| 欧美日韩亚洲高清精品| 成人亚洲精品一区在线观看| 国产麻豆69| 欧美精品亚洲一区二区| 国产一区二区激情短视频 | 精品久久久精品久久久| 久久久久精品性色| 欧美成人午夜免费资源| 在线观看一区二区三区激情| 亚洲成av片中文字幕在线观看 | 在线天堂最新版资源| 男人操女人黄网站| 熟女少妇亚洲综合色aaa.| 热re99久久国产66热| 一级黄片播放器| 在线观看一区二区三区激情| 伦理电影大哥的女人| 国产熟女午夜一区二区三区| 丰满乱子伦码专区| 丰满少妇做爰视频| 蜜桃国产av成人99| 精品一品国产午夜福利视频| 大香蕉久久网| 一级,二级,三级黄色视频| 可以免费在线观看a视频的电影网站 | 尾随美女入室| 99精国产麻豆久久婷婷| 69精品国产乱码久久久| 午夜老司机福利剧场| av线在线观看网站| 午夜日本视频在线| 亚洲婷婷狠狠爱综合网| 久久国产精品男人的天堂亚洲| 欧美亚洲 丝袜 人妻 在线| 精品卡一卡二卡四卡免费| 80岁老熟妇乱子伦牲交| 久久精品国产自在天天线| 免费在线观看视频国产中文字幕亚洲 | 国产亚洲av片在线观看秒播厂| 男女边摸边吃奶| 男男h啪啪无遮挡| 免费少妇av软件| 免费在线观看黄色视频的| 欧美日韩精品网址| 日韩,欧美,国产一区二区三区| 色吧在线观看| 久久久精品区二区三区| 午夜福利在线观看免费完整高清在| 在线观看美女被高潮喷水网站| 久久久久久久亚洲中文字幕| 街头女战士在线观看网站| 亚洲欧美一区二区三区久久| 18禁国产床啪视频网站| 国产一区二区在线观看av| 一级a爱视频在线免费观看| 国产在线免费精品| av网站在线播放免费| 韩国高清视频一区二区三区| 美女高潮到喷水免费观看| 国产亚洲欧美精品永久| 18禁动态无遮挡网站| 伊人亚洲综合成人网| 啦啦啦在线免费观看视频4| 一区福利在线观看| 日韩中字成人| 免费日韩欧美在线观看| 久久久久精品人妻al黑| 精品午夜福利在线看| 老汉色∧v一级毛片| 久久99一区二区三区| 亚洲 欧美一区二区三区| 国产免费又黄又爽又色| 观看av在线不卡| 不卡av一区二区三区| 亚洲精品美女久久av网站| 极品人妻少妇av视频| 一二三四在线观看免费中文在| 在线天堂中文资源库| 久久久久国产一级毛片高清牌| 一边摸一边做爽爽视频免费| 久久精品人人爽人人爽视色| 精品一区在线观看国产| 黑人巨大精品欧美一区二区蜜桃| 久久久亚洲精品成人影院| av在线播放精品| 国产野战对白在线观看| 国产精品偷伦视频观看了| 王馨瑶露胸无遮挡在线观看| 国产淫语在线视频| 最新的欧美精品一区二区| 国产免费一区二区三区四区乱码| h视频一区二区三区| 中文字幕人妻熟女乱码| 9热在线视频观看99| 大香蕉久久网| 多毛熟女@视频| 乱人伦中国视频| 最新的欧美精品一区二区| 在现免费观看毛片| 黄色视频在线播放观看不卡| 国产成人精品福利久久| 欧美少妇被猛烈插入视频| 亚洲国产精品国产精品| 只有这里有精品99| 热99国产精品久久久久久7| 欧美+日韩+精品| 亚洲av.av天堂| 亚洲精品成人av观看孕妇| 中文字幕另类日韩欧美亚洲嫩草| 我的亚洲天堂| 午夜福利乱码中文字幕| 精品一区二区三区四区五区乱码 | 五月开心婷婷网| 搡老乐熟女国产| 国产精品麻豆人妻色哟哟久久| av免费观看日本| 最近的中文字幕免费完整| av.在线天堂| 久久 成人 亚洲| 亚洲男人天堂网一区| 午夜激情久久久久久久| 黄色 视频免费看| 一级,二级,三级黄色视频| 亚洲欧美成人综合另类久久久| 麻豆乱淫一区二区| 国产女主播在线喷水免费视频网站| 少妇猛男粗大的猛烈进出视频| 国产成人一区二区在线| 色哟哟·www| 国产日韩欧美亚洲二区| 欧美日韩精品成人综合77777| 亚洲欧美精品综合一区二区三区 | 亚洲欧美成人综合另类久久久| 麻豆乱淫一区二区| 天天操日日干夜夜撸| 国产免费一区二区三区四区乱码| 亚洲在久久综合| 久久久精品94久久精品| 国产一区有黄有色的免费视频| 久久这里只有精品19| 韩国高清视频一区二区三区| av网站免费在线观看视频| av国产久精品久网站免费入址| 日本黄色日本黄色录像| 亚洲精品国产色婷婷电影| 日韩视频在线欧美| 菩萨蛮人人尽说江南好唐韦庄| 国产精品蜜桃在线观看| 91国产中文字幕| 亚洲精品成人av观看孕妇| 男女免费视频国产| 午夜福利影视在线免费观看| 久久久国产精品麻豆| 青春草亚洲视频在线观看| 国产日韩欧美亚洲二区| 亚洲av成人精品一二三区| 亚洲国产欧美日韩在线播放| 亚洲一区二区三区欧美精品| 亚洲国产色片| 天堂中文最新版在线下载| 99久久综合免费| 欧美激情极品国产一区二区三区| 色视频在线一区二区三区| 亚洲精品一二三| 91久久精品国产一区二区三区| 青春草视频在线免费观看| 一本色道久久久久久精品综合| 女人被躁到高潮嗷嗷叫费观| 汤姆久久久久久久影院中文字幕| 午夜激情av网站| 亚洲一码二码三码区别大吗| 免费日韩欧美在线观看| 久久久久久久久久久免费av| 免费观看性生交大片5| 91精品三级在线观看| 中文欧美无线码| 国语对白做爰xxxⅹ性视频网站| 亚洲av电影在线进入| 国产精品av久久久久免费| 少妇猛男粗大的猛烈进出视频| 精品视频人人做人人爽| 中文字幕人妻熟女乱码| 97精品久久久久久久久久精品| 午夜福利网站1000一区二区三区| 欧美 亚洲 国产 日韩一| 精品国产露脸久久av麻豆| 亚洲 欧美一区二区三区| 亚洲第一青青草原| 国产精品欧美亚洲77777| 久久久国产欧美日韩av| 菩萨蛮人人尽说江南好唐韦庄| 亚洲欧美日韩另类电影网站| 男的添女的下面高潮视频| 国产精品久久久久成人av| 久久女婷五月综合色啪小说| 高清不卡的av网站| 日韩欧美一区视频在线观看| 久久久国产一区二区| 日韩制服丝袜自拍偷拍| 波多野结衣一区麻豆| 国产精品99久久99久久久不卡 | 欧美 日韩 精品 国产| 国产不卡av网站在线观看| 26uuu在线亚洲综合色| 卡戴珊不雅视频在线播放| 日韩欧美一区视频在线观看| 街头女战士在线观看网站| 亚洲四区av| 免费看不卡的av| 亚洲伊人久久精品综合| 欧美人与性动交α欧美软件| 国产成人一区二区在线| freevideosex欧美| 婷婷色综合www| www.av在线官网国产| 中文字幕人妻熟女乱码| 一本大道久久a久久精品| 午夜免费观看性视频| 又大又黄又爽视频免费| 国产精品久久久久成人av| 欧美黄色片欧美黄色片| 国产欧美日韩综合在线一区二区| 成人国产av品久久久| 中文字幕人妻丝袜制服| 自拍欧美九色日韩亚洲蝌蚪91| 美女国产高潮福利片在线看| 亚洲成av片中文字幕在线观看 | 99久久中文字幕三级久久日本| 麻豆精品久久久久久蜜桃| 亚洲人成电影观看| 国产精品99久久99久久久不卡 | 18+在线观看网站| 爱豆传媒免费全集在线观看| 免费少妇av软件| 最近最新中文字幕免费大全7| 热re99久久精品国产66热6| 丝袜人妻中文字幕| 王馨瑶露胸无遮挡在线观看| 十八禁网站网址无遮挡| 毛片一级片免费看久久久久| 国产男女超爽视频在线观看| av国产精品久久久久影院| 亚洲经典国产精华液单| 亚洲精品av麻豆狂野| 午夜福利在线免费观看网站| 啦啦啦在线免费观看视频4| 精品酒店卫生间| 国精品久久久久久国模美| 久久女婷五月综合色啪小说| 啦啦啦视频在线资源免费观看| 十八禁网站网址无遮挡| 丰满迷人的少妇在线观看| 十八禁高潮呻吟视频| 国产一区二区三区av在线| 久久人人爽人人片av| 欧美日韩综合久久久久久| 欧美 亚洲 国产 日韩一| 最近中文字幕2019免费版| 美女国产高潮福利片在线看| 18在线观看网站| 一二三四在线观看免费中文在| 香蕉国产在线看| 妹子高潮喷水视频| 看非洲黑人一级黄片| 久久久久久人人人人人| 亚洲国产av影院在线观看|