• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Improving Parallel Corpus Quality for Chinese-Vietnamese Statistical Machine Translation

    2018-04-16 06:50:02HuuanhTranYuhangGuoPingJianShuminShiandHeyanHuang

    Huu-anh Tran, Yuhang Guo, Ping Jian, Shumin Shi and Heyan Huang,,

    (1.Department of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China; 2.Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Application, Beijing Institute of Technology, Beijing 100081, China)

    Large scale parallel corpora play an important role in the language study and statistical machine translation (SMT) researches. They mainly provide training data for translation models[1-2]and represent resources for automatic lexical obtainment and enrichment[3-4]as well as for grammar introduction[5].

    Previous researches focused on building bilingual corpora from the Internet. However, few researches about bilingual corpora were related to Vietnamese. Ref.[6] collected French-Vietnamese bilingual data from theVietnamNewsAgencywebsite and aligned the documents based on filtering steps (publishing date, special words and sentence alignments results). They used the Champollion toolkit to align the sentences. Eventually, they obtained 12 100 pairs of bilingual documents and 50 300 bilingual sentence pairs in the news form. Ref.[7] designed a Chinese-Vietnamese bilingual parallel corpus management platform and collected more than 110 000 Chinese-Vietnamese sentence pairs from the Internet. 53 000 words in several fields were aligned manually in their work. In order to extract an English-Vietnamese bilingual corpus from the web, Ref.[8]designed two content-based features: cognates and translation segments. These content-based features together with the structure of the web pages were fed into a machine learning model to extract the bilingual texts. For bilingual e-books sources, they used some forms of linkages between the blocks of the text in two languages. Three steps were adopted: pre-processing, paragraph alignment and sentence alignment.

    All the above works have one feature in common, they have to search for the websites containing bilingual data, and then collect, process and align the paragraphs, sentences and words. These methods may produce a quality corpus but it is difficult to construct the bilingual corpus in a large scale.

    In addition, there are several sentence-alignment tools which have been published, such as Refs.[9-12]. However, all of them do not support Vietnamese sentence alignment from other languages and vice versa.

    Fortunately, there is a kind of large scale multilingual corpus, in ever growing and easily accessible recources available on-line for SMT, i.e., movie subtitles. It’s easy to access a million-sentence scale of a movie or TV subtitles bilingual corpus in various languages pairs, and the best part is that the data possesses natural rough alignment in the sentence level. The work of Refs.[13-15] focused on this direction and all the sources adopted in these works were obtained from Open-Subtitles.

    The Open-Subtitles have many different versions (years: 2011, 2012, 2013, 2016), and the latest is Open-Subtitles-2016. From a linguistic perspective, subtitles cover a wide and interesting breadth of genres, from colloquial language or slang to narrative and expository discourse (e.g. documentaries). Open-Subtitles-2016 includes a total of 1 689 bi-texts extracted from a collection of subtitles containing 2.6 billion sentences (17.2 billion tokens) distributed over 60 languages[16].

    Ref.[16] pre-processed the raw subtitle files to generate XML subtitle files, and then performed a cross-lingual alignment to generate XML alignment files (1 XML file per language pair, encoded as a collection of alignments). Finally, they produced a bilingual corpus in the Moses format.

    Although the amount of this kind of data is very large, there are still too many problems such as sentence mismatching, translation errors, free translations, font errors, etc. In order to take full advantage of this large scale of subtitle corpus, three filtering methods are introduced: ① the sentence length difference, ② the semantic similarity, and ③ the machine learning to pick out the sentence pairs of good quality.

    The rest of this paper is organized as follows.Section 1 presents our proposed model. Section 2 presents a filtering method based on the sentence length. Section 3 presents a filtering method based on machine translation references (based on semantic similarity and machine learning methods). Section 4 describes our experiments and experimental results. Finally, section 5 made the conclusions and future work directions are given.

    1 Proposed Model

    The raw data in this paper is obtained from the Open-Subtitles-2016(http:∥opus.lingfil.uu.se/OpenSubtitles2016.php). To clean up the noises in this corpus, pre-processing are conducted as: ① remove unnecessary symbols such as:, §, #, [ , ], ※, -, *, @, 「,」, 「, 」, ② remove the sentence pairs that contain font errors, ③ remove the sentences that contain English words, and ④ convert traditional Chinese characters to simplified Chinese characters. After pre-processing, a baseline corpus called C0-corpus is obtained. Fig.1 illustrates the architecture of the system and the pre-processing is in the first part of the architecture. In this section the remaining parts of the system will be described in detail.

    C0-corpus is obtained through the pre-processing stage. Next, we do the first filtering which is based on a Chinese-Vietnamese dictionary. At this step, we filter to remove all pairs of sentences that have big differences in length. Then, C1-corpus is obtained. However, there are still some sentences with very poor quality translation; or some translated sentences do not match with the meaning of the original sentences. So, we will apply a further filter to remove those poor-quality translations by employing a method based on a machine translation reference. Since the number of sentence pairs in C1-corpus is still too big, we randomly take 2 000 and 5 000 sentence pairs representatives from the C1-corpus to find the most appropriate threshold values, which will be described below. It is done by combining two methods, namely automatic labelling (based on measures: Cosine, Jaccard, Dice and smoothed-BLEU (Smoothing 3)[17]) and manual labelling. With each measure, we will keep C1-V1 sentence pairs in the C1-coprus to group them into the C2-corpus, if the semantic similarity of V1 and V1-google(V1-google is translated from C1 by using https:∥translate.google.com/) is greater than or equal to the threshold value. Moreover, for utility of the measures from above steps, we use them as features in machine learning methods (support vector machine-SVM, logistic regression-LR) for classification. In the classification, Yes is denoted for good quality sentence pairs, and No is denoted for being removed sentence pairs. The Yes results will be considered as the C2-corpus.

    Fig.1 Architecture of corpus filtering system

    2 Filtering Based on Sentence Length Difference

    2.1 Characteristics of Vietnamese and Chinese

    In terms of language typology, Chinese and Vietnamese are the same type of isolated form, so there are some similarities between them. The basic unit of Vietnamese is syllable. In writing, syllables are separated from each other by a white space. The white space itself cannot be used to determine the word because words often include one or more syllables. Tab.1 presents an example of a Vietnamese sentence and its corresponding Chinese one segmented into syllables and words.

    Tab.1Example of a Chinese-Vietnamese sentence pair segmented into syllables and words

    With some similarities in the type of language as above,it is recognized that Chinese sentences and their Vietnamese translation often have proportional sentence lengths. Therefore, the length of sentences in a sentence pair will be a very good criterion to determine whether they are accurate translations of one another.

    2.2 Sentence relative length based filtering

    As mentioned above, the length of the sentence is a good benchmark to determine whether they are accurately translated from each other. It is intuitive in the bilingual corpus. Tab.2 shows the examples of sentence alignment errors in C0-corpus.

    In detail: ① Sentence alignment errors, where one Vietnamese sentence is aligned with several Chinese sentences (1st row) and vice versa. ② One Chinese sentence is aligned with several Vietnamese sentences (2nd row). ③ A short Chinese sentence is aligned with a very long Vietnamese sentence (3rd row), and vice versa. ④ A long Chinese sentence is aligned to a very short Vietnamese sentence one (4th row). These errors cause the length of Chinese-Vietnamese sentence pairs to have a big difference, which indicates that they are not translated properly.

    Tab.2Examples of sentence alignment errors in C0-corpus

    Here,a Chinese-Vietnamese dictionary with 340 000 word pairs is taken as the reference, the average difference of Chinese-Vietnamese word pairs is calculated by the following formula:

    (1)

    where dicthresholdis the threshold value of the dictionary,lch_wordis the length of Chinese words,lvi_wordis the corresponding length of Vietnamese words, andldicis the number of the word pairs in this dictionary.

    In C0-corpus, the difference of each sentence pair is calculated by

    (2)

    where dif(ch, vi)is the length difference of a Chinese-Vietnamese sentence pairs,lch_senis the length of a Chinese sentence,lvi_senis the length of a Vietnamese sentence, min(lch_sen,lvi_sen) is the smallest value oflch_senandlvi_sen.

    Sentence pairs whose dif(ch, vi)is greater than dicthresholdare removed from C0-corpus. To estimate the impact of sentence length on C1-corups quality, several values of dicthresholdare tried as follows. Then the original dicthresholdvalue is multiplied by different coefficients, such as coefficients of 1, 2, …, 6 (as mentioned in section 4.2.1) to get new threshold values. From each new dicthresholdvalue, a corresponding C1-corpus will be obtained from C0-corpus.

    3 Filtering Based on Machine Translation Reference

    After the previous step, although we have collected sentence pairs in the C1-corpus that have similar sentence lengths, these pairs still contain several mis-translated ones as exemplified below.

    Tab.3 shows some common mis-translations: ① sentence pairs in line 1 and line 2 that are aligned but differ completely in meaning; ② mis-alignment, i.e., the Chinese sentence in line 3 that’s aligned to the Vietnamese sentence in line 4, the Chinese sentence in line 4 is aligned to the Vietnamese sentence in line 5,which also leads to; ③ free translation, i.e., the Vietnamese sentence in line 3 and the Chinese sentence in line 5 have no matching translation aligned to them.

    Tab.3Examples of mis-translated sentences in C1-corpus

    Therefore, in this step we aim to eliminate these sentence pairs, which include translation errors, sentence mismatch, free translations, etc. relying on semantic criterions.

    3.1 Filtering based on semantic similarity

    As the size of the C1-corpus is very big, it is infeasible for manual evaluation of the translating quality of each sentence pair in C1-corpus. Instead, first we randomly sample 2 000 sentence pairs which represent C1-corpus. They are notated as 2K_ch, 2K_vi and manually labelled: Yes for the good quality sentence pairs and No for bad quality sentence pairs that will be removed. These 2 000 sentence pairs are used as a training data set.

    First, Google Translate(https:∥translate.google.com/) is utilized to translate the extracted 2 000 Chinese sentences (2K_ch) to Vietnamese (2K_google). Then, the semantic similarity of the 2 000 sentence pairs (2K_vi, 2K_google) is measured by using the measures: Cosine, Jaccard, Dice, and smoothed-BLEU.

    At each measure, the most appropriate threshold value is identified to differentiate good-bad quality sentence pairs from C1-corpus. Only sentence pairs (C1-V1) which satisfy the semantic similarity of V1 and V1-google are greater than or equal to the threshold value of that measure will be kept in C2-corpus.

    Each Vietnamese sentence pair of the 2K_vi-2K_google set has similarity of 0 if they are semantically completely different and of 1 if they are completely identical. Typically, the range of the similarity is [0,1].

    In this step, we search for the most appropriate threshold value for each measure. The most appropriate threshold value is the value at whichF1reaches max value. Taking Cosine measure for example, the following table is considered (the other measure was similarity performed).

    As the threshold value is in [0, 1], the original temporary threshold value is set att0(=0.0) and the final threshold value at 1. For the original temporary threshold (t0), the corresponding values in theTt0and theNt0columns are calculated as in Tab.4, whereiis the number of sentence pairs that are manually labelled,i=1,2,…,2 000.

    Tab.4 Cosine measure of threshold value

    Here, sentence pairs where Cosine similarity is greater thant0will haveTt0set at Yes, and No otherwise. In theNt0column, we combine Manual labelling column value withTt0column value.Nt0is set at Yes when both these two columns are Yes, and No otherwise. With the other temporary threshold values (fromt1to 1), the corresponding values in theTt1and theNt1columns, etc. are similarity calculated.

    (3)

    (4)

    Finally, we calculate the Precision (P), Recall (R),F1and the threshold value for Cosine measure at temporary thresholdt0as follows:

    (5)

    (6)

    (7)

    Cosinethresholdis calculated as:

    Cosinethreshold=max(F1;j=0,1,2,…,10)

    (8)

    wherej=0,1,2,…,10 corresponds to the temporary threshold values from 0.0 to 1 and with each repeating the threshold is increased by 0.1. If the original temporary threshold values are set at 0.00 or 0.000, (e.g,j=0,1,2,…,100 orj=0,1,2,…,1 000) respectively with each repeat, the threshold is increased by 0.01 or 0.001, etc.

    Cosinethresholdis the threshold value at whichF1reaches a maximum value when combining the two methods of manual and automatic labelling. Similarly, we also calculate the Jaccardthreshold, Dicethresholdand smoothed-BLEUthresholdvalues.

    In order to test the stability of the classifier, similarly, from C1-corpora we also randomly sample 5 000 sentences pairs(5K_ch, 5K_vi) to find the threshold value for each measure. Tab.5 is the corresponding parameters for each measure of 2 000 and 5 000 sentence pairs. These values are the criteria to determine whether a new sentence pair would be labelled Yes or No.

    Tab.5 Each measure and their parameters

    3.2 Filtering based on machine learning

    Above, we have manually labelled (Yes, No) for 2 000 sentence pairs (2K_ch, 2K_vi), 5 000 sentence pairs (5K_ch, 5K_vi), and calculated their similarity (2K_vi, 2K_google) (5K_vi, 5K_google) using the Cosine, Jaccard, Dice and smoothed-BLEU measures (2K_google, 5K-google are translated from 2K-ch, 5K-ch by using https:∥translate.google.com/.). In this step, we build SVM and LR classifiers to conduct machine learning methods, based on these labelled data and measures. The classification features are as follows:

    SVM: Cosine, Jaccard, Dice, smoothed-BLEU, absdif, reldif

    LR: Cosine, Jaccard, Dice, smoothed-BLEU, absdif, reldif

    where absdifand reldifare calculated using the formulas:

    absdif=lch-lvi

    (9)

    (10)

    wherelchis the length of the Chinese sentence andlviis the length of the Vietnamese sentence. absdif, reldifis the absolute and relative difference of each Chinese-Vietnamese sentence pair in the C1-corpus.

    With each C1-corpus corresponding to each threshold value calculated using Eq.(1), we differentiate (Yes/No) sentence pairs from C1-corpus and obtain respective C2-corpus.

    4 Experiments

    4.1 Data setting

    We redraw the subsequent steps of the method in Fig.2 and distinguish the data after each filtering step.

    The initial corpus includes 1 120 000 sentence pairs. After pre-processing (see section 1) and manually selecting relatively good 5 000 sentence pairs, we obtained C0-corpus that consists of 997 424 sentence pairs. We combined 5 000 sentence pairs just extracted with other 5 000 sentence pairs collected from: vietnamese.cri.cn to create develop_set and test_set.

    For the C1-corpus (C1-V1), Google Translate is used to translate Chinese sentences (C1) into Vietnamese sentences (V1-google) and then the measures are used: Cosine, Jaccard, Dice, smoothed-BLEU to measure the similarity of sentence pairs (V1, V1-google). In the experiments below, depending on each threshold value calculated using Eq. (1), from C0-corpus we extract the number of pairs (Chinese, Vietnamese) for each measure (Cosine, Jaccard, Dice, smoothed-BLEU) accordingly to create training data sets for C1-corpus and C2-corpus.

    Next, 2 000 and 5 000 pairs are extracted from the C1-corpus randomly (see section 3.1) to build training_set for the next experiments. Note that the only difference of all the experiments is the training_set, develop_set and test_set are completely identical. Thus, the corpus preparation includes 6 categories in Tab.6.

    Tab.6 Data sets for the experiments

    In this study, Moses[18]is used as a decoder, SRILM is applied to build the language model, Giza++ for word alignment process, and BLEU for scoring the translation.

    4.2 Experimental results

    In order to empirically evaluate the quality of the C0-corpus, C1-corpora and C2-corpora, we used them as the training set for a SMT system based on Moses. First, a SMT system is built for evaluating C0-corpus by the BLEU score. With the train set of 997 424 sentence pairs, the C0-BLEU result is 18.78 (baseline).

    4.2.1C1-BLEU scores

    Filtering by the different thresholds in the filtering step based on sentence length, different C1-corpora can be obtained from C0-corpus. Tab.7 shows the number of sentence pairs of each corpus and the corresponding BLEU score.

    Tab.7 Change of BLEU score with their threshold

    The initial threshold calculated by Eq. (1) is 0.236 (level 1). Then this threshold is multiplied with the coefficients from 2 to 6, corresponding to the threshold values from 0.472 to 1.416. For convenience, the levels (1, 2, …,6) are introduced to indicate the different C1-corpora below.

    Fig.3 shows that, at level 1, C1-BLEU score is 18.41, which is smaller than the baseline C0-BLEU score (18.78). This is due to the reason that the overly strict filtering will discard too many translation pairs and the rest is insufficient for training a translation model.

    Fig.3 Thresholds and their corresponding BLEU scores

    From level 2 onward, at each threshold, most of C1-BLEU scores are higher than the C0-BLEU score and the highest value occurs at level 3. This demonstrates that the filtering step based on sentence length is effective.

    However, from level 4 to level 6,it can be observed that C1-BLEU tends to decrease. The reason is that the filter conditions are being loosened and the amount of data added to the training set at the next level is containing more and more poor quality sentence pairs. Therefore, it is completely logical that the lowest C1-BLEU at level 6 is 19.15.

    4.2.2C2-BLEU scores

    Now, based on the above C1-corpora with various threshold values, further filtering is conducted based on the measures (Cosine, Jaccard, Dice and smoothed-BLEU) of the threshold value and the machine learning methods (SVM, LR). Representatives of thresholds at levels 1, 2, 3, 4, 5, 6 are taken and obtained the results shown in Fig.4.

    This chart (Fig.4) has six intervals for six levels (level 1, 2, … , 6) respectively. The first point of each interval is the value of C1-BLEU score. The six remaining points in each interval are the C2-BLEU scores with the measures:Cosine, Jaccard, Dice, smoothed-BLEU, SVM and LR respectively. The horizontal line is C0-BLEU score.

    Fig.4 Thresholds and their C2-BLEU scores

    According to this chart, when the threshold at level 1, the value of C1-BLEU score and C2-BLEU score are smaller than C0-BLEU score. Due to the fact that at the filtering step based on the length of the sentence, the threshold value is too small (0.236), and too many good quality sentence pairs are removed. Therefore, in the next step, the number of good quality sentence pairs obtained in C1- corpora and C2- corpora is smaller in the C0-corpus, thus driving down BLEU-score.

    Thus, from the level 1 to level 3, C1-BLEU scores tend to increase and reach the highest value at level 3 (19.6), then decrease to level 6 (19.15) in Fig.3. Note that the two data sets (2 000 and 5 000 pairs) we extracted in section 3.1 have no effect on C1-BLEU scores at levels, but only on C2-BLEU scores. Similarly, C2-BLEU scores (with both data set: 2 000 and 5 000 pairs) are also neither good nor stable at level 1 (as above discussed). From level 2 onwards, C2-BLEU scores are always stable and higher than the C1-BLEU scores at the respective levels. C2-BLEU reaches the highest value at level 3 (SVM), which is 20.01 with the data set of 2 000 pairs and level 3 (SVM), which is 20.10 with the data set of 5 000 pairs.

    The BLEU score, percentage of sentence pairs and experimental results are presented in Tab.8.

    Tab.8 Original corpus, C0-corpus, C1-corpus, C2-corpus and their BLEU scores

    Tab.8 compares results of our experiments in different filtering steps (i.e. C1-corpus and C2-corpus at level 3) with the baseline (C0-corpus) and original corpus. From the results, it is clear that the selected data using our method could obtain better quality corpora and get a higher score using small amount of data.

    Noticeably, we could realize the competitive performance of much less data, and this will reduce the computation load in process. To illustrate, C1-corpus uses 74.70% of the baseline data and attain a BLEU score of 19.60. Meanwhile, C2-corpus uses 55.47% (with 2 000 pairs) and 57.70% (with 5 000 pairs) of the baseline data and get a BLEU score of 20.01 and 20.10 (i.e. higher BLEU score in C1-corpus and baseline). It means that only above half of the data can get a competitive performance compared to using all the data.

    5 Conclusions and Future Work

    This paper proposes the filtering methods based on the sentence length difference, the semantic of sentence pairs, and machine learning for the Chinese-Vietnamese parallel corpus construction. In the filtering step based on the sentence length difference, with all of the various threshold values, the C1-BLEU scores (from level 2) are higher than the C0-BLEU score. Next, two manual data sets are used for the remaining two filter steps. In the filtering step based on the semantic similarity of sentence pairs, the filter quality and quantity depend on the threshold values in step 1. The C2-BLEU scores are consistently higher than the C1-BLEU scores.

    In addition, the machine learning methods (SVM, LR) also achieve more stable and higher BLEU scores compared with the single feature methods (based on each measure). Using the manual data sets having the size of 2 000 or 5 000 did not significantly affect C2-BLEUs, as they are extracted randomly to represent the C1-corpora. These methods can also be easily transferred to other language pairs.

    In the future, we will attempt to calculate C2-BLEU scores at the remaining threshold values. It is hopeful that at these thresholds,the C2-BLEU scores could reach values higher than the current value of 20.10.

    [1] Melamed D I. Models of translational equivalence among words[J]. Computational Linguistics,2000,26(2):221-249.

    [2] Jin R, Chai J Y. Study of cross lingual information retrieval using on-line translation systems[C]∥International ACM Sigir Conference on Research & Development in Information Retrieval,2005:619-620.

    [3] Gale W A, Church K W. Identifying word correspondences in parallel texts[C]∥Speech and Natural Language, Proceedings of a Workshop Held at Pacific Grove, California, USA, DBLP, 1991: 152-157.

    [4] Widdows D, Dorow B, Chan C K. Using parallel corpora to enrich multilingual lexical resources[C]∥International Conference on Language Resources & Evaluation, 2002:240-245.

    [5] Kuhn J. Experiments in parallel-text based grammar induction[C]∥Meeting on Association for Computational Linguistics, Association for Computational Linguistics, 2004:470. https:∥dl.acm.org/citation.cfm?id=1218955.1219015.

    [6] Do Thi-Ngoc-Diep, Le Viet-Bac, Bigi Brigitte,et al. Mining a comparable text corpus for a Vietnamese-French statistical machine translation system[C]∥Proceedings of the Fourth Workshop on Statistical Machine Translation, Association for Computational Linguistics, 2009:165-172. https:∥dl.acm.org/citation.cfm?id=1626466.

    [7] Luo L, Guo J Y, Yu Z T, et al. Construction of a large-scale Sino-Vietnamese bilingual parallel corpus[C]∥IEEE International Conference on System Science and Engineering, IEEE, 2014:154-157.

    [8] Le Q H, Le A C. Extracting parallel texts from the web[C]∥Second International Conference on Knowledge and Systems Engineering, IEEE, 2010:147-151.

    [9] Gale W A, Church K W. A program for aligning sentences in bilingual corpora[C]∥Meeting on Association for Computational Linguistics, Association for Computational Linguistics, 1991:177-184.

    [10] Moore R C. Fast and accurate sentence alignment of bilingual corpora[C]∥Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users, Springer-Verlag, 2002:135-144.

    [11] Braune F, Fraser A. Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora[C]∥COLING 2010, International Conference on Computational Linguistics, Posters Volume, 23-27 August 2010, Beijing, China, DBLP, 2010:81-89.

    [12] Sennrich R, Volk M. MT-based sentence alignment for OCR-generated parallel texts[C]∥Proc of Amta, 2010: 175-182.

    [13] Tiedemann J. Building a multilingual parallel subtitle corpus[J]. International Journal of Multilingualism, 2009, 11(2):266-268.

    [14] Tiedemann J. Synchronizing translated movie subtitles[C]∥International Conference on Language Resources and Evaluation, Lrec 2008, 26 May-1 June 2008, Marrakech, Morocco, DBLP, 2012:1902-1906.

    [16] Lison Pierre, J?rg Tiedemann. OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles[C]∥ LREC, 2016. https:∥www.duo.uio.no/handle/10852/50459.

    [17] Chen B, Cherry C. A systematic comparison of smoothing techniques for sentence-level BLEU[C]∥The Workshop on Statistical Machine Translation, 2014:362-367.

    [18] Koehn Philipp Hoang. Moses: open source toolkit for statistical machine translation[C]∥Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Association for Computational Linguistics, 2007:177-180.

    黄色日韩在线| 日本黄大片高清| 国产精品蜜桃在线观看| 一本久久精品| 18禁在线无遮挡免费观看视频| 18禁在线播放成人免费| 黄片wwwwww| 亚洲精品成人久久久久久| av在线观看视频网站免费| 精品人妻偷拍中文字幕| 日韩一本色道免费dvd| av在线蜜桃| av网站免费在线观看视频 | 亚洲国产日韩欧美精品在线观看| 国产成人精品一,二区| 国产在线一区二区三区精| 国产91av在线免费观看| 国产v大片淫在线免费观看| 日韩亚洲欧美综合| 嫩草影院入口| 欧美激情在线99| 国产视频首页在线观看| 久久久国产一区二区| 国产精品人妻久久久影院| 免费电影在线观看免费观看| 欧美bdsm另类| 26uuu在线亚洲综合色| 日韩中字成人| 搡女人真爽免费视频火全软件| 街头女战士在线观看网站| 91午夜精品亚洲一区二区三区| 交换朋友夫妻互换小说| 岛国毛片在线播放| 老汉色av国产亚洲站长工具| 老汉色av国产亚洲站长工具| 捣出白浆h1v1| 亚洲成av片中文字幕在线观看 | 日韩一本色道免费dvd| 久久精品亚洲av国产电影网| 这个男人来自地球电影免费观看 | 国产成人精品久久久久久| 欧美日韩一区二区视频在线观看视频在线| 欧美日韩综合久久久久久| 欧美日本中文国产一区发布| 久久久欧美国产精品| 国产精品国产av在线观看| 赤兔流量卡办理| 国产淫语在线视频| 青春草视频在线免费观看| 国产又色又爽无遮挡免| 午夜免费男女啪啪视频观看| 少妇被粗大的猛进出69影院| 午夜免费男女啪啪视频观看| 精品一品国产午夜福利视频| 国产又爽黄色视频| 午夜福利乱码中文字幕| 18禁国产床啪视频网站| 国产野战对白在线观看| 久久精品国产亚洲av天美| 伊人久久大香线蕉亚洲五| 涩涩av久久男人的天堂| 国产欧美日韩一区二区三区在线| 韩国av在线不卡| 99热全是精品| 久久久久精品性色| 欧美国产精品va在线观看不卡| av不卡在线播放| 在现免费观看毛片| 成年人午夜在线观看视频| 9热在线视频观看99| 我要看黄色一级片免费的| 女性被躁到高潮视频| 欧美精品亚洲一区二区| 少妇 在线观看| 热re99久久精品国产66热6| 欧美日韩亚洲高清精品| 成人二区视频| 美女视频免费永久观看网站| 老汉色∧v一级毛片| 亚洲美女搞黄在线观看| 免费黄网站久久成人精品| 涩涩av久久男人的天堂| 日韩一区二区视频免费看| 久久毛片免费看一区二区三区| 亚洲精品一二三| 十八禁网站网址无遮挡| 大陆偷拍与自拍| 成人毛片a级毛片在线播放| 欧美亚洲 丝袜 人妻 在线| 免费观看a级毛片全部| 桃花免费在线播放| 久久午夜综合久久蜜桃| av又黄又爽大尺度在线免费看| av免费观看日本| 中国三级夫妇交换| 中国国产av一级| 亚洲精品av麻豆狂野| 亚洲欧美一区二区三区久久| 免费在线观看视频国产中文字幕亚洲 | 欧美日韩综合久久久久久| 成人毛片60女人毛片免费| 欧美精品一区二区免费开放| 免费在线观看黄色视频的| 少妇的丰满在线观看| 中文字幕色久视频| 国产视频首页在线观看| 91精品国产国语对白视频| 热99国产精品久久久久久7| 777久久人妻少妇嫩草av网站| 女的被弄到高潮叫床怎么办| 97在线人人人人妻| 两个人免费观看高清视频| 大香蕉久久网| 99精国产麻豆久久婷婷| 亚洲av欧美aⅴ国产| 亚洲色图 男人天堂 中文字幕| 极品人妻少妇av视频| www.熟女人妻精品国产| 91精品国产国语对白视频| 人人妻人人澡人人看| 国产麻豆69| 国产亚洲最大av| 热re99久久国产66热| 亚洲av免费高清在线观看| 亚洲av综合色区一区| 在线观看人妻少妇| 亚洲视频免费观看视频| 久久久久久久久久人人人人人人| 美女脱内裤让男人舔精品视频| 亚洲精品自拍成人| av福利片在线| 国产高清国产精品国产三级| 国产精品久久久久久av不卡| 欧美人与性动交α欧美软件| 侵犯人妻中文字幕一二三四区| 午夜精品国产一区二区电影| 午夜福利视频在线观看免费| 国产成人一区二区在线| 岛国毛片在线播放| 天天操日日干夜夜撸| 日韩中字成人| 国产亚洲午夜精品一区二区久久| 久久青草综合色| 亚洲一区中文字幕在线| 中文字幕亚洲精品专区| 成年女人在线观看亚洲视频| 色94色欧美一区二区| 人成视频在线观看免费观看| 观看av在线不卡| xxx大片免费视频| 两个人看的免费小视频| 精品国产国语对白av| 麻豆乱淫一区二区| 免费黄网站久久成人精品| 午夜福利乱码中文字幕| 亚洲国产日韩一区二区| 中文字幕另类日韩欧美亚洲嫩草| 香蕉国产在线看| 亚洲内射少妇av| 卡戴珊不雅视频在线播放| 女性被躁到高潮视频| 亚洲美女搞黄在线观看| 人妻 亚洲 视频| 国产精品.久久久| 亚洲精品aⅴ在线观看| 欧美日韩视频精品一区| av又黄又爽大尺度在线免费看| 日本av手机在线免费观看| 热99国产精品久久久久久7| 亚洲国产欧美日韩在线播放| 美女午夜性视频免费| 黄色配什么色好看| 1024视频免费在线观看| 国产成人91sexporn| 我的亚洲天堂| 女性被躁到高潮视频| 三上悠亚av全集在线观看| 欧美激情 高清一区二区三区| 成人亚洲欧美一区二区av| 欧美亚洲 丝袜 人妻 在线| 成年女人毛片免费观看观看9 | 国产精品无大码| 各种免费的搞黄视频| 看免费av毛片| 777久久人妻少妇嫩草av网站| 伊人久久国产一区二区| 1024香蕉在线观看| 午夜福利在线观看免费完整高清在| 欧美日韩视频精品一区| 国产精品免费大片| 国产成人a∨麻豆精品| 国产乱来视频区| 国产日韩欧美亚洲二区| 亚洲在久久综合| 一级,二级,三级黄色视频| 美女国产视频在线观看| 天美传媒精品一区二区| 色婷婷久久久亚洲欧美| av在线app专区| 男女无遮挡免费网站观看| 午夜福利在线免费观看网站| 免费高清在线观看视频在线观看| 十分钟在线观看高清视频www| 91在线精品国自产拍蜜月| 国产精品免费视频内射| 免费少妇av软件| 色94色欧美一区二区| 免费高清在线观看视频在线观看| 2018国产大陆天天弄谢| 美女福利国产在线| 另类精品久久| 狠狠精品人妻久久久久久综合| 80岁老熟妇乱子伦牲交| 久久久久久久久久人人人人人人| 91aial.com中文字幕在线观看| 高清av免费在线| 色播在线永久视频| 亚洲成人一二三区av| 亚洲精华国产精华液的使用体验| 免费观看在线日韩| 欧美日韩视频精品一区| √禁漫天堂资源中文www| 99久久精品国产国产毛片| 黄色 视频免费看| 国产伦理片在线播放av一区| 美女大奶头黄色视频| 少妇的丰满在线观看| 久久精品国产综合久久久| 丝袜在线中文字幕| 国产av一区二区精品久久| 欧美黄色片欧美黄色片| 美女福利国产在线| 高清黄色对白视频在线免费看| 美女视频免费永久观看网站| 天堂8中文在线网| 青草久久国产| 色94色欧美一区二区| 国产在线免费精品| 极品人妻少妇av视频| 国产成人a∨麻豆精品| 久久久久久久大尺度免费视频| 黄片小视频在线播放| 91午夜精品亚洲一区二区三区| 亚洲综合色网址| 欧美国产精品va在线观看不卡| 久久精品国产a三级三级三级| 国产在线视频一区二区| 久久99蜜桃精品久久| 多毛熟女@视频| 中文字幕精品免费在线观看视频| 亚洲伊人色综图| 久久久久精品人妻al黑| 久久99蜜桃精品久久| 一级毛片电影观看| 欧美精品亚洲一区二区| 天天影视国产精品| 久久精品人人爽人人爽视色| 2022亚洲国产成人精品| 亚洲一级一片aⅴ在线观看| 咕卡用的链子| 在线亚洲精品国产二区图片欧美| 久久毛片免费看一区二区三区| 美女大奶头黄色视频| 欧美日韩亚洲国产一区二区在线观看 | 九草在线视频观看| 国产有黄有色有爽视频| 色吧在线观看| 人妻人人澡人人爽人人| 涩涩av久久男人的天堂| 伦理电影大哥的女人| 波多野结衣一区麻豆| 天堂8中文在线网| 美女视频免费永久观看网站| 午夜福利,免费看| 亚洲美女黄色视频免费看| 有码 亚洲区| videossex国产| 女人被躁到高潮嗷嗷叫费观| 久久久久国产网址| 看非洲黑人一级黄片| 高清不卡的av网站| 九九爱精品视频在线观看| 90打野战视频偷拍视频| 少妇人妻 视频| 啦啦啦在线免费观看视频4| 999精品在线视频| 国产在线视频一区二区| 两性夫妻黄色片| 一区二区三区四区激情视频| 九九爱精品视频在线观看| 性高湖久久久久久久久免费观看| 欧美日韩视频精品一区| 欧美日韩视频高清一区二区三区二| 建设人人有责人人尽责人人享有的| kizo精华| 国产极品天堂在线| 9色porny在线观看| 韩国高清视频一区二区三区| 人人妻人人爽人人添夜夜欢视频| 亚洲天堂av无毛| 啦啦啦中文免费视频观看日本| 中文字幕亚洲精品专区| videosex国产| 亚洲伊人久久精品综合| 美女福利国产在线| 极品少妇高潮喷水抽搐| 国产av码专区亚洲av| 亚洲综合色惰| 国产精品嫩草影院av在线观看| 亚洲欧美色中文字幕在线| 国产成人精品久久二区二区91 | 狠狠婷婷综合久久久久久88av| 中文字幕亚洲精品专区| 午夜福利乱码中文字幕| av在线观看视频网站免费| 久久人人爽av亚洲精品天堂| 青春草国产在线视频| 久久av网站| 又大又黄又爽视频免费| 午夜老司机福利剧场| 在线观看一区二区三区激情| 天天操日日干夜夜撸| 夫妻午夜视频| 亚洲欧美一区二区三区国产| 一本一本久久a久久精品综合妖精 国产伦在线观看视频一区 | 国产精品秋霞免费鲁丝片| 日日摸夜夜添夜夜爱| 午夜福利视频在线观看免费| 免费观看av网站的网址| 欧美精品国产亚洲| 少妇熟女欧美另类| 亚洲在久久综合| 女人精品久久久久毛片| 人妻人人澡人人爽人人| 国产淫语在线视频| 天天躁夜夜躁狠狠久久av| 黑人欧美特级aaaaaa片| 一边摸一边做爽爽视频免费| 国产探花极品一区二区| 日韩中字成人| 2021少妇久久久久久久久久久| 男女无遮挡免费网站观看| 久久精品国产自在天天线| 不卡av一区二区三区| 18禁国产床啪视频网站| 黄片无遮挡物在线观看| 99久久综合免费| www.自偷自拍.com| 欧美成人午夜精品| 在线观看免费视频网站a站| 成人黄色视频免费在线看| 性色avwww在线观看| 少妇的逼水好多| 亚洲视频免费观看视频| 免费播放大片免费观看视频在线观看| 日韩欧美精品免费久久| 成人午夜精彩视频在线观看| 免费观看a级毛片全部| www日本在线高清视频| 亚洲,欧美,日韩| 五月伊人婷婷丁香| 超碰成人久久| 黄片小视频在线播放| 精品一区二区免费观看| 亚洲精品成人av观看孕妇| 色94色欧美一区二区| 亚洲精品国产一区二区精华液| 999精品在线视频| 亚洲经典国产精华液单| av又黄又爽大尺度在线免费看| 亚洲成人一二三区av| 中文字幕人妻丝袜制服| 久久影院123| 欧美亚洲日本最大视频资源| 国产成人精品在线电影| 国产亚洲欧美精品永久| 婷婷成人精品国产| 大陆偷拍与自拍| 中国国产av一级| 国产日韩欧美亚洲二区| 亚洲人成电影观看| 好男人视频免费观看在线| 国产成人午夜福利电影在线观看| 成人国产麻豆网| 最近的中文字幕免费完整| 国产一区二区三区综合在线观看| av又黄又爽大尺度在线免费看| 亚洲欧美一区二区三区国产| 亚洲经典国产精华液单| 亚洲综合色网址| 丝袜在线中文字幕| 天天操日日干夜夜撸| 精品国产国语对白av| 国产精品不卡视频一区二区| 一本—道久久a久久精品蜜桃钙片| 亚洲三区欧美一区| av.在线天堂| 观看美女的网站| 巨乳人妻的诱惑在线观看| 如日韩欧美国产精品一区二区三区| www日本在线高清视频| 伦精品一区二区三区| 七月丁香在线播放| 韩国高清视频一区二区三区| 国产探花极品一区二区| 女人精品久久久久毛片| 国产成人aa在线观看| 久久精品人人爽人人爽视色| 国产精品三级大全| 国产深夜福利视频在线观看| 老鸭窝网址在线观看| 综合色丁香网| 少妇精品久久久久久久| 国产精品偷伦视频观看了| 午夜老司机福利剧场| 国产精品麻豆人妻色哟哟久久| 只有这里有精品99| 9热在线视频观看99| 国产成人精品在线电影| av天堂久久9| 午夜福利视频在线观看免费| 91aial.com中文字幕在线观看| 我要看黄色一级片免费的| 久久av网站| 在线观看三级黄色| 熟妇人妻不卡中文字幕| 亚洲精品aⅴ在线观看| 精品少妇内射三级| 亚洲成av片中文字幕在线观看 | 一区二区三区四区激情视频| 久久亚洲国产成人精品v| 中文字幕色久视频| 亚洲伊人色综图| 国产精品偷伦视频观看了| 一级a爱视频在线免费观看| 免费高清在线观看日韩| 啦啦啦中文免费视频观看日本| 亚洲三级黄色毛片| 国产国语露脸激情在线看| 天堂俺去俺来也www色官网| 欧美精品一区二区大全| 日韩免费高清中文字幕av| 深夜精品福利| 日韩三级伦理在线观看| 亚洲av在线观看美女高潮| 在线观看免费日韩欧美大片| 最近手机中文字幕大全| 亚洲精品美女久久av网站| 黄色怎么调成土黄色| 精品视频人人做人人爽| 亚洲国产av影院在线观看| 亚洲欧美精品综合一区二区三区 | 男女高潮啪啪啪动态图| 下体分泌物呈黄色| 看免费成人av毛片| 激情五月婷婷亚洲| 在线观看一区二区三区激情| 精品久久久久久电影网| 香蕉精品网在线| 亚洲精品久久久久久婷婷小说| 五月天丁香电影| 国产白丝娇喘喷水9色精品| 国产日韩一区二区三区精品不卡| 美女中出高潮动态图| 99热网站在线观看| 亚洲成人一二三区av| 丝袜人妻中文字幕| a 毛片基地| videosex国产| 制服人妻中文乱码| 97在线人人人人妻| 哪个播放器可以免费观看大片| 九色亚洲精品在线播放| 亚洲国产精品一区二区三区在线| 午夜91福利影院| 在线天堂中文资源库| 国产综合精华液| 亚洲视频免费观看视频| 欧美精品国产亚洲| 国产乱人偷精品视频| 成人国产麻豆网| 女人被躁到高潮嗷嗷叫费观| 亚洲经典国产精华液单| 天堂8中文在线网| 欧美变态另类bdsm刘玥| 久久久久视频综合| 男女午夜视频在线观看| 欧美日韩一级在线毛片| 黄网站色视频无遮挡免费观看| 男的添女的下面高潮视频| 中文字幕精品免费在线观看视频| 女性生殖器流出的白浆| 亚洲精品av麻豆狂野| 欧美黄色片欧美黄色片| 午夜久久久在线观看| 免费少妇av软件| 日本vs欧美在线观看视频| 黑人巨大精品欧美一区二区蜜桃| 汤姆久久久久久久影院中文字幕| 曰老女人黄片| 亚洲第一区二区三区不卡| videossex国产| 美女高潮到喷水免费观看| 日韩成人av中文字幕在线观看| 欧美+日韩+精品| 国产精品一区二区在线不卡| 考比视频在线观看| 欧美日韩亚洲国产一区二区在线观看 | 看非洲黑人一级黄片| 老汉色∧v一级毛片| 久久精品亚洲av国产电影网| 一级黄片播放器| 亚洲一区二区三区欧美精品| 一二三四在线观看免费中文在| 肉色欧美久久久久久久蜜桃| 精品久久久精品久久久| 久久久久国产网址| 最近中文字幕高清免费大全6| 高清视频免费观看一区二区| 精品一区在线观看国产| 亚洲人成网站在线观看播放| 91久久精品国产一区二区三区| 国产av一区二区精品久久| 日本猛色少妇xxxxx猛交久久| 美女午夜性视频免费| 一本色道久久久久久精品综合| 亚洲国产精品国产精品| 精品久久久久久电影网| 99久久人妻综合| 人人妻人人澡人人看| 精品午夜福利在线看| 黄色视频在线播放观看不卡| 黑人巨大精品欧美一区二区蜜桃| 亚洲成人av在线免费| 校园人妻丝袜中文字幕| 国产精品.久久久| 亚洲欧美一区二区三区久久| 天堂中文最新版在线下载| 色婷婷久久久亚洲欧美| 中文天堂在线官网| 一边摸一边做爽爽视频免费| 久久久精品国产亚洲av高清涩受| 男人添女人高潮全过程视频| 人人澡人人妻人| 波野结衣二区三区在线| 国产老妇伦熟女老妇高清| 两性夫妻黄色片| 新久久久久国产一级毛片| 欧美少妇被猛烈插入视频| 黄色一级大片看看| 99九九在线精品视频| 国产精品 国内视频| 女的被弄到高潮叫床怎么办| 十分钟在线观看高清视频www| 午夜免费男女啪啪视频观看| 午夜福利视频精品| 精品国产一区二区三区久久久樱花| 国产精品免费视频内射| 又大又黄又爽视频免费| 免费少妇av软件| 国产成人av激情在线播放| 人体艺术视频欧美日本| 99热全是精品| 十分钟在线观看高清视频www| 久久av网站| 亚洲婷婷狠狠爱综合网| 91午夜精品亚洲一区二区三区| 国产欧美日韩一区二区三区在线| 啦啦啦在线观看免费高清www| 国产精品二区激情视频| 99久久精品国产国产毛片| 亚洲精品,欧美精品| av在线app专区| 搡老乐熟女国产| 亚洲人成77777在线视频| 久久久久国产精品人妻一区二区| 观看av在线不卡| 国产亚洲av片在线观看秒播厂| av天堂久久9| 黑丝袜美女国产一区| 久久久久国产精品人妻一区二区| 日韩一卡2卡3卡4卡2021年| 亚洲人成网站在线观看播放| 99久久综合免费| 亚洲精品自拍成人| 亚洲伊人色综图| 国产人伦9x9x在线观看 | 久久久a久久爽久久v久久| 在线观看国产h片| 99国产综合亚洲精品| 国产精品国产三级国产专区5o| 久久久久久伊人网av| 欧美老熟妇乱子伦牲交| videossex国产| 亚洲人成电影观看| 黑人巨大精品欧美一区二区蜜桃| 国产高清国产精品国产三级| 大陆偷拍与自拍| 三级国产精品片| 日本wwww免费看| 国产不卡av网站在线观看| videossex国产| 国产一区二区三区av在线| 日本欧美视频一区| 大香蕉久久网| av女优亚洲男人天堂| 国产极品粉嫩免费观看在线| 日韩欧美一区视频在线观看| 热re99久久国产66热| 亚洲一区二区三区欧美精品| 丝袜喷水一区| 久久久久久久精品精品| 人成视频在线观看免费观看| 免费黄网站久久成人精品| 伦理电影免费视频|