• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Design of a clustered data-driven array processor for computer

    2020-11-27 09:17:16ShanRuiDengJunyongJiangLinZhuYunWuHaoyueHeFeilong
    High Technology Letters 2020年4期

    Shan Rui(山 蕊), Deng Junyong, Jiang Lin, Zhu Yun, Wu Haoyue, He Feilong

    (*School of Electronic and Engineering, Xi’an University of Posts and Telecommunications, Xi’an 710121, P.R.China)(**Integrated Circuit Laboratory, Xi’an University of Science and Technology, Xi’an 710054, P.R.China)

    Abstract

    Key words: array processor, data-driven, adjacent interconnection, distributed memory, computer vision (CV)

    0 Introduction

    Computer vision (CV) is widely expected to be the next significant technique in emerging applications[1-3]. CV algorithms can be divided into 3 types according to complexity: low-level, intermediate-level and high-level[4]. Low-level vision is the estimation of depth, motion, shape, and other physical scene properties from visual measurements[5]. Since these algorithms are fully predetermined and act identically on all input data and the initial and final data structures are fixed arrays the algorithms are ideally suited to single instruction, multiple data (SIMD) machines[6]. Intermediate-level and high-level are the critical stage in a computer vision system. The data structures are not fixed in size, and no simple one-to-one or many-to-one fixed and even distribution of data to processors exists[6]. Thus, parallel implementation of these algorithms is more suited to a multiple instruction, multiple data (MIMD) architecture. From above, it can be seen that different levels employ different methods to accelerate for getting the best acceleration. So, many heterogeneous architectures for CV emerge[7-10]recently. However, the high delay of data communication in heterogeneous architectures becomes a main problem to limit the computing efficiency for CV applications.

    In order to reduce data communication delay and enhance the computing efficiency of CV application, a clustered data-driven array processor (CD-DAP) is proposed, which can support SIMD and MIMD at the same time. Multiple processing elements (PEs) are arranged in MESH structure, and connected by short line adjacent interconnection. Single PE is designed with pipelining and works as the mode of dataflow. The main contributions are summarized as follows:

    ? A double-buffer interface structure based on dataflow driven is designed, and it can realize ultra-low data transfer delay between adjacent PEs.

    ? A four-way shared pipeline transcendental function accelerator is designed. The resource utilization and computing efficiency are enhanced.

    ? A distributed shared memory structure based on unified addressing is employed. Through a way of highly efficient interconnection, the data accessing delay can be reduced dramatically.

    This paper is organized as follows. Section 1 introduces related work and the motivation. After that, clustered data-driven array processor will be proposed in Section 2. Section 3 describes the mapping of computer vision algorithm. In Section 4, simulation and performance analysis will be given. The conclusion is given in Section 5.

    1 Related work and motivation

    Many researchers pay attention to acceleration architectures for computer vision. CPU+GPUs is one of widely employed structure. For example, Ref.[3] presented an implementation of OpenVX directed at CPUs and GPUs platforms, and discussed these analytical techniques in detail. Ref.[11] discussed that how to use GPUs’ hundreds of gigaflops of processing power to realize image processing and computer vision through a way of parallel programming. Ref.[12] presented a method which can adapt CPU-GPU task parallelism, sliding window parallelism, scale image parallelism, dynamic allocation of threads, and local memory optimization. Meanwhile a face detection algorithm was realized using proposed method to accelerate computing time. CPU is mainly used to execute serial part of CV algorithms, and GPU is mainly used to compute parallel part of CV algorithms. So, CPU+GPUs can accelerate the computing part of CV algorithms well, but it is inevitable that plenty of data need to be transferred between CPU memory and GPU memory. The performance improvement is limited by the long data transfer delay.

    For satisfying the necessary real-time performance for many computer vision applications, dedicated hardware accelerators are often designed, such as in Ref.[13], a hardware-based stereo matching architecture was proposed which aims to provide high accuracy and concurrently high performance in embedded vision applications. The architecture integrated an image filter and an edge-preserving filter. Dedicated hardware accelerators can satisfy the requirement of real time application well, but it cannot adapt to variety algorithms. Once algorithm changed, the hardware must be redesigned. The cost is very high and the time to market is long.

    Field programmable gate arrys (FPGAs) are also used in accelerating computer vision applications, such as in Ref.[14], an FPGA-based emulation framework was proposed that can provide dynamic vulnerability analysis for hardware-accelerated computer vision applications. Ref.[15] proposed a FPGA-based accelerator architecture that can tackle a range of standard CV algorithms. The architecture consists of pipelined processing elements that can be configured to support various belief propagation settings for different CV tasks. FPGAs can shorten the time to market slightly, but it also needs to be redesigned once algorithm changes.

    For adapting to the fast-changed computer vision algorithms, multi-core array architectures are introduced, such as in Ref.[16], a polymorphous array processor was proposed that consists of several levels of clusters of processors and seamlessly integrates data parallelism, thread parallelism, operation parallelism, and distributed instruction parallelism. And OpenVX is implemented on this architecture. Ref.[17] proposed a novel framework that is for fast prototyping and optimization of OpenVX applications for heterogeneous SoCs with many-core accelerators. Ref.[18] presented a method for early parallel performance estimation on embedded multiprocessors from sequential application traces. Ref.[19] proposed a dynamically reconfigurable hybrid architecture for vision processing. So multi-core array architectures are very promising target for CV applications.

    Based on above, CD-DAP architecture is proposed. On one hand, it can support data-level parallelism well, and it can execute serial computation well. At the same time, it supports flexible programming. One the other hand, the data transfer delay can be reduced sharply because data do not need to be transferred between different frameworks.

    2 Clustered data-driven array processor

    CD-DAP architecture consists of PE array, memory banks (MBs) array, fast switching unit and router as shown in Fig.1. PE array, MB array, and one fast switching unit are constructed one cluster. PE array includes 16 pipelined PEs and 4 transcendental function accelerators. PEs in array are connected with adjacent interconnection. One transcendental function accelerator is shared by 4 adjacent PEs. Data communication in one cluster is realized through fast switching unit and adjacent interconnection. Data communication between clusters is realized through routers. The proposed structure can be easily scaled with routers.

    Fig.1 CD-DAP architecture

    2.1 Pipelined PE based on two-buffer dataflow driven interface

    Three-level pipelining architecture is employed in PE design. The first level is mainly used to read code from configurable RAM. The second level is mainly used to receive data from 4 input ports, register file, or h-register. The third level is mainly used to process buffered data from up-level and send processing results to corresponding destination, may be to 4 output ports, register file, or h-register. The third level can be fired only when the needed data arrive.

    Its detail architecture is shown in Fig.2. It consists of 4 two-buffer units, Pc fresh unit, instruction RAM, data receiving unit, data processing unit, generate ready unit, data fan out unit, control unit, pipeline code, register file and h-register.

    Fig.2 Architecture of pipelined PE

    Two-buffer unit is in charge of receiving data from east, west, south, or north direction. In order to reduce data transfer delay, double buffers are used, which are alternately working. Simultaneously the buffer is realized through dataflow driven. When the input data is arriving, and at least one buffer is empty or buffers is full but the buffered data is already being received by next level unit, the buffer is fired to save. The delay of data transfer between adjacent PEs can be near zero.

    Pc fresh unit is used to refresh the value of program point, which decides the next code address. It is controlled by current stage of pipelining. If the code which is already in third level is processing finished, Pc fresh can be done immediately. The detail value is decided by current finished code. If the code is jumping or branch instruction, it is refreshed by corresponding jumping address, or Pc is just added by one.

    Data receiving unit is in charge of saving data for data processing unit, including left data and right data. Left data may be from 4 input ports, register file, or h-register, and right data may be from 4 input ports, register file, immediate or h-register. And saving can be done only when the next pipeline is finished.

    Data processing unit is in charge of processing data from data receiving unit. Some operations are supported, such as add, add immediate, sub, sub immediate, shift, and, or, xor, not, branch, jump, multiply, nop. For satisfying requirements of computer vision application better, sub-word parallelism operation is also supported, including four 8 bits add/sub operations in one code, two 16 bits add/sub operations in one code.

    Control unit is the key part, which controls Pc refreshing, data receiving and processing, ready signals generating, and register file writing. Pipeline code is used to buffer code from instruction RAM. And the buffered code is used to control data processing, result fan out, ready signals generating, and register file writing. Instruction RAM is mainly used to save application codes, which are from H-tree network. The H-tree network can be seen in Fig.3. Register file includes 32 generate registers. H-register includes 2 specific registers, which are used to save the result from multiply. Hi is used to save upper 32 bits of result, and ho is used to save lower 32 bits of result. Some auxiliary circuits are also used to realize by-pass for avoiding data hazard.

    Fig.3 The architecture of H-tree

    Generate ready unit is used to generate ready signals to east, west, south, or north direction, indicating the data from east, west, south, or north direction whether be received already. Data fan out unit is used to send result from data processing unit to corresponding destination according to code. There are 4 direction output registers. Only the register is empty or the register is full but the data is already received, the result can be written.

    2.2 Transcendental function accelerator

    Transcendental function accelerator supports sin, cos, square, square-root, log, exponent, and arctan computing. One transcendental function accelerator is shared by 4 adjacent PEs, as shown in Fig.4.

    Fig.4 The shared architecture

    The architecture of transcendental function accelerator is also shown in Fig.5. It comprises one pre-processing unit, 2 sin/cos function pipeline units, 2 other function pipeline units, and one fan out unit.

    Fig.5 Architecture of transcendental function accelerator

    Pre-processing unit is used to distribute requests from 4 PEs to computing pipeline. When conflicting between requests occurs, a random arbitration algorithm is employed. Fan out unit is used to send computation result to corresponding output ports according to the result of arbitration. So, the result of arbitration from pre-processing unit must be pipelined as the same as data.

    Sin/cos function pipeline unit is used to execute sin and cos computation. The processing delay can reach one cycle, because piecewise linear algorithm is employed. According to the result of software statistic, the style of 16 segments has lower error and fast speed hardware, so it is employed in this work. Its circuit can be seen in Fig.6.

    Fig.6 Architecture of sin/cos function pipeline unit

    Other function pipeline unit has three-level pipeline structure. Logarithmic system is employed for simple computation. The input data must be translated into logarithm form, and then execute corresponding add, sub, shift operations, finally translate the result into exponent form. For example, the processing of square is shown as Eq.(1).

    (1)

    So other function pipeline unit has one log convertor, some auxiliary circuits, and one pow convertor, which can be seen in Fig.7. In this paper, the log and pow convertors are realized which employs piecewise linear algorithm. A style of 16 segments is also used, and the convertor errors can be reached 0.011% and 0.013% for log and pow respectively.

    Fig.7 Architecture of other function pipeline unit

    2.3 Distributed shared memory structure

    A distributed shared memory structure is designed in this paper. Its architecture is shown in Fig.8(a). For long distance data communication, a network on chip (NoC) is used. A 4 virtual channels router is designed, supporting packet switching. Four 32 bits data will be transferred once.XYrouting algorithm is employed. The delay of virtual channel router can be reached at one cycle.

    For data accessing in cluster, a fast switching unit is designed. Allowing for data accessing frequency, the request from local MB can be responded immediately, and the request from other MBs must be judged through line-row 2 level controllers, and finally arrives at destination MB. The detail structure of fast switching unit is shown in Fig.8(b).

    Fig.8 Architecture of distributed shared memory

    Judge unit is used to receive request from PE, and dispatch the request to local MB or line controller according to request address. Line controller is mainly used to receive requests from the same line PEs, and dispatch 4 requests to corresponding row controller or router controller. If more than one requests need to access the same row controller, a random arbitration algorithm is employed to judge these requests. No replied requests will be postponed. Router controller is in charge of receiving requests from 4 line’s controllers and choosing one to router. Row controller is in charge of receiving requests from different lines and dispatching these requests to corresponding MB according to request address. If more than one requests need to the same MB, a random arbitration algorithm is employed to judge these requests. No replied requests will be also postponed.

    3 Computer vision algorithm mapping

    To study the efficiency of proposed architecture, 3 computer vision algorithms are realized on proposed architecture: Sobel edge detection, Canny edge detection, and Harris corner detection.

    3.1 Sobel edge detection

    Edge detection is a fundamental technique for image segmentation, feature extraction and object tracking[20]. In edge detection, Sobel operator is a kind of commonly used template. The process of Sobel edge detection is shown in Fig.9(a). The Sobel filtering matrices for theXandYdirections are shown in Eq.(2). And the process of filtering can be seen in Eq.(3).

    sobel-x=[-1, -2, -1; 0,0,0;1,2,1]

    sobel-y=[-1,0,1;-2,0,2;-1,0,1]

    (2)

    filtering-x=(m6+2×m7+m8)

    -(m0+2×m1+m2)

    filtering-y=(m2-m0)-(2×m5-2×m3)

    +(m8-m6)

    (3)

    Wherem0-m9 stands for 3×3 source image data. For mapping Sobel edge detection on proposed architecture, one cluster is used. The mapping details are shown in Fig.9(b). PE00, PE01, and PE02 are used to load 3×3 source image data, and send them to PE10, PE11, and PE12 to executexandydirections filtering. For taking full advantage of data reusability, PE00, PE01, PE02 all need read data of 3×3 matrixes at the starting of each line. Next, only PE02 needs read data from memory until this line is finished, and PE00 and PE01 just receive data from PE02. They work like a sliding window.

    Fig.9 Sobel edge detection

    PE10 is mainly used to receive data from PE00, sends the result ofm2-m0 to PE11 firstly, computesm0+2×m1+m2 next, receives the result ofm6+2×m7+m8 from PE11, and executes sub operation getting the final result ofxdirection filteringGx.

    PE11 is mainly used to receive data from PE01, computes 2×m5-2×m3 firstly, receives the result ofm2-m0 from PE10, receives the result ofm8-m6 andm8+2×m7+m6 from PE12, sends the result ofm6+2×m7+m8 to PE10, executes add operation getting the final result ofydirection filteringGy.

    PE12 is mainly used to receive data from PE02, computesm8-m6 andm8+2×m7+m6, and sends the results to PE11. PE20 and PE21 are used to execute absolute operation. PE31 is used to execute |Gx|+|Gy|. PE32 is used to compare the threshold value. PE33 is used to write back results.

    3.2 Canny edge detection

    Canny edge detection[21]is a kind of classic image edge detection algorithm, and it has wide application. The process of Canny edge detection is shown in Fig.10(a). Gussian filtering is implemented on cluster 01. Each of PE00 and PE32 is used to process 6 lines image data, each of PE01-PE31 is used to process 4 lines image data. PE33 is used to send result to next cluster.

    Gradient computation and threshold segmentation are implemented on cluster 02. PE33 is used to receive filtering results from upper cluster. PE32, PE23, PE22, PE21, PE11, PE12 are used to compute gradient. PE10 is used to compare the threshold value and write back the results. The details can be shown in Fig.10(b).

    Fig.10 Canny edge detection 3.3 Harris corner detection

    Harris corner detection[22]is widely used in the area of feature extraction of computer vision algorithm. It is simple and has strong stability. The process of Harris corner detection is shown in Fig.11 (a).

    Three clusters are employed to map it. Cluster 0 is used to implement Sobel filtering. There PE00, PE01, PE02 are used to compute the results ofxdirection filteringGx. PE30, PE31, PE32 are used to compute the results ofydirection filteringGy. PE12 is used to compute the result ofGx2. PE22 is used to compute the result ofGy2. PE23 is used to judge writing back or not. PE33 is used to send results to cluste 1. The details can be shown in Fig.11(b).

    Fig.11 Harris corner detection

    Cluster 1 is in charge of Gussian filtering. The processing of Gussian filtering can be shown in Eq.(4).

    Gussian=(m0+2×m1+m2)

    +(m6+2×m7+m8)

    +2×(m3+2×m4+m5)

    (4)

    In cluster 1, PE00, PE01, PE02 are used to read 3×3 image data, and send them to PE10-PE12 respectively. PE10-PE12 receive data from upper PEs. Simultaneously, PE10 computesm0+2×m1+m2. PE11 computesm6+2×m7+m8. PE12 computes 2×(m3+2×m4+m5). PE21 is used to compute (m0+2×m1+m2)+(m6+2×m7+m8), and send the result to right PE. The final filtering result can be gotten from PE23. PE33 is used to send the results to cluster 2. The details can be shown in Fig.11(b).

    Cluster 2 is in charge of corner value computation and corner judging. PE00-PE13 are used to compute corner value. When they finish, PE21-PE33 will start to work and judge whether a corner or not. The details can be shown in Fig.11(b).

    4 Simulation and performance analysis

    Some computing vision algorithms are realized on proposed structure. The results can be seen in Fig.12. Image in Fig.12(a) is original image. Images in Figs12 (b)-(g) are the result of Sobel edge detection, Canny edge detection, Harris corner detection, Fast corner detection, Gassion pyramid, and Histogram respectively.

    Fig.12 The running results of computer vision algorithms

    Computing time of Sobel, Canny, Harris, Fast, Gaussian pyramid, and Histogram realized on CD-DAP and the results of peek signal to noise ratio (PSNR) and root-mean-square error (RMSE) are shown in Table 1.

    Table 1 Performance statics of some algorithms

    Meanwhile, computing time of Sobel, Canny, Harris realized on CD-DAP compared with which realized on FPGA and GPU can be seen in Table 2. From this table, it can be seen that totally processing time of a 512×512 image needs 0.283 ms for Sobel, 0.682 ms for Canny, 4.985 ms for Harris on CD-DAP, which is faster than FPGA and GPU.

    Table 2 Computing time of Sobel, Canny, Harris (unit: ms)

    CD-DAP based on one cluster including 16 PEs, one transcendental function accelerator, one fast switch unit and one router has been implemented on Xilinx ZC 706 develop board. The source occupation can be seen in Table 3. The synthesis result of 4 clusters and the result compared with other implementation architectures can be shown in Table 4. The circuit can be run at 100 MHz.

    Table 3 Synthesized results for one cluster of CD-DAP

    Table 4 Performance and resource usage

    The same circuit has been synthesized using SMIC 130 nm COMS technology. The circuit can be run at 100 MHz. Area is 26.58 mm2. Compared with Refs[19,30,31], the proposed architecture has higher frequency. Area in Ref.[19,30] are larger than the proposed architecture. The architecture in Refs[19,30,31] has 32 kB, 171 kB, 16 kB data memory respectively, however CD-DAP based on 4 clusters has 256 kB data memory. 4096, 3072, 80 PEs are integrated in Ref.[19,30,31] respectively. The proposed architecture has 64 PEs, and can be scaled easily, meanwhile compared with Ref.[19,30,31], the PE is more complicated and functional. 8 bit, 16 bit, 32 bit data width can be supported simultaneously in this work, however only 10 bit data width is supported in Ref.[19], 8 bit data width is supported in Ref.[31] and 11 bit data width is supported in Ref.[30].

    5 Conclusions

    A clustered data-driven array processor for computer vision is proposed. To reduce data transfer delay, a double buffer dataflow driven interface is designed. For improving data parallel computation, 8 bits, 16 bits, 32 bits subtext parallel computation has been supported. For accelerating computer vision applications further, a four-way shared pipelining transcendental function accelerator based on Y-intercept adjusted piecewise linear segment algorithm is designed. Simultaneously, a distributed shared memory structure based on unified addressing is also employed. Through employing fast switching unit realizing data transfer in cluster and NoC using for data communication between clusters, data accessing delay is reduced dramatically.

    Sobel, Canny, Horris, Fast, Gaussian pyramid, and Histogram algorithms are implemented on proposed architecture. The computing time is statistic. Simultaneously, CD-DAP based on 4 clusters has been implemented on Xilinx ZC 706 develop board. The same circuitry has been synthesized using SMIC 130 nm COMS technology. The circuitry can be run at 100 MHz. Area is 26.58 mm2.

    熟女少妇亚洲综合色aaa.| 成人亚洲精品一区在线观看| 欧美成狂野欧美在线观看| 成人三级做爰电影| 中文字幕高清在线视频| 久久人人97超碰香蕉20202| 亚洲精品一二三| 黄频高清免费视频| 可以免费在线观看a视频的电影网站| 中文字幕高清在线视频| 国产深夜福利视频在线观看| 亚洲av电影在线进入| 桃红色精品国产亚洲av| 黄片播放在线免费| 一二三四在线观看免费中文在| 国产xxxxx性猛交| 精品国产乱码久久久久久男人| 久99久视频精品免费| 伦理电影免费视频| 国产精品亚洲一级av第二区| 午夜福利,免费看| 免费看十八禁软件| 国产高清国产精品国产三级| 又紧又爽又黄一区二区| 亚洲成人国产一区在线观看| 超色免费av| av一本久久久久| 777久久人妻少妇嫩草av网站| av天堂在线播放| av线在线观看网站| 欧美成人免费av一区二区三区 | 精品午夜福利视频在线观看一区| 国产不卡av网站在线观看| 久久精品亚洲精品国产色婷小说| 在线国产一区二区在线| 婷婷精品国产亚洲av在线 | 亚洲欧美日韩高清在线视频| 成人av一区二区三区在线看| 成人亚洲精品一区在线观看| 国产亚洲欧美98| 99久久99久久久精品蜜桃| 成人国产一区最新在线观看| 99精国产麻豆久久婷婷| 可以免费在线观看a视频的电影网站| av电影中文网址| 黄色视频不卡| 丝袜人妻中文字幕| 黄色丝袜av网址大全| 亚洲精品一卡2卡三卡4卡5卡| 精品视频人人做人人爽| 大型av网站在线播放| 亚洲综合色网址| 超碰成人久久| 欧美黑人精品巨大| 十八禁人妻一区二区| 亚洲精品国产一区二区精华液| 欧美 亚洲 国产 日韩一| 色在线成人网| 成人亚洲精品一区在线观看| 18禁裸乳无遮挡免费网站照片 | 高清视频免费观看一区二区| 宅男免费午夜| 国产免费现黄频在线看| 在线观看免费午夜福利视频| 欧美日韩国产mv在线观看视频| 日韩欧美国产一区二区入口| 久久人妻熟女aⅴ| 少妇的丰满在线观看| 亚洲自偷自拍图片 自拍| 久久久国产欧美日韩av| 交换朋友夫妻互换小说| 最新美女视频免费是黄的| 亚洲欧美日韩另类电影网站| 99re6热这里在线精品视频| 香蕉国产在线看| 高潮久久久久久久久久久不卡| 亚洲精品中文字幕在线视频| 在线天堂中文资源库| 视频区图区小说| 色综合婷婷激情| 国产在线一区二区三区精| 亚洲av电影在线进入| 国产成人免费观看mmmm| 搡老熟女国产l中国老女人| 国产男靠女视频免费网站| 91av网站免费观看| 亚洲国产中文字幕在线视频| 超碰97精品在线观看| 亚洲熟妇中文字幕五十中出 | 黄色怎么调成土黄色| 欧美日韩亚洲高清精品| 国产精品一区二区在线观看99| 18禁黄网站禁片午夜丰满| 亚洲av片天天在线观看| 日本欧美视频一区| 一区二区三区精品91| 精品国产乱子伦一区二区三区| 热re99久久精品国产66热6| 在线天堂中文资源库| 国产成人一区二区三区免费视频网站| 在线十欧美十亚洲十日本专区| 国产xxxxx性猛交| 中亚洲国语对白在线视频| 正在播放国产对白刺激| 天天躁夜夜躁狠狠躁躁| 久久精品国产亚洲av高清一级| 91麻豆av在线| 亚洲精品国产区一区二| 国产有黄有色有爽视频| www.熟女人妻精品国产| 动漫黄色视频在线观看| 国产精品av久久久久免费| 国产精品亚洲av一区麻豆| 欧美国产精品va在线观看不卡| 亚洲国产看品久久| 一边摸一边抽搐一进一小说 | 日韩欧美免费精品| 伦理电影免费视频| av线在线观看网站| 久久精品91无色码中文字幕| 91精品国产国语对白视频| 乱人伦中国视频| 香蕉丝袜av| 自拍欧美九色日韩亚洲蝌蚪91| 手机成人av网站| 亚洲成人国产一区在线观看| 日韩一卡2卡3卡4卡2021年| 亚洲专区字幕在线| 免费在线观看完整版高清| 欧美人与性动交α欧美精品济南到| 一区二区三区国产精品乱码| 精品视频人人做人人爽| 在线观看免费日韩欧美大片| 91在线观看av| 亚洲第一青青草原| 性少妇av在线| 老司机在亚洲福利影院| 啦啦啦 在线观看视频| 欧美日韩成人在线一区二区| 夫妻午夜视频| 999精品在线视频| 久久狼人影院| 最近最新免费中文字幕在线| 少妇猛男粗大的猛烈进出视频| 村上凉子中文字幕在线| 超色免费av| 一级毛片高清免费大全| 久久久精品免费免费高清| 欧美色视频一区免费| 精品高清国产在线一区| 精品人妻在线不人妻| 嫁个100分男人电影在线观看| 窝窝影院91人妻| 精品亚洲成国产av| 大香蕉久久成人网| 热re99久久国产66热| 久久国产乱子伦精品免费另类| 亚洲国产欧美一区二区综合| 怎么达到女性高潮| 波多野结衣av一区二区av| a级毛片在线看网站| 免费在线观看日本一区| 亚洲成人免费av在线播放| 国产精品久久久久久人妻精品电影| 人人澡人人妻人| 韩国精品一区二区三区| 久久久久国内视频| 91精品国产国语对白视频| 在线观看一区二区三区激情| 色精品久久人妻99蜜桃| netflix在线观看网站| avwww免费| 欧美乱妇无乱码| 亚洲av成人一区二区三| 欧美激情 高清一区二区三区| 99热国产这里只有精品6| 久久精品亚洲熟妇少妇任你| 亚洲久久久国产精品| 国产麻豆69| 欧美日韩瑟瑟在线播放| 黄片小视频在线播放| 视频区图区小说| 亚洲色图av天堂| 国产精品二区激情视频| av中文乱码字幕在线| a级毛片在线看网站| 窝窝影院91人妻| 免费少妇av软件| 青草久久国产| 9热在线视频观看99| 久久久水蜜桃国产精品网| 欧美丝袜亚洲另类 | 无限看片的www在线观看| 在线观看一区二区三区激情| 国产国语露脸激情在线看| 老司机亚洲免费影院| 在线免费观看的www视频| 久久这里只有精品19| 曰老女人黄片| 大陆偷拍与自拍| 欧美国产精品一级二级三级| 波多野结衣av一区二区av| 一区二区三区精品91| av电影中文网址| 亚洲,欧美精品.| 国产麻豆69| 麻豆成人av在线观看| 女性生殖器流出的白浆| 人人澡人人妻人| 国产一区有黄有色的免费视频| 国产成人影院久久av| 色播在线永久视频| 日本黄色日本黄色录像| 丰满人妻熟妇乱又伦精品不卡| 久久久久久久国产电影| a在线观看视频网站| 女人爽到高潮嗷嗷叫在线视频| 国产精品久久久人人做人人爽| 国产极品粉嫩免费观看在线| 一区二区三区激情视频| 大香蕉久久网| 一进一出抽搐动态| 国产有黄有色有爽视频| 午夜老司机福利片| 欧美乱色亚洲激情| 99re在线观看精品视频| 制服诱惑二区| 男男h啪啪无遮挡| 亚洲美女黄片视频| 亚洲五月天丁香| 91老司机精品| 每晚都被弄得嗷嗷叫到高潮| 久久久国产精品麻豆| 国产亚洲欧美精品永久| 高清欧美精品videossex| 久热这里只有精品99| 免费观看人在逋| 18禁裸乳无遮挡动漫免费视频| 成年版毛片免费区| 99精品欧美一区二区三区四区| 中文字幕高清在线视频| 亚洲专区中文字幕在线| 一区二区三区精品91| 老司机深夜福利视频在线观看| 国产一区二区三区综合在线观看| 成人18禁高潮啪啪吃奶动态图| 无遮挡黄片免费观看| tocl精华| av免费在线观看网站| 色94色欧美一区二区| 久久国产精品人妻蜜桃| 久久久精品免费免费高清| 亚洲成a人片在线一区二区| 精品国产亚洲在线| 69av精品久久久久久| 国产精品久久电影中文字幕 | 一级a爱视频在线免费观看| 国产亚洲精品第一综合不卡| 亚洲国产欧美一区二区综合| 欧美最黄视频在线播放免费 | 欧美人与性动交α欧美软件| 午夜福利欧美成人| 超色免费av| 脱女人内裤的视频| 国产日韩欧美亚洲二区| 色94色欧美一区二区| 多毛熟女@视频| 99国产极品粉嫩在线观看| 欧美精品一区二区免费开放| 变态另类成人亚洲欧美熟女 | 一个人免费在线观看的高清视频| 精品少妇久久久久久888优播| 久热这里只有精品99| 国产97色在线日韩免费| av天堂在线播放| 欧美色视频一区免费| 一级a爱视频在线免费观看| 中文字幕人妻丝袜制服| 精品卡一卡二卡四卡免费| 中文字幕人妻丝袜一区二区| 久久午夜综合久久蜜桃| av视频免费观看在线观看| 久久久久视频综合| 国产野战对白在线观看| 国产成人av教育| 动漫黄色视频在线观看| 一区二区三区精品91| 下体分泌物呈黄色| 狠狠狠狠99中文字幕| 亚洲av成人一区二区三| 国产成人精品在线电影| 国产成人精品久久二区二区免费| 亚洲熟妇熟女久久| 搡老乐熟女国产| 日本欧美视频一区| 精品久久蜜臀av无| 亚洲国产精品sss在线观看 | 正在播放国产对白刺激| 国产精品久久久人人做人人爽| 可以免费在线观看a视频的电影网站| 韩国精品一区二区三区| 久久久精品国产亚洲av高清涩受| 欧美+亚洲+日韩+国产| 欧美精品一区二区免费开放| 在线观看www视频免费| av网站免费在线观看视频| 久久精品亚洲精品国产色婷小说| 国产有黄有色有爽视频| 午夜91福利影院| 日韩三级视频一区二区三区| 最近最新中文字幕大全电影3 | 亚洲精品乱久久久久久| 国产男靠女视频免费网站| 亚洲精品自拍成人| 国产成人一区二区三区免费视频网站| 免费观看人在逋| 亚洲一卡2卡3卡4卡5卡精品中文| 黄色片一级片一级黄色片| 国产在视频线精品| 欧美黄色片欧美黄色片| av片东京热男人的天堂| 老汉色∧v一级毛片| 久久国产精品大桥未久av| 黑人猛操日本美女一级片| 18禁观看日本| 母亲3免费完整高清在线观看| 看免费av毛片| 在线视频色国产色| 亚洲午夜理论影院| 新久久久久国产一级毛片| 国产精品久久视频播放| 一级毛片高清免费大全| 欧美老熟妇乱子伦牲交| 免费在线观看亚洲国产| 久久香蕉精品热| 久久中文字幕一级| 精品第一国产精品| 婷婷精品国产亚洲av在线 | e午夜精品久久久久久久| netflix在线观看网站| 自拍欧美九色日韩亚洲蝌蚪91| 亚洲一区中文字幕在线| 国产一区二区三区综合在线观看| 自拍欧美九色日韩亚洲蝌蚪91| 少妇 在线观看| 久久精品人人爽人人爽视色| 免费不卡黄色视频| 黑人巨大精品欧美一区二区mp4| 久久中文字幕一级| 啦啦啦视频在线资源免费观看| 亚洲国产看品久久| 日韩一卡2卡3卡4卡2021年| 亚洲精品国产区一区二| 亚洲国产欧美一区二区综合| 久久精品亚洲精品国产色婷小说| 欧美日本中文国产一区发布| 最新美女视频免费是黄的| 久久国产乱子伦精品免费另类| 久久久国产成人免费| 九色亚洲精品在线播放| 国产av精品麻豆| 亚洲欧美日韩高清在线视频| 一进一出好大好爽视频| 中出人妻视频一区二区| 韩国精品一区二区三区| 69精品国产乱码久久久| 一二三四在线观看免费中文在| 在线天堂中文资源库| 可以免费在线观看a视频的电影网站| 国产精品99久久99久久久不卡| 国产一卡二卡三卡精品| 免费在线观看黄色视频的| 岛国在线观看网站| 欧美黄色片欧美黄色片| 久久久国产成人精品二区 | 午夜成年电影在线免费观看| 成人av一区二区三区在线看| 交换朋友夫妻互换小说| 人妻久久中文字幕网| 一边摸一边抽搐一进一出视频| 日本五十路高清| 午夜福利在线观看吧| 91在线观看av| 亚洲精品美女久久久久99蜜臀| 最新在线观看一区二区三区| 老司机深夜福利视频在线观看| 国产精品乱码一区二三区的特点 | 一级作爱视频免费观看| 欧美日韩瑟瑟在线播放| 黄色视频不卡| 日韩一卡2卡3卡4卡2021年| 美女国产高潮福利片在线看| 每晚都被弄得嗷嗷叫到高潮| 99精国产麻豆久久婷婷| 欧洲精品卡2卡3卡4卡5卡区| 女人被躁到高潮嗷嗷叫费观| 国产免费现黄频在线看| 自拍欧美九色日韩亚洲蝌蚪91| 精品少妇一区二区三区视频日本电影| 狠狠婷婷综合久久久久久88av| 多毛熟女@视频| 国产亚洲精品第一综合不卡| 两性夫妻黄色片| 国产精品免费视频内射| 叶爱在线成人免费视频播放| 精品少妇久久久久久888优播| 日韩成人在线观看一区二区三区| 久久午夜亚洲精品久久| 九色亚洲精品在线播放| 黄频高清免费视频| 亚洲精品av麻豆狂野| 国产91精品成人一区二区三区| 狂野欧美激情性xxxx| 一本大道久久a久久精品| 亚洲少妇的诱惑av| 麻豆国产av国片精品| 午夜两性在线视频| 俄罗斯特黄特色一大片| 成人18禁高潮啪啪吃奶动态图| 精品亚洲成国产av| 99久久国产精品久久久| 欧美亚洲日本最大视频资源| 亚洲国产欧美一区二区综合| 看免费av毛片| 91老司机精品| 国产精品久久久久久精品古装| 国产免费现黄频在线看| www.熟女人妻精品国产| 亚洲 国产 在线| 免费人成视频x8x8入口观看| 丝袜美足系列| 亚洲精品美女久久av网站| 亚洲在线自拍视频| 国产不卡av网站在线观看| 国产精品 欧美亚洲| 久久精品熟女亚洲av麻豆精品| 99久久综合精品五月天人人| 亚洲少妇的诱惑av| 日韩一卡2卡3卡4卡2021年| 好男人电影高清在线观看| 欧美日韩福利视频一区二区| 黄色 视频免费看| 男女之事视频高清在线观看| 国产激情欧美一区二区| 成年人免费黄色播放视频| 精品无人区乱码1区二区| av电影中文网址| 成在线人永久免费视频| 91精品国产国语对白视频| 大香蕉久久网| 人人妻人人添人人爽欧美一区卜| 久久人人97超碰香蕉20202| 99riav亚洲国产免费| 手机成人av网站| 久久中文字幕一级| 成人精品一区二区免费| 国产精品免费一区二区三区在线 | 天堂中文最新版在线下载| 中文字幕人妻丝袜一区二区| 视频区图区小说| 午夜两性在线视频| 男女免费视频国产| 国产成人精品久久二区二区免费| 两个人免费观看高清视频| 亚洲专区国产一区二区| 亚洲av成人av| 色尼玛亚洲综合影院| 老熟妇仑乱视频hdxx| 男人的好看免费观看在线视频 | 午夜久久久在线观看| 色在线成人网| 久久这里只有精品19| 久久久久久免费高清国产稀缺| 久久精品国产亚洲av高清一级| 久久国产亚洲av麻豆专区| 午夜福利一区二区在线看| 国产麻豆69| 国产亚洲一区二区精品| 国产三级黄色录像| 久久午夜综合久久蜜桃| 两个人看的免费小视频| 99re在线观看精品视频| 99热网站在线观看| 老汉色∧v一级毛片| 欧美乱妇无乱码| 亚洲成人免费电影在线观看| 久久亚洲真实| 激情在线观看视频在线高清 | 国产日韩一区二区三区精品不卡| 深夜精品福利| 久久午夜综合久久蜜桃| 人人妻人人爽人人添夜夜欢视频| 黄片播放在线免费| 亚洲熟妇熟女久久| 免费观看精品视频网站| 午夜福利,免费看| 国产在线精品亚洲第一网站| 男人舔女人的私密视频| 人人妻,人人澡人人爽秒播| 丝袜人妻中文字幕| 欧美日韩精品网址| 1024视频免费在线观看| 激情视频va一区二区三区| 国产黄色免费在线视频| 色婷婷久久久亚洲欧美| 亚洲欧美一区二区三区黑人| 王馨瑶露胸无遮挡在线观看| 亚洲九九香蕉| 亚洲国产精品合色在线| 亚洲av欧美aⅴ国产| 亚洲精品国产一区二区精华液| 亚洲精品成人av观看孕妇| 大香蕉久久成人网| 一进一出抽搐gif免费好疼 | 嫁个100分男人电影在线观看| 在线观看www视频免费| 18禁裸乳无遮挡免费网站照片 | 国产一区在线观看成人免费| 1024视频免费在线观看| 色老头精品视频在线观看| 国产精品av久久久久免费| 日日爽夜夜爽网站| 91成年电影在线观看| 欧美人与性动交α欧美软件| 老鸭窝网址在线观看| 热re99久久精品国产66热6| 久99久视频精品免费| 亚洲伊人色综图| 老司机福利观看| 国产一卡二卡三卡精品| 中文字幕av电影在线播放| 99精国产麻豆久久婷婷| av不卡在线播放| 亚洲国产精品一区二区三区在线| 午夜激情av网站| 青草久久国产| 午夜精品国产一区二区电影| 成人国语在线视频| 啦啦啦免费观看视频1| 国产精品永久免费网站| 精品乱码久久久久久99久播| 精品国产乱子伦一区二区三区| 国产精品一区二区在线观看99| 亚洲一区二区三区不卡视频| 满18在线观看网站| 国产精品亚洲一级av第二区| 别揉我奶头~嗯~啊~动态视频| 久久久久久免费高清国产稀缺| 欧美人与性动交α欧美精品济南到| 不卡一级毛片| 操出白浆在线播放| 亚洲少妇的诱惑av| 免费在线观看黄色视频的| 女人被躁到高潮嗷嗷叫费观| 又紧又爽又黄一区二区| 90打野战视频偷拍视频| 精品卡一卡二卡四卡免费| 性色av乱码一区二区三区2| 九色亚洲精品在线播放| 亚洲色图 男人天堂 中文字幕| 天天躁夜夜躁狠狠躁躁| 久久久精品国产亚洲av高清涩受| 国产成人系列免费观看| 高清在线国产一区| 日韩欧美三级三区| 一级毛片精品| 99国产综合亚洲精品| 欧美人与性动交α欧美软件| 18禁美女被吸乳视频| 激情在线观看视频在线高清 | 日本wwww免费看| 久久婷婷成人综合色麻豆| 国产精华一区二区三区| 婷婷成人精品国产| 香蕉国产在线看| 午夜视频精品福利| 国产精品秋霞免费鲁丝片| 日韩制服丝袜自拍偷拍| 性少妇av在线| 欧美在线黄色| 久久国产精品人妻蜜桃| 国产一区二区激情短视频| 免费在线观看视频国产中文字幕亚洲| 老鸭窝网址在线观看| 欧美日韩黄片免| 热99re8久久精品国产| 99re在线观看精品视频| 国产精品一区二区免费欧美| 成人亚洲精品一区在线观看| 又黄又粗又硬又大视频| 亚洲一区二区三区欧美精品| 少妇粗大呻吟视频| 亚洲成国产人片在线观看| 在线永久观看黄色视频| 9色porny在线观看| 午夜福利免费观看在线| 久久精品亚洲熟妇少妇任你| 久久天躁狠狠躁夜夜2o2o| www日本在线高清视频| 欧美av亚洲av综合av国产av| 狂野欧美激情性xxxx| 国产高清激情床上av| 成人18禁高潮啪啪吃奶动态图| 女性生殖器流出的白浆| 国产成人精品在线电影| 国产精品久久久久成人av| 人成视频在线观看免费观看| 精品国产亚洲在线| 中文字幕人妻熟女乱码| 欧美av亚洲av综合av国产av| 18禁裸乳无遮挡动漫免费视频| 又黄又粗又硬又大视频| 久久亚洲真实| 中文字幕制服av| 亚洲avbb在线观看|