• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    An Optimized SW/HW AVMF Design Based on High-Level Synthesis Flow for Color Images

    2021-12-14 06:03:40TurkiAlanaziAhmedBenAtitallahandImenAbid
    Computers Materials&Continua 2021年9期

    Turki M.Alanazi,Ahmed Ben Atitallah,,*and Imen Abid

    1Department of Electrical Engineering,Jouf University,Sakaka,Aljouf,Saudi Arabia

    2LETI(E.N.I.S.),University of Sfax,Sfax,Tunisia

    Abstract:In this paper,a software/hardware High-level Synthesis(HLS)design is proposed to compute the Adaptive Vector Median Filter(AVMF) in realtime.In fact,this filter is known by its excellent impulsive noise suppression and chromaticity conservation.The software(SW)study of this filter demonstrates that its implementation is too complex.The purpose of this work is to study the impact of using an HLS tool to design ideal floating-point and optimized fixed-point hardware(HW) architectures for the AVMF filter using square root function (ideal HW) and ROM memory (optimized HW),respectively,to select the best HLS architectures and to design an efficient HLS software/hardware(SW/HW)embedded AVMF design to achieve a trade-off between the processing time,power consumption and hardware cost.For that purpose,some approximations using ROM memory were proposed to perform the square root and develop a fixed-point AVMF algorithm.After that,the best solution generated for each HLS design was integrated in the SW/HW environment and evaluated under ZC702 FPGA platform.The experimental results showed a reduction of about 65%and 98%in both the power consumption and processing time for the ideal SW/HW implementation relative to the ideal SW implementation for an AVMF filter with the same image quality,respectively.Moreover,the power consumption and processing time of the optimized SW/HW are 70%and 97%less than the optimized SW implementation,respectively.In addition,the Look Up Table(LUTs)percentage,power consumption and processing time used by the optimized SW/HW design are improved by nearly 45%,18% and 61%compared the ideal SW/HW design,respectively,with slight decrease in the image quality.

    Keywords:AVMF filter;SW/HW design;HLS flow;ZC702 FPGA platform

    1 Introduction

    The contamination of the color images by “salt and pepper” impulse noise presents the most common problem in image processing.In fact,starting from communication field to the field of security and health,image processing has a vital role to play.Therefore,it is very important to determine the corruption produced by this noise and ameliorate the quality of the image before further processing.

    Image filtering is the most important stage in the image processing operation [1,2].It helps to suppress the noise and to restore and ameliorate the image quality.Image filtering is used in a vast array of applications such as satellite images where the noise can affect the image quality through the capture and transmission processes [3].Thus,it is vital to eliminate the noise from satellite imagery because it is used in several vital fields such as security,water bodies,changing lands and planet health [4].Besides,in the medical profession,images can contain “salt and pepper” noise,which affects image quality,especially in ultrasound imaging and Magnetic Resonance Imaging(MRI) [5].It is,hence,critical to eliminate noise from medical images as crucial information may be affected.

    The earliest filters were based on linear approaches which cannot take account of the nonlinearity of human vision [6]and cannot be suitable for the non-linearities of transmission channels.For that,the non-linear filters are appropriate to the digital color images.Many nonlinear filters have been proposed in the literature such as the Vector Median Filter (VMF) [7],the Adaptive Vector Median Filter (AVMF) [8],the Vector Median Rational Hybrid Filter(VMRHF) [9],etc.

    Many researchers have noted the time consuming and the high complexity of non-linear filters [10,11].However,to reduce this complexity,researchers have adopted hardware acceleration as a solution.In fact,Trivedi et al.[12]propose a hardware implementation of the median filtering on Field-Programmable Gate-Array (FPGA) which consumes less power and less hardware area.Hu et al.[13]propose two hardware architectures to implement the median filtering counting standard and multi-level median filters.In [14],an optimized hardware architecture based on systolic array is developed to implement median filtering.This architecture uses the pipeline structure which requires seven clock cycles to determine the median value.Lee et al.[15]detail a 3×3 window median filtering based on a bit serial sorting algorithm,which has high speed of operation and less hardware complexity.In [16],a hardware implementation of the VMRHF for color images is described.This hardware architecture uses some approximations to reduce the implementation complexity of the relational function.Boudabous et al.[10]suggest an efficient fast parallel architecture to implement the VMF.This architecture uses approximation to implement an L2norm for the VMF filter.

    But these hardware architectures miss the flexibility of design updating and take more development time.Indeed,the development and implementation of these architectures are done by Low-Level Synthesis (LLS) using hardware description language (HDL) on an FPGA circuit.With LLS design,it is possible to adjust the Register Transfer Level (RTL) description to give a very great,optimized netlist.However,producing such an RTL description requires a lot of effort and time to describe the operations of each low-level circuit and,especially for complex applications [17,18].Nevertheless,designing a complex system is only possible for hardware designers who have specific knowledge and skills.Therefore,there is a real need to raise the design space abstraction level from LLS to High-Level Synthesis (HLS) [19,20]in order to reduce the FPGA design complexity.In fact,HLS allows designers to formalize algorithms using software high-level language (systemC,C/C++,etc.) and synthesizes them via the HLS tool to form a behavioral and structural of the RTL hardware description.In this context,several academic and commercial HLS tools are developed such as Xilinx Vivado HLS,Intel OpenCL [21],Catapult-C [22],and ROCCC [23].However,without any expertise and skills in hardware design,designers can develop and generate automatically from high-level language a complex hardware design which permits designers to explore and simulate a large design space in the shortest of times,identify design performances (power consumption,processing time and hardware cost) and eliminate the source of many design errors.Unfortunately,to design an optimized and high performant RTL circuit with HLS tools,the code should be restructured in a specific style.Without such restructuring,the HLS tools can still generate an RTL circuit but with poor performance [24].

    Given this context,our goal in this work is to use HLS flow to design various hardware architectures for the AVMF filter and integrate these architectures as intellectual property (IPs)blocks with Hardcore ARM processor on Xilinx Zynq FPGA in order to design an efficient software/hardware (SW/HW) embedded system.The SW/HW design should reduce this filter’s complexity and power consumption as well as speed up the execution time.However,HW solution is used for performance (processing speed and power consumption).In contrast,SW solution is used for design flexibility [25,26].

    The remainder of the paper is organized as follows.In Section 2 below an overview of the AVMF filter is presented.The description of Vivado HLS tool and directives are given in Section 3.The proposed HLS AVMF designs are described in Section 4.Section 5 discusses the experimental results in terms of hardware cost,power consumption and processing time of the SW/HW AVMF implementation on ZC702 platform.Finally,conclusion is given in Section 6.

    2 Overview of the AVMF Filter

    In [8],the author presents an Adaptive Vector Median Filter (AVMF) which is based on the VMF filter.It is enhanced by using a threshold to detect the probability of the pixel to be noisy as shown in Fig.1.

    Figure 1:AVMF filter algorithm

    We define byV=(xi∈Zl;i=1,2,...,N)the size(N)of the filtering window.The noisy pixels are presented byx1,x2,...,xN.The position of the filtering window is determined by central pixel which isx(N+1)/2.We consider that each multichannel pixelxiis associated with a distance measurediwhich is calculated by Eq.(1).

    The outputyAVMFof the AVMF is expressed in (2) below:

    where the vectorx(1)represents the VMF output obtained byx(1)≤x(2)≤...≤x(N).It corresponds to the minimum vector distanced(1)∈{d1,d2,...,dN} inside the filtering window.d(1)is expressed by (3).

    The vectorx(N+1)/2corresponds to the distance measured(N+1)/2of the center pixel.ξ(N+1)/2defines the threshold value given in (4):

    whereλAVMFallows the adjustment of the proposed method’s smoothing properties.ΨAVMFis the estimated variance which is defined in (5).

    The approximation presented in (5) determines the mean distance between the vector median and the different pixels held inV.In this equation,d(1)is divided by(N-1)that gives the number of distances fromx(1)to all other pixels inV.However,from Fig.1,we can see that if the distanced(N+1)/2is greater than the thresholdξ(N+1)/2,thereforex(N+1)/2is noisy and is changed by the vectorx(1).But,if the distanced(N+1)/2is less than or equal toξ(N+1)/2,thenx(N+1)/2remains unchanged.

    3 Xilinx Vivado HLS Tool

    The purpose of HLS methodology is to simplify and accelerate the hardware implementation specially for FPGA circuits.For that,an HLS tool is developed by Xilinx to help engineers to rapidly implement algorithms on the FPGA with gains in resource,power,and performance.This tool is called Vivado HLS which gives a design environment to interpret,analyze,optimize,and transform a software language like C/C++to RTL design.This design is synthesized and implemented for Xilinx FPGA.Indeed,with the Vivado HLS tool,it is possible to apply different optimizations to increase the hardware design performances by using several directives such as pipelining,loop unrolling,resource,etc.Some optimizations lead to decreasing the hardware area by applying the ALLOCATION directive.This directive can minimize the number of resources using in design by sharing resource between several functions.Moreover,RESOURCE directive can be used to replace vectors and arrays by specific memory blocks (BRAMs).But,to raise the data rate and achieve a higher throughput,the UNROLL or PIPELINE directives can be used.In this case,by unrolling loops,several hardware blocks are built in parallel to operate the loop iterations in parallel.Otherwise,the PIPELINE directive performs pipelining to reach higher throughput.In fact,the pipeline technique permits the loop iteration to begin before the completion of its predecessor.For that,the data dependencies should be satisfied.Furthermore,the ARRAY PARTITION directive divides the large memory into individual registers or multiple smaller memory blocks for parallel data accesses.However,these optimizations lead to an excessive use of FPGA resources.For that,the level of pipelining or parallelism should be customized.

    With Vivado HLS tool,some steps should be followed to generate an RTL description.In Step 1,the C/C++code should be written in a specific style to permit the HLS tool to create an optimized RTL description.In Step 2,the source code is explored to extract the control path and dataflow.In Step 3,a various specific directive is applied for each algorithm for better hardware optimization.In the last step,the Export RTL tool is used to export as an IP module the created RTL design to the Xilinx Vivado tool in order to generate the bitstream file.

    4 HLS Architecture of the AVMF Filter

    The developed AVMF C code is given as input to the Vivado HLS tool 18.1 in order to generate a hardware architecture for AVMF algorithm.The generated architecture is illustrated in Fig.2.This architecture is optimized to reconstruct the filtered color image in minimum of clock cycles.In fact,to optimize the load of pixels,tree lines of image are sent in parallel to the AVMF coprocessor.In order to form a(3×3)filtering window,three pixels from each line are selected to be stored in the register bank.Each pixel is composed of three colors (R,G,B).However,the 81 Elementary Distances (ED) which aredij(xi,xj)should be calculated for each(3×3)filtering window.Thedij(xi,xj)is given by Eq.(6).

    From Fig.2,we can see that the EDs are implemented based on Eq.(6) and computed by using 81 loop iterations.In fact,the loop 1 is used nine times to accumulate nine EDs(di1,di2,di3,di4,di5,di6,di7,di8,di9)and loop 2 is used also nine times to calculate the nine Euclidean distancesdi.When the nine Euclidean distances are ready,the comparator determines the minimum distance from these nine distances.With the search for the minimum of nine distancesdi,the filtered pixel is supplied and another filtering window for another pixel is started.At the end,to optimize the memory access,the three colors (R,G,B) for the filtered pixel which is determined based on minimum distancediin the filtering window are concatenated in 24-bits and stored in image memory.All these steps are repeatedN×Nloop iterations which depend on the image size in order to filter all pixels in the image.

    Figure 2:AVMF coprocessor

    The bottleneck of this architecture is the implementation of the square root (SQRT) which is used to calculate the EDs for the AVMF filter and the floating-point values.Thus,the purpose of this work is to generate two HLS architectures for the AVMF filter.The first architecture is based on the SQRT function.The second architecture is based on the approximation of the SQRT function in order to use the fixed-point values only and reduce the hardware complexity.Our main goal is to design floating-point and fixed-point architectures using Vivado HLS tool and compares the power consumption,the processing time and the area cost of the designed architectures.

    4.1 HLS Floating-Point AVMF Design

    Several designs are generated from AVMF C floating-point code.These designs are generated by adding incrementally specific directives through the Vivado HLS tool and synthesized for Xilinx XC7Z020 FPGA.After that,we are compared the performance in terms of number of clock cycles and hardware resources (LUTs,FFs,BRAMs and DSPs).

    #Design 1:In this first design,the software code is implemented under the Xilinx XC7Z020 FPGA without any optimizations.The synthesis results are given in Tab.1 for hardware resources and Fig.3 for number of clock cycles.From Tab.1,we can notice that this design uses 14%LUTs,5% FFs,93% BRAMs and 10% DSPs and can reach a maximum 261265418 clock cycles.

    Figure 3:Number of clock cycles of the HLS floating-point AVMF designs

    Table 1:Synthesis results of the HLS floating-point AVMF designs

    #Design 2:In the second experiment,the ARRAY PARTITION directive is applied to the filtering window array in order to partition this array into multiple smaller memory modules.This allows a data parallel access.The experimental results record an increase in the percentage of LUTs by 16% and a decrease of about 5% in the number of clock cycles relative to #Design 1.

    #Design 3:In this design,the PIPELINE directive is applied to the loop iterations with an interval equal to 1 to decrease time latency.This optimization allows a decrease by 97% in number of clock cycles compared to #Design 2,but with an increase of about 79% in the percentage of LUTs and 80% in the number of DSP blocks.

    #Design 4:For this last design,the ALLOCATION directive is added to process the multiplication for good improvement in the FPGA resources by the fact that it permits the sharing of the hardware resources.This optimization shows a reduction in the percentage of LUTs and DSPs by 39% and 10%,respectively,compared to #Design 3,but with an increase of about 6%in clock cycles.

    From these experimental results,#Design 4 is selected for HLS AVMF implementation using the SQRT function (ideal HW).Indeed,it provides a good compromise between number of clock cycles and FPGA area cost.This implementation is done by the Xilinx HLS Vivado tool under the Xilinx XC7Z020 FPGA.

    4.2 HLS Fixed-Point AVMF Design

    The AVMF filter is based on the calculation of the SQRT which should be approximated to decrease the complexity of the hardware architecture.In fact,to simplify the implementation of the AVMF filter,a ROM (Read-Only Memory) is used to store the obtained values of the SQRT [27].Accordingly,a quantity A in Eq.(7) is defined as follows:

    The SQRT(A) is computed and stored in a ROM memory.To determine the size of memory and the precision of the fixed-point values of the SQRT(A),we have conducted an experiment simulation for two standard images (Sailboat and Peppers) which are contaminated with 3% of impulsive noise.In this simulation,we have measured the image quality of the filtered image using Normalized Color Difference (NCD) for several memory sizes to store 512,1024 and 2048 fixed-point values of the SQRT(A) with precision from 1-bit to 12-bit.The simulation results are presented in Fig.4 where we notice that the NCD is decreased when the memory size and precision bit increase.To have a compromise between the memory size and quality of image,we have chosen to store 1024 values in the ROM memory with the 9-bit as the precision bit.With these parameters,we can see that the NCD of approximated AVMF is nearer than the ideal AVMF.To justify this,the relative error is calculated for NCD between ideal and approximated AVMF.The relative error is given by Eq.(8):

    Figure 4:Approximation of the ROM memory size and bit precision of the SQRT function

    From Tab.2,it is worth noting that the average of the relative error for several standard images is equal to 0.05.This error is very small and ensures the quality of image for the approximated AVMF.Consequently,SQRT(A) is established in the manner presented in Fig.5 in which we remark that the first step consists in subdividing the interval [0,195075]into five subintervals.Subsequently,when A is in the range of [0,1024],we read directly the value of the SQRT(A) from the ROM memory.But,when A is in the intervals 2,3,4,or 5,the ROM value is divided by 4,16,64 or 256,respectively.The new value is determined from this interval[0,1024].Then,it is multiplied by 2,4,8,16,respectively,to obtain the desired values of the SQRT(A).To reduce the hardware cost,the multiplication and division operations are done by using shifting operations.

    Figure 5:Approximation of the SQRT(A) computation [27]

    Table 2:Comparison of NCD values for ideal and approximated AVMF filters

    The proposed AVMF C code is used to generate and implement different designs for the AVMF algorithm based on ROM memory.This implementation is realized with the Xilinx Vivado HLS tool.

    #Design 1:In the first experiment,the fixed-point AVMF C code is synthesized for the XC7Z020 FPGA without any optimizations.The experimental results in terms of FPGA resources and the number of clock cycles are given in (Tab.3) and (Fig.6),respectively.As evident in Fig.6,this design can reach a maximum of 44150282 cycles.Furthermore,the FPGA resources are distributed between 4.8% LUTs,2% FFs,95% BRAMs and 4% DSP blocks as shown in Tab.3.

    Table 3:Synthesis results of the HLS fixed-point AVMF designs

    Figure 6:Number of clock cycles of the HLS fixed-point AVMF designs

    #Design 2:In this design,the ARRAY PARTITION directive is applied for parallel data accesses.The synthesized results show an increase in the percentage of LUTs by 25% and a diminution of about 32% in the number of clock cycles relative to #Design 1.

    #Design 3:In the third design,the PIPELINE directive is added to the loop iterations.The experimental results reveal a gain about 91% in clock cycles compared to #Design 2 with an increase of 56% in percentage of LUTs and 89% in number of DSP blocks.

    #Design 4:In this last design,to reduce the hardware cost for the multiplication operations,the ALLOCATION directive is used.This optimization provides a reduction in the percentage of FFs by 4% and 7% in the number of BRAM blocks relative to #Design 3 with an increase of about 17% in clock cycles.

    From synthesized results,we can notice that #Design 4 offers the best compromise between FPGA area cost and number of clock cycles.For that,this design is selected for HLS AVMF implementation using ROM memory (optimized HW).

    From these experimental results,we can conclude that the PIPELINE and the ARRAY PARTITION directives are mainly used to decrease the processing time,but with a concomitant increase in hardware cost.In contrast,the ALLOCATION directive is exploited to decrease the hardware cost but with an increase in the processing time.The purpose of the next section is to investigate the HLS approach in an SW/HW environment to design and verify a standalone IP (Intellectual property) of the AVMF filter (ideal HW and optimized HW) on the ZC702 development board [28].

    5 SW/HW Performance Validation of AVMF Filter Architecture

    ZC702 is a Zynq 7000 development board.The Zynq 7000 is a Xilinx programmable SoC which is used for quickly prototyping and evaluating the functionality of any designed system in SW/HW environment.The inside of the Zynq architecture contains two main parts:The Programmable Logic (PL) for hardware implementation and the Processor System (PS).In the PS part,we find a 32-bit dual Hardcore ARM processor,32 KBL1 data and instruction caches per core,512 KBL2 cache and 1GB DDR3.The PS is operating at 667 MHz and supports operating systems or software routines.In Zynq architecture,the connection between the PL and the PS parts are realized using the Advanced eXtensible Interface (AXI4) of the Advanced Microcontroller Bus Architecture (AMBA) protocols.

    Fig.7 illustrates the designed SW/HW AVMF architecture.This architecture is developed using the Xilinx Vivado 2018.1 tool and evaluated on the ZC702 development board which is based on the Xilinx XC7Z020 FPGA.In this architecture,the AVMF coprocessor is connected to the SW parts (ARM Cortex-A9 processor) through Direct Memory Access (DMA) by using an AXI4-stream interface which is designed for maximum bandwidth access to DDR memory of the PS [26].This mode of transfer supports unlimited data burst sizes and offers point-to-point streaming data without using any addresses.However,in our SW/HW AVMF architecture three DMAs are used.Indeed,the DMA1 is used in read/write mode while the DMA2 and DMA3 are configured in read mode only.

    Figure 7:The SW/HW AVMF design

    Initially,the color image (RGB format) is stored in DDR memory.Then,as shown in Fig.8,when theStart_transfertandTREADYsignals will be asserted,the PS starts to send the noisy pixels to the AVMF coprocessor.TREADYsignal indicates that the AVMF coprocessor is ready to receive data.However,three DMAs (DMA1,DMA2,and DMA3) are used to send three image lines in parallel from DDR memory to the hardware coprocessor.The AVMF coprocessor receives the valid data whenTVALIDsignal will be asserted by AXI stream interface and start to perform the AVMF algorithm for the noisy pixels as soon as the nine pixels for the first 3×3 filtering window are provided.Then,the coprocessor calculates nine Euclidean distances,determines the RGB filtered pixel which reduces the distance between all pixels in a filtering window and stores the concatenated RGB filtered pixel in the internal image memory.From Fig.8,we can see that all these steps are done in pipeline to decrease the processing time.To construct the next filtering window,we select the last 6 pixels from the previous window and adding 3 new pixels.Once the AVMF coprocessor finishes the filtering of all pixels in the image,theTREADYandTLASTsignals will be asserted and the PS starts to receive RGB filtered pixels through DMA1,disconcatenates and stores them in the DDR memory to construct the filtered image (Fig.7).Our SW/HW design,proposed in Fig.7,uses the AXI interface and 3 DMAs to increase the throughput.Further,our design supports various image sizes (i.e.,:32×32,64×64,128×128,256×256).The image size can be increased by increasing the memory size of the image output.

    To evaluate the proper functioning of the SW/HW AVMF design for the HLS ideal and optimized HW IPs blocks,we have followed the design flow which is presented in Fig.9.In fact,the Vivado HLS is used to apply directives and create a stream interface in order to connect the IP blocks with the processor.After that,when the HLS synthesis is completed,the compressed file(.ZIP) including all HDL files is generated and exported to the Xilinx Vivado tool which is used to implement a multiple accelerators blocks connected to the embedded processor across an AXI interface.Then,the Xilinx Vivado tool is used to synthesis,implement the SW/HW design and generate and load the Bitstream file (.bit) in the FPGA platform.Besides,the SW is carried out using the ARM Cortex-A9 processor and compiled with a standalone application using the Xilinx software development toolkit (SDK) to generate the executable file (.elf) which will be performed per the embedded processor.

    Figure 8:Behavior of the SW/HW AVMF design

    Figure 9:HLS flow of the SW/HW AVMF design

    Tab.4 reports the implementation results for the ideal and optimized SW/HW AVMF design under the XC7Z020 FPGA.It is obvious from this table that the optimized SW/HW AVMF design presents a decrease per nearly 45% in the number of LUTs and 22% in the number of DSPs compared to the ideal SW/HW design.

    The next step consists in evaluating the performance in terms of the processing time,power consumption and image quality parameters under the ZC702 FPGA board.The processing time measurement is done by means of the processor timer while power consumption is measured by the Texas Instruments fusion digital power designer software using the Texas Instruments device which is connected to the ZC702 board through a USB interface adapter.Accordingly,taking subjective measurement as an effective way of judging the efficiency of the filter,from Fig.10,we can see that the implemented filter conserves the chromaticity components as well as the fine details of a color image (Monalisa) using 3% of impulsive noise.Consequently,no differences are noticeable between the images filtered by the SW/HW solutions and those in the output of the SW solutions (ideal and optimized).

    Figure 10:(1) Original RGB images,(2) noisy RGB images,filtered RGB images with (3) ideal SW,(4) ideal SW/HW,(5) optimized SW,and (6) optimized SW/HW

    To prove that,the NCD and the Peak Signal to Noise Ratio (PSNR) are used in these simulations and performed for different standard test images (Lena,Flower,Peppers,Sailboat,Mandrill,Monalisa).The size of these color images is 256 × 256 and are contaminated by “salt and pepper” impulsive noise with an intensity equal to 3%.Tab.5 and 6 present the PSNR and the NCD values of test standard test images for the ideal and optimized SW/HW and SW implementations of the AVMF filter.Otherwise,Fig.11 and Tab.7 illustrate the power consumption and the processing time for the SW/HW and the SW ideal and optimized implementations of the AVMF filter,respectively.Fig.11 and Tab.7 reflect a reduction of about 65% and 98% in the power consumption and the processing time for the ideal SW/HW implementation relative to the ideal SW implementation of the AVMF filter,respectively,with the same value of PSNR (Tab.5)and NCD (Tab.6).Besides,we notice that the power consumption and the processing time of the optimized SW/HW are 70% and 97% less than the optimized SW implementation for same image quality,respectively.These results prove not only the efficiency of the HLS approach,but also the reliability of the proposed SW/HW AVMF design.Furthermore,the LUTs percentage,the power consumption and the processing time required by the optimized SW/HW design are improved by nearly 45%,18% and 61%,respectively,compared with the ideal SW/HW design for the AVMF filter,with slight decrease in the quality of image.As obvious from the results when performing the HLS to a software code that includes elementary functions of floating-point numbers,the processing time and the hardware size of the converted hardware increase compared to the fixed-point number.Yet,the floating-point architecture yields the same quality of results(QoR) relative to the ideal SW and accelerates design time and time-to-market (TTM) which is not the case for the fixed-point architecture.However,with the fixed-point architecture,a long process is needed to transform a floating-point algorithm and implement it as fixed-point while taking into account the QoR.

    Figure 11:Power consumption measurement for the SW/HW and SW AVMF designs

    Table 4:SW/HW implementation results

    Table 5:Comparisons of the PSNR values for the SW/HW and the SW AVMF designs

    Compared to other realizations,we can notice from Tab.8 that our design produces better performance results than in [8]for the floating-point implementation of the AVMF filter.As a matter of fact,the throughput of our design is 6.8 times faster than [8].Moreover,our design has better performance than results in [29,30]which present the floating-point implementation of AVMF and VMF filters in Intel Core (TM) i7-4790 at 3.2 GHz and DSP TMS320C6701 at 150 MHz,respectively.It is crucial to mention that our design is implemented on the ZC702 board which has an ARM cortex-A9 core running on 667 MHz and the IP was implemented with a 100 MHz clock frequency.On the other hand,the throughput of our design is 4.9 times faster than [27]for the fixed-point implementation of the AVMF filter in SW/HW environment.Knowing that,it is important to remind that the LLS method is used in [27].

    Table 6:Comparisons of NCD values for the SW/HW and the SW AVMF designs

    Table 7:Comparisons of processing time for the SW/HW and SW AVMF designs

    In light of the above findings,the HLS approach presents a good solution to elevate the abstraction level from RTL to algorithms and accelerate both the design time and the time to market (TTM).But,we can notice that with HLS approach,the reference software should be rewrite in a specific format and select the right directives to attain a better performance in terms of FPGA area cost,power consumption and processing time.Thus,for example with the Xilinx Vivado HLS,the ALLOCATION directive allows to reduce the FPGA area cost.Moreover,the PIPELINE and ARRAY PARTITION directives can be used to improve pipeline and parallel processing between loop iterations which helps to reach higher throughput.Besides,it becomes possible to design a HW floating-point architecture where the performance in terms of processing time and power consumption is better than that of the SW solution and with the same QoR.Further,HLS approach can be combined with SW/HW design methodology to guarantee on the one hand the acceleration of the conception and the flexibility to update the design,on the other hand the performance in terms of processing time and power consumption.

    Table 8:Performance comparisons of the filter implementation

    6 Conclusion

    In this work,the HLS approach is used to design floating-point and fixed-point hardware architectures for the AVMF filter by applying specific directives (ALLOCATION,PIPELINE and ARRAY PARTITION) to the AVMF C codes.The first design was based on the square root function (ideal HW).The second design was based the ROM memory (optimized HW).These designs are integrated as coprocessor with ARM cortex-A9 processor in the SW/HW environment.The AXI-stream interface is used to speed up the data transfer between the PL part and the DDR memory.The experimental results under the ZC702 FPGA platform show that the SW/HW AVMF designs give a better performance in terms of processing time,power consumption and hardware cost relative to the SW implementation with the same QoR.These results prove not only the efficiency of HLS tool,but also the reliability of the proposed SW/HW AVMF design which can be used for several image sizes.

    Acknowledgement:The authors would like to express their deepest gratitude to Prof.Ali M.AMRI of ENET’Com,University of Sfax,Tunisia,for his meticulous proofreading of their paper.

    Funding Statement:The authors extend their appreciation to the Deanship of Scientific Research at Jouf University (Kingdom of Saudi Arabia) for funding this work through research Grant No.DSR2020-06-3663.

    Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

    免费观看性生交大片5| 久久精品国产a三级三级三级| 日韩精品免费视频一区二区三区 | 国产精品免费大片| 人妻人人澡人人爽人人| 内地一区二区视频在线| 国产成人午夜福利电影在线观看| 1024视频免费在线观看| 又粗又硬又长又爽又黄的视频| 国精品久久久久久国模美| 在线免费观看不下载黄p国产| 国产亚洲av片在线观看秒播厂| 国产一区二区激情短视频 | 青春草亚洲视频在线观看| 热99国产精品久久久久久7| 黑人巨大精品欧美一区二区蜜桃 | 久久午夜福利片| 韩国高清视频一区二区三区| 天堂俺去俺来也www色官网| 亚洲欧美清纯卡通| 国产精品一区二区在线不卡| 18禁裸乳无遮挡动漫免费视频| 搡女人真爽免费视频火全软件| 亚洲天堂av无毛| 国产黄频视频在线观看| 欧美日韩视频精品一区| 日韩不卡一区二区三区视频在线| 亚洲av男天堂| 有码 亚洲区| 天堂8中文在线网| 国产精品久久久久久av不卡| 久久精品国产综合久久久 | 日本黄大片高清| 免费观看在线日韩| 又粗又硬又长又爽又黄的视频| 大香蕉97超碰在线| 内地一区二区视频在线| 两个人免费观看高清视频| 亚洲av成人精品一二三区| 水蜜桃什么品种好| 日韩一区二区三区影片| 国产高清三级在线| 五月开心婷婷网| 婷婷色麻豆天堂久久| 一本—道久久a久久精品蜜桃钙片| 国产精品久久久久久久电影| 精品久久久久久电影网| 嫩草影院入口| 亚洲国产精品国产精品| 波野结衣二区三区在线| 亚洲美女视频黄频| 美女内射精品一级片tv| 男女午夜视频在线观看 | 亚洲av在线观看美女高潮| 人妻一区二区av| 一级,二级,三级黄色视频| 黑人高潮一二区| 最后的刺客免费高清国语| 美女视频免费永久观看网站| 十八禁网站网址无遮挡| 黄片无遮挡物在线观看| 亚洲 欧美一区二区三区| 建设人人有责人人尽责人人享有的| 男女无遮挡免费网站观看| 夫妻性生交免费视频一级片| 色吧在线观看| 日本欧美国产在线视频| 日本欧美视频一区| 女性生殖器流出的白浆| 久久这里只有精品19| 国精品久久久久久国模美| 亚洲av综合色区一区| 欧美激情 高清一区二区三区| 亚洲av欧美aⅴ国产| 最近最新中文字幕免费大全7| 日本爱情动作片www.在线观看| 一边摸一边做爽爽视频免费| 欧美激情国产日韩精品一区| 波多野结衣一区麻豆| 国产综合精华液| 国产精品一二三区在线看| 18+在线观看网站| 亚洲精品aⅴ在线观看| 99久久中文字幕三级久久日本| 国产免费视频播放在线视频| 18禁动态无遮挡网站| h视频一区二区三区| 男女下面插进去视频免费观看 | 免费黄网站久久成人精品| 日韩精品免费视频一区二区三区 | 久久97久久精品| 久久99热6这里只有精品| 久久久欧美国产精品| 满18在线观看网站| 国产亚洲最大av| 一级黄片播放器| 亚洲精品一二三| 国产精品国产三级专区第一集| 91久久精品国产一区二区三区| 狂野欧美激情性bbbbbb| 亚洲,欧美精品.| 街头女战士在线观看网站| 亚洲av电影在线观看一区二区三区| 麻豆乱淫一区二区| 欧美激情极品国产一区二区三区 | 国产又爽黄色视频| 日本欧美视频一区| 亚洲欧美精品自产自拍| 亚洲综合色网址| 中文乱码字字幕精品一区二区三区| 久久婷婷青草| 久久99蜜桃精品久久| 国产综合精华液| 欧美日韩亚洲高清精品| 色网站视频免费| av免费在线看不卡| 日韩 亚洲 欧美在线| 日韩欧美一区视频在线观看| 满18在线观看网站| 亚洲情色 制服丝袜| 另类精品久久| 成人亚洲精品一区在线观看| 久久人人爽人人爽人人片va| a 毛片基地| 日韩成人伦理影院| 成人国产麻豆网| 咕卡用的链子| 国产午夜精品一二区理论片| 日韩中字成人| 中文字幕精品免费在线观看视频 | 精品一区在线观看国产| 国产一级毛片在线| 免费高清在线观看日韩| 爱豆传媒免费全集在线观看| 亚洲人与动物交配视频| 大香蕉久久网| 在线观看三级黄色| 啦啦啦啦在线视频资源| 美女脱内裤让男人舔精品视频| 久久久久视频综合| a级毛色黄片| 亚洲精品美女久久久久99蜜臀 | 丁香六月天网| 免费人妻精品一区二区三区视频| 亚洲成人手机| 免费久久久久久久精品成人欧美视频 | 九草在线视频观看| 亚洲综合色惰| 久久人人爽人人片av| 久久亚洲国产成人精品v| 国产精品成人在线| 一区二区三区乱码不卡18| 在线观看一区二区三区激情| av播播在线观看一区| 亚洲精品av麻豆狂野| 亚洲av在线观看美女高潮| 日本黄大片高清| 国产成人精品婷婷| 蜜桃国产av成人99| 人体艺术视频欧美日本| 亚洲精品国产色婷婷电影| 成人二区视频| 精品国产露脸久久av麻豆| 又粗又硬又长又爽又黄的视频| 国产精品熟女久久久久浪| 成人午夜精彩视频在线观看| 免费播放大片免费观看视频在线观看| 国产精品久久久久久精品电影小说| 中国美白少妇内射xxxbb| 26uuu在线亚洲综合色| 啦啦啦视频在线资源免费观看| 高清毛片免费看| 亚洲精品美女久久久久99蜜臀 | 日本猛色少妇xxxxx猛交久久| 免费黄色在线免费观看| 免费av不卡在线播放| 777米奇影视久久| 建设人人有责人人尽责人人享有的| 伦理电影大哥的女人| av又黄又爽大尺度在线免费看| 97精品久久久久久久久久精品| 黄色一级大片看看| av一本久久久久| 免费黄网站久久成人精品| 边亲边吃奶的免费视频| videossex国产| 考比视频在线观看| 亚洲国产精品专区欧美| 国产成人aa在线观看| 国产在线视频一区二区| 国产欧美另类精品又又久久亚洲欧美| 蜜桃国产av成人99| 久久免费观看电影| 精品久久久精品久久久| kizo精华| 国产精品人妻久久久影院| 欧美3d第一页| 亚洲成人av在线免费| 日本猛色少妇xxxxx猛交久久| 如何舔出高潮| 母亲3免费完整高清在线观看 | 人妻人人澡人人爽人人| 久久毛片免费看一区二区三区| 少妇人妻 视频| 男人爽女人下面视频在线观看| 宅男免费午夜| 九色成人免费人妻av| 999精品在线视频| 香蕉丝袜av| 国产片内射在线| 女人被躁到高潮嗷嗷叫费观| 天天操日日干夜夜撸| 日韩成人av中文字幕在线观看| 97在线人人人人妻| 日本色播在线视频| 久久久久国产精品人妻一区二区| 国产av码专区亚洲av| 亚洲国产精品一区三区| 夜夜爽夜夜爽视频| 亚洲色图 男人天堂 中文字幕 | 五月开心婷婷网| 一级a做视频免费观看| 纯流量卡能插随身wifi吗| 国产高清不卡午夜福利| 天美传媒精品一区二区| 成人午夜精彩视频在线观看| 大码成人一级视频| 高清毛片免费看| 亚洲精品乱码久久久久久按摩| 国产免费视频播放在线视频| 欧美人与善性xxx| 又粗又硬又长又爽又黄的视频| 18禁动态无遮挡网站| 亚洲av欧美aⅴ国产| 少妇猛男粗大的猛烈进出视频| 成年女人在线观看亚洲视频| 日日撸夜夜添| 免费黄网站久久成人精品| 亚洲精品自拍成人| 亚洲欧美清纯卡通| 日韩熟女老妇一区二区性免费视频| 精品一区在线观看国产| 国产淫语在线视频| av免费在线看不卡| 国产深夜福利视频在线观看| 午夜av观看不卡| www日本在线高清视频| 看免费成人av毛片| 国产精品一区二区在线不卡| 22中文网久久字幕| 大片免费播放器 马上看| 亚洲精华国产精华液的使用体验| 久久人人爽人人爽人人片va| 国产高清三级在线| 最新中文字幕久久久久| a级毛色黄片| 男女免费视频国产| 亚洲国产最新在线播放| www.av在线官网国产| 亚洲综合色网址| 黑人巨大精品欧美一区二区蜜桃 | 欧美成人午夜免费资源| 五月伊人婷婷丁香| 三级国产精品片| 黑丝袜美女国产一区| 在线观看国产h片| tube8黄色片| 五月伊人婷婷丁香| 亚洲av综合色区一区| 夫妻午夜视频| 欧美精品高潮呻吟av久久| 亚洲美女黄色视频免费看| 久久久久网色| 亚洲国产精品一区二区三区在线| 又黄又粗又硬又大视频| 亚洲人成77777在线视频| 黑人欧美特级aaaaaa片| 少妇的逼水好多| 十分钟在线观看高清视频www| 精品久久国产蜜桃| 国产日韩欧美视频二区| 天天躁夜夜躁狠狠久久av| 高清欧美精品videossex| 久久鲁丝午夜福利片| 一级a做视频免费观看| 91成人精品电影| 免费在线观看黄色视频的| 最近的中文字幕免费完整| 亚洲第一区二区三区不卡| 日日爽夜夜爽网站| 久久久欧美国产精品| 国产男人的电影天堂91| 免费少妇av软件| 国产麻豆69| 9热在线视频观看99| 一级毛片黄色毛片免费观看视频| 精品少妇内射三级| 看免费成人av毛片| 天堂8中文在线网| 两个人免费观看高清视频| 国产精品人妻久久久影院| 亚洲国产精品国产精品| 亚洲伊人久久精品综合| 久久人人97超碰香蕉20202| 国产精品无大码| 99热6这里只有精品| 永久免费av网站大全| 夜夜骑夜夜射夜夜干| av有码第一页| 久久韩国三级中文字幕| 亚洲国产日韩一区二区| 人人妻人人添人人爽欧美一区卜| 国产伦理片在线播放av一区| av天堂久久9| 美女中出高潮动态图| 亚洲av男天堂| 午夜福利视频在线观看免费| 色婷婷av一区二区三区视频| 久久精品国产亚洲av天美| 欧美精品一区二区免费开放| 性色av一级| 久久久久久久久久久久大奶| 男人爽女人下面视频在线观看| av.在线天堂| 王馨瑶露胸无遮挡在线观看| 色网站视频免费| 国产成人a∨麻豆精品| 欧美人与善性xxx| 久久午夜综合久久蜜桃| 色5月婷婷丁香| 成人影院久久| 国产精品人妻久久久久久| 有码 亚洲区| 成人综合一区亚洲| 中文精品一卡2卡3卡4更新| 日日啪夜夜爽| 日韩大片免费观看网站| 午夜影院在线不卡| 国产亚洲欧美精品永久| 亚洲三级黄色毛片| 国产亚洲午夜精品一区二区久久| 在线观看免费视频网站a站| 爱豆传媒免费全集在线观看| 国产xxxxx性猛交| 最近最新中文字幕免费大全7| 国产乱人偷精品视频| 国产欧美日韩综合在线一区二区| 三上悠亚av全集在线观看| 日本午夜av视频| 丝袜在线中文字幕| 欧美日韩精品成人综合77777| 国产成人欧美| 99香蕉大伊视频| 婷婷色av中文字幕| 一区在线观看完整版| 蜜桃国产av成人99| 看非洲黑人一级黄片| 婷婷成人精品国产| 五月伊人婷婷丁香| 韩国精品一区二区三区 | 天天影视国产精品| 亚洲av福利一区| 国产xxxxx性猛交| 亚洲综合色网址| 美女中出高潮动态图| 又粗又硬又长又爽又黄的视频| 在线观看一区二区三区激情| 少妇的逼好多水| 在线观看免费视频网站a站| 国产激情久久老熟女| 日韩视频在线欧美| 欧美少妇被猛烈插入视频| 国产av一区二区精品久久| 有码 亚洲区| 久久精品aⅴ一区二区三区四区 | 黄色毛片三级朝国网站| 国产男女超爽视频在线观看| 国产精品久久久久成人av| 制服丝袜香蕉在线| 亚洲国产精品成人久久小说| 国产永久视频网站| 99热6这里只有精品| 18在线观看网站| 一级黄片播放器| av在线观看视频网站免费| 日韩一区二区三区影片| 免费观看a级毛片全部| 国产乱来视频区| 制服丝袜香蕉在线| 在线观看免费高清a一片| 美女中出高潮动态图| 观看av在线不卡| 精品少妇内射三级| 国产免费又黄又爽又色| 少妇被粗大猛烈的视频| 免费黄色在线免费观看| 欧美日韩综合久久久久久| 高清av免费在线| 亚洲国产成人一精品久久久| 一区在线观看完整版| 午夜精品国产一区二区电影| 国产麻豆69| 久久久国产欧美日韩av| 精品久久国产蜜桃| 欧美老熟妇乱子伦牲交| 国精品久久久久久国模美| videosex国产| 日韩免费高清中文字幕av| 99国产综合亚洲精品| 日韩免费高清中文字幕av| 亚洲国产精品一区二区三区在线| 亚洲欧美日韩另类电影网站| 免费观看a级毛片全部| 国产爽快片一区二区三区| 国产一区二区在线观看日韩| 亚洲精品一区蜜桃| 亚洲精品色激情综合| 亚洲丝袜综合中文字幕| 黑人欧美特级aaaaaa片| 国产白丝娇喘喷水9色精品| 美女内射精品一级片tv| 亚洲国产精品一区二区三区在线| 亚洲五月色婷婷综合| 日本av免费视频播放| 久久久久久久大尺度免费视频| 最新中文字幕久久久久| 少妇精品久久久久久久| 日本猛色少妇xxxxx猛交久久| 少妇被粗大的猛进出69影院 | 丝瓜视频免费看黄片| 女人精品久久久久毛片| 精品一品国产午夜福利视频| 日韩伦理黄色片| 男人舔女人的私密视频| 一区二区av电影网| 成人综合一区亚洲| 国国产精品蜜臀av免费| 尾随美女入室| 嫩草影院入口| 国产福利在线免费观看视频| a级毛色黄片| 看非洲黑人一级黄片| av黄色大香蕉| 多毛熟女@视频| 欧美人与善性xxx| 毛片一级片免费看久久久久| 亚洲国产精品一区二区三区在线| 久久人人爽人人片av| 制服丝袜香蕉在线| 国产日韩欧美视频二区| 日韩不卡一区二区三区视频在线| 国产高清三级在线| 国产成人91sexporn| 晚上一个人看的免费电影| 亚洲欧美精品自产自拍| 久久 成人 亚洲| 看免费成人av毛片| av片东京热男人的天堂| 亚洲国产精品成人久久小说| 国产淫语在线视频| 久久久国产一区二区| 久久久精品免费免费高清| 日本vs欧美在线观看视频| 亚洲av综合色区一区| 极品人妻少妇av视频| 国产亚洲精品第一综合不卡 | 久久99热6这里只有精品| 夫妻性生交免费视频一级片| 国产1区2区3区精品| 热99久久久久精品小说推荐| 中文天堂在线官网| a 毛片基地| 日本av免费视频播放| 亚洲av在线观看美女高潮| 国产亚洲欧美精品永久| 99国产精品免费福利视频| videossex国产| 久久人妻熟女aⅴ| 国产精品熟女久久久久浪| 亚洲激情五月婷婷啪啪| 欧美 日韩 精品 国产| 成人国产av品久久久| 免费黄色在线免费观看| 免费播放大片免费观看视频在线观看| 日韩视频在线欧美| 黄色怎么调成土黄色| 插逼视频在线观看| 欧美成人午夜精品| 日韩不卡一区二区三区视频在线| 男女无遮挡免费网站观看| 综合色丁香网| 九九在线视频观看精品| 亚洲国产精品成人久久小说| 欧美日韩精品成人综合77777| 国产亚洲av片在线观看秒播厂| 日本猛色少妇xxxxx猛交久久| 亚洲成av片中文字幕在线观看 | 春色校园在线视频观看| 亚洲精品视频女| 精品99又大又爽又粗少妇毛片| 国产男人的电影天堂91| 精品少妇黑人巨大在线播放| 一区二区三区乱码不卡18| a级毛片在线看网站| 中文字幕亚洲精品专区| 日日撸夜夜添| 中国美白少妇内射xxxbb| 18禁在线无遮挡免费观看视频| 美女大奶头黄色视频| 999精品在线视频| 爱豆传媒免费全集在线观看| 深夜精品福利| 大香蕉97超碰在线| 欧美日韩视频精品一区| 一个人免费看片子| 成人18禁高潮啪啪吃奶动态图| 欧美最新免费一区二区三区| 久久久久国产精品人妻一区二区| 日本wwww免费看| 午夜视频国产福利| 久久 成人 亚洲| 天天操日日干夜夜撸| 国产精品蜜桃在线观看| 一边亲一边摸免费视频| 亚洲精品美女久久av网站| 中文字幕精品免费在线观看视频 | 男女午夜视频在线观看 | 黑丝袜美女国产一区| 少妇 在线观看| 久久精品久久久久久噜噜老黄| 国产亚洲精品第一综合不卡 | 99精国产麻豆久久婷婷| 亚洲精品第二区| 国产成人91sexporn| 蜜桃在线观看..| 国产免费一区二区三区四区乱码| 日韩av不卡免费在线播放| 日韩制服丝袜自拍偷拍| 午夜激情久久久久久久| 精品少妇黑人巨大在线播放| 亚洲精品日本国产第一区| 美女主播在线视频| 夜夜爽夜夜爽视频| 日日摸夜夜添夜夜爱| 22中文网久久字幕| 亚洲伊人色综图| 久久影院123| 中文字幕人妻丝袜制服| 丝袜在线中文字幕| 国产精品免费大片| 伦理电影大哥的女人| 日韩 亚洲 欧美在线| 日韩伦理黄色片| 两个人免费观看高清视频| 好男人视频免费观看在线| 国产精品国产三级专区第一集| 在线观看人妻少妇| 精品少妇久久久久久888优播| 午夜福利在线观看免费完整高清在| 久久99热6这里只有精品| 黑人高潮一二区| 成年人免费黄色播放视频| 中文乱码字字幕精品一区二区三区| 少妇被粗大猛烈的视频| 国产av一区二区精品久久| 亚洲国产最新在线播放| 国产黄频视频在线观看| 国产色婷婷99| 在线观看免费高清a一片| 午夜日本视频在线| 免费黄频网站在线观看国产| xxxhd国产人妻xxx| 国产成人精品无人区| 内地一区二区视频在线| 国产精品一区二区在线不卡| 久久人妻熟女aⅴ| 亚洲国产成人一精品久久久| 丰满乱子伦码专区| 色网站视频免费| 国产精品久久久久久精品古装| 精品午夜福利在线看| 性高湖久久久久久久久免费观看| 最近2019中文字幕mv第一页| 精品久久久久久电影网| 人人妻人人爽人人添夜夜欢视频| 精品一区二区三区视频在线| 日韩不卡一区二区三区视频在线| 宅男免费午夜| 国产精品嫩草影院av在线观看| 最新中文字幕久久久久| 国产精品不卡视频一区二区| 色婷婷久久久亚洲欧美| 国产精品秋霞免费鲁丝片| 国产伦理片在线播放av一区| 中文乱码字字幕精品一区二区三区| 亚洲美女视频黄频| 欧美激情 高清一区二区三区| av不卡在线播放| 看非洲黑人一级黄片| 久久毛片免费看一区二区三区| 午夜福利影视在线免费观看| 国产一区二区三区av在线| 久久国产精品男人的天堂亚洲 | 永久网站在线| 精品99又大又爽又粗少妇毛片| 亚洲av电影在线观看一区二区三区| 大话2 男鬼变身卡| 亚洲伊人色综图| 看免费av毛片| a级毛片在线看网站| av在线老鸭窝| 午夜福利视频精品| 欧美精品一区二区免费开放| 日韩欧美一区视频在线观看| 日韩免费高清中文字幕av|