• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Resource Scheduling Strategy for Performance Optimization Based on Heterogeneous CPU-GPU Platform

    2022-11-10 02:31:46JuanFangKuanZhouMengyuanZhangandWeiXiang
    Computers Materials&Continua 2022年10期

    Juan Fang,Kuan Zhou,Mengyuan Zhang and Wei Xiang

    1Faculty of Information Technology,Beijing University of Technology,Beijing,100124,China

    2La Trobe University,Melbourne,VIC,3086,Australia

    3James Cook University,Cains,QLD,4878,Australia

    Abstract:In recent years,with the development of processor architecture,heterogeneous processors including Center processing unit(CPU)and Graphics processing unit (GPU) have become the mainstream.However,due to the differences of heterogeneous core,the heterogeneous system is now facing many problems that need to be solved.In order to solve these problems,this paper try to focus on the utilization and efficiency of heterogeneous core and design some reasonable resource scheduling strategies.To improve the performance of the system,this paper proposes a combination strategy for a single task and a multi-task scheduling strategy for multiple tasks.The combination strategy consists of two sub-strategies,the first strategy improves the execution efficiency of tasks on the GPU by changing the thread organization structure.The second focuses on the working state of the efficient core and develops more reasonable workload balancing schemes to improve resource utilization of heterogeneous systems.The multi-task scheduling strategy obtains the execution efficiency of heterogeneous cores and global task information through the processing of task samples.Based on this information,an improved ant colony algorithm is used to quickly obtain a reasonable task allocation scheme,which fully utilizes the characteristics of heterogeneous cores.The experimental results show that the combination strategy reduces task execution time by 29.13%on average.In the case of processing multiple tasks,the multi-task scheduling strategy reduces the execution time by up to 23.38% based on the combined strategy.Both strategies can make better use of the resources of heterogeneous systems and significantly reduce the execution time of tasks on heterogeneous systems.

    Keywords:Heterogeneous computing;CPU-GPU;Performance;Workload balance

    1 Introduction

    Nowadays,as GPUs show more and more powerful performance in terms of massively parallel computing[1],heterogeneous CPU-GPU architecture has become the current mainstream architecture.However,this structure also faces many challenges[2].For example,the difference in heterogeneous cores leads to a decrease in resource utilization,which makes the system performance less than expected.Therefore,improving resource utilization in heterogeneous systems is an important research objective in the field of heterogeneous computing[3].In order to improve the resource utilization and performance of heterogeneous systems,many researchers have found ways to improve the performance by improving a specific application implementation in a specific heterogeneous system.Wozniak et al.[4]tested the execution time of three different implementations of specific applications Hash Join,which proves the superiority of the CPU-GPU heterogeneous architecture.The idea of classification of GPU application based on the kernel structure proposed by Shen et al.[5]is worthy of reference.However,it is only effective for applications with a simple kernel structure.Rizvi et al.[6]have improved the implementation of deep neural convolution network applications and achieved remarkable optimization results.Although these schemes can significantly shorten the execution time,they are applied only to specific applications in specific environments.

    At present,Mittal et al.[7]summarized studies on heterogeneous computing.They classify workload balancing strategies into static and dynamic.Next,the paper will discuss the related research of these two kinds of strategies.The static strategy needs to estimate the execution time of the application.Jung et al.[8]proposed a scheme combining static prediction and dynamic inference.The prediction accuracy is affected by the size of tasks,and it is difficult to achieve high prediction accuracy.Alavani et al.[9]proposed a GPU Compute Unified Device Architecture(CUDA) kernel execution time prediction method based on the code analysis.But the predicted result still has more than 20%error compared to the actual running result.Alsubaihi et al.[10]proposed a multi-objective optimization scheme for energy consumption and execution time.When the peak values of power and core temperature are limited,they quantify the benefits of energy consumption and execution time and obtain the comprehensive optimal execution scheme by making full use of the particle swarm optimization algorithm.However,this scheme increases additional overhead,and the optimization effect will be greatly reduced for small applications with short execution times.Li et al.[11]proposed an improved heterogeneous earliest finish time(HEFT)algorithm to improve the utilization of system resources.These tasks are too abstract to take advantage of the specific core characteristics About dynamic workload balancing strategies,Vu et al.[12]found that when workload distribution is applied to irregular applications,the amount of computation cannot be accurately predicted.Belviranli et al.[13]found that processing larger data blocks each time can improve the utilization of GPU,but the workload between CPU and GPU may become unbalanced.Therefore,they have made a comprehensive selection between GPU efficiency and resource utilization and achieved good results.This paper also focuses on the impact of input data attributes on efficiency,but the block size they mentioned refers to the data size,while the block size mentioned in this article affects the data structure.Navarro et al.[14]proposed a scheme that can adapt to most applications,but the search phase for irregular applications may last until the end.This method will degrade performance in individual special applications.Lin et al.[15]proposed a feedback-based workload distribution and frequency adjustment strategy.This strategy does not focus on how to reduce the execution time of the application on a specific core.

    However,the effect of these solutions will be affected by the change of the heterogeneous system environment.The research about task scheduling in heterogeneous system pays little attention to the execution efficiency of specific core,which results in that the processing ability of heterogeneous core is not fully developed[16].Therefore,to improve the performance of CPU-GPU heterogeneous systems,this paper focuses on the resource utilization of the heterogeneous system and the execution efficiency of the application in the specific core.According to the different requirements of single task and multi-task processing scenarios,two resource scheduling strategies are proposed,including single task combination strategy and multi-task scheduling strategy.On this basis,the improved ant colony algorithm is used to make full use of the characteristics of heterogeneous cores to quickly obtain a reasonable task allocation scheme.

    2 Basic Method

    Heterogeneous systems often encounter situations where a large number of tasks need to be handled[17].In this case,in order to minimize the execution time of the entire task group,it is necessary to make a reasonable resource scheduling plan through global analysis under the basic conditions of understanding the characteristics of each task in the task group.

    The scheduling strategy for a single task is generally based on task data as the operation granularity,while the scheduling strategy for multiple tasks is generally based on the whole task as the operation granularity.The operation granularity of strategies for single-task processing scenarios and multi-task processing scenario is quite different,and the latter is more advantageous when dealing with multiple tasks.The reason is shown in Fig.1,in this sample,task A executes more efficiently on the GPU,Task B executes more efficiently on the CPU.The length of the rectangle represents the execution time of the task.

    Figure 1:Comparison of special situations when different kinds of strategies deal with multiple tasks

    When a task group containing task A and task B is executed using three different scenarios,the total execution time corresponds to the multi-task scheduling strategy,the combination strategy,and GPU alone,from less to more.The reason why multi-task scheduling strategy is better than GPU alone is that extra CPU is used to process some data and fully utilize heterogeneous core processing resources.The reason why multi-task scheduling strategy is better than combination strategy is that when combination strategy executes task A(B),CPU(GPU)processing efficiency is lower and core processing capability is not fully utilized.The multi-task scheduling strategy is to place tasks A and B on the most appropriate efficient cores to execute,without abstracting the heterogeneous cores into a single processing power,to make full use of the characteristics of the heterogeneous cores.

    Although the above illustration is a special case,there are only two preconditions for a multi-task strategy to be superior to a combination strategy when dealing with multiple tasks.The first is that the efficiency difference between tasks on the CPU and the GPU cannot be too large.The second is that when there is a certain difference in execution efficiency,most tasks in a task group cannot be better suited to call one of the specific cores for processing.As long as the task groups satisfying these two conditions are processed,the scheduling strategy for multiple tasks can achieve better results than the scheduling strategy for single task.It is necessary to design a multi-task scheduling strategy.

    3 Combination Strategy for Single Task

    3.1 Workload Balancing Strategy

    The innovation of this strategy is to add priority protection measures,this measure can be used to protect the core which can handle tasks faster,thus improving resource utilization and shortening task execution time.In fact,this paper designs two kinds of Workload Balancing(WB)strategies that take advantage of this measure,one is used this measure separately,and another is a combination strategy used this measure and the Block Size(BS)adjustment strategy mentioned earlier.These two strategies share some common processes,which will be explained first in this section.

    Assume that the execution time of the application sample on the CPU and GPU areTcpuandTgpu,respectively,and the workload distribution ratio between GPU and CPU is R.The protection measures we propose in this paper is as follows:When the CPU execution time of the sample is far beyond the GPU,the workload distribution ratio R between the GPU and the CPU is increased so that the expected execution time of the CPU is slightly smaller than the GPU.When the CPU execution time is less than the GPU,the workload distribution ratio R between the GPU and the CPU is reduced,so that the expected execution time of the CPU is slightly larger than the GPU.This ensures that the utilization of efficient cores is not affected by other cores when dealing with applications that are suitable for different cores.The adjustment method of the workload distribution ratio is as shown in Eq.(1):

    In this equation,αis the efficiency gap threshold,and the execution efficiency gap of the heterogeneous core exceeds the threshold,which will reduce the workload allocated to the inefficient core,sacrificing inefficient utilization to ensure that the utilization of the efficient core is maximized.βis the workload retention factor of the inefficient core,which determines the length of the inefficient core wait time.If the value is set too large,it may cause a short wait for the efficient core.If the value is set too small,the utilization of the inefficient core will be too low.The optimal values of these parameters are not affected by the hardware,but by the application group being processed.Based on the experience of many tests,the proposed system get a set of available parameter settings,so in the subsequent experiments,the values of these two parameters were set to 10 and 0.95,respectively.

    3.2 BS-WB Combination Strategy

    This combination strategy,which takes the block size adjustment strategy as its main body,is also divided into two phases:the search phase and the stable execution phase.The main changes and adjustments are in the search phase.During the search phase,the change of block size leads to a change in the efficiency of the GPU execution,which in turn causes the execution efficiency ratio on the GPU and CPU to change constantly.This has caused a lot of trouble in maintaining workload balancing among heterogeneous cores.Therefore,each iteration in the search phase requires a retest of the sample to change the distribution ratio.The distribution ratio R in the stable execution phase is obtained as shown in Algorithm 1.

    Algorithm 1 Obtaining task distribution ratio in BS-WB combination strategy Input:Task A Output:Task distribution ratio R 1:Initialization setting blocksize=128,it=0,AOC=amount of computation per iteration 2:While blocksize<1024 and it<number of iterations do 3:AOC*0.01 executes on CPU and AOC*0.01 executes on GPU 4:Record Tcpu,Tgpu,and use Eq.(1)to calculate task allocation ratio R 5:AOC*0.98/(1+R)executes on CPU and AOC*0.98*R/(1+R)executes on GPU 6:Record the blockszie-Tcpu-Tgpu correspondence in the table 7:Blocksize+64,it+1 8:End while 9:Look up the table to get the Tcpu and Tgpu in the iteration rounds with the shortest task execution time,and use the Eq.(1)to calculate the task distribution ratio R.10:Return task distribution ratio R

    In order to ensure that the workload balancing strategy does not cause performance degradation due to extreme conditions,protection measures are performed after the sample test,and the distribution ratio is checked and adjusted again to protect the efficient core.In the search phase,the GPU execution efficiency and workload distribution ratio of each block size is updated before each iteration.When the search phase is over,there is no need to perform sample testing for the subsequent iteration.The remaining data is processed by querying the previous records to use the best block size and the corresponding workload distribution ratio to perform the protection measures until all iterations of the application are completed.

    4 Multi-Task Scheduling Strategy Based on Ant Colony Algorithm

    4.1 Implementation of Multi-task Scheduling Strategy

    This section focuses on the implementation of the multi-task scheduling strategy proposed in this paper.Multiple tasks scheduling strategy can be divided into two main steps,the first step is to obtain information about each task,and the second step is to determine the task allocation scheme based on the information obtained.Based on this idea,the multi-task scheduling strategy proposed in this paper is also divided into two components.The first part is a sample test,through which the expected execution time of tasks in the task group on the current CPU/GPU is understood,so as to provide the basic material for the global assignment of tasks.The second part takes the expected execution time as the basis material,and takes execution time as the optimization target,obtains the task allocation scheme quickly through the ant colony algorithm,and completes the execution on the most appropriate core according to the scheme.Next,the implementation steps of these two parts and their design principles are explained in detail.

    Nowadays,as the number of cores in heterogeneous systems continues to increase,it becomes more difficult to determine the matching relationship between tasks and cores.In this case,it is not persistent to pursue the optimal task allocation scheme[18].It has been proved that ant colony algorithm can quickly get task processing scheme suitable for heterogeneous multi-core environments.In addition,in order to further reduce the pressure on task allocation caused by the number of heterogeneous cores,the proposed paper divide all the cores of CPU/GPU heterogeneous system into CPU group and GPU group.Task allocation schemes are formulated in the group instead of individual cores.

    The algorithm is divided into two main phases:the initialization phase and the iteration phase.The improvement is in the iteration phase.The initialization phase of the algorithm is explained first.

    Step 1:The first step in the initialization phase is to process the initialization of related parameters,such as the number of tasks in the task group and the estimated execution time of each task.

    Step 2:The second step in the initialization phase is to initialize the ant colony algorithm-related parameters,which are set as shown in Tab.1.

    Table 1:Initial value of parameters in the ant colony algorithm

    After the initialization phase,the iteration phase of the algorithm begins.In order to adapt to the current environment and meet the needs of getting stable task allocation schemes,this paper makes two key improvements in this phase.The iteration phase of the improved ant colony algorithm is described in detail below.

    Step 3:The next task is taken from the task group,recorded as taski,and the probability of being selected for each type of core of the task is calculated.The probability of executing taskion corex,pixis affected by execution time and pheromones.The influence of execution time on selectionPpixis calculated in the Eq.(2).

    In which|T|is the collection of currently optional cores,andρis the number of pheromones left over by the corresponding path.The influence of pheromones on the selection processPtixis shown in the Eq.(3).

    In whichtixis the estimated execution time of taskion corex.In order to reduce the computational complexity and shorten the execution time of the algorithm,the calculation method of this step has been modified.The probability of choosing to place taskion corex,pixcan be calculated by the following Eq.(4)

    In whichgis pheromone influence factor.Pptxis the pheromone influence calculated by formula(2).

    Step 4:Arrange the cells according to the probability of each option,and make a random selection by generating random numbers.Increase the task counter once,and then decide if the task counter reaches the maximum number of tasks.If not,return to step 3.Otherwise,zero the task counter and restart the task acquisition in step c)from the beginning,and proceed to step 5.

    Step 5:Update the local pheromone.Assume that the pheromone isρix.After the ants pass the current cycle,update the pheromone of the corresponding path with the Eq.(5).

    Then increase the ant counter once.If the ant counter is 1,the current set of routes selected by the ant and its estimated execution time is directly recorded as the local optimal solution.Otherwise,compared with the local optimal solution currently obtained in the record,the solution with less execution time is considered as the new local optimal solution.Next,it is further determined that if the ant counter is equal to the number of ants set in Tab.1,the ant counter is cleared and goes to step 6,otherwise,return to step 3 and continue the iteration process.

    Step 6:Updates global pheromone with the Eq.(6).Add iteration counter once.If the iteration counter is 1,the scheme obtained in this cycle is recorded as the global optimal solution,and then return to step 3,otherwise go to step 7.

    Step 7:Recalculate the execution time difference between the new solution and the old solution,update the pheromone of the path corresponding to the optimal solution using Eq.(7)as follow,and take the solution with less execution time as the new global optimal solution.

    Step 8:Determines whether the iteration counter has reached the set number of iterations N,and returns step c)if it has not,otherwise the solution in the current record is the final result.This improved ant colony algorithm is shown in Algorithm 2.

    Algorithm 2 Improved ant colony algorithm Input:Task group to be processed Output:Task allocation scheme 1:Complete the initialization of ant colony algorithm-related parameters according to Tab.1(including task information such as the Number of tasks in the task group)2:For iterationcounter=0 to Number of iterations do 3:For antcounter=0 to Number of ants do 4:For taskcounter=0 to Number of tasks do 5:Get the task and complete the calculation of the selection probability based on Eqs.(2)-(4)6:Generate random number to complete route selection 7:End for 8:Updating local pheromones using Eq.(5)9:Updating Local Optimal Solutions

    Algorithm 2 Continued 10:End for 11:Updating global pheromones using Eq.(6)12:Adjust the pheromone of the corresponding path according to the Eq.(7)13:Update global optimal solution 14:End for 15:Return global optimal solution as task allocation scheme

    At the end of the above steps,the strategy obtains the required task allocation scheme through the improved ant colony algorithm.The execution process can be completed by assigning tasks to the corresponding core according to the obtained scheme,so as to optimize the execution time and improve the performance of heterogeneous systems.

    4.2 Differences between Multi-task Scheduling Strategy and BS-WB Combination Strategy

    In terms of optimization objectives,the same point of the two strategies is to reduce the execution time of tasks and improve system performance.The difference is that the design of BSWB combination strategy focuses on a single task,and the ultimate goal is to complete the task in the shortest time.The multi-task scheduling strategy is designed to deal with the task group composed of multiple tasks.It pays attention to the overall resource management,making up for the shortcomings of the former,which pays too much attention to the characteristics of individual tasks.The design goal is to deal with large-scale tasks in the shortest time.

    The core idea of BS-WB combination strategy is to give priority to workload balancing,try to ensure that CPU and GPU are in a working state in the whole process of heterogeneous processing tasks,and ensure resource utilization.The main idea of the multi-task scheduling strategy is to give priority to the work efficiency of the core,and then consider the problem of workload balance after ensuring the maximum execution efficiency of heterogeneous cores.

    Finally,the scope of application of both strategies is discussed.Although the combination strategy does work as expected for a small number of tasks,factors such as the proportion of serial code,the difference in task execution efficiency on heterogeneous cores will affect the effectiveness of the strategy.The multi-task scheduling strategy proposed by us can play a significant role in multi-task scenarios as expected.The problem with this strategy is that the sample test incurs additional overhead,which results in the optimization of the strategy being severely affected by the additional execution overhead.In summary,although tasks with large differences in CPU/GPU execution efficiency are still not applicable,flexible use of these two strategies can effectively reduce the execution time for any number of tasks,thereby improving system performance.

    5 Experimental Results

    5.1 Experiment about Workload Balancing Strategy

    For the workload balancing strategy as previously proposed,the experiment first runs eight different applications directly on the GPU and records the execution time,then gradually increases the number of CPU cores enabled in the experiment.As the number of CPU cores participating in task processing increases,the overall efficiency of GPU and CPU changes gradually.For the five experiments,the number of CPU cores enabled was set to 2,4,8,16,and 32,respectively.

    The effect of this strategy that the proposed system get from the experiment is shown in Fig.2 below.Since Nbody,Barnes-Hut,lud,lavaMD,and leukocyte are much more efficient on GPU than on CPU,the effect of keep workload balancing is not obvious,with a reduction of only 0.41%,5.74%,3.14%,0.84%,and 5.46%,respectively.Since the difference in execution efficiency between GPU and CPU is less than an order of magnitude,this strategy works well on Hotspot,resulting in a 14.83%reduction in total execution time.In addition,since the execution time of applications Kmeans and myocyte on the CPU is not significantly different from that on the GPU,this strategy works best in Kmeans and myocyte instances.The execution time of GPU is reduced by 58.01% and 79.12%,respectively,compared with the default GPU method.

    Figure 2:Effect of workload balancing strategy for different CPU configurations and different instances

    Through these experiments,it could be found that the effect of workload balancing strategy is always positive no matter what task the system faces due to the protection measures.Overall,the use of this strategy resulted in an average 20.94%reduction in execution time.Looking at the execution time reduction for each instance,it can be found that this strategy can produce a better effect for the task with a few efficiency differences between CPU and GPU.

    5.2 Experiment About BS-WB Combination Strategy

    Four implementation strategies used in this experiment are as follows:the original GPU implementation (Original),the GPU implementation of the block size adjustment strategy (Block size adjustment),the CPU-GPU heterogeneous implementation of the workload balance strategy that prioritizes the protection of efficient cores(Workload balance)and the BS-WB combination strategy(Combination strategy).

    The experiment result is shown in Fig.3 below.For Nbody and lavaMD,the effect of BS-WB combination strategy is not obvious.The workload balancing strategy is not applicable due to the difference in execution efficiency between CPU and GPU.Since the default block size still applies to the current GPU environment,the block size adjustment strategy is not applicable either.The result is that the execution time is reduced by only 1.01%and 3.18%,respectively,compared with the original GPU method.For Barnes-Hut,Kmeans,lud and myocyte,the combination strategy produced significant optimization results,reducing execution time by 29.38%,58.50%,47.25% and 78.82%,respectively.But most of these benefits come from one of these branch strategies.It does not reflect the superiority of the combination strategy over the sub-strategies.In fact,these experiments on two applications,Hotspot and leukocyte,best reflect the advantages of the combination strategy when compared with its branch strategies.Not only can both branch strategies achieve good results when used separately,but the final execution time is reduced by 19.98% and 14.97% when using the BS-WB combination strategy,which is significantly better than the results of using one of the branch strategies alone.

    Figure 3:Effect comparison of different strategies in different applications

    It could be found that the combination strategy proposed in this paper reduces the execution time of these applications by 29.13%on average compared with the default GPU method,and significantly improves the performance of heterogeneous systems.

    5.3 Experiment About Multi-task Scheduling Strategy

    To test the effectiveness of the multi-task scheduling strategy proposed in this paper in responding to different scenarios,this paper construct four task groups that have different numbers of tasks and the data size of the tasks varies.The proposed system use these task groups to experiment with this strategy.BS-WB combination strategy was used to process the same work to observe the differences in the scope and effectiveness of these strategies.In addition,the figure corresponding to the experiment contains the following items:Task execution time on CPU when using multi-task scheduling strategy (recorded as MTS-CPU),task execution time on GPU when using multi-task scheduling strategy(recorded as MTS-GPU),show whether the workload distribution is reasonable by comparing MTS-CPU with MTS-GPU;Total time spent using multi-task scheduling strategy(recorded as MTS),by comparing MTS with MTS-CPU&MTS-GPU,the proposed system can get the additional overhead incurred by the process of getting a task processing scheme;Total time spent using the combination strategy(recorded as BW-WB),by comparing BW-WB with MTS,the proposed system can get the difference in the effectiveness of the two strategies when dealing with the same task.Next,the composition of each task group and its reasons are presented,followed by data and diagrams illustrating the actual effect.

    For the first experiment,task group A is designed to test the effectiveness of the multi-task scheduling strategy when dealing with scenarios outside its scope of application.This task group is suitable for GPU to handle,which results in the multi-task scheduling strategy that cannot guarantee the workload balance.Task group B is to tests the impact of task group configuration on the optimization effect when workload balance can be ensured.The composition of task group A is shown in Tab.2,and the composition of task group B is shown in Tab.3.

    Table 2:Configuration of task group A

    Table 3:Configuration of task group b

    The specific experiment result is shown in Fig.4 below.For task group A,the tasks spent 17 s to execute on the CPU,but 77.87 s on the GPU,which results in a serious waste of resources.The execution time of using the multi-task scheduling strategy is 4.46%longer than the BS-WB combined strategy.Taking the cost of the sample test into account,the total time spent is 41.3% longer than the BS-WB combination strategy.It is proved that not all multitasking scenarios are more suitable for multi-task strategy,so the scope of application of the strategy needs to be considered.

    For task group B,multi-task scheduling strategy forms a stable workload balance when dealing with this task group.In this case,the task execution time of using the multi-task scheduling strategy is 11.12%less than that of the combined strategy.However,due to the additional overhead caused by sample testing,the total time consumption of the multi-task scheduling strategy is still 7.3%more than that of the combined strategy.The main reason is that half of the CPU’s working time is to executing tasks suitable for GPU.It can be seen that for a multi-task scheduling strategy,it is more important for each core to spend most of its time executing tasks suitable for itself than to pursue load balancing unilaterally.

    For the other two task groups,task Group C is designed to observe the effect of multi-task scheduling strategy versus BS-WB combination strategy when tasks can be executed on the most appropriate core.Task Group D is the best-case design based on the results of two previous experiments.The task group has a balanced proportion of CPU/GPU tasks,with 32 tasks.This task group is used to test the optimal performance of multi-task scheduling strategies relative to BS-WB combination strategies under optimal conditions.The composition of task group C is shown in Tab.4,while that of task group D is shown in Tab.5.

    Figure 4:Effect of multi-task scheduling strategy when processing task group A(a),B(b),C(c),D(d)

    Table 4:Configuration of task group c

    Table 5:Configuration of task group d

    Table 5:Continued

    For task group,C,the execution time of tasks using multi-task scheduling strategy is much lower than that using combination strategy,with a relative reduction of 23.05%.And even after considering the sample test time before task allocation,the total time spent using this strategy is still 14.33%less than that of the combination strategy.It is proved that if the workload balance can be maintained while the tasks are executed on the appropriate core,the effect of the multi-task scheduling strategy is much better than that of the combination strategy.

    For task group D,the task execution time on the CPU and GPU differs by only about 1%,resulting in an excellent workload balancing with a 34.86%reduction in task execution time compared to the BS-WB combination strategy and a relatively short sample testing time,with a 23.38% reduction in total time spent using the multi-task scheduling strategy compared with the combination strategy.It proves that the multi-task scheduling strategy has a great advantage over the BS-WB combination strategy when dealing with task groups in the optimal scope of application.

    In summary,when faced with many different types of task groups,the BS-WB combination strategy and the multi-task scheduling strategy are not only significantly better than those executed by GPU alone but also have their advantages and scope of application.Flexible use of these two strategies can ensure stable and efficient handling of multiple types of task groups by CPU/GPU heterogeneous systems.In particular,when using a multi-task scheduling strategy to handle scenarios within its scope of application,it can reduce the execution time by up to 23.38%compared with the BSWB combination strategy,which demonstrates the impact and necessity of this multi-task scheduling strategy.

    6 Conclusion

    This paper proposes several resource scheduling strategies to reduce task execution time based on CPU-GPU heterogeneous systems.First,a combined strategy is proposed for single-task processing,which improves the efficiency of task execution on GPU by adjusting the block size in the linear exploration process.Then,a multi-task scheduling strategy is proposed for the processing of multiple tasks.This strategy considers the efficiency of the core as the main consideration,uses the improved ant colony algorithm to get the task allocation scheme.Finally,the proposed system selected eight instances to test our resource scheduling strategies in a specific heterogeneous CPUGPU environment.The experimental data demonstrate that the scope of application of these strategies is in line with our expectations and that these strategies can achieve significant optimization results when dealing with tasks within the scope of application.In the future work,on the one hand,the proposed system refers to the commonly used energy consumption optimization method,considers the energy consumption,and refers to the relevant research that considers the energy consumption and execution time together,to further improve the strategy.On the other hand,there is still the possibility of improving the block size adjustment strategy.The proposed system can solve the processing problem of irregular applications by improving the dynamic parallel method,or the work can try to further optimize the block size search process so that it can get the best solution in one iteration.The two strategies of improving size block strategy and adding energy consumption factor can not only play a significant performance optimization effect in their application field but also stably improve the performance of heterogeneous CPU-GPU systems when processing tasks,which is of great research significance.

    Acknowledgement:This work is supported by Beijing Natural Science Foundation (4192007),and supported by the National Natural Science Foundation of China (61202076),along with other government sponsors.The authors would like to thank the reviewers for their efforts and for providing helpful suggestions that have led to several important improvements in our work.We would also like to thank all teachers and students in our laboratory for helpful discussions.

    Funding Statement:This work is supported by Beijing Natural Science Foundation[4192007],the National Natural Science Foundation of China[61202076],and Beijing University of Technology Project No.2021C02.

    Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

    大码成人一级视频| 精华霜和精华液先用哪个| 亚洲三级黄色毛片| 国产免费一区二区三区四区乱码| 成人黄色视频免费在线看| 久久久久久久久久成人| 国产精品熟女久久久久浪| 日本wwww免费看| 夜夜看夜夜爽夜夜摸| 日韩电影二区| 丰满少妇做爰视频| a级毛片在线看网站| 欧美+日韩+精品| 国产精品伦人一区二区| 我要看日韩黄色一级片| 亚洲欧美一区二区三区黑人 | 亚洲av免费高清在线观看| 亚洲精品国产成人久久av| 99精国产麻豆久久婷婷| 成年女人在线观看亚洲视频| 免费黄色在线免费观看| 久久久久久久精品精品| 丰满人妻一区二区三区视频av| 国产黄频视频在线观看| 亚洲在久久综合| 国产乱来视频区| 美女xxoo啪啪120秒动态图| 自拍偷自拍亚洲精品老妇| h日本视频在线播放| 日本欧美视频一区| 男人狂女人下面高潮的视频| a级毛片免费高清观看在线播放| 欧美精品人与动牲交sv欧美| 久久久国产欧美日韩av| 最近2019中文字幕mv第一页| 18禁在线无遮挡免费观看视频| 欧美精品人与动牲交sv欧美| 另类亚洲欧美激情| 秋霞在线观看毛片| 校园人妻丝袜中文字幕| 国产极品粉嫩免费观看在线 | 亚洲欧美日韩卡通动漫| 99热国产这里只有精品6| 三级经典国产精品| 中文字幕人妻熟人妻熟丝袜美| 欧美 亚洲 国产 日韩一| 国产精品免费大片| 亚洲av国产av综合av卡| 国产成人午夜福利电影在线观看| 亚洲精品成人av观看孕妇| 91精品国产国语对白视频| 日韩中文字幕视频在线看片| 伦理电影大哥的女人| 亚洲丝袜综合中文字幕| 99久久人妻综合| 国产精品一二三区在线看| av不卡在线播放| 青春草亚洲视频在线观看| 日韩制服骚丝袜av| 最黄视频免费看| 边亲边吃奶的免费视频| 国产免费又黄又爽又色| 亚洲精品第二区| 少妇裸体淫交视频免费看高清| 免费观看a级毛片全部| 国产精品一区二区在线观看99| 一级毛片aaaaaa免费看小| 一级黄片播放器| 一边亲一边摸免费视频| 在线观看免费视频网站a站| 美女内射精品一级片tv| 肉色欧美久久久久久久蜜桃| a级一级毛片免费在线观看| 中文在线观看免费www的网站| 能在线免费看毛片的网站| 少妇人妻久久综合中文| 亚洲av中文av极速乱| 精品人妻熟女毛片av久久网站| 啦啦啦啦在线视频资源| 久久人人爽av亚洲精品天堂| 欧美日韩视频精品一区| 制服丝袜香蕉在线| 欧美老熟妇乱子伦牲交| 国内精品宾馆在线| 一本—道久久a久久精品蜜桃钙片| 亚洲精品乱码久久久久久按摩| 久热这里只有精品99| 性色av一级| 免费观看性生交大片5| 日韩伦理黄色片| 99视频精品全部免费 在线| 国产在线一区二区三区精| 高清欧美精品videossex| 国产精品国产三级国产av玫瑰| 久久人妻熟女aⅴ| 高清视频免费观看一区二区| 久久久a久久爽久久v久久| 国模一区二区三区四区视频| 九九久久精品国产亚洲av麻豆| 国产黄频视频在线观看| 免费看光身美女| 日日啪夜夜爽| 欧美精品亚洲一区二区| 久久精品久久精品一区二区三区| 一本色道久久久久久精品综合| 亚洲精品自拍成人| 黄色日韩在线| a级片在线免费高清观看视频| 如何舔出高潮| 曰老女人黄片| 久久99热这里只频精品6学生| 国产成人精品婷婷| 久久女婷五月综合色啪小说| 国产探花极品一区二区| 免费黄网站久久成人精品| 青春草国产在线视频| 性色avwww在线观看| 毛片一级片免费看久久久久| 久久影院123| 天堂8中文在线网| 色吧在线观看| 日韩大片免费观看网站| 亚洲av成人精品一二三区| av福利片在线| 看非洲黑人一级黄片| 亚洲美女黄色视频免费看| 亚洲激情五月婷婷啪啪| 成年美女黄网站色视频大全免费 | 老司机影院成人| 校园人妻丝袜中文字幕| 亚洲欧美日韩卡通动漫| 国产精品女同一区二区软件| 夫妻午夜视频| 一本色道久久久久久精品综合| 日韩人妻高清精品专区| 九草在线视频观看| 精品人妻熟女毛片av久久网站| 伦精品一区二区三区| 久久亚洲国产成人精品v| 久久亚洲国产成人精品v| 久久毛片免费看一区二区三区| 少妇精品久久久久久久| 内地一区二区视频在线| 国产亚洲一区二区精品| 少妇 在线观看| 亚洲欧美清纯卡通| 亚洲精品国产av成人精品| 色婷婷久久久亚洲欧美| 欧美日韩视频高清一区二区三区二| 日韩强制内射视频| 一本—道久久a久久精品蜜桃钙片| 日本色播在线视频| 美女大奶头黄色视频| 国产黄片美女视频| 人人妻人人看人人澡| 久久精品熟女亚洲av麻豆精品| 丝袜在线中文字幕| 一级av片app| 熟妇人妻不卡中文字幕| 国产精品久久久久久精品电影小说| 黄色怎么调成土黄色| 18禁裸乳无遮挡动漫免费视频| 免费观看a级毛片全部| 国产极品天堂在线| 天天躁夜夜躁狠狠久久av| 22中文网久久字幕| 成年美女黄网站色视频大全免费 | 在线观看人妻少妇| 新久久久久国产一级毛片| 99久久人妻综合| 国产精品久久久久久精品古装| 啦啦啦啦在线视频资源| 精品卡一卡二卡四卡免费| 一二三四中文在线观看免费高清| 国产综合精华液| 最后的刺客免费高清国语| 观看美女的网站| 午夜福利视频精品| 91精品国产国语对白视频| 精品少妇内射三级| 少妇裸体淫交视频免费看高清| 久久午夜综合久久蜜桃| 精品一区在线观看国产| 精品久久国产蜜桃| 男女边吃奶边做爰视频| 国产精品三级大全| 免费av中文字幕在线| 国产精品女同一区二区软件| 一边亲一边摸免费视频| 日韩三级伦理在线观看| 亚洲精品乱久久久久久| 午夜久久久在线观看| 国产高清国产精品国产三级| 精品久久国产蜜桃| 国产亚洲最大av| 综合色丁香网| 一区二区三区乱码不卡18| 欧美日韩国产mv在线观看视频| 一级毛片我不卡| videossex国产| 乱系列少妇在线播放| 亚洲精品成人av观看孕妇| 日本爱情动作片www.在线观看| 久久人人爽av亚洲精品天堂| 久久影院123| 亚洲欧洲精品一区二区精品久久久 | 国产日韩欧美视频二区| 日韩欧美精品免费久久| 18禁在线播放成人免费| 亚洲内射少妇av| 国产欧美另类精品又又久久亚洲欧美| 午夜av观看不卡| 亚洲熟女精品中文字幕| 国内揄拍国产精品人妻在线| 国产成人免费观看mmmm| 少妇的逼好多水| 亚洲欧洲精品一区二区精品久久久 | 日韩大片免费观看网站| 最近最新中文字幕免费大全7| 97超视频在线观看视频| 国产精品一区二区在线不卡| 少妇猛男粗大的猛烈进出视频| 毛片一级片免费看久久久久| 国内精品宾馆在线| 91精品国产国语对白视频| 午夜福利视频精品| 免费观看无遮挡的男女| 一本大道久久a久久精品| 欧美日韩av久久| av女优亚洲男人天堂| 高清不卡的av网站| 啦啦啦视频在线资源免费观看| 一本久久精品| 国产中年淑女户外野战色| 99久久人妻综合| 国产日韩欧美在线精品| 免费人成在线观看视频色| 成人美女网站在线观看视频| 日韩欧美一区视频在线观看 | 少妇人妻 视频| 亚洲天堂av无毛| 欧美bdsm另类| 男人舔奶头视频| 日产精品乱码卡一卡2卡三| 日韩av不卡免费在线播放| 在线天堂最新版资源| 美女脱内裤让男人舔精品视频| 我的老师免费观看完整版| 美女大奶头黄色视频| 一级毛片黄色毛片免费观看视频| 亚洲怡红院男人天堂| 2018国产大陆天天弄谢| 国产黄片视频在线免费观看| 色视频在线一区二区三区| 99九九线精品视频在线观看视频| 高清不卡的av网站| 日韩免费高清中文字幕av| 日韩一区二区三区影片| 国产精品一二三区在线看| 亚洲精品日本国产第一区| 国产精品伦人一区二区| 日韩精品免费视频一区二区三区 | kizo精华| 国产一区二区三区综合在线观看 | 国产免费视频播放在线视频| 春色校园在线视频观看| 日韩欧美精品免费久久| 麻豆精品久久久久久蜜桃| 少妇人妻精品综合一区二区| 日本欧美国产在线视频| 最后的刺客免费高清国语| 春色校园在线视频观看| 国语对白做爰xxxⅹ性视频网站| 久久久久久久精品精品| 久久国产亚洲av麻豆专区| 性色avwww在线观看| 高清欧美精品videossex| 51国产日韩欧美| 亚洲国产成人一精品久久久| 乱码一卡2卡4卡精品| 国产伦理片在线播放av一区| 看免费成人av毛片| 国产免费一级a男人的天堂| 国产精品福利在线免费观看| 99久久中文字幕三级久久日本| 哪个播放器可以免费观看大片| 一本色道久久久久久精品综合| 欧美另类一区| 久久青草综合色| 亚洲国产成人一精品久久久| 在线精品无人区一区二区三| 亚洲av欧美aⅴ国产| 亚洲自偷自拍三级| 亚洲成人手机| 伊人久久国产一区二区| 乱人伦中国视频| 国产精品不卡视频一区二区| h日本视频在线播放| 日韩欧美 国产精品| 国产精品秋霞免费鲁丝片| 亚洲国产精品一区三区| 一本一本综合久久| av线在线观看网站| 欧美老熟妇乱子伦牲交| 久久热精品热| 亚洲av男天堂| 嫩草影院新地址| 国产日韩欧美视频二区| 亚洲丝袜综合中文字幕| 在线看a的网站| 亚洲av.av天堂| 日韩强制内射视频| 毛片一级片免费看久久久久| av女优亚洲男人天堂| 亚洲精品中文字幕在线视频 | 国产深夜福利视频在线观看| 久热这里只有精品99| 三级经典国产精品| 男女边摸边吃奶| 婷婷色综合大香蕉| 一级片'在线观看视频| 久久青草综合色| 国产精品熟女久久久久浪| 久久久久久久大尺度免费视频| 在线观看一区二区三区激情| 看免费成人av毛片| 亚洲av欧美aⅴ国产| 国产伦精品一区二区三区视频9| 久久久久人妻精品一区果冻| 高清欧美精品videossex| 中文字幕制服av| 国产精品不卡视频一区二区| 免费观看av网站的网址| 99热国产这里只有精品6| 亚洲欧洲精品一区二区精品久久久 | 一区在线观看完整版| 日韩中文字幕视频在线看片| 久久青草综合色| 在线免费观看不下载黄p国产| av在线app专区| 国产成人一区二区在线| 永久网站在线| 极品人妻少妇av视频| 综合色丁香网| 国产美女午夜福利| 丁香六月天网| 亚洲国产日韩一区二区| av福利片在线观看| 日本av免费视频播放| 九色成人免费人妻av| 王馨瑶露胸无遮挡在线观看| 热re99久久精品国产66热6| 91成人精品电影| 国产白丝娇喘喷水9色精品| 日韩强制内射视频| 大香蕉97超碰在线| 丝袜喷水一区| 22中文网久久字幕| 美女国产视频在线观看| 久久午夜综合久久蜜桃| 97精品久久久久久久久久精品| 亚洲精品乱码久久久v下载方式| 成年人免费黄色播放视频 | 七月丁香在线播放| 一级毛片我不卡| 久久久久久久亚洲中文字幕| 中文字幕人妻丝袜制服| 最近最新中文字幕免费大全7| 男男h啪啪无遮挡| 久久久久久人妻| 国产男女超爽视频在线观看| 亚洲精品乱久久久久久| 国内精品宾馆在线| h视频一区二区三区| 男女边摸边吃奶| 久久人人爽人人爽人人片va| 亚洲人与动物交配视频| 久久久精品免费免费高清| 日韩av不卡免费在线播放| 成人特级av手机在线观看| 99热6这里只有精品| 亚洲精品一二三| freevideosex欧美| 国产高清国产精品国产三级| 日韩一区二区三区影片| 欧美变态另类bdsm刘玥| 亚洲av日韩在线播放| 777米奇影视久久| 亚洲精品国产av成人精品| 免费少妇av软件| 欧美xxⅹ黑人| xxx大片免费视频| 简卡轻食公司| 国产毛片在线视频| 日韩伦理黄色片| 中文字幕av电影在线播放| 亚洲不卡免费看| 久久女婷五月综合色啪小说| 26uuu在线亚洲综合色| 日本av手机在线免费观看| 亚洲国产成人一精品久久久| 伊人久久精品亚洲午夜| 日韩中字成人| 91精品国产国语对白视频| av在线app专区| 高清午夜精品一区二区三区| 日韩大片免费观看网站| 久久午夜福利片| 又爽又黄a免费视频| 日本-黄色视频高清免费观看| 中文精品一卡2卡3卡4更新| 亚洲人与动物交配视频| kizo精华| 日本vs欧美在线观看视频 | 欧美高清成人免费视频www| 永久网站在线| 一边亲一边摸免费视频| 日韩 亚洲 欧美在线| 97超碰精品成人国产| 国产极品天堂在线| 人人妻人人添人人爽欧美一区卜| 成人二区视频| 久久久国产精品麻豆| 日本wwww免费看| 亚洲国产最新在线播放| 黄色欧美视频在线观看| 国产精品国产三级专区第一集| 一二三四中文在线观看免费高清| 观看免费一级毛片| 久久热精品热| 青春草亚洲视频在线观看| 中文字幕制服av| 亚洲欧美一区二区三区黑人 | 欧美日韩精品成人综合77777| 午夜福利视频精品| 精品亚洲成a人片在线观看| 日本-黄色视频高清免费观看| 啦啦啦在线观看免费高清www| 中文字幕精品免费在线观看视频 | √禁漫天堂资源中文www| 妹子高潮喷水视频| 国产在线免费精品| 国产成人免费观看mmmm| 一本大道久久a久久精品| 在线看a的网站| 成人国产麻豆网| 午夜福利影视在线免费观看| 国产精品麻豆人妻色哟哟久久| 久久99精品国语久久久| 王馨瑶露胸无遮挡在线观看| 午夜视频国产福利| 26uuu在线亚洲综合色| 亚洲人与动物交配视频| 久久久国产精品麻豆| √禁漫天堂资源中文www| 久久精品国产亚洲网站| 免费看光身美女| 亚洲美女视频黄频| 亚洲av免费高清在线观看| 午夜视频国产福利| 深夜a级毛片| 亚洲成人手机| 日韩视频在线欧美| av在线观看视频网站免费| 一二三四中文在线观看免费高清| 男女无遮挡免费网站观看| 日韩伦理黄色片| 亚洲人与动物交配视频| 亚洲国产日韩一区二区| av在线观看视频网站免费| 一级毛片电影观看| 国产深夜福利视频在线观看| 久久久亚洲精品成人影院| 一区二区三区四区激情视频| 久久精品国产鲁丝片午夜精品| 美女xxoo啪啪120秒动态图| 亚洲国产精品国产精品| 日本av免费视频播放| 精品国产一区二区久久| 国产亚洲一区二区精品| 亚洲av日韩在线播放| 欧美少妇被猛烈插入视频| 在线观看www视频免费| 欧美+日韩+精品| 婷婷色综合大香蕉| 午夜福利视频精品| 久久精品国产鲁丝片午夜精品| 亚洲成人av在线免费| 亚洲精品乱久久久久久| 热99国产精品久久久久久7| 熟女av电影| 爱豆传媒免费全集在线观看| 国产黄色视频一区二区在线观看| 高清午夜精品一区二区三区| 久久久久久久精品精品| 精品视频人人做人人爽| 色婷婷av一区二区三区视频| 嘟嘟电影网在线观看| 国产极品天堂在线| 欧美日韩av久久| 国产熟女欧美一区二区| 校园人妻丝袜中文字幕| 免费少妇av软件| 少妇被粗大猛烈的视频| 在现免费观看毛片| 人妻制服诱惑在线中文字幕| 国产欧美另类精品又又久久亚洲欧美| 在线免费观看不下载黄p国产| 伦理电影大哥的女人| 在线观看三级黄色| 麻豆精品久久久久久蜜桃| 亚洲经典国产精华液单| 一边亲一边摸免费视频| 久久久久久久大尺度免费视频| 日产精品乱码卡一卡2卡三| av女优亚洲男人天堂| 国产有黄有色有爽视频| 日日摸夜夜添夜夜爱| 国产午夜精品一二区理论片| 人人妻人人澡人人看| 亚洲人与动物交配视频| 美女脱内裤让男人舔精品视频| 亚洲情色 制服丝袜| 国产日韩欧美在线精品| 国产成人91sexporn| 亚洲人成网站在线播| 精华霜和精华液先用哪个| 性色av一级| 日韩人妻高清精品专区| 国产淫片久久久久久久久| 99视频精品全部免费 在线| 日日摸夜夜添夜夜爱| 十八禁网站网址无遮挡 | 国产乱人偷精品视频| av播播在线观看一区| 乱系列少妇在线播放| av.在线天堂| 热re99久久国产66热| 国产毛片在线视频| 午夜久久久在线观看| 国产乱人偷精品视频| 国产一区二区在线观看av| 自线自在国产av| 搡女人真爽免费视频火全软件| 97精品久久久久久久久久精品| 在线观看av片永久免费下载| 狂野欧美白嫩少妇大欣赏| 欧美三级亚洲精品| 热re99久久国产66热| 国产一区二区三区综合在线观看 | 亚洲国产日韩一区二区| 国国产精品蜜臀av免费| 国产一区有黄有色的免费视频| 少妇人妻 视频| 久久久久久久大尺度免费视频| 免费高清在线观看视频在线观看| 亚洲欧美中文字幕日韩二区| 国产av码专区亚洲av| 一区二区三区四区激情视频| 久久婷婷青草| 色婷婷av一区二区三区视频| 久久精品夜色国产| av.在线天堂| 久久国产亚洲av麻豆专区| 欧美高清成人免费视频www| 国产一区亚洲一区在线观看| 亚洲人成网站在线播| 国产精品偷伦视频观看了| 久久精品国产亚洲网站| 国产女主播在线喷水免费视频网站| 久久97久久精品| 伦精品一区二区三区| 一级毛片电影观看| tube8黄色片| 日韩制服骚丝袜av| 亚洲av不卡在线观看| 在线观看av片永久免费下载| 一区二区三区四区激情视频| 久久精品国产亚洲av涩爱| 美女xxoo啪啪120秒动态图| 国产在线男女| 爱豆传媒免费全集在线观看| 男人狂女人下面高潮的视频| 精品人妻偷拍中文字幕| 亚洲av免费高清在线观看| 国产男女超爽视频在线观看| 久久97久久精品| 久久久久久久久久成人| 亚洲精品国产色婷婷电影| 精品久久久噜噜| 美女国产视频在线观看| 特大巨黑吊av在线直播| 如日韩欧美国产精品一区二区三区 | 久久久欧美国产精品| 日韩av免费高清视频| 一级爰片在线观看| 精品国产乱码久久久久久小说| 久久精品久久久久久久性| 久久免费观看电影| 亚洲国产色片| 最近中文字幕2019免费版| 黄色毛片三级朝国网站 | 日韩av免费高清视频| 老女人水多毛片| 视频区图区小说| 久热久热在线精品观看| 久久99精品国语久久久| 亚洲人与动物交配视频| 天天操日日干夜夜撸| 婷婷色综合www| 国产高清不卡午夜福利| 国产免费视频播放在线视频| 一个人看视频在线观看www免费| 国产亚洲最大av| 日韩一区二区视频免费看|