• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Application of material-mesh algebraic collapsing acceleration technique in method of characteristics–based neutron transport code

    2021-09-03 08:14:06MingDaiMaoSongCheng
    Nuclear Science and Techniques 2021年8期

    Ming Dai? Mao-Song Cheng

    Abstract The algebraic collapsing acceleration (ACA)technique maximizes the use of geometric flexibility of the method of characteristics(MOC).The spatial grids for loworder ACA are the same as the high-order transport,which makes the numerical solution of ACA equations costly,especially for large-size problems. To speed-up the MOC transport iterations effectively for general geometry, a coarse-mesh ACA method that involves selectively merging fine-mesh cells with identical materials, called material-mesh ACA(MMACA),is presented. The energy group batching (EGB) strategy in the tracing process is proposed to increase the parallel efficiency for microscopic crosssection problems. Microscopic and macroscopic crosssection benchmark problems are used to validate and analyse the accuracy and efficiency of the MMACA method. The maximum errors in the multiplication factor and pin power distributions are from the VERA-4B-2D case with silver-indium-cadmium (AIC) control rods inserted and are 104 pcm and 1.97%, respectively. Compared with the single-thread ACA solution, the maximum speed-up ratio reached 25 on 12 CPU cores for microscopic cross-section VERA-4-2D problem. For the C5G7-2D and LRA-2D benchmarks,the MMACA method can reduce the computation time by approximately one half. The present work proposes the MMACA method and demonstrates its ability to effectively accelerate MOC transport iterations.

    Keywords Algebraic collapsing acceleration ?Materialmesh ACA ?Method of characteristics ?OpenMP ?

    1 Introduction

    Owing to the advantages of excellent geometric flexibility, suitability for large-scale problems and good parallelism, the method of characteristics (MOC) is one of the mainstream methods employed for neutron transport calculation. However, the convergence of scattering source iterations is slow,and the efficient numerical algorithms to speed-up the convergence and multicore parallel technologies are useful. This study focuses on effectively accelerating MOC source iterations for an arbitrary geometry.

    The convergence acceleration algorithm generally introduces an efficient low-order approximate solution of the transport equation to speed-up the scattering source iteration,such as the coarse-mesh finite difference(CMFD)[1] and diffusion synthetic acceleration (DSA) [2]. The CMFD method is widely adopted and developed in pressurized water reactors(PWRs).Many improved algorithms have been proposed to enhance the convergence rate or stability, such as partial current-based CMFD (pCMFD)[3], optimally diffusive CMFD (odCMFD) [4] and linear prolongation CMFD (lpCMFD) [5]. The PerMOC code performed CMFD in adjoint mode to accelerate the adjoint MOC kernel in the thermal-up-scattering-like iteration scheme [6]. Recently, an equivalent angular flux nonlinear finite difference (ANFD) equation was established to update the MOC incident angular flux and sources directly[7], based on which a novel acceleration technique is expected to be developed to exceed the CMFD performance. Limited by the finite-difference method, these schemes are only suitable for regular geometries. To expand the geometric adaptability of the CMFD method,unstructured CMFD (uCMFD) [8] was developed for unstructured polygonal meshes and generalized CMFD(gCMFD)[9,10]for general geometries.Nevertheless,the uCMFD method is unsuitable for arbitrary geometries,and gCMFD depends on the width factor adjusted according to the five empirical conditions.

    The algebraic collapsing acceleration (ACA) technique[11–15]is a variant of the DSA method.Its solution grid is the same as that of the fine-mesh grid of the transport equation. The ACA equations with the sparse coefficient matrix were constructed approximately from the characteristic line equations. The ACA has good properties in terms of its convergence speed and stability and is suitable for arbitrary geometries. Spectral radius studies[11, 12] showed that convergence can be guaranteed for any optical length;the larger the optical length,the smaller the spectral radius. Divergence due to the large optical length in CMFD can be avoided, and this is also an advantage of the DSA method over CMFD [16]. Owing to the complex proximity relationship between nodes in the unstructured meshes, the number of non-zero elements in the ACA coefficient matrix is increased, which makes the convergence rate of the solution lower than that of the tridiagonal matrix in CMFD. Using the same grids as the transport calculation increases the number of calculations and memory requirements. The efficiency of solving the ACA equations is sensitive to the fine-mesh size,especially in the case of microscopic cross-section problems. In this case,the solution efficiency decreases[12]or even exceeds the ray tracing time. Therefore, it is necessary to study the coarse-mesh method to reduce the computational time and memory requirements of the low-order ACA.

    Larsen and Kelley studied the relationship between coarse-mesh DSA (CMDSA) and CMFD [16]. The CMDSA used volume homogenization for the cross-sections in the coarse-mesh cell, and a uniform distribution was used for the flux prolongation of the coarse-mesh cells to the fine-mesh FSRs. For a single track, the ACA equations are equivalent to the characteristic line equations,but the adoption of any homogenization causes them to be inequivalent. Byambaakhuu proposed discontinuous Galerkin DSA (DG-DSA) with coarse-mesh grids, which uses DG discretized coarse-mesh diffusion equations to accelerate the solution of the SNtransport equation with discontinuous finite-element discretization [17]. DG-DSA does not involve cross-section homogenization,but adjusts the mesh size or polynomial order according to the total cross-section of the material. Because the ACA equations are derived from MOC,and the ACA technique has strong geometric adaptability,this idea of broadening the mesh by material mesh is very suitable for ACA.Santandrea studied the DSA acceleration of eigenvalue problems in the MOC[18]. The computational efficiency of the ACA without coarser grids will further deteriorate because the power iteration for solving the ACA equations will take more time.

    Because the ACA method is suitable for general geometry, this paper proposes a coarse-mesh ACA by selectively merging some fine-mesh cells with the same material, called material-mesh ACA (MMACA). In this way, the homogenization operation is avoided. The MMACA uses coarse-mesh grids to solve low-order ACA equations, which reduces the number of ACA grids to improve the efficiency of the solution and decreases the size of its coefficient matrix to meet the memory requirements of parallel computing.

    The remainder of this paper is organized as follows. In Sect. 2, the basic solution process of the MOC is introduced. The derivation of the elemental equations of the MMACA method is presented in Sect. 3, and the energy group batching (EGB) strategy in the parallel process of ray tracing is presented in Sect. 4. Then, benchmark validation and acceleration performance analyses are presented in Sects. 5 and 6 concludes the paper.

    2 Method of characteristics

    The steady-state neutron transport equation is written in the following matrix form:

    where L is the neutron leakage and collision coefficient matrix, H is the scattering source coefficient matrix, F is the fission source coefficient matrix,λ is the eigenvalue,and Φ is the neutron flux.Equation (1)can be transformed into an eigenvalue problem as follows:

    For the multi-group equations of practical problems,the efficiency of the direct inversion of A is low; thus, the iterative method is usually used. It is transformed into a fixed-source problem to solve

    Equation (4) can be solved by the conventional split iteration method:

    The necessary condition for the convergence of Eq. (5)is that the spectral radius of the iteration matrix Pfree=L-1H should be smaller than 1.

    The MOC approach uses Eq. (3)and Eq. (5)to solve the neutron transport equation. Unlike other methods, MOC does not explicitly construct a matrix L or inverse it.First,the source term is constructed as

    where αm,T,βm,Tand γm,Tare coefficients that are related to the optical length,and lm,Tis the length of the m-th segment on track T in the FSR Nm.

    The angular fluxes of all FSRs can be obtained by tracing all the characteristic lines according to Eq. (7).The incoming boundary currents were obtained using the boundary conditions. There is an iterative convergence problem of the boundary currents for a non-vacuum boundary. The required scalar neutron fluxes are obtained by a weighted sum of all angular fluxes, as follows:

    where p is the index of the angle discretization, and wpis the weight of the angle p.

    From the above, it can be seen that ray tracing once is equivalent to left multiplying the source term in Eq. (6)by L-1.For practical problems,the split-iteration convergence of Eq. (5)is slow,and the global energy group rebalancing method and the Livolant method [15] can accelerate the convergence to a certain extent,but the effect is limited.In this paper, the ACA method is studied and a coarse-mesh ACA based on a material mesh is proposed.

    3 Material-mesh algebraic collapsing acceleration

    The ACA method is a variant of DSA. It was derived based on the MOC method, and a low-order neutron transport description similar to the diffusion equation is obtained by an algebraic collapsing approximation, which applies to arbitrary geometry.Because only adjacent FSRs are coupled, the coefficient matrix has good sparsity. The solution of the coefficient matrix can be accelerated using the tracking merging technique (TMT) [19]. This paper proposes a material-mesh ACA method to improve the efficiency of the ACA method.

    3.1 Algebraic collapsing acceleration

    ACA is a preconditioned Richardson iteration. Equation (5) is left multiplied by L-1, and we obtain

    Equation (10)is the same as Eq. (5).The solution of this free iteration can be obtained using the MOC method. In synthetic acceleration, the new iterative solution is achieved by the additive correction of the free iterative solution:

    Equation (13) is a fixed-source problem, similar to Eq. (5). However, the difference is that the source term is replaced by the scattering source term of the flux residual instead of the fission source term. Equation (13) is as difficult to solve as is the case with Eq. (5).The ACA method is used to construct a simplified system of Eq. (13) by using the algebraic collapsing approximation directly on equations for the even symmetric part of the corrective fluxes. These equations have a strongly sparse coefficient matrix, such as diffusion equations and are derived from the basic equations of the MOC method.

    where -T denotes the direction opposite to the characteristic line T. The angular and volume integrals of the even symmetric part will be equal to the corrective scalar flux.The relationship between the segment-averaged angular flux and the segment-boundary angular flux is obtained by eliminating the source term from Eq. (7)for MOC.This is similar to the-T direction.Then,the relationship between the even- and odd-symmetry part of the segment-averaged angular flux and those of the segment-boundary angular flux can be obtained by combining the definition of Eq. (14) as

    where ^qNmincludes fixed and scattering source terms. A similar expression can be obtained for the -T direction.Then, by adding or subtracting two equations, dividing them by two and combining Eq. (14) gives

    By combining Eq. (15) and Eq. (17), we can obtain equations that contain only the even symmetric part of the corrective flux. As an example, take the m-th segment,which is away from the boundary surface. If we let the segment in front be j = m-1 and the segment behind be l = m + 1, we can write

    Equation (18) implies that adjacent regions are connected by characteristic lines, and its coefficient matrix is sparse. Each term of Eq. (18) is integrated as Eq. (8), and the algebraic collapsing approximation is introduced on both sides of the equation. This approximation means that the integral of the product of two functions is approximately the product of their respective integrals, or the assumption of an isotropic even symmetric part of the corrective flux is adopted.For example,for the first term in Eq. (18), we can write

    where Iprojis the projection matrix from the corrective system to the actual neutron transport system. The source term ^qNmin Eq. (18) includes the corrected flux scattering source term,in addition to the flux residual on the right side of Eq. (20); thus, D is related to the coefficients on both sides of Eq. (18). The biconjugate gradient stabilized method (BICGSTAB) with a left incomplete LU (ILU0)precondition can be used to solve Eq. (20). To reduce the non-zero filling items in ILU0 decomposition,the inversion of the breadth-first search (BFS) algorithm can be used to adjust the order of the elements of the corrective flux vector[20].

    By combining Eq. (10), Eq. (11) and Eq. (20), we can write

    3.2 Tracking merging technique

    The TMT technique [19] compresses the characteristic lines that pass through the same sets of FSRs into one merged track and the contribution of that merged track to the components of the ACA coefficient matrix is then computed.For the m-th segment,the weight of the merged track is the sum of the weights of all the compressed characteristic lines,and the length of the merged track is its weighted average value:

    where TMT represents the merged track. In practical problems, the density of characteristic lines is large, and there is a specific proportion of the characteristic lines that pass through the same sets of FSRs. TMT reduces the number of tracks when solving the coefficient matrix, so it can improve the efficiency with which the coefficient matrix is solved, and it can also reduce the deviation introduced by algebraic collapsing approximation to a certain extent.

    3.3 Material-mesh ACA

    The basis of the ACA derivation is given by Eq. (15)and Eq. (17),which are derived from Eq. (7)and Eq. (16),respectively, and Eq. (7)is the basic equation of the MOC method and is derived from the integral form of the neutron transport equation.Taking s as the distance from any point on the m-th segment to the incoming boundary of this segment, the angular neutron flux of any point on the m-th segment can be written as

    where s*is the distance between the source point and the target point.

    The premise of Eq. (7),which is derived from Eq. (24),is that the cross-section and source term of the m-th segment remain unchanged. To keep the cross-section unaltered, the most direct method is to select the material mesh; that is,FSRs with the same material can be assigned to the same coarse region.Because the coefficient matrix is computed by ray tracing,all FSRs in the same coarse region can form any geometric shape, including a concave shape or a group of discretely distributed shapes. It is worth noting that it is difficult to construct a homogenization method to make the system equivalent before or after homogenization. For example, the optical lengths before and after volume homogenization were markedly different. To make the source term unchanged,M denotes the coarse-mesh segment index on track T and taking the volume-averaged source term of each FSRs in the same coarse region to replace the source term in Eq. (24), which is equivalent to using the algebraic collapsing approximation, it can be expressed as:

    We can deduce the basic Eq. (7) for the coarse-mesh segments from Eq. (25). Another basic Eq. (16) for the coarse-mesh segments can be derived in the same way.Then, the similar ACA equations that are used in Eq. (20)can be obtained.

    The coefficient matrix and equation of the MMACA can be solved with only a few changes in the ACA.

    1. Before the coefficient matrix is solved, the track information of the fine-mesh grids is transformed into that of the material mesh. The adjacent segments with identical materials are merged, leading to the modification of the number of elements, region numbering and the length of segments.

    2. When solving the equation,the scalar neutron fluxes of the last iteration and current free iteration are converted into those averaged by the material mesh; thus,the source term computed in each coarse region is the volume-averaged source term of the corresponding FSRs.

    3. The calculated coarse-mesh correction fluxes were used to correct the free iteration fluxes of all FSRs in the same coarse region. The uniform source term is used for each FSR in the same coarse region, and then the coarse-mesh correction fluxes were returned to each FSRs homogeneously.

    In terms of the matrix-form equations, the interpolation matrix and projection matrix of Eq. (11), Eq. (20) and Eq. (21) are different from those of the original ACA method. In addition to considering the change in the order of elements to reduce the non-zero filling items in ILU0 decomposition, the corresponding mapping between the fine-mesh grids and the coarser material mesh should also be reflected in those two matrices. Owing to the algebraic collapsing approximation, the coarser the grid, the larger will be the deviation. Therefore, the material mesh should be selected properly.

    4 Energy group batching strategy in the parallel process of ray tracing

    At present,the primary method of parallel computing is to use message passing parallelism between nodes and to use memory-sharing parallelism within a single node. Its purpose is to reduce the memory consumption of a single node and reduce the communication time between the processors. OpenMP is a widely used shared memory programming model that can be controlled by compiling directives, API functions and environmental variables.OpenMP is easy to use, but because of the opacity of interfaces,it is not easy to achieve good parallel scalability.OpenMP has been widely used to parallel MOCs. So far,the mainstream Intel multicore CPU adopts a three-level cache architecture in which the last level of cache L3 is shared. The competition for shared cache L3 is inevitable in multicore parallel, especially for random access to memory data. During ray tracing, the access to FSRs is unordered, which makes the L3 competition important for problems with numerous energy groups.One way to handle this problem is to reduce the working set size of the execution of the core code.A simple and convenient way to achieve this goal is to process ray tracing in energy group batches. To reduce the cost of the repeated construction or destruction of the parallel sections,the loops of the energy group batching (EGB) and the characteristic lines are mixed by the COLLAPSE directive clause to allocate tasks. The pseudo-code is expressed as follows:

    !$OMP PARALLEL &

    !$OMP PRIVATE(…).

    !$OMP DO REDUCTION(+ :phi) COLLAPSE(2).

    DO ibatch = 1,nbatch ! the loop of the energy group batching.

    DO iline = 1,nline!the loop of the characteristic lines.

    Reading_tracks() ! to get the information of characteristic line.

    Tracing_process() ! Ray tracing process through the polar anger loop and energy group loop.

    ENDDO

    ENDDO

    !$OMP END DO.

    !$OMP END PARALLEL.

    The dimension of the scalar flux phi is the number of grids multiplied by the number of energy groups, and its size can reach the capacity of the L3 cache. The REDUCTION directive clause is used in the pseudo-code above to generate copies of variables that need to be modified for each thread, and there will be no pseudosharing problem. However, if the EGB is not adopted, the working set of each thread will be near L3 capacity, and there will be competition in sharing L3 among multiple threads when FSRs are accessed in a nearly random manner.The use of EGB is equivalent to refining the task size.The work originally completed by one thread is further subdivided into N batches, which are completed by different threads in parallel. At this time, the parallel computing work set will be 1/N of the original, leading to a reduction in the competition of the shared L3.Although the cost of repeated construction or destruction of the parallel section can be effectively avoided by a COLLAPSE clause,the addition of an outer loop increases the number of computations, such as repeatedly reading the track information.Then,there is an optimal selection problem for the number of batches N.

    5 Benchmark validation and acceleration performance analysis

    ThorLAT is a collision probability and MOC-based lattice and burnup code for the analysis of nuclear reactor fuel assemblies. The proposed MMACA method in the ThorLAT code was validated using the VERA-2A,VERA-2F, VERA-4-2D, C5G7-2D and LRA-2D benchmark problems.

    The 2A and 2F cases in VERA problems are both assembly geometries, and the 4-2D case is an array of 3×3 assemblies. For a detailed model geometry and parameters,please refer to the literature [21].The influence of the coarse-mesh partition on the convergence and the effect of the EGB strategy on the acceleration ratio was analyzed using the 1/8 VERA-2A model. We validated the problem with burnable poisons by VERA-2F with complete geometry and studied the acceleration effect of the MMACA method for a larger-size problem.Finally,VERA-4-2D was used to validate the results and performance of the MMACA method for problems with the control rod.In the current work,the Draglib format database with the SHEM-361 energy group structure generated from ENDF/B-VII.1[22] was used. Choosing numerous energy groups reduces the influence of the resonance interference effect because the energy group structure is specifically refined. The resonance calculation adopted the subgroup method based on the physical probability table, considered the resonance interference effect by the Bondarenko iteration and ensured the conservation of the reaction rate before and after the subgroup collapsing and volume homogenization by the SPH method. The MOC was used to solve the subgroup slowing down equation.

    The C5G7-2D [23] benchmark is a mini-core problem with four 17×17 pin-cell assemblies and five reflector blocks.The LRA-2D[24]benchmark is a 2-group,quartercore transient BWR problem. These macroscopic crosssection benchmark problems can exclude the influence of cross-sections when validating the MMACA method.

    All calculations were carried out on a single server that has 32 GB of memory and which uses 12 Intel Xeon silver 4214 CPU cores with a 2.20-GHz main frequency. Some common calculation parameters are as follows:For VERA problems,the characteristic line spacing was 0.05 cm,half of which was used for macroscopic cross-section problems,and the azimuth number was 64. The polar angle number was four, and the convergence accuracy of the eigenvalue was 1?0×10-5. The convergence accuracy of the scale flux was 1?0×10-5for VERA problems and 5?0×10-5macroscopic cross-section problems. A transport-modified P0 cross-section was used. Fixed-source iteration uses the global energy group rebalancing method and the Livolant method in addition to the ACA or MMACA techniques.The Livolant method was also used in power iteration.The diamond difference scheme [25] was used to integrate the characteristic form of the transport equation, and the TMT method was used in computing the ACA coefficient matrix.For macroscopic cross-section problems, only one inner iteration was set up to solve the fluxes.

    5.1 1/8 VERA-2A

    5.1.1 MMACA validation and analysis

    Figure 1 shows the geometric modeling of VERA-2A in ThorLAT. The fuel pin-cell was subdivided into 88 FSRs.The number of FSRs is 3157,and the number of unknowns is approximately 1?0×106. The water gap between the assemblies was explicitly modeled, and the grid of the ACA was the same as that of the FSRs. The different coarse-mesh divisions of the MMACA are shown in Fig. 2.The material mesh of the pin was selected as the coarsemesh in MMACA3, and the number of coarse-mesh cells was changed from 88 to 4. MMACA1 and MMACA2 represent meshes that are further refined from MMACA3 and the corresponding number of coarse-mesh cells is 16 and 7, respectively. MMACA4 denotes that the cells with two pins having identical materials are merged into the same coarse-mesh cell, which is the case in which the coarse-mesh cell consists of discretely distributed finemesh cells. In this case, the number of coarse-mesh cells changed from 176 to 4.

    Fig.1 (Color online)Geometric modeling of VERA-2A in ThorLAT

    Fig. 2 (Color online) Different coarse-mesh division of MMACA method

    The calculation results of VERA-2A with different ACA grids are listed in Table 1.This problem was solved using a single thread,and the EGB strategy was not used.From the errors of keffand pin power distribution, MMACA can reach the convergence result of ACA, which indicates the correctness of MMACA. With the coarser ACA grid, thenumber of ray tracing increases, which is caused by the deviation introduced by the algebraic collapsing approximation. The coarser the grid is, the greater the deviation introduced by this approximation, which makes the approximation of the preconditioning matrix to the inverse of the coefficient matrix less accurate and increases the number of iterations. However, the increase in the number of iterations is limited. From the perspective of the calculation efficiency, the four MMACA grids can effectively reduce the calculation time, and the speed-up ratio can reach more than 1.5 when compared with ACA, which shows the effectiveness of MMACA at improving the computational efficiency. There are two aspects of the effect of grid coarsening on computational efficiency.First,coarsening the grid leads to more iterations, which is detrimental to improving computational efficiency; the other is that reducing the number of ACA cells can significantly lower the ACA calculation time. Consequently,MMACA2 with a moderate number of coarse-mesh cells had the highest speed-up ratio.

    Table 1 The calculation results of VERA-2A with different ACA grids

    In the original ACA scheme,the time taken to solve the ACA equations of the low-order system accounts for 51%,which indicates that solving ACA equations is inefficient for problems with medium or above size. The reasons are as follows: (1) the complex neighborhood relationship between nodes in unstructured grids increases the number of non-zero elements of the coefficient matrix, and its solution convergence will be lower than that of a standard tri-diagonal matrix in CMFD;(2)for medium or above size problems,the use of the ACA equations to solve problems with numerous energy groups is time-consuming because the iterative solution of upward scattering in thermal groups takes a lot of time;(3)the contribution of each track to the ACA coefficient matrix can be accumulated by ray tracing based on algebraic collapsing approximation, and the larger the problem size,the more time-consuming is the solution of the coefficient matrix.MMACA can effectively reduce the percentage of the ACA run-time. For example,the percentage of the ACA run-time of MMACA2, which is the optimal coarse-mesh division,can be reduced to less than 10%,which shows the need to introduce MMACA for applying ACA to large-scale problems.

    5.1.2 Performance analysis of energy group batching

    The process of ray tracing can be divided into two parts.First, the information about the track is read, and the boundary or segment-averaged angular flux along the track is then calculated. The track information only needs to be read once when the energy groups are not processed in batches. In contrast, the time taken to read the track information is the number of batches, which increases the number of calculations. This is also the reason for which EGB is not used in general, but it may not be the optimal method for problems having different sizes. The calculation in this section adopts the MMACA2 model for VERA-2A.

    Table 2 shows the effect of the number of energy groups per batch (nEGB) on the computational performance,where the case with 361 energy groups in a batch corresponds to non-batch processing. With the increase in nEGB,the run-time first decreased and then increased,and the speed-up ratio declined. Compared with the case with non-batch processing, the minimum-time case, which has 10 energy groups per batch,increases the parallel speed-up ratio from 3.7 to 6.9. The results indicate that the EGB strategy can improve the calculation efficiency and significantly enhance the OpenMP parallel speed-up.

    To analyse the impact of the cache on program performance, we used the ‘‘perf’’ tool to record the events of‘‘cache-misses’’ and ‘cache-references’’ in ray tracing.‘‘Cash-misses’’ can reflect misses of cache-L3, while‘‘cash-references’’ is the sum of L3 hits and misses, which can reflect L2 misses,as shown in Fig. 3.With the increase in nEGB,the number of L2 and L3 misses of reading track information gradually decreases, while the number of misses of calculations in ray tracing increases.When nEGB is small, the number of cache misses of reading is higherthan that of tracing because of the repeated reading of track information.When nEGB increases,the number of times of reading decreases and the working set size of the tracing calculation is enlarged,causing the ratio of cache misses to be more focused on the tracing calculation. The total number of cache misses is the combined result of the two processes mentioned above. It first decreases with a decrease in the number of cache misses of reading and then rises with the increase in the number of cache misses of tracing, which is consistent with the trend observed in the calculation time with nEGB in Table 2.When nEGB is 1 in the single-thread calculation, the time is longer than the case without batching. This is because all energy groups need to read the track information again,and the number of calculations is much larger than the case without batching.When nEGB is 50, the number of cache misses under a single thread is less than that of the case when nEGB is 10,but its calculation time is longer.The reason is that with the increase in nEGB, the working set size of the tracing calculation will exceed L2 capacity, resulting in a rapid increase in the number of L2 misses by two orders of magnitude.When 12 threads are used in parallel,the scalar flux is treated by the REDUCTION clause, which means that each thread generates a copy of the scalar flux,leading to a significant increase in the size of the working set.During the parallel calculation, threads compete for the shared L3. When nEGB increases to a certain extent, L3 becomes saturated.If nEGB continues to grow,the numberof L3 misses will increase sharply, which will seriously affect the speed-up ratio.

    Fig. 3 Cache misses in ray tracing

    Table 2 The effect of the number of energy groups per batch on the computational performance

    When nEGB is 10 and the ACA coarse-mesh division adopts MMACA2, the strong parallel speed-up ratio is as shown in Fig. 4, where 24 threads are realized by using hyper-threading technology. The maximum speed-up was 8.0,on a 12-core Intel Xeon silver 4214 CPU.Considering the MMACA and EBG introduced in this paper, the maximum speed-up is 15.0,compared with the original scheme.

    5.2 VERA-2F with complete geometry

    It can be seen from the above calculations that with an increase in the number of cells for ACA,the percentage of the ACA run-time gradually increases. When using the same fine-mesh grid with the transport calculation in Table 1,the ACA total time exceeds the ray tracing time in the solution of the transport equation.If the problem size is further expanded,it can be predicted that the proportion of the ACA solution time will also increase. Hence, the calculation of VERA-2F with complete geometry was carried out. The scale of the unknown variables in this problem was close to 1?0×107.The calculation results are listed in Table 3. The nEGB is 10 and 12 threads are used in parallel. VERA-2F is an assembly with 24 Pyrex burnable poison rods. The results show that MMACA can correctly handle the calculation of such an assembly.The percentage of the ACA run-time reaches 76%,which is the most timeconsuming part of the entire calculation. The use of four types of coarse grids for MMACA can also effectively improve calculation efficiency. MMACA2 is also the best coarse-mesh partition scheme, which can reduce the percentage of the ACA run-time to 14.1% and its speed-up ratio attains 3.1 when compared with the fine-mesh scheme. The computational efficiency of MMACA3 based on the material mesh of the pin-cell was equivalent to that of MMACA2.

    Table 4 shows the effect of nEGB on the computational efficiency of the complete geometry VERA-2F when using MMACA2 and 12 threads in the calculation. The effect of nEGB was similar to that of the 1/8 VERA-2A calculations. When nEGB is 10, the calculation efficiency is better, and the speed-up ratio is 2.6 when compared with that of non-batch processing. The increase in the ACA total time without batching is mainly due to the increase in the ACA coefficient matrix calculation. The ACA coefficient matrix is obtained by accumulating through ray tracing, in which the EGB strategy can also be implemented. Compared with scalar flux, the ACA coefficient matrix calculation requires more memory space to store non-zero elements in the non-diagonal position, and the speed-up effect of the ACA coefficient matrix calculation is more obvious by performing EGB in parallel computing. The reason for which the percentage of the ACA run-time is lower when EGB is not used is that ray tracing is more time-consuming, which reduces the proportion of ACA.

    5.3 VERA-4-2D

    5.3.1 1/8 symmetric model calculation

    VERA-4-2D is a 3×3 color set. 4A-2D is the case in which the control rods are not inserted, 4B-2D is inserted with AIC control rods, and 4C-2D is inserted with B4C control rods. These cases can be used to validate the calculation with the burnable poison Pyrex and control rods inserted into the guide tubes.The results of the control rod worth calculated using the coarse-mesh division MMACA3 for 1/8 VERA-4-2D are shown in Table 5, and 12 threads are used for the calculation.The maximum deviation of the control rod worth calculated by MMACA3 is -1.12%,which is in good agreement with the reference solution.The error distributions of pin power are shown in Fig. 5.The maximum error for the case with the control rods withdrawn was 0.96% and for the case with the rods inserted, it was 1.97%. The error distribution of the pin power is slightly larger than that of VERA-2A, which indicates that there is a certain deviation when treating scattering anisotropy by the transport-corrected P0 crosssection in the presence of a strong absorber. Therefore, a more accurate anisotropic scattering treatment is needed.

    Fig. 5 (Color online) Error distributions of pin power calculated using MMACA3 for 1/8 VERA-4-2D

    5.3.2 Performance analysis for VERA-4-2D with complete geometry

    The size of the unknown variables to be solved in VERA-4-2D with complete geometry is approximately 8?0×107, and the ACA calculation is time-consuming,which can be inferred from the situation of VERA-2F with complete geometry.To analyse the impact of MMACA and EGB, the following five cases are calculated: ‘‘ACA-1-a’’represents fine-mesh ACA calculation in which nEGB is‘‘a(chǎn)’’ and 1 thread is used; ‘‘MMACA3-a-b’’ denotes that coarse-mesh MMACA3 is used in the calculation,in which nEGB is ‘‘b’’, and the number of threads used is ‘‘a(chǎn)’’. The results are listed in Table 6. The ACA grid is the same as the grid of FSRs for transport computing,which makes the ACA coefficient matrix occupy a significant amount ofmemory. Consequently, restricted by the limit of 32 GB memory, only the results using a single-thread can be provided for fine-mesh ACA calculations. Coarse-mesh division MMACA3 can effectively decrease the number of non-zero elements of the ACA coefficient matrix and can significantly reduce the memory requirement of the ACA calculation, so multi-threading parallel computing can be carried out with limited memory.For the cases that involve the use of one thread, EGB can slightly increase the calculation efficiency and the introduction of MMACA can reduce the total time, although the ray tracing times are increased to a certain extent. The percentage of ACA runtime can be reduced to 8.5%,and the speed-up ratio is 3.8.In 12-thread parallel computing, the speed-up ratio is 8.6 without EBG, but the speed-up ratio can be increased to 25.5 by setting nEBG to 10, which shows that EGB can significantly enhance the parallel efficiency. In summary,MMACA can effectively improve the computing efficiency and reduce the memory requirements for problems with a certain scale.

    Table 3 Calculation results of complete geometry VERA-2F with different ACA grids and using 12 threads

    Table 4 Effect of nEGB on the computational performance for VERA-2F when using MMACA2 and 12 threads

    Table 5 Control rod worth calculated using MMACA3 for 1/8 VERA-4-2D

    Table 6 Results for VERA-4-2D with complete geometry

    5.4 C5G7-2D

    Similar to the above VERA problems, C5G7-2D was solved using 12 threads in four ways, as shown in Fig. 6.The grid for the ACA is also the mesh for the FSRs. The number of unknowns is approximately 2?1×106, which is much smaller than the number of VERA-4-2D problems.Table 7 gives the results for C5G7-2D, and the error distribution of the pin power calculated using MMACA2 is illustrated in Fig. 7. The reference results were obtained using OpenMOC [26]. The results demonstrated good agreement between the MMACA and the reference OpenMOC solution. By reducing the percentage of the ACA run-time from 61.6% to 5.4%, MMACA2 can achieve a speed-up of about 1.9,which is the most efficient solution. Because C5G7-2D has strong heterogeneity,MMACA2 with the water region of a pin further divided can effectively reduce the number of ray tracing processes compared with MMACA3.MMACA1 with the fuel region further divided had little influence on decreasing the ray tracing times, which makes it less effective than MMACA2.

    Fig.6 (Color online)Different coarse-mesh division of MMACA for C5G7-2D

    Fig. 7 (Color online) The error distribution of pin power calculated using MMACA2 for C5G7-2D

    5.5 LRA-2D

    LRA-2D is a benchmark for the diffusion solver, by using which the reference eigenvalue for the initial steady-state problem was keff=0.99637. In this study, OpenMOC was used to generate the reference eigenvalue for the MOC solver. Each assembly was divided into 32×32 squares with a side length of 0.46875 cm. Full core geometry with vacuum boundary conditions was used instead of the quarter-core. The coarse grids for an assembly in MMACA1, MMACA2 and MMACA3 are 10×10, 4×4 and 1×1,respectively.In this case,12 threads were used,and the convergence criterion for solving the ACA equations was set to 1?0×10-7. As shown in Table 8, the MMACA eigenvalue agrees well with the reference MOC solver solution. MMACA2 can drastically decrease the total ACA time and achieve a speed-up by a factor of 2.3×.

    Table 7 Results for C5G7-2D with different ACA grids and using 12 threads

    Table 8 Results for LRA-2D with different ACA grids and using 12 threads

    6 Conclusion

    To maximize the geometric adaptability of the MOC,the convergence acceleration algorithm also needs to be applied to any geometry, and the ACA technique is an effective method for meeting this requirement. The loworder equations with the sparse coefficient matrix can be established by using algebraic collapsing approximation and can be used to accelerate the convergence of the fixedsource iteration in the MOC. Although the ACA equations are very sparse, it still need to face the problem of large memory requirements and inefficiency when solving largesize problems. This is because the ACA is a type of finemesh DSA method. This work enables the ACA equation to be solved on a coarser mesh.

    In the current work, the basic solution process of the MOC is first introduced.After the derivation of the ACA,a coarse-mesh MMACA method based on the material mesh was proposed. This method can be realized with a slight modification to the original scheme of the ACA.Then, the EGB strategy is presented to achieve a better parallel efficiency for microscopic cross-section problems. The correctness and effectiveness of the MMACA method under distinct coarse-mesh partitions and with different problem sizes were analyzed numerically. The performance of the EGB strategy was also studied during the numerical validation. From the analysis of the VERA-2A problems, the cache misses caused by nearly random access to FSRs during the tracing process is the main reason for the decrease in parallel efficiency, and the EGB strategy can lessen them to achieve better parallel efficiency by decreasing the working set size simply and conveniently. The multiplication factor and pin power distributions agree well with the reference solutions. The maximum values are 104 pcm and 1.97%, respectively,which occurred in the VERA-4B-2D case with AIC control rods inserted. For microscopic cross-section VERA-4-2D problem, a speed-up ratio relative to a single-thread ACA solution can reach 25 on 12 CPU cores. The MMACA method can reduce the computation time cost by approximately one half for macroscopic cross-section C5G7-2D and LRA-2D benchmarks. The results show that the MMACA method can effectively improve the computing efficiency and reduce memory requirements for problems on a certain scale.

    Because the algebraic collapsing approximation is adopted in the ACA, the larger the mesh size, the greater will be the deviation caused by the approximation, which leads to a deterioration of the convergence performance.This issue requires further improvement. At present, the proposed MMACA method only accelerates fixed-source iteration.Therefore,an additional acceleration method may be introduced to accelerate the convergence of the power iteration.

    Author contributionsAll authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Ming Dai. The first draft of the manuscript was written by Ming Dai,and all authors commented on previous versions of the manuscript.All authors read and approved the final manuscript.

    中亚洲国语对白在线视频| 给我免费播放毛片高清在线观看| 色播亚洲综合网| 成人鲁丝片一二三区免费| 两个人看的免费小视频| 国产亚洲精品久久久久久毛片| 国产男靠女视频免费网站| 日韩 欧美 亚洲 中文字幕| av天堂在线播放| 国产精品98久久久久久宅男小说| 好看av亚洲va欧美ⅴa在| 香蕉久久夜色| 好男人电影高清在线观看| 色吧在线观看| 久久久久九九精品影院| 久久久久久久亚洲中文字幕 | 动漫黄色视频在线观看| 香蕉av资源在线| 色播亚洲综合网| 亚洲欧美精品综合久久99| 真人做人爱边吃奶动态| 一区二区三区高清视频在线| 久久精品综合一区二区三区| 国内精品久久久久久久电影| 在线播放无遮挡| 又紧又爽又黄一区二区| 欧美激情久久久久久爽电影| 国产成人a区在线观看| 午夜福利成人在线免费观看| 欧美黄色淫秽网站| 成人亚洲精品av一区二区| 制服人妻中文乱码| 国产av麻豆久久久久久久| 欧美日本视频| 激情在线观看视频在线高清| 成人特级黄色片久久久久久久| 国产精品久久久久久亚洲av鲁大| 18禁黄网站禁片免费观看直播| 日韩欧美三级三区| a级毛片a级免费在线| 久久亚洲真实| 成人鲁丝片一二三区免费| 韩国av一区二区三区四区| 日韩精品青青久久久久久| АⅤ资源中文在线天堂| 少妇人妻精品综合一区二区 | 搡老熟女国产l中国老女人| 亚洲精品在线观看二区| 制服丝袜大香蕉在线| 欧美色欧美亚洲另类二区| 少妇的逼水好多| 精品人妻偷拍中文字幕| 欧美性猛交╳xxx乱大交人| 非洲黑人性xxxx精品又粗又长| 成人精品一区二区免费| 日本五十路高清| 国产一区二区三区在线臀色熟女| 最新在线观看一区二区三区| www日本在线高清视频| 亚洲欧美日韩无卡精品| 日日干狠狠操夜夜爽| 国产v大片淫在线免费观看| 搞女人的毛片| 久久婷婷人人爽人人干人人爱| 色播亚洲综合网| 欧美中文综合在线视频| 黄色成人免费大全| 日本与韩国留学比较| 一个人免费在线观看的高清视频| 最近最新中文字幕大全免费视频| 精品午夜福利视频在线观看一区| 麻豆成人av在线观看| 国产欧美日韩一区二区三| 精品久久久久久久末码| 怎么达到女性高潮| 女人高潮潮喷娇喘18禁视频| 日本免费a在线| 性欧美人与动物交配| 一个人看视频在线观看www免费 | 国产高清三级在线| 我的老师免费观看完整版| 久久久久久久久大av| 国模一区二区三区四区视频| 国产色婷婷99| 99国产极品粉嫩在线观看| 偷拍熟女少妇极品色| 亚洲一区高清亚洲精品| 两性午夜刺激爽爽歪歪视频在线观看| 日韩欧美一区二区三区在线观看| 亚洲欧美日韩高清专用| 在线播放无遮挡| 亚洲美女黄片视频| 中亚洲国语对白在线视频| 中文在线观看免费www的网站| 全区人妻精品视频| 国产精品98久久久久久宅男小说| 波多野结衣高清作品| 麻豆国产av国片精品| 麻豆一二三区av精品| 1000部很黄的大片| 色吧在线观看| 亚洲欧美日韩卡通动漫| 青草久久国产| 色av中文字幕| 一级毛片高清免费大全| 亚洲精品亚洲一区二区| 51午夜福利影视在线观看| 成年免费大片在线观看| 变态另类成人亚洲欧美熟女| 99热只有精品国产| 久久性视频一级片| 亚洲午夜理论影院| 久久亚洲精品不卡| 久久久久九九精品影院| 中文资源天堂在线| www.色视频.com| 欧美乱妇无乱码| 精品一区二区三区av网在线观看| 精品久久久久久,| 欧美激情在线99| 精品久久久久久久久久久久久| 日本黄色视频三级网站网址| 俄罗斯特黄特色一大片| 在线播放国产精品三级| 精品人妻偷拍中文字幕| 99久久无色码亚洲精品果冻| 丁香六月欧美| 搡老岳熟女国产| 母亲3免费完整高清在线观看| 精品人妻一区二区三区麻豆 | 亚洲精品影视一区二区三区av| 级片在线观看| 久久久久亚洲av毛片大全| 制服丝袜大香蕉在线| 欧洲精品卡2卡3卡4卡5卡区| 在线观看av片永久免费下载| 女人十人毛片免费观看3o分钟| 国产av一区在线观看免费| 欧美日韩黄片免| 午夜福利免费观看在线| svipshipincom国产片| 国产极品精品免费视频能看的| 国产一区二区亚洲精品在线观看| 黄色片一级片一级黄色片| 久久人妻av系列| 久久6这里有精品| 性色avwww在线观看| 女人被狂操c到高潮| 日本一本二区三区精品| 激情在线观看视频在线高清| 欧美+亚洲+日韩+国产| 国产精品一及| 一个人免费在线观看电影| 亚洲avbb在线观看| 中文亚洲av片在线观看爽| 亚洲熟妇熟女久久| 亚洲精品456在线播放app | 中文在线观看免费www的网站| 欧美一区二区精品小视频在线| 香蕉丝袜av| 老司机福利观看| 日韩中文字幕欧美一区二区| 国内毛片毛片毛片毛片毛片| 国产综合懂色| 好男人电影高清在线观看| 超碰av人人做人人爽久久 | 色在线成人网| 精品久久久久久久末码| 国产精品野战在线观看| 国产精品久久久人人做人人爽| 一级作爱视频免费观看| 精品福利观看| 久久久久久大精品| 制服人妻中文乱码| 国产av在哪里看| 丝袜美腿在线中文| 国产伦精品一区二区三区四那| 2021天堂中文幕一二区在线观| 男女视频在线观看网站免费| 真实男女啪啪啪动态图| 亚洲乱码一区二区免费版| 中亚洲国语对白在线视频| 一级毛片女人18水好多| 午夜免费成人在线视频| 嫩草影院入口| 亚洲国产日韩欧美精品在线观看 | 国产精品 欧美亚洲| av在线蜜桃| 免费大片18禁| 国产高清videossex| 精品电影一区二区在线| 日本 欧美在线| 成人一区二区视频在线观看| 国产精品久久久人人做人人爽| 一区二区三区免费毛片| 99精品久久久久人妻精品| 久久婷婷人人爽人人干人人爱| 超碰av人人做人人爽久久 | 欧美色欧美亚洲另类二区| 国产精品一区二区免费欧美| 国产伦在线观看视频一区| 国产一区二区在线观看日韩 | 黄色成人免费大全| 少妇裸体淫交视频免费看高清| 欧美大码av| 99国产极品粉嫩在线观看| 一区二区三区免费毛片| 内射极品少妇av片p| 欧美黄色片欧美黄色片| 亚洲av不卡在线观看| 成人特级黄色片久久久久久久| 偷拍熟女少妇极品色| 国产又黄又爽又无遮挡在线| 国产真人三级小视频在线观看| 久久午夜亚洲精品久久| h日本视频在线播放| 琪琪午夜伦伦电影理论片6080| 法律面前人人平等表现在哪些方面| 国内少妇人妻偷人精品xxx网站| 两个人视频免费观看高清| 亚洲欧美日韩高清在线视频| 色综合亚洲欧美另类图片| 无人区码免费观看不卡| 亚洲中文字幕一区二区三区有码在线看| 少妇裸体淫交视频免费看高清| 国产在线精品亚洲第一网站| 久久精品国产亚洲av香蕉五月| 免费在线观看亚洲国产| 欧美日韩综合久久久久久 | 免费看a级黄色片| 黑人欧美特级aaaaaa片| 亚洲真实伦在线观看| 嫁个100分男人电影在线观看| 亚洲一区二区三区不卡视频| 国产不卡一卡二| 国产一区二区三区在线臀色熟女| 国内精品久久久久久久电影| 久久久久免费精品人妻一区二区| 五月伊人婷婷丁香| 18禁在线播放成人免费| 午夜精品久久久久久毛片777| 中文字幕高清在线视频| av天堂在线播放| 桃红色精品国产亚洲av| 亚洲一区二区三区不卡视频| 国产精品一区二区三区四区久久| 高清在线国产一区| 中文字幕人成人乱码亚洲影| 国产亚洲欧美98| 成年女人看的毛片在线观看| 欧美成人a在线观看| 国产亚洲精品av在线| 久久中文看片网| 国产精品久久久久久久久免 | 午夜福利在线在线| 欧美日韩一级在线毛片| 精品国产超薄肉色丝袜足j| 最近最新中文字幕大全免费视频| 欧美日韩国产亚洲二区| 丰满乱子伦码专区| 国产精品电影一区二区三区| 深夜精品福利| 内地一区二区视频在线| 亚洲成人中文字幕在线播放| 一级黄片播放器| 国产极品精品免费视频能看的| 免费看美女性在线毛片视频| 小蜜桃在线观看免费完整版高清| 少妇的逼好多水| 欧洲精品卡2卡3卡4卡5卡区| 中文字幕人成人乱码亚洲影| 欧美3d第一页| 午夜福利高清视频| 99在线视频只有这里精品首页| 亚洲色图av天堂| 嫩草影院精品99| 天堂网av新在线| 久久亚洲精品不卡| 亚洲av电影不卡..在线观看| 18禁黄网站禁片免费观看直播| 三级毛片av免费| 精品国产亚洲在线| 国模一区二区三区四区视频| 国产精品99久久99久久久不卡| 日韩欧美免费精品| 久久久久久久久中文| 色吧在线观看| 一本精品99久久精品77| 中文字幕人成人乱码亚洲影| 亚洲精品456在线播放app | 不卡一级毛片| 日本一二三区视频观看| 在线a可以看的网站| 天美传媒精品一区二区| 免费av观看视频| 一二三四社区在线视频社区8| 久久伊人香网站| 亚洲欧美日韩无卡精品| 国产97色在线日韩免费| 亚洲av成人不卡在线观看播放网| 亚洲人成电影免费在线| 每晚都被弄得嗷嗷叫到高潮| 日本一二三区视频观看| 亚洲中文字幕日韩| 亚洲精品影视一区二区三区av| 老司机深夜福利视频在线观看| 国产一区二区在线av高清观看| 最近最新中文字幕大全电影3| 少妇丰满av| 美女高潮的动态| 男人舔奶头视频| 午夜精品久久久久久毛片777| 在线天堂最新版资源| 夜夜爽天天搞| 亚洲精品一卡2卡三卡4卡5卡| 日本免费a在线| 亚洲国产欧美网| 19禁男女啪啪无遮挡网站| 国产又黄又爽又无遮挡在线| 丰满人妻一区二区三区视频av | 日韩成人在线观看一区二区三区| 首页视频小说图片口味搜索| 亚洲美女黄片视频| 久久久国产成人精品二区| 久久久久亚洲av毛片大全| 99精品久久久久人妻精品| 亚洲电影在线观看av| 免费看十八禁软件| 熟女电影av网| 久久精品91无色码中文字幕| 欧美日韩综合久久久久久 | 色综合亚洲欧美另类图片| 欧美极品一区二区三区四区| 男女下面进入的视频免费午夜| 两人在一起打扑克的视频| 欧美+亚洲+日韩+国产| 亚洲国产色片| 国产高潮美女av| tocl精华| 老汉色∧v一级毛片| 国产亚洲精品久久久com| 日本成人三级电影网站| 午夜免费激情av| 麻豆成人av在线观看| 天天躁日日操中文字幕| 亚洲av二区三区四区| 欧美zozozo另类| 99国产精品一区二区三区| 亚洲狠狠婷婷综合久久图片| 亚洲无线观看免费| 免费观看人在逋| 国产午夜精品论理片| 亚洲无线在线观看| 在线观看舔阴道视频| 久久久久久大精品| 搡老熟女国产l中国老女人| or卡值多少钱| 国产亚洲精品一区二区www| 中文字幕熟女人妻在线| 韩国av一区二区三区四区| or卡值多少钱| 欧美乱色亚洲激情| 日本a在线网址| 特级一级黄色大片| 91在线精品国自产拍蜜月 | 毛片女人毛片| av在线蜜桃| 欧美色视频一区免费| 久久精品91蜜桃| 18禁裸乳无遮挡免费网站照片| 女同久久另类99精品国产91| 99riav亚洲国产免费| 国产老妇女一区| www.www免费av| 久久香蕉精品热| 午夜免费激情av| 日韩中文字幕欧美一区二区| 日韩亚洲欧美综合| 好男人电影高清在线观看| bbb黄色大片| 好男人电影高清在线观看| 国产成人av教育| 国产三级中文精品| 国产日本99.免费观看| 女人高潮潮喷娇喘18禁视频| 亚洲第一欧美日韩一区二区三区| 最新中文字幕久久久久| 久久久久精品国产欧美久久久| 国产精品久久久人人做人人爽| 尤物成人国产欧美一区二区三区| 亚洲精品色激情综合| 精品久久久久久久久久免费视频| 麻豆国产97在线/欧美| 真人做人爱边吃奶动态| 深爱激情五月婷婷| 一个人看视频在线观看www免费 | 99久久久亚洲精品蜜臀av| 欧美日韩一级在线毛片| 午夜福利在线在线| 国产v大片淫在线免费观看| 中文字幕人成人乱码亚洲影| 高清在线国产一区| 波多野结衣高清作品| av专区在线播放| 丰满乱子伦码专区| 熟妇人妻久久中文字幕3abv| 色播亚洲综合网| av天堂中文字幕网| 精品欧美国产一区二区三| 一进一出抽搐gif免费好疼| 国产成人av教育| 可以在线观看的亚洲视频| 好男人电影高清在线观看| 免费在线观看亚洲国产| 美女大奶头视频| 91麻豆av在线| 日韩欧美一区二区三区在线观看| 久久久久久久亚洲中文字幕 | 国产美女午夜福利| 神马国产精品三级电影在线观看| 非洲黑人性xxxx精品又粗又长| 午夜免费成人在线视频| 亚洲av电影不卡..在线观看| 国产精品,欧美在线| 亚洲av成人精品一区久久| 有码 亚洲区| 在线播放无遮挡| 亚洲av第一区精品v没综合| 老熟妇乱子伦视频在线观看| 禁无遮挡网站| 97碰自拍视频| 特大巨黑吊av在线直播| 丰满的人妻完整版| 午夜精品一区二区三区免费看| 成年女人永久免费观看视频| 欧美绝顶高潮抽搐喷水| а√天堂www在线а√下载| 国产黄片美女视频| 欧美乱妇无乱码| 国产乱人视频| 欧美一级毛片孕妇| 美女黄网站色视频| 亚洲中文日韩欧美视频| 两个人视频免费观看高清| 三级男女做爰猛烈吃奶摸视频| 亚洲电影在线观看av| 搡老妇女老女人老熟妇| 国产老妇女一区| 麻豆国产av国片精品| 久久欧美精品欧美久久欧美| 国产v大片淫在线免费观看| 国产蜜桃级精品一区二区三区| 亚洲av成人精品一区久久| 欧美日韩中文字幕国产精品一区二区三区| 欧美乱色亚洲激情| 成人无遮挡网站| 12—13女人毛片做爰片一| 精品欧美国产一区二区三| 国产精品久久久久久久电影 | 黄色日韩在线| 免费大片18禁| 久久精品国产综合久久久| 久久久久久大精品| 99国产精品一区二区三区| 18美女黄网站色大片免费观看| 人妻夜夜爽99麻豆av| 久久久久性生活片| 精华霜和精华液先用哪个| 国产单亲对白刺激| 精品一区二区三区视频在线观看免费| 久久精品人妻少妇| 成年人黄色毛片网站| 在线天堂最新版资源| 国产精品久久视频播放| 国产探花极品一区二区| 在线观看美女被高潮喷水网站 | 免费观看人在逋| 亚洲国产欧美网| 久久香蕉国产精品| 脱女人内裤的视频| 男女床上黄色一级片免费看| 欧美大码av| 一进一出好大好爽视频| 欧美中文综合在线视频| 亚洲国产精品sss在线观看| 狂野欧美激情性xxxx| 男女午夜视频在线观看| av女优亚洲男人天堂| 欧美区成人在线视频| 午夜福利视频1000在线观看| 中文亚洲av片在线观看爽| 超碰av人人做人人爽久久 | 一进一出抽搐gif免费好疼| 成年人黄色毛片网站| 欧美绝顶高潮抽搐喷水| 一个人观看的视频www高清免费观看| 婷婷亚洲欧美| 一边摸一边抽搐一进一小说| 三级男女做爰猛烈吃奶摸视频| 狠狠狠狠99中文字幕| 国产精品亚洲美女久久久| 天天添夜夜摸| 国产亚洲精品一区二区www| 亚洲七黄色美女视频| 天天一区二区日本电影三级| 亚洲狠狠婷婷综合久久图片| 欧美另类亚洲清纯唯美| 18禁国产床啪视频网站| 成年免费大片在线观看| 精品久久久久久,| 国产成人欧美在线观看| 最好的美女福利视频网| 久99久视频精品免费| 三级国产精品欧美在线观看| 国产淫片久久久久久久久 | 国产成+人综合+亚洲专区| 免费看美女性在线毛片视频| 欧美乱色亚洲激情| 观看美女的网站| 日韩精品中文字幕看吧| 一个人免费在线观看电影| 一个人看视频在线观看www免费 | 欧美日韩精品网址| 久久亚洲真实| 国产精品1区2区在线观看.| 久久亚洲真实| 欧美最新免费一区二区三区 | 亚洲av第一区精品v没综合| 国语自产精品视频在线第100页| 国产精华一区二区三区| 麻豆久久精品国产亚洲av| 麻豆国产av国片精品| 国产精品综合久久久久久久免费| 日韩亚洲欧美综合| ponron亚洲| 免费在线观看成人毛片| 日韩欧美精品v在线| 亚洲欧美精品综合久久99| 日本 av在线| 国产综合懂色| 日本精品一区二区三区蜜桃| 国产精品野战在线观看| 精品久久久久久久毛片微露脸| 一个人看视频在线观看www免费 | 在线观看66精品国产| a在线观看视频网站| 精品国产超薄肉色丝袜足j| av女优亚洲男人天堂| 国产三级中文精品| 性欧美人与动物交配| 国产午夜精品久久久久久一区二区三区 | 亚洲自拍偷在线| 人妻久久中文字幕网| 在线观看一区二区三区| 蜜桃久久精品国产亚洲av| 色噜噜av男人的天堂激情| 男女那种视频在线观看| 一个人免费在线观看的高清视频| 精品免费久久久久久久清纯| 国产精品,欧美在线| 九色成人免费人妻av| 搡老妇女老女人老熟妇| 日本成人三级电影网站| 欧美成人a在线观看| 欧美最新免费一区二区三区 | 国产高清videossex| 久久精品影院6| 国产伦在线观看视频一区| 免费人成视频x8x8入口观看| 欧美一区二区精品小视频在线| 久久草成人影院| 波多野结衣高清作品| 人人妻人人看人人澡| 久久6这里有精品| 每晚都被弄得嗷嗷叫到高潮| 亚洲av电影不卡..在线观看| 国产黄a三级三级三级人| 久久精品国产亚洲av涩爱 | 一本精品99久久精品77| 亚洲性夜色夜夜综合| 熟女电影av网| 亚洲一区高清亚洲精品| 五月玫瑰六月丁香| 757午夜福利合集在线观看| 久9热在线精品视频| 午夜福利18| 精品一区二区三区av网在线观看| 51午夜福利影视在线观看| 我的老师免费观看完整版| 中出人妻视频一区二区| 变态另类丝袜制服| 国产乱人伦免费视频| 在线观看免费午夜福利视频| 久久婷婷人人爽人人干人人爱| 亚洲av中文字字幕乱码综合| 99久久无色码亚洲精品果冻| 日韩精品青青久久久久久| 天堂影院成人在线观看| 我要搜黄色片| 色哟哟哟哟哟哟| 精品99又大又爽又粗少妇毛片 | 黄片小视频在线播放| 国产免费男女视频| 久久久国产成人免费| 热99在线观看视频| 亚洲国产欧美网| 午夜福利视频1000在线观看| 18禁黄网站禁片免费观看直播| 丁香六月欧美| 欧美xxxx黑人xx丫x性爽| 2021天堂中文幕一二区在线观| 99热这里只有是精品50| 两个人视频免费观看高清| 91字幕亚洲| 亚洲天堂国产精品一区在线|