• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Approximate error correction scheme for three-dimensional surface codes based reinforcement learning

    2023-11-02 08:13:14YingJieQu曲英杰ZhaoChen陳釗WeiJieWang王偉杰andHongYangMa馬鴻洋
    Chinese Physics B 2023年10期
    關(guān)鍵詞:英杰

    Ying-Jie Qu(曲英杰), Zhao Chen(陳釗), Wei-Jie Wang(王偉杰), and Hong-Yang Ma(馬鴻洋),?

    1School of Sciences,Qingdao University of Technology,Qingdao 266033,China

    2School of Information and Control Engineering,Qingdao University of Technology,Qingdao 266033,China

    Keywords: fault-tolerant quantum computing, surface code, approximate error correction, reinforcement learning

    1.Introduction

    Quantum error correction (QEC),[1-4]which has gained popularity in recent years,is now regarded as the procedure in quantum computing that requires the most time and resources.However, given the current strategies for quantum computing, QEC is an effective means of reliable quantum computing and storage as well as protecting quantum information from loss.The requirement for QEC arises from the quantum systems’ inescapable connection to their surroundings,which causes qubits to change state(decoherence).[5,6]To mitigate the effects of quantum decoherence, it mainly controls error propagation and maintains a low error rate through active error correction and fault tolerance mechanisms, thereby achieving good local stability.[7-10]Topological properties of subsystems have become an important resource for building better and more robust quantum error correction codes.Due to the relatively low overhead and locality requirements of surface codes, as well as the availability of practical strategies for implementing the necessary logic gates, topological quantum codes[11-13]particularly only need local operations to diagnose and correct errors.This makes them a particularly promising candidate for large-scale fault-tolerant quantum computing.[14,15]

    Typically, naturally occurring physical systems are classified as having approximate or exact symmetries, which can be used to classify matter in equilibrium.The approximations of significant implications are also present in quantum error correction codes.For example, certain energy subspaces are known to form approximations of quantum error correction codes in the context of time-translation-invariant many-body systems.Additionally,error correction codes with suitable approximate properties can be used to protect information from noise.[16]In general,we can say that a quantum operation will be covariant with respect to the groupGif it commutes with the group action.When a quantum error correction code is covariant with respect toG, its encoding map is called theG-covariant operation, and we can achieve approximate error correction[17-20]by studying the properties of the recovery operation.A crucial step in quantum error correction is decoding.[21,22]Decoding is a process of identifying and correcting errors in the quantum system due to factors such as the environment and noise to ensure that the quantum state is error-free.This process is typically implemented classically in quantum error correction schemes.It takes to input a set of stabilizer measurements (syndromes) and returns the syndrome operators.If the product of the syndrome and the original error is equivalent to a stabilizer,[23-25]the correction will be successful.If the decoding process takes longer duration than the budgeted error correction time, errors will accumulate, eventually reaching an uncorrectable error state.To address rapid decoding difficulties,machine learning techniques have been employed in various quantum physics domains,and different types of neural networks[26-29]have also been studied during this time.

    In this paper,we investigate the accuracy of quantum error correction codes concerning continuous approximate covariance.Taking the surface code as an example, we study the properties of approximate quantum error correction of the surface code by combining its properties with the approximation of the quantum system.It is worth noting that to address the spatial correlation problem of surface codes,[30,31]this paper focuses on realizing the jumping of the surface code from the 2D-3D dimension,[32,33]and error detection is performed through the measurement of stabilizer operators, and subsequently, the surface code is reduced back to a 2D lattice for decoding using the reinforcement learning(RL)decoder.This methodology offers the advantage of high scalability for surface codes in 3D space while reducing the complexity and resource costs by saving on stabilizer measurements.

    This paper is organized is as follows.In Section 2, we provide a brief introduction to the background knowledge of approximate quantum error correction and surface codes.In Section 3,we design an error correction algorithm for approximate surface codes.We outline the decoding strategy in Section 4 and perform training and simulation analysis in Section 5,followed by our conclusions in Section 6.

    2.Background

    We begin by briefly explaining the basics of surface codes and RL.However,the framework and method proposed in this work are not limited to surface codes,and we can also apply to other stabilizer codes.This article takes the surface code as an example to make a simplified demonstration,which is also for experimental correlation.This article mainly introduces the application of surface code.

    2.1.Surface code

    Quantum error correction codes generally have topological properties, which are codes defined on various lattices,and typical topological codes are complex surface codes.[34,35]Surface codes are a class of 2D quantum error correction codes that can have periodic boundaries,while toric codes are a subclass of surface codes that are defined on a torus with periodic boundaries.Place qubits on a 2D square lattice,with each vertex corresponding to a qubit.The codespace can then be defined by the parity operator applied to the nearest four qubits on the square lattice ofL×L.Lattice and vertex operators can form abelian groups or stabilizer groups.Since qubits can only interact with their nearest neighbors,stabilizer generators with local properties are required.

    We first consider a 2D surface code.The surface code,introduced by Kitaev, defined on the square lattice of a torus with a qubit on each side.All stabilizers are commutative and have eigenvalues of±1.The logical operators are operators that preserve the code space,the logicalX(Z)operator is expressed asXL=?i∈vXi,ZL=?i∈f Zi.They are contiguous strings of single-vertexX(Z)operators that connect the top and bottom (left and right) boundaries of the lattice, and the code is defined as the ground space of the Hamiltonian

    Here data qubits are associated with edges, corresponding to each vertex on the lattice.There is an operatorXvassociated with each vertexvof the lattice and a patch operatorZfassociated with each facef.Xvis the product of Pauli-Xmatrices acting on the edges associated withv, i.e.,Xv=∏e∈v Xe, andZf=∏e∈f Zeis the productfof Pauli-Zacting on all edges of the face.The code space is defined as the simultaneous“+1”eigenstates of these operatorsXvandZf.These operators and any product of them are called stabilizers[23-25]of the code and form a stabilizer groupS.

    2.2.Reinforcement learning

    Reinforcement learning (RL) is a framework that can be articulated precisely with the old adage learning through experience.[36]In RL, the environmental framework for controlling physical systems and the selection of agents to apply system control define the control problem,as shown in Fig.1.The agent operates on the environment and performs a series of actions to solve certain problems.[37,38]Each time steptcan be represented by a statest ∈S,whereSis the state space,showing how the environment is represented.The feedback loop between the agent and the environment is called a Markov decision process(MDP).[39]

    Generally speaking,we describe the agent’sπ(or policy)as a mapping from states to particular probabilities, that is,π(a|s)is the environment in the stateAt=a,the probability that the agent choosesSt=sis formalized.By using a measure of cumulative reward,the value of any given state depends not only on the immediate reward from the particular policy stated in the following,but also on the expected reward in the future.The optimalQvalue for convergence[36]can be determined if the agent is trained with an unlimited number of times, and the best strategy is simple to find the given bestQ-function.The best course of action in a given statescan be easily determined by selecting the actiona=argmaxa'[q?(s,a')].DeepQlearning(DQL)uses deep convolutional networks,[40,41]when encountering an unknown state,DQL compares its global features with those from experience.[42]

    We use DQL to train the agent to decode irrelevant bits on approximate 3D surface codes or phase flips caused by ambient noise.The training process only terminates after a certain number of events or when the loss function of the convolutional neural network stops decreasing.We also use the training techniques of duel deepQ-learning[43,44]to ensure the stable operation of the training process.

    Fig.1.The agent’s objective is to maximize its total reward value throughout the decision period as an illustration of the signals passing between the agent and the environment for subsequent turn-based durations.

    3.Algorithmic process

    Single decoding is not possible for 2D topology codes,so measurement errors must be counteracted by repeating stabilizer measurements multiple times and preparingO(d)repetitions in a distancedcode for error tolerance.[32]To improve the strong resistance to the dullness of encoded qubits,we take advantage of the scalability property of topological error correction codes to encode qubits into a 3D cubic lattice.Although such encoding can be a candidate for quantum memory, it does not enable self-correction.Thus the key issue of how to effectively correct errors on 3D codes should be addressed.

    3.1.Approximate surface code

    As mentioned in the introduction section, in the case of time-shift-invariant many-body systems, it is known that certain energy subspaces can form approximate quantum error correction codes that are preserved under time evolution.More specifically, the mapping embeds a logical code space in the tensor product ofnphysical subsystems.The unitary transformation acting on the code space can be used to realize the tensor product ofnunitary transformations, which acts on each subsystem.We study the approximate error correction of quantum error correction codes with respect to symmetry covariance, when errors occur due to the loss of one or more subsystems out ofn,we can identify which subsystems are lost and approximate error correction can be achieved by studying the properties of recovery operations.

    We define a completely positive trace-preserving mappingXwhich assigns each logical state that exists on one or more logical systemsLto the corresponding state on a physical systemAconsisting ofnsubsystemsA=A1?A2?···?An.We assume that errors occur randomly in some subsystemAiand thatiis known,so that the recovery mapping may depend oni.Consider the code that maps any purely logical state|x〉Lto the physical state{|ψx〉A(chǔ)},we call the latter the code word,and all the code words{|ψx〉A(chǔ)}constitute the total code space.In general, we consider codes that are isometric, but in some special situations,a more general coding mapping is also taken into account.[18]

    To investigate the approximate error correction performance of the code,we quantify the approximate error correction by using the distance metric between states and channels.The fidelity between quantized quantum states using trace distances is implemented to quantify the proximity of the quantum channelKto the constant channel.These two standard measures are the maximum mixed input state entanglement fidelity and the worst-case entanglement fidelity,respectively,

    wheredis the subsystem dimension, for a code that admits universal transversal logical gates.The input state in the definition ofFeisi.e.,the maximum entangled state of the systemsLandR,and the systemsRandLhave the same dimension,which is denoted bydL.The optimization range in the definition ofFworstcontains all dichotomous states ofLandR.

    As shown in Fig.2, to construct the approximate error correction model for the surface codes, we further define the approximate surface code by Eqs.(2)and(3)and represent the way in which the approximate surface code interacts with the quantum channel using the following equations:

    The input state defined inFeis|?φ〉s=(|X〉?|X〉+|Z〉?|Z〉)/2.We obtain the cubic lattice after mapping the 2D surface code dimension to the approximate 3D code,and further characterize the strong noise immunity of the 3D lattice qubits.

    In a 2D surface code,both data qubits and ancilla qubits are situated on the planar lattice.However,in the approximate 2D surface code,data qubits are placed at the center of the lattice while the ancilla qubits are assigned to the edges of the lattice.To move the data qubit to the center of the lattice,we need to first measure the four surrounding stabilizers,and then verify each data qubit with its corresponding four stabilizers.Finally, based on the measurement results of the corresponding PauliZgate operation to complete the movement.This allocation scheme transforms the surface code into an approximate surface code and allows for standard surface code operations on the dual double lattice.Therefore, by performing a flat lattice transformation on the original surface code,an approximate surface code can be obtained,and standard surface code operations can be performed on the dual double lattice.

    Fig.2.Generic components of surface code.(a)A 2D surface code,the auxiliary qubits that measure the stabilizers are orange and green display,data qubits are shown in blue.(b)Approximate the 2D surface code,the dotted line represents the dual double lattice behind the flat lattice.(c)The topological operators X and Z each form a chain of topological corrections on the dual lattice(purple and yellow in the figure).

    3.2.Dimension mapping

    The ability to exchange 3D and 2D surface codes (a process called dimensional mapping[32,33]) is at the heart of Brown programs.We consider transforming the approximate 2D surface code into the 3D surface code by dimension mapping.Compared to the 2D code,the 3D surface code exhibits greater scalability due to its three-dimensional lattice structure,which enables error correction to be performed by measuring the parity of stabilizer operators acting on each face,thereby demonstrating a higher degree of redundancy.This increased redundancy allows for a higher threshold error rate in the 3D surface code, meaning that it can tolerate a greater level of noise before errors become uncorrectable.Moreover,the 3D surface code allows for more efficient error correction,as it only requires the measurement of stabilizers relevant to the current dimension after mapping, reducing the space required for stabilizer measurements and the number of ancilla qubits needed for error correction, thereby reducing resource overhead during the error correction process.

    Converting approximate 2D surface codes to 3D codes still encodes only one qubit, so we must employ additional methods to ensure that the final logical state of the 3D codes is the same as the initial logical state of the 2D codes.Meanwhile,we applyXstabilizer and logical gate operations to the code state at each time step in specific situations,ensuring that the dimension mapping only involves measurements ofZstabilizers.AsXstabilizers may not commute with logical gate applications, the 2D code is not necessarily in the eigenstate ofXstabilizers during the execution of operations.

    As shown in Fig.3,considering the realization of the dimensional mapping operation from the approximate 2D surface code to the 3D code, we start with two approximate 2D surface codes and entangle them through the measurement of the intermediate stabilizer to form an approximate 3D surface code perspective picture.At the same time,the stabilizer measurement is commutative, so we do not need to do anything to ensure an accurate conversion of logicalZfrom 2D code to 3D code.In order to successfully transfer the approximate 2D code to the 3D code,we must apply the matched logicalXto theX(gray dashed line)of the 3D code exactly.

    Dimension mapping will have several situations as shown in Fig.3: (i)Schematic diagram of the conversion of two approximate 2D surface codes to 3D codes.(ii) There is a partialXerror in the front code,the loop connecting to the front and side border constitutes the red syndrome,correcting these loops and moving them to the front code boundary completes the 2D code state to 3D transition.(iii)Two errors detected on the stabilizers of the front code(gray solid line)and the back code(gray dotted line)made it impossible for us to track and find the inner and outer space of the cycle.This in turn leads to the inability to transfer the state of the 2D code to the 3D code space with overall accuracy.

    Fig.3.(a)Approximate 2D surface code in the presence of topologically corrected chains.(b)Several situations exist in the dimension mapping process.(c)Approximate 3D code after dimension mapping.

    A suitable dimension mapping is mainly divided into the following steps:

    · Starting from a 2D surface code of state|ˉψ〉mapped to a 3D surface codeSand its boundary?S.The choice of these codes must ensure that theZstabilizer for the 2D code commutes with theXstabilizer for the 2D and 3D codes.

    · Prepare all qubits in|+〉that belong toS??S.

    · Measure theZstabilizer operators for 3D code,not for 2D code.

    · Perform different error correction methods according to the type of error that occurs at the code location.For example, if there are someXerrors in the code, they will be corrected directly.If two errors are detected,the code cannot be transferred to the 3D code space as a whole accurately.

    · The measurement is returned to the code space for error correction, which is not allowed to apply to any qubits in the original 2D code.

    TheZstabilizer of the 2D code is combined with the qubit of|+〉to ensure that the coding state of the measurement map is divided into two cases: One is the state of the code space in the 3D code, and the other is the state in the code space inferred by the correction from the stabilizer measurements.This correction to eliminate theXerror distribution is what Brown calls a 3D code gauge.[45]

    3.3.Error correction process

    The approximate 2D surface code obtains a cubic lattice after dimension mapping, and the qubits in its internal lattice are locally stable (not prone to errors) from the law of conservation of physical energy, which further characterizes the strong noise immunity of the 3D lattice qubits.The qubit error correction under the 3D lattice only needs to consider the position information of the surrounding surface (check information of the six faces),and relies on the stability of the lattice operator and vertex operator to ensure the feasibility of error correction.

    The stabilizerSis the Abelian subgroup of{1,-1}×{I,X,Y,Z}n, where-In/∈S.Assume thatShas a set ofn-kindependent generators.For simplicity,considerSm ∈{I,X,Y,Z}n, the binary[[n,k,d]]stabilizer code defined bySis a 2kdimension subspace in C2n,and the parameterdis the code’s the minimum distance,the elements inSare stabilizers.Any two Pauli operators withnqubits are either commuting or anticommuting.All stabilizers are interchangeable and have eigenvalues±1.If Pauli error commutes with some stabilizers,measuring these stabilizers will return to eigenvalues-1,and commutation will return to eigenvalues 1.When the stabilizer measurement result is±1, it will be mapped to+1→0 and-1→1,and the resultingis called the error syndrome(the measurement result of the stabilizer),and the error that the stabilizer can detect is represented by a nonzero error syndrome.Except that stabilizers have no impact on the code space,we do not need to account for stabilizer errors.

    In order to improve the modeling of the dimensional jump switching process between two codes,we have simplified the intricate 3D cubic lattice into six separate 2D lattices, each with a size of 5×5.This approach leads to a reduction in the number of ancilla qubits necessary for error correction,as well as a decrease in the number of stabilizer measurements required.As a result, the complexity of surface code coding is significantly reduced, along with the resource overhead in the error correction process.The error qubits are introduced and differentiated with various colors for the Pauli operatorsX,Y,andZ,after measuring the correctors and then performing approximate error correction,the specific error correction process is shown in Fig.4.

    In complex 3D lattice structures, this reduces the space required for the stabilizer measurements by only measuring the stabilizers associated with the current slice, rather than measuring all the stabilizers in the entire code.This dimensional mapping approach reduces the overall number of measurements required and can significantly reduce the resource overhead of the error correction process.We infer the results and correct them based on the structure of the stabilizer and guarantee the accuracy of the corrected errors and the logical state of the transmitted code.The approximate 3D surface code state is uniform in the total parity of the measured qubits onX, thus all four qubits may be in the same state, or two in|+〉and two in|-〉.In the first case,we directly map the same logical state of the approximate 2D surface code.In the other case,it should be corrected.The reason is that theZstabilizers that are projected to the rear when measured on theXstabilizers are randomly assigned, and these stabilizers leave traces that need to be corrected at the initial position of the code.We detect the traces left by the stabilizer in the back code and then applyZto the qubits in the front code to correct the error.

    In addition,whenZerrors exist in the front code,it cannot be judged whether it needs to be corrected only by its measurement result.We identified the error(syndrome of 3D code after dimension mapping)by combining the measurements of the anterior code with the 2D codeXstabilizers.For example,the parity of the front code violatesX1X2X3X4, and we infer the measurement of the sideXstabilizers based on the parity of the bottom qubits and the measurement of the back qubits.

    When qubits error occurs,the corresponding syndrome is generated and disappears after choosing the correct position.Given the error and the stabilizer element, when the error is the same as the syndrome generated by the measurement,the measurement result of the stabilizer automatically selects the error correction operator, which is called the decoder.The job of the decoder is to find errors in the qubits of data from the error correction subset.Since vertex and lattice operators produce strong spatial correlations, we propose a reinforcement learning decoder based on a DQL algorithm to find the optimal correction chain and to achieve better thresholds by continuously optimizing the conditions.

    Fig.4.Error correction diagram for the approximate 3D surface code.Take the bottom code in the three-dimensional cubic lattice as an example to measure the interior of the stabilizer, detect different error syndromes, and perform recovery operations to correct errors.The red and green circles at the bottom correspond to X and Z errors.

    4.Decoding strategy

    In fault-tolerant quantum computing based on surface codes, known protocols achieve Clifford gate operations through techniques such as lattice surgery, code deformation,and syndrome tracking.However, non-fault-tolerant Clifford gates, such as the T gate, can be fault-tolerantly executed through magic state distillation and gate teleportation.Highquality magic states are obtained through magic state distillation, which only requires Clifford gates and fault-tolerant magic states.[46,47]The primary objective in decoding idle qubits in quantum computing is to effectively suppress logical error rates by applying error correction schemes when the physical error rate of qubits is lower than a certain threshold,which is a crucial measure of fault-tolerant performance.This article introduces and compares two different error noise models for decoding,namely the minimum weight perfect matching(MWPM)decoder[27,42]and the RL decoder.[38,48-50]

    4.1.MWPM decoder

    To enhance the decoding performance of codes and reduce the overhead of qubits,we employed a universal decoder,the MWPM decoder.By reformulating the noise model into a mathematical model, we assume that the corrections selected in the previous time step were successful and that all stabilizers were in the+1 eigenstate.The measurement outcomes of theZstabilizers directly lead to a random distribution ofXerrors on new qubits, which are connected to the top or side boundaries via a set of cycles.The decoder matches errors present in certain stabilizer measurements to generate a set of effective syndromes (a set of lattices).To preserve quantum state information, the corrected qubits should have the same error pattern as the error qubits of stabilizers to avoid the occurrence of other logical errors.

    Noise mainly arises from syndromes and stabilizer measurements, as the measurement results of stabilizers are not accurate enough, requiringdrounds of measurement of the cycles to be repeated.Therefore, we apply the MWPM decoder and the Dijkstra algorithm to approximate the matching of the 3D surface code,thereby increasing the probability of successful stabilizer detection.We construct the decoder mainly through the following steps: Firstly, assume that the approximate 3D surface codes and the quantum circuits are noiseless.Secondly, perform a round of measurement cycles without stabilizer noise.Finally,add additional noise environment to continue measuring during the measurement period under stabilizer noise.The noiseless measurement cycle is to ensure that the noise state can be restored to the original code space,thereby determining whether the error correction is successful.

    When constructing an approximate 3D surface code, we use the MWPM decoding algorithm for error correction:

    · Performdrounds of stabilizer measurements under noise to approximate the initial state of the surface code and construct the correction graphs by measuring the stabilizers.

    · Mark the vertex values of the previous round’s stabilizer measurements.If the number of vertices is even,we mark the top vertex,if the number of vertices is odd,we mark the boundary vertex.

    · Use the Dijkstra algorithm to find the minimum weight matching for the markedX-type andZ-type vertices,where each vertex is connected to show the optimal weight path.

    · According to the law of conservation of physical energy,the qubits in the internal lattice are locally stable, so quantum error correction in the 3D lattice only needs to consider the check information of six faces.We calculate the display count of eachX-andZ-type horizontal boundary.If the horizontal boundary is not marked,we performX-andZ-type corrections.

    The decoding problem of quantum surface codes is a complex combinatorial optimization problem.Compared to MWPM, RL decoders better adapt and handle the complex three-dimensional cubic lattice structure by learning and optimizing strategies.The training process of RL decoders may also be scalable for decoding larger quantum surface codes.On the other hand,MWPM decoders typically rely on predefined rules and heuristic algorithms, with lower adaptability and flexibility.Theoretically,RL decoders can handle various types of errors, including bit-flip errors and phase-flip errors,while MWPM decoders require modification or extension for error types other than flip errors.Moreover,when facing high error rates or complex error models, RL decoders can search for the optimal correction strategy through optimization strategies,achieving better decoding performance.RL decoders can also achieve higher efficiency in decoding by learning efficient decoding strategies such as parallel processing or local search.In the following section, we introduce the decoding work of reinforcement learning to better address the decoding problem of quantum surface codes.

    4.2.Reinforcement learning decoder

    This paper utilizes a decoder based on a neural network agent,which is optimized through RL to observe and gradually establish a recovery chain for the syndrome of the approximate 3D surface code.The agent employs the deep neural network(DNN)and theQnetwork to determine the actions andQvalues of the syndromes.We typically consider discrete problems in which at each time stept,the environment is described by a stateSt ∈S,whereSis the state space.Given the environment state,the agent selects an actionAt ∈A,whereAis the action space.After the agent selects an action, the environment updates correspondingly, providing feedback to the agent in the form of a rewardRt+1and a new stateSt+1.Given an initial logical state|ψ0〉∈Hsc, the agent’s objective is to suppress errors as long as possible so that logical operations can succeed with high probability.The environment is formalized as a Markov decision process(MDP)under finite state and action spaces:

    The action-value function(also known as theQ-function)for the policyπis defined as

    At the timet, actionais taken and subsequently follows a policyπ, whereγ≤1 is the discount factor.TheQfunction conceptually resembles the state-value function, except that it provides values for state-action pairs.Additionally,we rank policies based on the value function,that is,π >π'??vπ(s)>vπ'(s)?s ∈S.Meanwhile,we can define the optimal policyπ?in reverse:

    Given a states,the optimal policy can be easily obtained by selecting actiona= argmaxa'[q?(s,a')].TheQvalueqπ(s,a) parameterized by the neural network is used as the output, and the network parameters are adjusted through the stochastic gradient descent algorithm to minimize the error between the optimalQvalue and the approximateQvalue.

    The decoding process feeds the algorithm as an input value, where the syndrome is the system state visible to the agent.The syndrome observed at each time step is a result of the cumulative effect of the agent’s operations on the syndrome, which is initially randomized by a distribution of bitflips.Once the system reaches a terminal state with a null syndrome,an odd number of non-trivial loops represent a failure in error correction.However,during the algorithm’s usage,information about success and failure can only serve as a metric to evaluate the agent’s performance during training.Regardless of whether the correction string requires logical operations,a reward ofr=-1 is given at every time step until the terminal state is reached.Therefore,compared to the MWPM algorithm,the primary objective of this algorithm’s agent is to eliminate syndromes with the fewest possible steps.The process of decoding by a well-trained agent is illustrated in Fig.5.

    Fig.5.Details of the deep Q decoding agent.Details of the deep Q decoding agent.The syndrome is encoded into a binary matrix and better fitted to the convolutional layer input,and the fully connected layer is used to complete the mapping from the input feature space to the label set, that is, to achieve the classification effect.Then, the training cost is reduced by training and optimizing the convolutional neural network structure, and the output eigenvalues are decoded by convolution operation.Finally, the feed-forward neural network is used for multi-layer connections to output multiple error chains that you want to correct.

    The decoding process described here employs neural networks to distill information between syndromes and uses stepwise decoding to gradually reduce the syndromes to smaller subsets.Specifically,due to the periodic boundary conditions of the encoding,the syndrome can be represented around any plaquette.TheQnetwork takes in ad×dmatrix corresponding to the positions of vertex and plaquette errors.The agent can move any wrong option in any direction,corresponding to a bit flip on the various error physical qubits.The output is a triplet ofQvalues forX,Y, andZoperations on a specific qubit, and to obtain the complete set of action values for the syndrome, we shift and rotate the syndrome sequentially and locate the position of each qubit.Simultaneously,the complete relevantQfunction of the syndrome is obtained by computing theQfunction of each individual.After performing the selected action,a new syndrome is generated,and this process is repeated until no errors remain.

    When decoding the approximate 3D surface code using reinforcement learning,the specific steps are as follows:

    · We have the constituent parts that describe the rules,and when the agent takes action through these rules,the environment generates a tuple[St+1={ssv,t,ht},rt,tt].

    · The agent can move any wrong option in any direction(up,down,left,and right),corresponding to a bit flip on the various error physical qubits.The number of agent actions changes continuously with the number of errors.

    · Use the neural network to represent theQfunction,when errors exist in the system,they are separately sent to theQnetwork, and theQfunction is parameterized by adjusting the weights and biases of the neural network,written asQ(s,a,θ),whereθis the set of network weights and biases.

    · Take theQvalue of the action in each error,and the new syndrome is sent to the algorithm after selecting the action and error using a greedy strategy.The process is repeated until there are no errors.

    Given the fundamental factors of the environment,we define the action spaceAto consist of all PauliXandZflips on a single data qubit and special actions,while PauliYflips can be implemented byXandZflips.In addition,if more qubits are included in the action space,it will have a certain impact on the complexity of the training process.Therefore,we set the agent to operate on single qubits,where in practice all single qubits accumulate incrementally between successive syndromic measurements, while multiple qubits can operate simultaneously and compute whether or not they are tracked.We also note that for a well-trained agent,the initial syndrome and unused error string distribution are typically combined, and most of them can be appropriately corrected by the MWPM algorithm,while a small portion cannot be corrected.Since the agent only sees the given syndrome, it does not have the opportunity to learn and recognize other types of errors and therefore is not applicable to other types of training.

    5.Simulation analysis

    5.1.Training

    The training process of the decoder agent is implemented through the deepQnetwork(DQN)algorithm.The agent uses the experience replay technique in the algorithm to store the constantly acquired experiences as transitions in a buffer,and then randomly samples small batches of transitions from the buffer for updating theQnetwork.By unifying the samples of small batches through random sampling, a portion of random samples is extracted to reduce the temporal correlation of the data and to improve the stability of the neural network training.

    To better apply to the CNN,we embedded the syndromes collected in the input phase into a binary matrix and encoded and signed the information.Each layer of the convolutional neural network had 64 output filters,followed by a fully connected layer with 512 neurons.We utilize the DQN constructed by stacking a feed-forward neural network over multiple convolutional layers.The final layer of this network has activation points, with each activation point encoding an action.Additionally, a fully connected feed-forward layer follows the convolutional layers,with ReLU serving as the activation function for the hidden layer and softmax used as the activation function for the output layer.In order to accelerate the model training,the ResNet network layer is introduced as the underlying architecture,with data stacked with ResNet 7,14,and 21 network layers,which ensures that a large number of stacks can be stacked without reducing the learning efficiency of the convolutional layer.Furthermore, we use two neural networks that are structurally equivalent,the regularQnetwork with parametersθand the targetQnetwork with parametersθt.For each parameter iteration,we clone the activeQnetwork(the network used to select the best action at each state)to obtain the targetQnetwork,and the target network is synchronized with theQnetwork at the set time interval.

    We train the agent using the DQL algorithm until the parameters of the CNN are stable.The training sequence starts from the action phase,where the agent utilizes a greedy policy and adjusts theQnetwork parameters to calculate a new target,the agent employs anε-greedy strategy, which implies that it suggests the action with the highestQvalue with probability(1-ε)and chooses the action with the highestQvalue,or else takes a random action.By executing different actions,rewards are generated,and new observations are obtained from the resulting joint state,which is then stored as a complete transition tupleT=(P,a,r,O)in a memory buffer.

    The training sequence then enters the learning phase using the stochastic gradient descent algorithm.Firstly, given a batch size ofN, a random sample of transition{Ti=is drawn from the buffer and replaced, and the training target value of theQnetwork is defined as

    whereγis the discount factor, and we adjust theθparameters continuously to adjust the cumulative reward predicted by the target network.Secondly, the gradient descent algorithm is used to minimize the loss function and to reduce the difference between the target value of the sample and theQnetwork prediction until theQnetwork produces an accurate value for theQfunction,and the network parameters are adjusted based on-?θ∑i(yi-Q(Pi,ai,θ))2.Then,a new training sequence is started and the target network weights with parametersθtare synchronized with theθof theQnetwork at some specific rate.Finally, the ResNet network architecture is utilized for multiple iterations and predictions in the convolutional neural network(CNN),uses the data set with the same error rate for predictive training, and stops training when it is close to the threshold, and the parameters of the fully connected network andQnetwork are synchronized.

    Fig.6.The training iteration count as a function of both training error rate and accuracy.The horizontal axis denotes the number of training iterations,while the vertical axis represents training accuracy.Orange,red, and blue markers represent ResNet = 7, 14, and 21, respectively.To facilitate the visual inspection of data,zoom plots are employed.

    The training process, as shown in Fig.6, involves increasing the number of convolutional layers and adjusting the number of training iterations,which significantly improves the accuracy of the training.Increasing the number of layers of the network can obtain more precise training data.Before the number of iterations reaches 300, the ResNet network has an accuracy improvement of about 0.3% at each training interval compared to the ResNet=7 network.However, after the number of iterations reaches 300 or more, the training accuracy starts to oscillate, the overfitting phenomenon appears,and the training accuracy improvement is small, fluctuating around 9.0%.After ensuring that the iteration depth reaches 500, the accuracy of all three networks can reach 9.6%, ensuring the generation of accurate predictions of the optimal error-correction chains under noisy conditions.After sufficient deep training, theQvalue matrix of theQnetwork is fully replenished,and the performance approaches that of the optimal decoder,improving the decoding efficiency and accuracy significantly.

    5.2.Error correction performance analysis

    To quantify the decoding performance, the relationship between logical qubit error rate and physical qubit error rate must be analyzed.The physical error rate for which the decoder achieves approximately the same performance independent of the surface code distance is defined as the decoder threshold, which is often defined aspand is used as a single parameter to quantify the performance of the decoding algorithm.It is worth investing in larger distances for any physical error rate and decoding threshold.Since it is difficult to capture error correction for approximate surface code dimensions after mapping,this paper utilizes the MWPM and RL decoders to compute the threshold for restoring approximate 2D surface codes and draws the logical error rate in the physical error rate range of different code distances.

    Fig.7.The error correction performance of surface code with code distances of d =5, 7, 9, and 11 decoded with MWPM.To facilitate comparison and analysis, different code distances are represented by different colors, namely orange, purple, blue, and green.(a)The error correction performance of MWPM decoding without approximation is shown.(b)The error correction performance of MWPM decoding after approximate quantum error correction at different d-values has a threshold of 0.5%.

    Through MWPM decoding, the error-correcting performance of regular surface codes and approximate surface codes at different code distances was obtained.As shown in Fig.7(a),the orange line of code distance 5 indicates a slight increase in logical error rate with an increasing physical error rate, and the rate of increase is slow.The green line of code distance 11 shows a sharp increase in logical error rate,which reaches the highest threshold limit (the same horizontal coordinate is observed for each code distance).In general,the logical error rates exhibit a slight and slow increase with a threshold of 0.25%.As shown in Fig.7(b),after approximate error correction,the logical error rate for the code distance of 11 increases significantly to 0.098.The data shows that the error probability for the code distance of 11 is the lowest,and its performance is relatively superior.However,the threshold strength achieved through MWPM decoding is still not high enough,at only 0.5%,which is below our expected result.

    In order to further improve the threshold of the decoder,we use the trained deepQnetwork model to perform error correction in different code distance grids.As shown in Fig.8(a),the logic error rate is relatively low in the orange line with a code distance of 5,while the logical error rate of code distance 11 increases significantly to 0.087.In addition,the RL decoding after approximate error correction also achieves significant improvements.As shown in Fig.8(b),there is a noticeable increase in the logical error rate for the code distances of 5,7,9,11,and the threshold reaches 0.78%.

    For the display of the threshold results, MWPM did not achieve the good error correction performance we expected,and the error correction success rate was poor under the surface code dimension mapping model.RL models take the advantage of agent versus environment training, and the use of ResNet architecture shows good performance in error correction.Compared to the MWPM decoder, the threshold after approximate error correction has increased by 56%,which largely addresses the low threshold and poor error-correcting capability issues of surface codes.Furthermore, it is of great research interest to apply the RL model not only for threshold determination but also for constructing a universally distinguishable linear decoder.

    6.Conclusions

    In summary,we have investigated a reinforcement learning based error correction scheme for approximate surface codes under dimensional mapping operations,which is a novel error correction scheme.By exploiting the topological properties of error correction codes to map the surface code dimension to three dimensions,the three-dimensional lattice of topological codes with a higher degree of redundancy and more effective error correction exhibits excellent scalability.By reducing the space required for stabilizer measurements and the number of ancilla qubits needed for error correction, the approach achieves savings in measurement space and reduces resource consumption costs.In addition, to improve the decoding efficiency, we introduce a deepQ-learning based RL decoder, which made a major breakthrough in improvement of the error correction rate with a threshold of 0.78%, an improvement of 56% compared to the MWPM decoder.Of course,there are still many shortcomings in this work.For example,this paper only considers approximate error correction schemes for surface codes,and other quantum error correction codes such as color codes have not been considered and practiced in this paper.In terms of decoders, the development of generative adversarial network is currently relatively mature and is the focus of further research, which is also a preparation for further improving fault tolerance.

    Acknowledgment

    Project supported by the Natural Science Foundation of Shandong Province, China (Grant Nos.ZR2021MF049,ZR2022LLZ012,and ZR2021LLZ001).

    猜你喜歡
    英杰
    Absorption spectra and enhanced Kerr nonlinearity in a four-level system
    觀賞園藝學課程教學改革探討
    高教學刊(2022年11期)2022-05-05 23:05:07
    急性子的媽媽
    Influence of magnetic filter field on the radiofrequency negative hydrogen ion source of neutral beam injector for China Fusion Engineering Test Reactor
    Probability density and oscillating period of magnetopolaron in parabolic quantum dot in the presence of Rashba effect and temperature*
    Observe modern design works and taste traditional Chinese culture
    Special Property of Group Velocity for Temporal Dark Soliton?
    呈大千意境 融自然萬象——談王英杰的潑彩世界
    燕趙英杰
    軍工文化(2017年12期)2017-07-17 06:07:56
    除夕之夜
    av在线亚洲专区| 国产精品人妻久久久影院| 淫秽高清视频在线观看| 嫩草影院入口| videossex国产| 国产精品乱码一区二三区的特点| 欧美3d第一页| 九色成人免费人妻av| 搡老岳熟女国产| 成人三级黄色视频| 大型黄色视频在线免费观看| 国产男靠女视频免费网站| 国产成人91sexporn| 欧美zozozo另类| 久久人人爽人人片av| 国产探花在线观看一区二区| 夜夜看夜夜爽夜夜摸| 久久99热这里只有精品18| а√天堂www在线а√下载| 亚洲成人久久爱视频| 成人美女网站在线观看视频| 12—13女人毛片做爰片一| 少妇丰满av| 我要看日韩黄色一级片| 精品一区二区三区视频在线| 波多野结衣巨乳人妻| 久久久久久九九精品二区国产| 国内揄拍国产精品人妻在线| 香蕉av资源在线| 最好的美女福利视频网| 色哟哟哟哟哟哟| 国产亚洲精品久久久com| 国产成人a区在线观看| 淫妇啪啪啪对白视频| 赤兔流量卡办理| 国产精品一区二区性色av| 国产高清激情床上av| 搡老妇女老女人老熟妇| 成年免费大片在线观看| 成人高潮视频无遮挡免费网站| 国产精品久久久久久亚洲av鲁大| 哪里可以看免费的av片| 亚洲成人久久爱视频| 99精品在免费线老司机午夜| 欧美+亚洲+日韩+国产| 亚洲国产欧美人成| 欧美色欧美亚洲另类二区| av专区在线播放| 国产极品精品免费视频能看的| 亚洲综合色惰| 俺也久久电影网| 精品99又大又爽又粗少妇毛片| 黄色一级大片看看| 国产成人91sexporn| 一级毛片我不卡| 亚洲精品乱码久久久v下载方式| 亚洲国产精品合色在线| 99久久中文字幕三级久久日本| 国产成人a∨麻豆精品| 亚洲精品乱码久久久v下载方式| 老熟妇乱子伦视频在线观看| 国内少妇人妻偷人精品xxx网站| 一级黄色大片毛片| 日本熟妇午夜| 欧美zozozo另类| 国产毛片a区久久久久| av福利片在线观看| 男女啪啪激烈高潮av片| 免费观看人在逋| 欧美丝袜亚洲另类| 成人欧美大片| 国产精品福利在线免费观看| 日日摸夜夜添夜夜爱| 成年女人毛片免费观看观看9| 中国美白少妇内射xxxbb| 日韩一本色道免费dvd| 免费不卡的大黄色大毛片视频在线观看 | 日韩制服骚丝袜av| 一个人免费在线观看电影| 最新在线观看一区二区三区| 黄色一级大片看看| 国产成人91sexporn| 国模一区二区三区四区视频| 老女人水多毛片| 成人亚洲欧美一区二区av| www.色视频.com| 99久久无色码亚洲精品果冻| 精品久久久久久久久久免费视频| 亚洲激情五月婷婷啪啪| 国产一级毛片七仙女欲春2| 天堂av国产一区二区熟女人妻| 一个人看视频在线观看www免费| ponron亚洲| 精华霜和精华液先用哪个| 男女啪啪激烈高潮av片| 精品午夜福利视频在线观看一区| 国产av不卡久久| 99久国产av精品国产电影| 国产aⅴ精品一区二区三区波| 国产熟女欧美一区二区| 女生性感内裤真人,穿戴方法视频| 亚洲成人久久爱视频| 亚洲精华国产精华液的使用体验 | 变态另类丝袜制服| aaaaa片日本免费| 色5月婷婷丁香| 亚洲精品日韩av片在线观看| 午夜免费激情av| 男女那种视频在线观看| 国产乱人视频| 在线观看美女被高潮喷水网站| 如何舔出高潮| 99视频精品全部免费 在线| 一进一出好大好爽视频| 一进一出抽搐动态| 熟妇人妻久久中文字幕3abv| eeuss影院久久| 91久久精品国产一区二区成人| 人妻久久中文字幕网| 成年版毛片免费区| АⅤ资源中文在线天堂| 麻豆国产av国片精品| 国产激情偷乱视频一区二区| 国产高潮美女av| 国产三级在线视频| 国产美女午夜福利| 欧美成人免费av一区二区三区| 精品久久久久久久久久免费视频| 国产伦精品一区二区三区视频9| 99久久无色码亚洲精品果冻| 天美传媒精品一区二区| 六月丁香七月| 午夜爱爱视频在线播放| 在线看三级毛片| 一本精品99久久精品77| 国内精品美女久久久久久| 免费不卡的大黄色大毛片视频在线观看 | 精品人妻熟女av久视频| 伦精品一区二区三区| 婷婷色综合大香蕉| 久久午夜福利片| 国产探花在线观看一区二区| 成人三级黄色视频| 亚洲国产色片| 精品久久久久久成人av| 中文亚洲av片在线观看爽| 国产伦精品一区二区三区视频9| 久久久精品欧美日韩精品| 一级av片app| 一个人看的www免费观看视频| 色吧在线观看| 夜夜夜夜夜久久久久| 看十八女毛片水多多多| a级毛片免费高清观看在线播放| 国产日本99.免费观看| 久久精品国产亚洲av涩爱 | 国产69精品久久久久777片| 精品久久久久久久久亚洲| 尾随美女入室| 国产精品一区二区三区四区免费观看 | 在线国产一区二区在线| 人妻制服诱惑在线中文字幕| 国产乱人视频| АⅤ资源中文在线天堂| 99久国产av精品| 国产精品1区2区在线观看.| av免费在线看不卡| 99热这里只有是精品在线观看| 人妻制服诱惑在线中文字幕| 97在线视频观看| 夜夜夜夜夜久久久久| 日韩成人伦理影院| 一个人看视频在线观看www免费| 69av精品久久久久久| 亚洲国产色片| 国产v大片淫在线免费观看| 成人综合一区亚洲| 看黄色毛片网站| 国产成人一区二区在线| 国产精品亚洲一级av第二区| 亚洲精品亚洲一区二区| eeuss影院久久| 免费av毛片视频| 亚洲无线在线观看| 六月丁香七月| 亚洲成a人片在线一区二区| 国产精品美女特级片免费视频播放器| 亚洲精品456在线播放app| 免费av观看视频| 日本与韩国留学比较| 亚洲国产欧美人成| 欧美xxxx性猛交bbbb| 性欧美人与动物交配| 免费看日本二区| av黄色大香蕉| 日韩大尺度精品在线看网址| 欧美成人精品欧美一级黄| 久久久精品大字幕| 亚洲电影在线观看av| 99久久中文字幕三级久久日本| 亚洲欧美日韩卡通动漫| 成年女人毛片免费观看观看9| 国产探花在线观看一区二区| 全区人妻精品视频| 日本免费一区二区三区高清不卡| 国产蜜桃级精品一区二区三区| 国产一区二区三区在线臀色熟女| av女优亚洲男人天堂| 真实男女啪啪啪动态图| 国产亚洲精品久久久久久毛片| 日日啪夜夜撸| 乱人视频在线观看| 中出人妻视频一区二区| 天天一区二区日本电影三级| 亚洲精品成人久久久久久| 少妇的逼好多水| 精品欧美国产一区二区三| 99久久精品一区二区三区| 波多野结衣高清无吗| 免费看美女性在线毛片视频| 韩国av在线不卡| 国产亚洲精品av在线| 久久精品国产亚洲av涩爱 | 可以在线观看的亚洲视频| 国产成人a区在线观看| 久久精品91蜜桃| 国产国拍精品亚洲av在线观看| 嫩草影院入口| 午夜精品一区二区三区免费看| 人人妻人人澡欧美一区二区| 亚洲av第一区精品v没综合| 国产美女午夜福利| 在线国产一区二区在线| 在线观看一区二区三区| 亚洲最大成人手机在线| 亚洲精品456在线播放app| 国产伦一二天堂av在线观看| 久久精品91蜜桃| 欧美性感艳星| 午夜福利视频1000在线观看| 国产精品久久久久久久久免| 亚洲成av人片在线播放无| 国产精品久久久久久亚洲av鲁大| 日韩制服骚丝袜av| 欧美成人一区二区免费高清观看| 欧美另类亚洲清纯唯美| 欧美高清成人免费视频www| 国产免费一级a男人的天堂| 一级a爱片免费观看的视频| 久久人人爽人人爽人人片va| 69人妻影院| 1024手机看黄色片| 淫妇啪啪啪对白视频| 亚洲欧美日韩东京热| 亚洲精品日韩在线中文字幕 | 国产免费男女视频| 97碰自拍视频| av在线天堂中文字幕| 亚洲精品456在线播放app| 久久久久国产网址| 成人三级黄色视频| 亚洲七黄色美女视频| 在线看三级毛片| 亚洲最大成人中文| 成人国产麻豆网| 激情 狠狠 欧美| 舔av片在线| 最近的中文字幕免费完整| 欧美高清性xxxxhd video| 搡老熟女国产l中国老女人| 欧美+亚洲+日韩+国产| 国产精品人妻久久久影院| 男人的好看免费观看在线视频| 欧洲精品卡2卡3卡4卡5卡区| 少妇熟女aⅴ在线视频| 亚洲精品日韩av片在线观看| 淫秽高清视频在线观看| 国产亚洲精品久久久com| 美女 人体艺术 gogo| 国产av麻豆久久久久久久| 日本成人三级电影网站| 国内揄拍国产精品人妻在线| 日本一二三区视频观看| 亚洲欧美成人精品一区二区| 国产 一区精品| 欧美性猛交╳xxx乱大交人| 国产亚洲欧美98| 在线观看免费视频日本深夜| or卡值多少钱| 亚洲人成网站在线播| 97碰自拍视频| 国产精品久久久久久av不卡| 国产探花在线观看一区二区| 国产精品福利在线免费观看| 狠狠狠狠99中文字幕| 国产在视频线在精品| 欧美又色又爽又黄视频| 一夜夜www| 日韩一区二区视频免费看| 亚洲乱码一区二区免费版| 久久6这里有精品| 国产激情偷乱视频一区二区| 久久精品久久久久久噜噜老黄 | www日本黄色视频网| 久久精品夜夜夜夜夜久久蜜豆| 免费观看精品视频网站| 两个人的视频大全免费| 国产精品女同一区二区软件| 好男人在线观看高清免费视频| 国内久久婷婷六月综合欲色啪| 搡女人真爽免费视频火全软件 | 一级黄色大片毛片| 99久久成人亚洲精品观看| aaaaa片日本免费| 久久精品国产鲁丝片午夜精品| 亚洲aⅴ乱码一区二区在线播放| 免费在线观看成人毛片| 国产又黄又爽又无遮挡在线| 成年女人看的毛片在线观看| 五月伊人婷婷丁香| 淫秽高清视频在线观看| 欧美3d第一页| 国产激情偷乱视频一区二区| 露出奶头的视频| 五月玫瑰六月丁香| 日本欧美国产在线视频| 色视频www国产| 99视频精品全部免费 在线| 中文字幕av成人在线电影| 听说在线观看完整版免费高清| 国产美女午夜福利| 美女高潮的动态| 精品少妇黑人巨大在线播放 | 日韩成人av中文字幕在线观看 | 亚洲精品亚洲一区二区| 国产一区二区三区在线臀色熟女| 97超级碰碰碰精品色视频在线观看| 欧美日本视频| 国产高清激情床上av| 成人二区视频| 人人妻,人人澡人人爽秒播| 国产真实乱freesex| 欧美三级亚洲精品| 1024手机看黄色片| 麻豆久久精品国产亚洲av| 午夜老司机福利剧场| 欧美日本视频| 欧美日韩乱码在线| 在线播放无遮挡| 免费高清视频大片| 亚洲在线自拍视频| 亚洲熟妇中文字幕五十中出| 一级毛片aaaaaa免费看小| 久久精品夜夜夜夜夜久久蜜豆| 免费看日本二区| 自拍偷自拍亚洲精品老妇| 少妇人妻精品综合一区二区 | 亚洲精品乱码久久久v下载方式| 18+在线观看网站| 欧美日韩一区二区视频在线观看视频在线 | 又爽又黄a免费视频| 成人亚洲欧美一区二区av| 人人妻人人澡人人爽人人夜夜 | 少妇猛男粗大的猛烈进出视频 | 人妻夜夜爽99麻豆av| 在线观看66精品国产| 精品午夜福利在线看| 欧美xxxx黑人xx丫x性爽| 欧美三级亚洲精品| 美女大奶头视频| 精品人妻偷拍中文字幕| 亚洲av免费在线观看| 久久天躁狠狠躁夜夜2o2o| 亚洲国产精品国产精品| 寂寞人妻少妇视频99o| 久久人人爽人人爽人人片va| 99久久精品国产国产毛片| 麻豆乱淫一区二区| 久久99热这里只有精品18| 一级毛片久久久久久久久女| 可以在线观看的亚洲视频| 十八禁国产超污无遮挡网站| 91狼人影院| 一级a爱片免费观看的视频| videossex国产| 国产综合懂色| 成人毛片a级毛片在线播放| 91久久精品国产一区二区三区| 校园春色视频在线观看| 99久久久亚洲精品蜜臀av| 久久久久久大精品| a级毛片免费高清观看在线播放| 亚洲av美国av| 51国产日韩欧美| 蜜桃久久精品国产亚洲av| 国产极品精品免费视频能看的| 九九热线精品视视频播放| 日本-黄色视频高清免费观看| 五月伊人婷婷丁香| 中国美女看黄片| 黄色配什么色好看| АⅤ资源中文在线天堂| 国产私拍福利视频在线观看| 99热6这里只有精品| 色综合色国产| 久久久精品欧美日韩精品| 国产在线精品亚洲第一网站| 亚洲精品一区av在线观看| 欧美成人一区二区免费高清观看| 欧美最黄视频在线播放免费| 精品人妻偷拍中文字幕| 一进一出好大好爽视频| 可以在线观看的亚洲视频| 久久久久久久久中文| 亚洲美女黄片视频| 免费大片18禁| 午夜福利成人在线免费观看| 亚洲精品一卡2卡三卡4卡5卡| 国产三级中文精品| 国产精品伦人一区二区| 亚洲欧美日韩无卡精品| 桃色一区二区三区在线观看| 3wmmmm亚洲av在线观看| 午夜精品国产一区二区电影 | 日韩国内少妇激情av| 免费人成视频x8x8入口观看| 国产精品亚洲一级av第二区| 国产成人a∨麻豆精品| 国产白丝娇喘喷水9色精品| 亚洲高清免费不卡视频| 亚洲专区国产一区二区| 18禁在线播放成人免费| 97超级碰碰碰精品色视频在线观看| 免费人成视频x8x8入口观看| 天堂动漫精品| 国产精华一区二区三区| 男女边吃奶边做爰视频| 丝袜美腿在线中文| 国产单亲对白刺激| 精品熟女少妇av免费看| 欧美日韩精品成人综合77777| 日本爱情动作片www.在线观看 | 99视频精品全部免费 在线| 亚洲无线在线观看| 三级毛片av免费| 欧美最新免费一区二区三区| 国产一区亚洲一区在线观看| 一夜夜www| 午夜视频国产福利| 日韩中字成人| 亚洲国产精品成人综合色| 中出人妻视频一区二区| 国产精品一区二区性色av| 国产午夜福利久久久久久| 综合色av麻豆| 黄色欧美视频在线观看| 精品国产三级普通话版| 亚洲最大成人手机在线| 欧美不卡视频在线免费观看| 精品午夜福利在线看| 国产av一区在线观看免费| 国产三级在线视频| 亚洲最大成人手机在线| 亚洲av二区三区四区| 亚洲国产精品成人综合色| 精品一区二区三区av网在线观看| 18禁裸乳无遮挡免费网站照片| 联通29元200g的流量卡| 麻豆国产97在线/欧美| 久久久久精品国产欧美久久久| 18禁黄网站禁片免费观看直播| 国产黄色视频一区二区在线观看 | 国产人妻一区二区三区在| 成人特级黄色片久久久久久久| 最后的刺客免费高清国语| av天堂中文字幕网| 18禁在线播放成人免费| 日日摸夜夜添夜夜添av毛片| 亚洲av美国av| 亚洲国产精品久久男人天堂| 亚洲精品日韩av片在线观看| 桃色一区二区三区在线观看| 久久久午夜欧美精品| 精品一区二区三区视频在线| 国语自产精品视频在线第100页| 午夜亚洲福利在线播放| 亚洲成av人片在线播放无| 国产国拍精品亚洲av在线观看| 蜜桃久久精品国产亚洲av| 日本与韩国留学比较| 热99re8久久精品国产| 日本免费一区二区三区高清不卡| 欧美成人a在线观看| 日韩,欧美,国产一区二区三区 | 久久久久国产网址| 免费大片18禁| 欧美人与善性xxx| 亚洲av一区综合| 午夜日韩欧美国产| 乱人视频在线观看| 97超碰精品成人国产| 噜噜噜噜噜久久久久久91| 国产成人freesex在线 | av在线观看视频网站免费| 校园春色视频在线观看| 精品久久久久久久久av| 免费看光身美女| 久久九九热精品免费| 国产精品久久久久久亚洲av鲁大| 成人性生交大片免费视频hd| 免费看光身美女| 国产伦一二天堂av在线观看| 日日摸夜夜添夜夜添av毛片| 国内揄拍国产精品人妻在线| 久久久国产成人免费| 成人美女网站在线观看视频| 老女人水多毛片| 男人舔奶头视频| 18禁裸乳无遮挡免费网站照片| 人妻制服诱惑在线中文字幕| 国产私拍福利视频在线观看| 一本精品99久久精品77| 精品国产三级普通话版| 别揉我奶头 嗯啊视频| 日韩精品有码人妻一区| 国产精品久久电影中文字幕| videossex国产| 亚洲国产高清在线一区二区三| 午夜免费激情av| 国产老妇女一区| 综合色av麻豆| 国产欧美日韩一区二区精品| 国产成人一区二区在线| 综合色av麻豆| 日韩欧美国产在线观看| 日韩欧美三级三区| 中文字幕人妻熟人妻熟丝袜美| 久久中文看片网| 国产成人a区在线观看| 国产男人的电影天堂91| 联通29元200g的流量卡| 97在线视频观看| 看片在线看免费视频| 精品久久国产蜜桃| 亚洲中文日韩欧美视频| 国产高清视频在线观看网站| 啦啦啦观看免费观看视频高清| 国产一区二区激情短视频| 中文资源天堂在线| 日日干狠狠操夜夜爽| 大香蕉久久网| videossex国产| 国产一级毛片七仙女欲春2| 久久久色成人| 亚洲成人久久性| 少妇熟女欧美另类| 亚洲av中文av极速乱| 国产中年淑女户外野战色| 婷婷精品国产亚洲av在线| 最近2019中文字幕mv第一页| 午夜福利高清视频| 亚洲欧美日韩高清专用| 两个人的视频大全免费| 日韩欧美精品免费久久| 国产成人精品久久久久久| 在线免费十八禁| 欧美一区二区精品小视频在线| 乱系列少妇在线播放| 此物有八面人人有两片| 可以在线观看的亚洲视频| 午夜福利在线在线| 久久精品国产亚洲网站| 日韩av不卡免费在线播放| 在线看三级毛片| 日韩精品青青久久久久久| 如何舔出高潮| 99久久精品热视频| 精品久久久噜噜| 欧美精品国产亚洲| 午夜视频国产福利| 亚洲七黄色美女视频| 国产精品亚洲一级av第二区| 精品一区二区三区视频在线| 日产精品乱码卡一卡2卡三| 日本黄色片子视频| 搡女人真爽免费视频火全软件 | 日韩成人伦理影院| 狠狠狠狠99中文字幕| 国产精品久久久久久久电影| 三级毛片av免费| 1000部很黄的大片| 欧美中文日本在线观看视频| 婷婷六月久久综合丁香| 国产精品人妻久久久久久| 俺也久久电影网| 3wmmmm亚洲av在线观看| 日韩欧美在线乱码| 一本一本综合久久| 国产伦精品一区二区三区视频9| 国产精品一区二区免费欧美| 日本免费a在线| 天天躁夜夜躁狠狠久久av| 国产v大片淫在线免费观看| 成人特级av手机在线观看| 成年版毛片免费区| 97超级碰碰碰精品色视频在线观看| 波多野结衣高清作品| 久久久久国产精品人妻aⅴ院| 精品无人区乱码1区二区| 国产精品久久久久久精品电影| 亚洲天堂国产精品一区在线| 一区二区三区四区激情视频 | 久久综合国产亚洲精品| 91久久精品电影网| 干丝袜人妻中文字幕|