• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

    2022-10-26 12:23:18RuofanWuZhikaiYaoJennieSiandHeHelenHuang
    IEEE/CAA Journal of Automatica Sinica 2022年1期

    Ruofan Wu, Zhikai Yao, Jennie Si,, and He (Helen) Huang,

    Abstract—We address a state-of-the-art reinforcement learning(RL) control approach to automatically configure robotic prosthesis impedance parameters to enable end-to-end, continuous locomotion intended for transfemoral amputee subjects.Specifically, our actor-critic based RL provides tracking control of a robotic knee prosthesis to mimic the intact knee profile. This is a significant advance from our previous RL based automatic tuning of prosthesis control parameters which have centered on regulation control with a designer prescribed robotic knee profile as the target. In addition to presenting the tracking control algorithm based on direct heuristic dynamic programming(dHDP), we provide a control performance guarantee including the case of constrained inputs. We show that our proposed tracking control possesses several important properties, such as weight convergence of the learning networks, Bellman (sub)optimality of the cost-to-go value function and control input, and practical stability of the human-robot system. We further provide a systematic simulation of the proposed tracking control using a realistic human-robot system simulator, the OpenSim, to emulate how the dHDP enables level ground walking, walking on different terrains and at different paces. These results show that our proposed dHDP based tracking control is not only theoretically suitable, but also practically useful.

    I. INTRODUCTION

    POWERED lower limb prosthesis provides great promise for amputees to regain mobility in daily life. Its potential has been demonstrated for transfemoral amputees’ walking ability [1], [2]. Such robotic devices rely on an impedance control framework which is designed based on human biomechanics to mimic the central nervous system controlled human joint movements to provide a natural substitute to the lost limb functions. These devices require customization of the impedance parameters for each individual user. Currently,configuration of the powered devices is performed in clinics by technicians who manually tune a subset of impedance parameters over a number of visits of the patient. This procedure is time and labor intensive for both amputees and clinicians. Therefore, an automatic approach to tuning the powered prosthesis parameters is needed.

    Automatically configuring the impedance parameter settings has been attempted over the past several years. An untested idea aims at estimating the joint impedance based on biomechanical measurements and a model of the unimpaired leg [3],[4]. This idea may not be practically useful as the biomechanics and the joint activities of amputees are fundamentally different from those of the able-bodied population. Another approach is to constrain the knee kinematics via the relationship of the joint control and intrinsic measurements,which in turn requires careful modeling and thus may not be feasible [5], [6]. Such an approach relies on significant domain knowledge and is tuning time. A cyber expert system was proposed [7] to emulate the prosthetists’ tuning decisions of human experts into configuring the control parameters.This approach heavily relies on the expert’s experience and is not expected to scale well to more joints and different users and tasks.

    As those methods all have their fundamental limitations in principled ways, new approaches to configuring the prosthesis control parameters are needed. RL based adaptive optimal control approach is a promising alternative as they have demonstrated their capability of learning from data measurements in an online or offline manner in several realistic application problems including large-scale control problems [8]–[13]. The core of the RL methods is the idea of providing approximate solutions to the Bellman equation of optimal control problems. We have successfully developed several RL algorithms to configure impedance parameter settings, including actor-critic RL [14]–[17] and policy iteration based RL approaches [18]–[21], and systematically tested them in both extensive simulations and in experiments using able-bodied and transfemoral amputee subjects.

    All of our RL control approaches to date require a target knee motion profile which can only be subjectively determined. Nonetheless, those results are important as they provided the necessary understanding of the impedance control parameter configuration problem and if RL control is capable of solving this problem. Even though detailed understanding of human locomotion at neurological and biomechanical levels has long been established, the individual human subject’s locomotion dynamics are still not feasible to model accurately by mathematical descriptions as individuals differ physically, biologically and neurologically. Additionally, different locomotion tasks, such as changing pace [22],sloped walking [23] and walking on uneven terrain [24] all have significant influence on human gait behavior. As such,accurately prescribing each and every locomotion behavior for control purposes is not feasible.

    Tracking the intact knee joint motion by a prosthetic knee is an intuitive idea as the intact knee kinematics is the most natural and realistic target: it contains actual biological joints’inter-relational information, which makes it a good candidate to replace a subjectively defined knee profile. Studies have shown that bilateral coordination between two legs are needed in the regulation of bipedal walking to maintain stability, and that such interlimb cooperation can be accomplished at a spinal level. Since the spinal level locomotor networks are symmetrically organized [25], sensory and muscle activities of both sides are involved in rhythmic walking. Amputees usually display asymmetrical walking by relying heavily on their intact limbs because of the loss of sensory feedback.

    Tracking the intact knee actually has been explored years ago. Grimeset al.developed a mirror control scheme for the stance phase (not a complete gait cycle). It tracked the sound limb’s knee trajectory in the stance phase by multiplying a gain factor to avoid over flexion while a fixed trajectory was applied in swing phase. Bernal-Torreset al.copied the full gait trajectory by the Kalman filter with a biomimetic designed prosthesis but no human experiment or systematic simulations were reported [26]. Joshiet al.developed a control strategy by controlling the swing time to mirror the stride duration of the intact knee while the prosthesis was locked during stance phase [27]. Sahooet al.aimed at mirroring the step length by controlling the push-off force[28]. The above approaches focused on either part of a gait cycle or the outcome measurement such as step length and stance time. None of them has shown feasibility of tracking a completed gait cycle.

    Virtual constraints were proposed to generate coordinated joint motions as target joint motion profiles for the robotic knee to track [29]. Biomimetic virtual constraints described the joints’ geometric relationships and were encoded by hybrid zero dynamics [30]. However, there are a few limitations on this approach. Virtual constraints require a simplified human model to establish the geometric relationship among joints. Such a model is difficult to establish for a human-prosthesis system. In a recent work of prosthesis control design based on the virtual constraints [29], only a proportional gain was derived and applied. The overall human-prosthesis performance during locomotion is yet to be demonstrated.

    Tracking an intact knee motion poses additional challenges beyond regulation control. Individual prosthesis users demonstrate very different gait patterns, which are also very different from one another, and also from healthy subjects.This may be caused by individual’s physical conditions and/or biological factors such as reduced or lost proprioception and condition of socket fitting. It has long been believed that the intact knee motion pattern changes as the user learns to walk in a prosthesis (our data below also provide corroborating evidence about this), tracking a moving target knee motion has never been demonstrated by any controller. Most existing tracking control designs based on mathematical models of the human-prosthesis dynamics and the reference trajectory, both are difficult or actually impossible to obtain. For data-driven adaptive optimal control, reports on data-driven tracking control are much less than those of regulation control. Most approaches are based on reinforcement learning to establish approximate solution to the HJB equation, yet, few of them have demonstrated feasibility in engineering problems. And even fewer results can be found to provide systematic studies on the control design to demonstrate applicability.

    In this paper, we propose an RL tracking control scheme for the robotic knee to mimic the intact knee in different locomotion tasks. In a previous experiment [31], we successfully tested this pilot idea of RL tracking control to automatically configure impedance parameter settings. In this study, we formally formulate the tracking control problem, develop a complete tracking control algorithm based on dHDP, and provide an analytical framework to validate the real time control performance guarantee by using this proposed scheme.The contributions of this work include the following.

    1) We provide the first systematic demonstration of an endto-end, continuous walking enabled by automatic tracking control of a wearable lower limb robotic device beyond our previous regulation control results [14]–[21].

    2) We show that our proposed tracking control possesses several important properties, such as weight convergence of the learning networks, Bellman (sub) optimality of the cost-togo value function and control input, and practical stability of the human-robot systems.

    3) We provide a systematic evaluation of the proposed tracking control using a realistic human-robot system simulator to demonstrate level ground walking, slope walking under various slope angles and walking under different paces. These results show that our proposed dHDP based tracking control is not only theoretically suitable, but also practically useful.

    The remaining of this paper is organized as follows. Section II describes the human-prosthesis system and develops dHDP to solve the tracking problem. Section III gives the Lyapunov stability analysis of system. Section IV presents the implementation using Opensim simulations. Section V presents extensive simulations. Discussions and conclusion are presented in Section VI.

    II. METHOD

    Our proposed RL tracking control is built upon the finite state machine (FSM) impedance controller (IC) framework. It is to mimic the torque-generating capability of biological joints to enable natural movement. Humans reportedly control muscle activity to adjust joint impedance in walking[32]–[34], and the compliant behaviors of legs are fundamental to human locomotion [35], [36]. Within this context,impedance represents the inherent property of a mechanical joint. It describes the relationship between external force and motion produced or in other words, it is a dynamic property that governs human joint-torque relationship. In our study, the prosthesis joint is therefore characterized by the stiffness (K),damping ratio (B), and equilibrium position ( θe). The FSM-IC provides intrinsic control in the form of adjustable control torque influenced by impedance parameters. The settings of the impedance parameters as control inputs have to be adjusted or adapted to meet individuals’ needs including their different physical conditions. Our proposed RL tracking control is to automatically provide such needed impedance parameter settings. In turn, the joint impedance affects the knee kinematics such as peak value of knee angle collaboratively. External force such as ground reaction force and human reactions also affect knee angle or peak value.However, many of these factors are difficult, or nearly impossible, to be modeled. Gait duration is influenced by impedance control parameters [17], as well as human’s movement control of residual hip and intact limb in gait.Meanwhile, the transition between double-stance and singlestance will influence the duration as well since they have different dynamics. Therefore, the relationship between impedance and peak knee angle/duration is not deterministic and difficult to model. These challenges motivate us to consider data-driven RL control.

    A. Finite State Machine (FSM) Impedance Control (IC)

    The FSM-IC is common for prosthesis intrinsic control as studies have shown that humans control the stiffness of leg muscles and therefore joint impedance while walking, and compliant behaviors of legs are instrumental for human walking. The impedance controller generates a torque input to the robotic knee based on current knee kinematics and knee joint impedance settings.

    Refer to Fig. 1(b), a gait cycle is divided into four phases in the FSM-IC: stance flexion (STF,m=1), stance extension(STE,m=2) , swing flexion (SWF,m=3) and swing extension (SWE,m=4). The phase transitions are determined by knee motion and gait events (heel strike and toe-off) that are obtained from vertical ground reaction forces of both legs.In each phase of the FSM, three impedance parameters(stiffnessK, dampingB, and equilibrium position θe) are provided as inputs to the FSM-IC for gait cyclek

    Fig. 1. Two control loops are coordinated to realize automatic control of the prosthetic knee based on an impedance control framework: i) an intrinsic impedance control (left most panel of (a)) that operates at 100 Hz that creates a control torque according to (2) based on the knee kinematics and also the impedance settings, and ii) an RL controller that is updated at each step k. The output is the adjustment of the impedance setting that is to be used in the intrinsic controller. (a) Flow chart of the automatic robotic knee control parameter tuning scheme by dHDP. (b) The dHDP controller to adjust the impedance setting. (c)The structure of the critic network in the RL controller.

    The RL controller will adjust these impedance parameters,i.e.,

    so that the updated impedance parameters are applied to the FSM-IC to generate knee torque according to (2)

    B. Tracking Problem Formulation

    Biomechanical studies have shown that the intact knee joint movements or profiles change as amputees adapt to a prosthetic device [37]. We have observed the same in our pilot study using two human subjects [31]. Fig. 2 is an illustration of the same phenomenon using simulations where the intact knee kinematic trajectories were recorded, and changes in profile features are clearly observed. The goal of the RL controlled robotic knee is therefore to track those time varying intact knee profile features for each and every phase during each and every gait cycle. For a gait cyclek, the robotic knee motion (Fig. 1(b)) featured by the peak knee angle(degrees) and duration(seconds) is measured. Let

    Fig. 2. An illustration of how the intact knee profile changes as the robotic knee control parameters adapt during a level ground walking simulation session. The peak angles (blue) and the phase durations (orange) are shown in all phases.

    Similarly, we measure the peak knee angle and duration of the intact knee, and let

    We consider a human-robot, i.e., an amputee-prosthesis system, a discrete-time nonlinear dynamic system with unknown nonlinearities. For the ease of discussion, we let the dynamics be represented by

    where the nonlinear mappingFis Lipschitz continuous on the domain ofwhere Z and U are compact sets with dimensions ofNZandNu, respectively. In the human-robot system under consideration,Frepresents the kinematics of the robotic knee, which is affected by both the human wear and the RL controller. Because of a human inthe-loop, an explicit mathematical model as (7) is intractable or impossible to obtain.

    Without causing any confusion and for the sake of convenience, we drop the superscriptm(m=1,2,3,4) in the rest of the paper because all four FSM-ICs and their respective RL controllers share the same structure, although the RL controllers for each phase have different parameterizations or in other words, the control policies are different for each phase even though they have the same structure. Then, the tracking error between the intact knee and the prosthetic knee is defined as

    C. dHDP for Tracking Control

    Fig. 1(a) depicts the RL based solution approach to automatically configure the impedance parameters of the robotic knee to track the intact knee joint motion within the FSM-IC framework. Each RL control block corresponds to one of the four FSM phases. As shown in Fig. 1(a), we develop a dHDP based RL tracking control with each of the four dHDP blocks providing impedance parameter settings for each of the four gait phases. Each dHDP block has an action network and a critic network, trained for the given FSM phase only.

    In the RL tracking controller, let the state be denoted bysk,and the control input/action network output asukfor gait cyclek, i.e.,

    We consider the stage cost in a quadratic form

    whereRs∈R2×2andRu∈R3×3are positive definite matrices.Note that, our following results are also applicable to other stage costs such as a reinforcement signal of finite discrete levels or a general bounded sefmi-definite reinforcement signal.

    We consider the tracking problem as one to devise an optimal control law via learning from observed data along the human-robot interacting system dynamics. We define the state-actionQ-function or the total cost-to-go as

    Note that theQ(sk,uk) value is a performance measure when actionukis applied at statesk. SuchQ(sk,uk) formulation implies that we have considered the optimal adaptive tracking control of the robotic knee as a discrete-time, infinite horizon,discounted problem without knowing an explicit mathematical description of the human-robot interacting dynamics.

    For theQ-function in (11), it satisfies the Bellman equation

    Assumption 1:The state trajectoryYkof the intact knee is bounded, and the initial robotics knee stateZ0is bounded.

    1) Critic Network:Fig. 1(c) depicts the structure of the critic network which is realized by an universal approximator with one hidden layer. Therefore, the approximated value is

    The approximation error of the critic network is

    In the following, we use the short-hand notationUk?1forU(sk?1,uk?1) and similarly for others. The weightsandare updated as

    According to the gradient descend rule as in [38], ?Wc1,kand ?Wc2,kcan be written as

    where ?c,kis the output of hidden layers in critic network, andlc,kis the learning rate.

    2) Action Network:The output of the action network is the control input

    Based on the design principle of dHDP [38], the action network is to minimize the total cost-to-goQ(sk,uk). We defined the prediction error of the action network as

    Similarly to (16), the weightsandare updated as

    III. LYAPUNOV STABILITY ANALYSIS

    In this section, we provide a qualitative analysis for the weight convergence of the actor-critic networks, the Bellman(sub) optimality of the control policy, and practical stability of the human-prosthesis system.

    A. Preliminaries

    B. Weight Convergence, (Sub) Optimality and Practical Stability

    To guarantee that the second and the third terms in the last expression are negative, we need to choose learning rates in the following manner:

    Remark 2 (Practical Stability):From Assumption 1,Remarks 1–3, and Theorem 1, we obtain that all the signals(such asuk,Zk,Yk, andsk) are bounded in the human-robot system (7). As shown in Theorem 2, the approximatedQvalue in (13) and the resulted policy (17) achieve (sub)optimality as time stepkincreases. This demonstrates that the stage cost in (10) approaches zero in the sense of uniformly ultimately boundedness. As such, the tracking error (8)approaches zero in the sense of uniformly ultimately boundedness also as time stepkincreases. Therefore, the considered human-robot system is practically stable.

    IV. OPENSIM SIMULATION STUDIES

    Table I is a summary of how our proposed dHDP tracking control is implemented and applied to configure the impedance parameters of the robotic knee. Pertinent information for the implementation steps is provided below.

    TABLE I IMPLEMENTATION OF SIMULATION STUDIES

    A. OpenSim Model Setup

    We investigated this tracking control problem using OpenSim, a well-established simulator in the field of biomechanics [41]. A bipedal walking model, as shown in Fig. 3(a), includes a body of five rigid-segments, linked through a one degree of freedom pin joint and the pelvis was linked to the ground by a free joint to allow free movement.The model settings, such as segment length, body mass and inertial parameters, followed the lower limb OpenSim model[42]. In this study, the left knee was defined as intact while the right as prosthetic to enable locomotion in a single simulation model. The intact knee was controlled by a fixed set of impedance parameters settings while the prosthetic knee was controlled by the RL controller with its impedance parameters updated for each gait cycle.

    Fig. 3. OpenSim model. (a) Five-rigid segment bipedal walking model; (b)The next gait is used to measure the tracking error as the prosthetic knee needs to copy the intact knee that is half gait ahead.

    Simulating a gait cycle in OpenSim requires specifying initial model settings such as walking speedvxandvy, both knee angles θLand θRwhich are set to small numbers near stance position. We added artificial Gaussian noises to the statesand controluto simulate sensor noise and actuator noise, a consideration to make the simulations more realistic.The sensor noise magnitude was set at 20% of the performance tolerance bound and the actuator noise was 1%of the maximum action value.

    B. Simulation Procedure

    All simulations were carried out in trials. A trial is a continuous experiment of 500 gait cycles when RL tracking control is applied to tuning the impedance parameters of the robotic knee under different simulation scenarios. We performed two sets of simulations: training trials and testing trials. During training, we performed 30 training trials each from a randomly initialized controller, i.e., a set of randomly generated initially feasible impedance parameter settings, that allow the simulator to simulate balanced walking without falling. During testing, we randomly selected 10 successful controllers (i.e., 10 sets of impedance parameters after training) and applied those control policies (i.e., the actor network weights) as initial controller parameter settings for tracking new, untrained trajectory profiles. Then we tested each of the 10 controllers (policy network weights) to perform 30 new trials, each of which has a new set of randomly selected initial impedance parameters.

    A trial is considered successful if the tracking error in (8)reached an error tolerance bound (Table II, bottom row).Specifically, for each of the 4 phases (Fig. 1(b)), if the tracking error was within the tolerance bound for 8 out of 10 consecutive gait cycles, tracking process in this phase was considered convergent. If all 4 phases had converged within 500 gait cycles, the trial was a success.

    To ensure subjects safety (not stumble or fall), a safetybound was introduced based on the realistic conditions of balanced walking. Specifically, as shown in Table II (top row), the safety bound was set at 1.5 standard deviations above the knee kinematic peak values observed in each phase[43]. If the tracking errors exceed the safety bound, which means the prosthetic profile may place subjects in unsafe areas, the impedance parameters of the prosthetic knee will be reset to the initial impedance. Note however, the actor and critic network weights are retained for further training until meeting tracking criteria.

    TABLE II SAFETY BOUND AND TOLERANCE BOUND

    In obtaining all results, we set the weighting matrices in the stage cost (10) as:Rs=diag(1,1) andRu=diag(0.1,0.1,0.1).For the critic network, we used 8 hidden layer neurons with hyperbolic activation function and we used a linear output layer. For the actor network, we used 6 hidden layer neurons with hyperbolic activation function and we also used a hyperbolic activation function for the output layer neuron so that the control inputs are constrained. For both networks,learning rate was 0.1. The actor-critic network was updated every gait cycle until reaching trial success.

    C. Simulating Realistic Walking

    To evaluate the efficacy of the proposed dHDP tracking control, we performed a systematic simulation study to evaluate human-robot walking performance under RL tracking control. We emulated three walking conditions: ground walking, walking on different terrains and at different paces.

    Scenario 1 (Level Ground Tracking):The intact knee was operated by a fixed set of impedance parameters at all time while the robotic knee impedance parameters were controlled by RL controller. Note that, fixed impedance parameters still provide realistic control according to (2) as the intact knee was also controlled by FSM-IC. For both the training stage and the testing stage, the RL controller was required to successfully track the intact knee profile within 500 gait cycles, while the OpenSim setting was placed at constant walking pace under level ground condition.

    Scenario 2 (Walking on Different Terrain):To simulate walking on different terrains, we provided different gait profiles for the intact knee which correspond to different impedance control parameters. The changing profiles of the intact knee were then the new moving targets for the prosthetic knee to track. Five randomly initialized impedance parameter sets that could enable balanced walking of the human-robot system were generated. The impedance parameters of the intact knee were randomly selected from this pool of 5 different gait profiles every 20 gait cycles.During training, the controller was required to successfully track the intact knee for 3 consecutive times with respective knee profiles. During testing, a new pool of 5 sets of impedance parameters was generated and used on the intact knee, the respective profiles of which were tracked by the robotic knee. Again, the controller was required to successfully track the intact knee profile for 3 consecutive times.

    Scenario 3 (Changing Pace):We examined RL tracking performance when the pace changes. The simulated pace changes were implemented in a sequence of [100%→112%→100%→88%] of the initial pace. Changing pace took place once the controllers successfully tracked the intact knee by meeting convergence criteria. The controller must complete the full sequence to complete the training stage. In the testing stage, the pace change was placed in a different order of[100%→80%→100%→120%] which also signifies a greater variance than the respective conditions for training.Same as in the training stage, each controller must finish the full sequence to be counted as a success.

    V. RESULTS

    All three scenarios were simulated in both training and testing stages with the same human subject. Table III shows the tracking performance for all three scenarios.

    TABLE III SIMULATION RESULTS OF ALL THREE SCENARIOS

    In Scenario 1, a 1 00% success rate was achieved with 64.6 average steps to fully learn to track the intact knee. The RMS tracking error was reduced from 0.0588 to 0.0131 radian of peak angle and from 1 .96% to 0.69% of phase duration. In the test stage, because a trained actor network was used in initialization, an improvement in tuning speed was observed from 64.6 to 36.3 average steps. The RMS tracking error has a similar performance.

    In Scenario 2, a 97% success rate was achieved with 55.8 average steps for successfully tracking the intact knee each time. The RMS tracking error was reduced from 0.0581 to 0.0124 radian of peak angle and reduced from 1 .82% to 0.66%of phase duration. In the test stage, the trained policy of actor network shows the ability to track the target profile rapidly.An average of 19.16 tracking steps with a 100% success rate shows the ability to track a changing target effectively.Fig. 4(a) shows an example of typical training and testing trials. The RMS tracking error has a similar outcome in both training and testing as they share the same success criteria.

    Scenario 3 focuses on examining tracking performance reflected in gait duration. 80% success rate was achieved with 52.5 average steps for successfully tracking the intact knee at different paces. The RMS tracking error was reduced from 0.0573 to 0.0103 radian of peak angle and reduced from 1.90% to 0 .92% of phase duration. In the test stage, the trained policy of actor network shows a slightly improvement on tracking speed. But it greatly improved the success rate from 0.8 to 0.92. Fig. 4(b) shows an example of typical training and testing trials. The RMS tracking error has a similar outcome in both training and testing as they share the same success criteria.

    Fig. 5 shows RMS tracking errors for all trials in all simulation scenarios. A significant reduction in tracking error was observed under all conditions. The difference between training and testing is minor because they use the same randomization in initial impedance parameters and the same convergence criteria.

    Fig. 4. Typical trials for Scenario 2 (a) and Scenario 3 (b). The left columns show tracking errors during training while the horizontal green dashed lines represent tolerance bounds and the grey vertical dashed lines represent task/pace transitions. The right columns show tracking trajectories during testing.

    Fig. 5. Summary of RMS tracking errors of the peak angle (top) and phase duration (bottom) for all three scenarios. The blue bars represent the initial tracking errors while the red bars represent the final tracking errors.

    VI. CONCLUSIONS AND DISCUSSIONS

    We have introduced a new RL based tracking control scheme for automatic tuning of robotic knee impedance parameter settings of a robotic knee to track the intact knee kinematics. For the first time, we successfully demonstrated stable and continuous walking in simulations of a human wearing a robotic prosthesis which was designed to track the intact knee motion.

    Mirroring the intact knee motion by a prosthetic knee is an intuitive idea which has been proposed for decades, but has not been successfully demonstrated. The robotic knee control to mimic the intact knee joint is a tracking problem in classical control. Even though many control theoretic solutions exist, such as backstepping [44]–[46], observerbased control [47] and nonlinear adaptive/robust control [48],[49], they are inadequate for this problem as they require an accurate mathematical description of the system dynamics,which involve co-adapting human and robot in this case, and which are nearly impossible to obtain. Additionally, those control theoretic approaches focus on the stabilization (in the Lyapunov sense) of the nonlinear dynamic systems without addressing control performance such as the Bellman optimality.

    Recently, some results emerged to tackle these issues using data-driven, learning enabled, nonlinear optimal tracking control designs [50]. Unfortunately, many of the reported results have focused on theoretical analyses, which are usually based on requiring a reference model for the desired movement trajectory and/or control trajectory. They are thus not practically useful.

    In this paper, we have presented a complete tracking control algorithm based on dHDP. Additionally, we have systematically evaluated the performance of the proposed tracking controller. Our simulation results have shown effectiveness of the tracking controller for different walking tasks that emulate level ground walking, walking on different terrains and at different paces. Based on our previous work, we expect these new results to be verified in human experiments at a future time.

    APPENDIX

    Then, by using the cyclic property of matrix trace, the last term in (52) is bounded by

    We obtain (30) by substituting (52), (53) into (50). ■

    在线观看免费视频网站a站| 夜夜躁狠狠躁天天躁| 99国产精品免费福利视频| 亚洲熟女毛片儿| 免费搜索国产男女视频| 天天躁夜夜躁狠狠躁躁| 免费观看精品视频网站| 午夜免费激情av| 亚洲欧美日韩无卡精品| 校园春色视频在线观看| 亚洲九九香蕉| 久久人人爽av亚洲精品天堂| 国产精品成人在线| 韩国精品一区二区三区| 中文字幕最新亚洲高清| 最近最新免费中文字幕在线| 国产xxxxx性猛交| 亚洲色图av天堂| 免费在线观看黄色视频的| 国产无遮挡羞羞视频在线观看| 国产精品偷伦视频观看了| 国产精品影院久久| 高潮久久久久久久久久久不卡| 色精品久久人妻99蜜桃| 一区福利在线观看| 真人做人爱边吃奶动态| 久久精品亚洲av国产电影网| 一a级毛片在线观看| 咕卡用的链子| 美女 人体艺术 gogo| 久久久水蜜桃国产精品网| 亚洲一区中文字幕在线| www.www免费av| 午夜福利欧美成人| 色婷婷久久久亚洲欧美| 免费一级毛片在线播放高清视频 | 19禁男女啪啪无遮挡网站| 不卡一级毛片| 免费观看精品视频网站| 国产黄色免费在线视频| 久久精品国产亚洲av香蕉五月| 欧美日韩黄片免| 在线观看日韩欧美| 另类亚洲欧美激情| 日日夜夜操网爽| 交换朋友夫妻互换小说| 97人妻天天添夜夜摸| 亚洲成a人片在线一区二区| 亚洲欧美一区二区三区黑人| 久久精品国产亚洲av香蕉五月| 精品国产超薄肉色丝袜足j| 久久青草综合色| 三级毛片av免费| 精品免费久久久久久久清纯| 日韩精品免费视频一区二区三区| 不卡av一区二区三区| 色尼玛亚洲综合影院| 在线观看免费日韩欧美大片| 亚洲av成人av| 日韩精品青青久久久久久| 亚洲七黄色美女视频| 精品国产亚洲在线| 亚洲av成人一区二区三| 久久精品国产99精品国产亚洲性色 | 午夜精品国产一区二区电影| 久久青草综合色| 色婷婷av一区二区三区视频| 精品久久久久久久毛片微露脸| av天堂在线播放| 在线观看免费视频日本深夜| 一级毛片精品| 老司机午夜福利在线观看视频| 免费人成视频x8x8入口观看| 一区二区日韩欧美中文字幕| 国产精品一区二区精品视频观看| 9色porny在线观看| 精品一区二区三区四区五区乱码| 交换朋友夫妻互换小说| 99国产精品免费福利视频| 亚洲精品在线观看二区| 欧美激情久久久久久爽电影 | 后天国语完整版免费观看| 久久中文字幕一级| 男女下面进入的视频免费午夜 | 神马国产精品三级电影在线观看 | 一进一出抽搐动态| 亚洲狠狠婷婷综合久久图片| 日本欧美视频一区| 少妇粗大呻吟视频| 精品国产国语对白av| 色婷婷av一区二区三区视频| 高清毛片免费观看视频网站 | 亚洲免费av在线视频| 狠狠狠狠99中文字幕| 男女午夜视频在线观看| 韩国av一区二区三区四区| 69av精品久久久久久| 国产亚洲精品久久久久久毛片| 免费看十八禁软件| 十分钟在线观看高清视频www| 亚洲 欧美 日韩 在线 免费| 99久久国产精品久久久| 男人舔女人下体高潮全视频| 丰满人妻熟妇乱又伦精品不卡| 国产黄a三级三级三级人| 亚洲av成人一区二区三| a级毛片在线看网站| 国产精品国产av在线观看| 久久久久亚洲av毛片大全| 日韩中文字幕欧美一区二区| 美女福利国产在线| 日韩精品中文字幕看吧| 伊人久久大香线蕉亚洲五| 久久伊人香网站| 午夜精品久久久久久毛片777| 一夜夜www| 亚洲精品国产精品久久久不卡| 99re在线观看精品视频| 久久人人97超碰香蕉20202| 国产精品99久久99久久久不卡| 亚洲 欧美一区二区三区| 亚洲全国av大片| x7x7x7水蜜桃| 黄片大片在线免费观看| 成在线人永久免费视频| 五月开心婷婷网| 天堂√8在线中文| 中文字幕人妻丝袜一区二区| 国产精品香港三级国产av潘金莲| 亚洲伊人色综图| 身体一侧抽搐| 无人区码免费观看不卡| 亚洲av成人不卡在线观看播放网| 一级毛片高清免费大全| av网站在线播放免费| 久久人妻熟女aⅴ| 男女床上黄色一级片免费看| 久久精品aⅴ一区二区三区四区| 99国产极品粉嫩在线观看| 老汉色∧v一级毛片| 久久香蕉精品热| 高清欧美精品videossex| 国产精品日韩av在线免费观看 | 日韩一卡2卡3卡4卡2021年| 亚洲精华国产精华精| 国产精品永久免费网站| 最新在线观看一区二区三区| 一进一出抽搐gif免费好疼 | 真人做人爱边吃奶动态| 级片在线观看| 久久久国产一区二区| 精品无人区乱码1区二区| 亚洲人成电影免费在线| 人妻久久中文字幕网| 久久久久久亚洲精品国产蜜桃av| 1024视频免费在线观看| 亚洲黑人精品在线| 精品久久蜜臀av无| 香蕉久久夜色| 亚洲专区字幕在线| 香蕉久久夜色| 又黄又粗又硬又大视频| 国产亚洲精品一区二区www| 国产又色又爽无遮挡免费看| 欧美+亚洲+日韩+国产| 亚洲全国av大片| 女人被躁到高潮嗷嗷叫费观| 97碰自拍视频| 国产亚洲精品久久久久久毛片| 国产免费男女视频| 日日夜夜操网爽| 9色porny在线观看| 久久伊人香网站| 国产精华一区二区三区| 母亲3免费完整高清在线观看| 久久人妻福利社区极品人妻图片| 久久香蕉国产精品| 啦啦啦 在线观看视频| 久久中文看片网| 精品人妻在线不人妻| 欧美+亚洲+日韩+国产| 男人舔女人的私密视频| 在线观看www视频免费| 天堂俺去俺来也www色官网| 在线观看www视频免费| 一二三四在线观看免费中文在| 老司机亚洲免费影院| 91在线观看av| 国产成人免费无遮挡视频| 国产激情欧美一区二区| 我的亚洲天堂| 国产精品一区二区免费欧美| 高清在线国产一区| 在线播放国产精品三级| 成年女人毛片免费观看观看9| 热99re8久久精品国产| 欧美乱妇无乱码| 国产精华一区二区三区| 69精品国产乱码久久久| 一进一出抽搐动态| 亚洲成a人片在线一区二区| 久久久久国产一级毛片高清牌| 欧美日韩亚洲国产一区二区在线观看| 精品午夜福利视频在线观看一区| 国产精品亚洲av一区麻豆| 国产97色在线日韩免费| 日本wwww免费看| 高清毛片免费观看视频网站 | 精品久久久久久电影网| 视频区图区小说| 亚洲人成电影免费在线| 国产国语露脸激情在线看| 99热国产这里只有精品6| 视频区图区小说| 日本一区二区免费在线视频| x7x7x7水蜜桃| 国产国语露脸激情在线看| 99在线视频只有这里精品首页| 欧美成狂野欧美在线观看| 亚洲五月婷婷丁香| 欧美色视频一区免费| 国产色视频综合| 成人黄色视频免费在线看| 在线观看免费午夜福利视频| 精品国产一区二区久久| 久久久久久亚洲精品国产蜜桃av| 十分钟在线观看高清视频www| 自拍欧美九色日韩亚洲蝌蚪91| 精品日产1卡2卡| 麻豆一二三区av精品| 亚洲成人免费电影在线观看| 亚洲激情在线av| 国产欧美日韩精品亚洲av| 美女高潮到喷水免费观看| 嫩草影院精品99| 久久人妻福利社区极品人妻图片| 国产亚洲精品久久久久5区| 一级黄色大片毛片| 变态另类成人亚洲欧美熟女 | 免费av中文字幕在线| 亚洲国产中文字幕在线视频| 在线播放国产精品三级| 欧美久久黑人一区二区| 欧美最黄视频在线播放免费 | 老司机深夜福利视频在线观看| 超碰成人久久| 激情在线观看视频在线高清| 国产亚洲av高清不卡| 999久久久精品免费观看国产| 91成年电影在线观看| 亚洲激情在线av| 美女高潮喷水抽搐中文字幕| 美女高潮到喷水免费观看| 免费观看人在逋| 老鸭窝网址在线观看| 国产精品成人在线| 午夜老司机福利片| 亚洲一区高清亚洲精品| 国产aⅴ精品一区二区三区波| xxxhd国产人妻xxx| 亚洲中文日韩欧美视频| 热99国产精品久久久久久7| 欧美日韩国产mv在线观看视频| 精品高清国产在线一区| 岛国在线观看网站| 欧美成人午夜精品| ponron亚洲| 亚洲成人免费电影在线观看| 欧美+亚洲+日韩+国产| 国产又色又爽无遮挡免费看| 亚洲av成人av| 亚洲国产精品一区二区三区在线| 中文欧美无线码| 国产欧美日韩综合在线一区二区| 欧美中文日本在线观看视频| 免费av中文字幕在线| 99国产精品99久久久久| 老熟妇乱子伦视频在线观看| 亚洲精品中文字幕在线视频| av天堂在线播放| 亚洲第一青青草原| 亚洲精品粉嫩美女一区| 51午夜福利影视在线观看| 久久精品亚洲av国产电影网| 亚洲一区高清亚洲精品| 正在播放国产对白刺激| 久久精品国产综合久久久| 精品少妇一区二区三区视频日本电影| 成人av一区二区三区在线看| 好男人电影高清在线观看| 日韩 欧美 亚洲 中文字幕| 国产黄a三级三级三级人| 久久久久久久精品吃奶| 国产极品粉嫩免费观看在线| 免费久久久久久久精品成人欧美视频| 久久 成人 亚洲| 777久久人妻少妇嫩草av网站| 久久精品人人爽人人爽视色| 国产精品二区激情视频| 91av网站免费观看| 青草久久国产| 成人三级做爰电影| 热re99久久国产66热| 亚洲欧美日韩另类电影网站| 人人妻,人人澡人人爽秒播| 88av欧美| 啪啪无遮挡十八禁网站| 久久这里只有精品19| 法律面前人人平等表现在哪些方面| 看片在线看免费视频| 亚洲人成网站在线播放欧美日韩| 天天躁狠狠躁夜夜躁狠狠躁| 天堂√8在线中文| 1024视频免费在线观看| 首页视频小说图片口味搜索| 真人做人爱边吃奶动态| 777久久人妻少妇嫩草av网站| 麻豆久久精品国产亚洲av | 欧美人与性动交α欧美精品济南到| 色尼玛亚洲综合影院| 国产成人免费无遮挡视频| 正在播放国产对白刺激| 香蕉国产在线看| 日本黄色日本黄色录像| 在线观看一区二区三区激情| 欧美日韩瑟瑟在线播放| 精品人妻1区二区| 色综合站精品国产| 久久精品人人爽人人爽视色| 交换朋友夫妻互换小说| 午夜老司机福利片| 久久国产精品人妻蜜桃| 亚洲一区中文字幕在线| av天堂在线播放| 久久精品影院6| 久久久久久久久久久久大奶| av福利片在线| 成熟少妇高潮喷水视频| 99国产精品99久久久久| 亚洲欧美日韩另类电影网站| 男女午夜视频在线观看| 男女床上黄色一级片免费看| 国产精品 国内视频| 欧美人与性动交α欧美精品济南到| 日本三级黄在线观看| 咕卡用的链子| 一进一出抽搐gif免费好疼 | 国产精品美女特级片免费视频播放器 | 日韩大码丰满熟妇| 成在线人永久免费视频| 人成视频在线观看免费观看| 中文欧美无线码| 久久久国产一区二区| 丝袜美足系列| 黑人巨大精品欧美一区二区蜜桃| 亚洲 国产 在线| 一进一出好大好爽视频| 国产亚洲精品久久久久5区| 视频区图区小说| 在线观看免费高清a一片| 九色亚洲精品在线播放| 巨乳人妻的诱惑在线观看| 成人亚洲精品一区在线观看| 99精品久久久久人妻精品| 99久久综合精品五月天人人| 午夜福利影视在线免费观看| 伊人久久大香线蕉亚洲五| 啦啦啦 在线观看视频| 欧美乱妇无乱码| 制服人妻中文乱码| 亚洲色图av天堂| 精品国产一区二区三区四区第35| 五月开心婷婷网| 69精品国产乱码久久久| 黄色怎么调成土黄色| 欧美亚洲日本最大视频资源| 欧美激情 高清一区二区三区| 一个人观看的视频www高清免费观看 | 日日干狠狠操夜夜爽| 中文字幕色久视频| 免费在线观看视频国产中文字幕亚洲| 91成人精品电影| 国内毛片毛片毛片毛片毛片| 日本一区二区免费在线视频| 亚洲精品av麻豆狂野| 国产欧美日韩一区二区三区在线| 岛国视频午夜一区免费看| www.www免费av| 免费久久久久久久精品成人欧美视频| 国产免费男女视频| 午夜免费成人在线视频| 久久久国产精品麻豆| 国产欧美日韩综合在线一区二区| 国产单亲对白刺激| 丝袜美足系列| 每晚都被弄得嗷嗷叫到高潮| 天堂动漫精品| 后天国语完整版免费观看| 国产成人av教育| 一级毛片精品| 老司机靠b影院| 老汉色av国产亚洲站长工具| 欧美日韩一级在线毛片| 夜夜躁狠狠躁天天躁| 亚洲专区中文字幕在线| 视频区欧美日本亚洲| 热99re8久久精品国产| 好看av亚洲va欧美ⅴa在| 天天躁狠狠躁夜夜躁狠狠躁| 久久人妻福利社区极品人妻图片| 亚洲一区高清亚洲精品| 女性被躁到高潮视频| 国产免费av片在线观看野外av| 99riav亚洲国产免费| 精品一区二区三区av网在线观看| 久久影院123| 手机成人av网站| 他把我摸到了高潮在线观看| 久久精品成人免费网站| 久久国产精品人妻蜜桃| 每晚都被弄得嗷嗷叫到高潮| 露出奶头的视频| 国产亚洲欧美98| 久久久久国产精品人妻aⅴ院| 国产精品香港三级国产av潘金莲| 最近最新免费中文字幕在线| 国产高清videossex| 欧美亚洲日本最大视频资源| 成人免费观看视频高清| 十分钟在线观看高清视频www| 热99国产精品久久久久久7| 久久精品aⅴ一区二区三区四区| 欧美日韩福利视频一区二区| 欧美激情 高清一区二区三区| 身体一侧抽搐| 亚洲精品国产精品久久久不卡| 热99re8久久精品国产| 高清欧美精品videossex| 在线观看www视频免费| 亚洲熟妇中文字幕五十中出 | 热re99久久国产66热| 午夜影院日韩av| 满18在线观看网站| 中文字幕最新亚洲高清| 在线国产一区二区在线| 亚洲精品一区av在线观看| 黄频高清免费视频| 人妻久久中文字幕网| 又紧又爽又黄一区二区| 长腿黑丝高跟| 性少妇av在线| 热99国产精品久久久久久7| 久热这里只有精品99| 一区二区三区精品91| 在线观看免费高清a一片| 久久人人精品亚洲av| a级毛片在线看网站| 久久人人爽av亚洲精品天堂| 麻豆久久精品国产亚洲av | 搡老乐熟女国产| 亚洲中文日韩欧美视频| 成人三级黄色视频| 18美女黄网站色大片免费观看| 免费观看精品视频网站| 亚洲精品一区av在线观看| 啪啪无遮挡十八禁网站| 91老司机精品| 欧美日本亚洲视频在线播放| 久久亚洲真实| 国产99久久九九免费精品| 亚洲美女黄片视频| 亚洲一区二区三区欧美精品| 久久青草综合色| 久久久国产欧美日韩av| 日韩人妻精品一区2区三区| 女人爽到高潮嗷嗷叫在线视频| 欧美在线黄色| 国产欧美日韩一区二区精品| 亚洲一区二区三区不卡视频| 男女做爰动态图高潮gif福利片 | 好男人电影高清在线观看| 日韩欧美免费精品| 最近最新中文字幕大全电影3 | 国产欧美日韩一区二区三| 成人黄色视频免费在线看| 又黄又粗又硬又大视频| 黄片小视频在线播放| 亚洲人成电影免费在线| 女性被躁到高潮视频| 黄片小视频在线播放| 色婷婷av一区二区三区视频| 少妇 在线观看| 亚洲av电影在线进入| 日本三级黄在线观看| 老熟妇仑乱视频hdxx| 99国产精品一区二区三区| 国产精品影院久久| 18禁裸乳无遮挡免费网站照片 | 欧美日韩黄片免| 日本免费a在线| 国产精品野战在线观看 | 国产精品美女特级片免费视频播放器 | 超碰成人久久| 国产91精品成人一区二区三区| 色婷婷av一区二区三区视频| 午夜两性在线视频| 日本免费a在线| 脱女人内裤的视频| 国产又色又爽无遮挡免费看| 久久午夜亚洲精品久久| 在线观看舔阴道视频| 亚洲,欧美精品.| 欧美丝袜亚洲另类 | 久久久国产成人免费| 成人永久免费在线观看视频| 18禁国产床啪视频网站| 亚洲成a人片在线一区二区| 成人特级黄色片久久久久久久| 久久人妻熟女aⅴ| 丁香六月欧美| 80岁老熟妇乱子伦牲交| 欧美黄色淫秽网站| 超碰97精品在线观看| av网站免费在线观看视频| 午夜日韩欧美国产| 欧美老熟妇乱子伦牲交| 欧美黄色淫秽网站| 亚洲一区二区三区色噜噜 | 亚洲欧美精品综合久久99| av天堂在线播放| 十八禁人妻一区二区| 欧美性长视频在线观看| 黄色视频不卡| 搡老乐熟女国产| 欧美av亚洲av综合av国产av| 99香蕉大伊视频| 一级a爱视频在线免费观看| 国内毛片毛片毛片毛片毛片| 91老司机精品| 亚洲,欧美精品.| 欧美一级毛片孕妇| 乱人伦中国视频| 亚洲人成77777在线视频| 久久久久精品国产欧美久久久| 欧美日本亚洲视频在线播放| 国产av又大| 一边摸一边抽搐一进一出视频| 国产精品日韩av在线免费观看 | 色婷婷av一区二区三区视频| 久久久久久免费高清国产稀缺| 夜夜爽天天搞| 欧美乱色亚洲激情| 国产亚洲欧美在线一区二区| 久久热在线av| 日本 av在线| 久久久久国产一级毛片高清牌| 成人国产一区最新在线观看| 国产精品偷伦视频观看了| 国产精品国产高清国产av| 国产精品av久久久久免费| 黄色毛片三级朝国网站| 他把我摸到了高潮在线观看| 精品欧美一区二区三区在线| 18禁观看日本| 国产av又大| 亚洲中文字幕日韩| 大型av网站在线播放| 日韩免费高清中文字幕av| 欧美日韩中文字幕国产精品一区二区三区 | 在线观看日韩欧美| 亚洲五月色婷婷综合| 久久久久久久久久久久大奶| 欧美性长视频在线观看| 99国产综合亚洲精品| 色综合婷婷激情| 国产一区二区在线av高清观看| 窝窝影院91人妻| 亚洲久久久国产精品| 国产高清videossex| 午夜免费成人在线视频| 国产av又大| 岛国在线观看网站| 国产三级黄色录像| 久久精品国产99精品国产亚洲性色 | 9热在线视频观看99| 婷婷六月久久综合丁香| 两人在一起打扑克的视频| 人人妻,人人澡人人爽秒播| av福利片在线| 欧美日韩亚洲高清精品| 如日韩欧美国产精品一区二区三区| 欧美成人午夜精品| 一个人观看的视频www高清免费观看 | 免费在线观看日本一区| 国产精品二区激情视频| 日韩大尺度精品在线看网址 | 午夜免费观看网址| 久久影院123| 久久中文看片网| 亚洲情色 制服丝袜| 他把我摸到了高潮在线观看| 99热国产这里只有精品6| 纯流量卡能插随身wifi吗| av福利片在线| 大型黄色视频在线免费观看| 黄色片一级片一级黄色片| www.自偷自拍.com| 亚洲人成伊人成综合网2020| 国产男靠女视频免费网站| 国产97色在线日韩免费| 波多野结衣av一区二区av| 久久久久国产一级毛片高清牌| 一a级毛片在线观看| 夫妻午夜视频| 91麻豆精品激情在线观看国产 |