Risheng Kng*,Tinwei Zhng,Ho TngWenyong Zho
aEngineering Laboratory on Intelligent Perception for Internet of Things(ELIP),Peking University,Shenzhen Graduate School,China
bNakamura-Takano Lab,Department of Mechano-informatics,The University of Tokyo,113-8685,Japan
Available online 6 October 2016
Adaptive Region Boosting method with biased entropy for path planning in changing environment
Risheng Kanga,*,Tianwei Zhangb,Hao Tanga,Wenyong Zhaoa
aEngineering Laboratory on Intelligent Perception for Internet of Things(ELIP),Peking University,Shenzhen Graduate School,China
bNakamura-Takano Lab,Department of Mechano-informatics,The University of Tokyo,113-8685,Japan
Available online 6 October 2016
Path planning in changing environments with difficult regions,such as narrow passages and obstacle boundaries,creates significant challenges.As the obstacles in W-space move frequently,the crowd degree of C-space changes accordingly.Therefore,in order to dynamically improve the sampling quality,it is appreciated for a planner to rapidly approximate the crowd degree of different parts of the C-space,and boost sample densities with them based on their difficulty levels.In this paper,a novel approach called Adaptive Region Boosting(ARB)is proposed to increase the sampling density for difficult areas with different strategies.What's more,a new criterion,called biased entropy,is proposed to evaluate the difficult degree of a region.The new criterion takes into account both temporal and spatial information of a specific C-space region, in order to make a thorough assessment to a local area.Three groups of experiments are conducted based on a dual-manipulator system with 12 DoFs.Experimental results indicate that ARB effectively improves the success rate and outperforms all the other related methods in various dynamical scenarios.
Motion planning;DRM;Biased entropy classification;Hybrid boosting strategy
As one of the most fundamental researches in roboticfield,path planning has been applied in many other disciplines such as aviation,industrial manufacturing,games/ virtual reality and so on.The generalized motion planning problem has been proved to be PSPACE-hard[1].Therefore, many sampling based methods have been proposed to reduce computational complexity.Probabilistic Roadmap method(PRM)[2]and rapidly exploring Random Tree (RRT)[3]are two of the most successful ones with probabilistic complete guarantee.Though in the past two decades many variants of PRM [4,5]have gained great success in solving path planning problems even in high-dimensional C-space,it still remains hard in changing environments.
Difficultregion creates a significantchallenge for sampling-based methods to pass through,because it is hard to put enough samples in difficult region to construct a sufficient roadmap.In static environments,many non-uniform methods [6-9]are proposed to increase sampling density in difficult regions.Some heuristic measures are also proposed to solve the difficult region problem,such as predictive models[10,11], Approximate Cell Decomposed(ACD)[12],Region-sensitive method[13]and so on.Here,Region-sensitive method identifies difficult regions with probability density estimation to obtain approximate structure of C-space.Although all the methods above perform well in static environments,there are still lots of problems required to be solved under changing environments.
Path planning in changing environments encounters great challenges since the validity of sampling points changesfrequently in C-space.Dynamic Roadmap Method(DRM) [14-16]is an extension of PRM for path planning in changing environments,which is an uniform sampling method up against many troubles from difficult regions.In the past several years,only a few works[17-20]have focused on this conundrum.In changing environments,shapes and positions of regions change frequently according to the motion of obstacles.Particularly,some regions become more and more difficult and some regions become freer and freer based on the moving trend of obstacles.Hereby,the crucial issue is how to efficiently boost sampling quality in different difficult-level regions in order to increase the connection degree of the roadmap.
In this paper,a novel method named Adaptive Region Boosting(ARB)is proposed for path planning in changing environments,which could adaptively boost the sampling quality of regions based on the different crowd-degrees.The motivation of ARB is to find a heuristic measure for the "difficulty-degree"of the region around every valid sampling point.Biased entropy is proposed to evaluate difficult degree of a region.Low biased entropy represents low difficult degree,therefore the higher the biased entropy is,the more difficult the local region is.
By considering the difference between two adjacent frames, not only space information is captured,but also time information is taken into account to evaluate changing degree of a local region in updating phase.Our approach includes two steps.In preprocessing phase,a hierarchical sampling strategy is proposed to construct the roadmap by two layers of samples. In updating phase,more samples will be adaptively activated in difficult regions in order to boost sampling density within these regions.
Our contributions can be concluded as follows:
1)Biased entropy is proposed to evaluate the difficult degree of a specific C-space region by considering both temporal and spatial information.
2)Various boosting strategies are presented to adaptively increase the sampling density within difficult regions based on the various difficult degree.
The rest of this paper is organized as follows.Section 2 describes related works.Details of our method are given in Section 3.Three groups of experiments are conducted and analyzed in Section 4.Finally,conclusions are drawn in Section 5.
2.1.DRM
Dynamic Roadmap method(DRM)[14-16],which is a variant of PRM,has been successfully applied in changing environments.Initially,DRM generates sampling points randomly in C-space with no obstacles.Then the W-space is decomposed into basic cells to construct the mappings between the cells in W-space and the roadmap in C-space.The core of DRM includes two kinds of mappings,node mapping and edge mapping:
Here,G={Gp,Ge}is the roadmap constructed in C-space, where Gpand Geare the nodes and edges set of the roadmap, respectively.Functions Φp(w)and Φe(w)are the mappings of nodes and edges.Nodes and edges of the roadmap are labeled as invalid if the basic cell w of W-space is occupied by obstacles.Ω(q)denotes a subset of basic cells which are occupied by a robot whose configuration is q.
However,it is too complex to compute the nodes mapping Φp(w)and the edges mapping Φe(w).Instead,we generally compute the inverse mappingsand.The computing of[18]is that the robot in the W-space isfirst set to the configuration q in C-space,and then a seed cell is put inside the robot and expanded in each direction until all cells Ω(q)are found by collision detector.In general,vast time is needed to compute edge mapping in order to verify whether edge e is valid or not.Therefore,edge mapping is not used frequently in computing,while node mapping is the core of DRM method.
By quickly updating the roadmap to indicate invalid nodes and edges of the roadmap when obstacles are moving,DRM performs well in changing environments.However,difficult region problem is also a tremendous challenge for DRM planner.
2.2.Dynamic Bridge Builder method
Difficult region problem creates tremendous difficulty for path planning,as there are so few sampling points within these regions that planners can not find a collision-free path.This problem becomes more urgent in changing environments, because the positions and the shapes of narrow passages change frequently.How to increase sampling density within difficult areas is worth to be studied.
Some bridge builder planners are proposed to work out difficult regions in changing environments,such as Dynamic Bridge Builder(DBB)[18].In preprocessing phase,DBB generates sample points and constructs a new roadmap like DRM.Then,DBB computes midpoints of edges in the roadmap and generates incremental points around those midpoints. Afterward,W-C mapping of sampling points is calculated to ensure validity of sampling points and edges.If the midpoint of an edge is valid and two endpoints are invalid,this edge is regarded as a bridge which identifies a local region of a narrow passage.In updating phase,DBB rebuilds new bridges online by traversing all the edges.
When difficult regions are identified by DBB in updating phase,two different strategies are used to boost these regions. The first one is to activate incremental points around those bridges,which means more time will be consumed to compute W-C mapping for those incremental points during preprocessing phase.This strategy is written as DBB-I[18].The other is to predict the validity of incremental points by using Parzen-Window estimation in updating phase and delaying collision detections to query phase.In this way,it can decrease the preprocessing time.This strategy is written as DBB-II [19].
Although DBB has achieved great performance in changing environments with difficult regions,it still has some de ficiency.It is difficult for DBB to identify narrow passages completely as there will be some situations when no bridges could be found in difficult regions.
2.3.Parzen Window density estimation
Various machine learning algorithms[21]have been applied to estimate the probability density of valid sampling points in a local area.In this paper,active learning algorithms are utilized since the learner can classify different kinds of regions from which it learns.Parzen-Window is a well-known density estimation method which has been extensively used in pattern recognition,classification,prediction and so on. Parzen-window estimates the Probability Density Function (PDF)which is derived from a large amount of samples.
Assume that x is a random sample,a window function is used to determine how many observation samples{x1,…,xn} fall within the window around x.Then,the Probability Density Function(PDF)value P(x)is defined as the sum of the contributions from the observations to this window.The Parzenwindow estimates can be defined as follows:
where K(.)is the window function of d-dimensions,and hd(>0)is the window width.If the window is small enough,it will actually enclose zero points within the window.Therefore,that is the drawback of Parzen-Window Estimation. Therefore,in this paper,the K-nearest neighbors strategy is chosen to compute biased entropy,which sets the size of the window depending on the biggest distance between two points.
3.1.Overview of our method
In changing environment,a time period is divided into a series of slices.In each time slice,the environments are viewed as static environments.For convenience,a time slice is called a frame.Obviously,the more slices in a time period,the more accurate we simulate the real environment.
Fig.1.Framework of Adaptive Region Boosting Method.
It is one of the most crucial problems to estimate how difficult a region is in real time.In this paper,the difficult degree of a region can be roughly estimated with information of a few sampling points.Biased entropy of a valid point represents the difficult degree of the local region around this point.The biased entropy difference between two adjacent frames can evaluate the changing degree of a region through time.With this difference,an Adaptive Region Boosting (ARB)method is proposed to boost difficult regions in order tofind a free path effectively and efficiently.
During preprocessing phase,the roadmap is constructed by Hierarchical Sampling Strategy(HSS).After this,W-C nodes mapping of the roadmap is calculated for obtaining validity of these points online.During updating phase,the roadmap is updated through W-C mapping and then the biased entropy value of the valid points is computed to evaluate difficult degree of regions.Our approach computes biased entropy to evaluate how many incremental points need to be activated to boost the difficult regions.The A* algorithm is used to search a safe path.The safety of the path is also weighted by the entropy value.The smaller the entropy value of the sampling point is,the safer the path is. The flow chart of the proposed method is also shown in the Fig.1.
3.2.Preprocessing phase
Hierarchical Sampling Strategy(HSS)is used in this paper to construct the roadmap in preprocessing phase.As shown in Fig.2,the roadmap consists of two level points in C-space, which are main points and incremental points.Initially,the main points are generated in C-space,then,the roadmap is constructed by a straight-line local planner using Manhattan distance[22].Finally,the incremental points are generated around main points.
As shown in Algorithm 1,each node is connected to their centers which are the nodes of P initially.Then each node of B is connected to its k-nearest neighbors.This roadmap is denoted by G={Gp(P,B),Ge},in which P={p1,…,pn}is the set of main points,B={B1,…,Bn}is the set of incremental points and Geis the set of edges in the roadmap.Here, Biis the set of the incremental points around pi.At the beginning,all points of B will be marked as inactive,which means they will be ignored in the course of searching a path. The W-C node mapping for points of P and B should be computed in the preprocessing phase,in order to update their validity fast for each query.In updating phase,when incremental points around the main points need to be activated, these incremental points are used to search a path.
3.3.Biased entropy
In this paper,new statistic information of regions are utilized to classify regions around sampling points.The biased entropy,which is a measure of the disorder of the samples in a region,is used to evaluate difficult degree of the region.The higher the biased entropy is,the more difficult the region is.
In order to calculate the biased entropy value,Parzen Window estimation,which is an activate machine learning method,is adopted to estimate probability density.Since entropy represents the information of a probability distribution, Parzen Window is needed to provide the probability information of sampling points in a region.
Fig.2.Hierarchical Sampling Strategy(HSS)is used to construct the roadmap. Black points are main points.Blue points are incremental points.
As shown in Fig.3,Nq={q1,…,qk}is the set of K-nearest neighbor points of point q.The region Rqis the smallest area which contains Nq∪q.The Pvalidand Pinvaliddenote the set of valid and invalid sampling points,respectively.The observation probability is defined as the probability of valid sampling points.The probability estimation is defined as:
Assume initial point q to be a valid sampling point, then Pvalid∩Nq≠φ and P(q)∈(0,1]. However,is constant and P(q)is a linear function,which needs a large amount of samples to ensure high accuracy.Therefore,simple K-nearest neighbors method is not applied to local regions.
Thus,information entropy is introduced to be a measure of the disorder of the samples in a specific region.The entropy value of the initial point is calculated by P(q).In C-space, P(q)∈(0,1],because all sampling points in R are valid or invalid.Then,the entropy is defined as:
when P=0.5,H(P(q))gains the maximum.However,it is not a monotone function.It is difficult to measure difficult degree of regions,such as,approximately free regions,obstacle boundaries and narrow passages.A new estimating function called biased entropy is defined as:
Function e(q)is a decreasing function,which denotes information entropy of a valid sampling point.The higher e(q) is,the more difficult the region Rqwill be considered.As shown in Fig.4,there are fewer valid points within narrow passages than in regions of obstacle boundaries,the biased entropy of the point within narrow passages is higher thanregions of obstacle boundaries.In conclusion,biased entropy is an effective measure to evaluate difficult degree of regions.
Fig.3.Sampling with K-nearest strategy.Point q is the center point and q1,…,qkare K-nearest neighbor points.
3.4.Region boosting strategy
During updating phase,the roadmap needs to be updated all the time.In every frame,the roadmap is updated by W-C mapping according to which cell in W-space is occupied by obstacles or robots.In our method,the roadmap is also updated with an adaptive boosting region method,which is shown in Algorithm 2.
When obstacles move in W-Space,obstacle regions in C-space will change accordingly.Since the motion of obstacles is unknown,the changing degree of regions in C-space is quite different.For example,if some regions become narrow passages from free regions between two frames,these regions have a more intensity change than the other regions in contrast.Thus,the difference of e(p)can represent the changing degree of region Rp,which is defined as:
where d∈[-1,1].Here,enew(p)and eold(p)are e(p)in previous frame and e(p)in current frame respectively.
Fig.4.This figure illustrates how the biased entropy works.The higher e(q)is, the more difficult the region Rqwill be considered.Narrow passages have high value of biased entropy.Obstacle boundaries have middle value of biased entropy.Approximately free regions have low value of biased entropy.
We assume that caoldand canewrepresent how many incremental points have been activated in previous frame and in current frame,respectively.caoldand caneware initialized with 0,which represent all the incremental points Bithat have not been activated.If d>0,the region Ribecomes more and more difficult.Thereby,the number of activated incremental points needs to be increased.On the contrary,if d<0,the difficult degree of region Ribecomes much smaller.Hereby,the number of activated incremental points needs to be decreased. The absolute value|d|represents changing degree of Ri.The higher|d|is,the more incremental points need to be activated or inactivated for updating the roadmap.The number of incremental points in the new roadmap is represented by:
where|Bi|represents the number of total incremental points around pi.However,when canew>|Bi|or canew<0,we reset canew=|Bi|or canew=0 respectively.In other words,if canew>|Bi|,all incremental points in Biare activated.If canew<0,all incremental points in Biare inactivated.As shown in Fig.5(a),the difficult degree of region Rpbecomes smaller,some activated points are inactivated to constrain scale of the roadmap.As shown in Fig.5(b),the region Rpbecomes more and more difficult,some inactivated points are activated to boost this region.
With the adaptive region boosting method,more points are sampled in more difficult regions.Moreover,changing degree of regions is also taken into account to represent how many incremental points are needed to be activated or inactivated,in order to balance the density of sampling points in local areas with different motion of obstacles.
During query phase,the roadmap is updated by adaptive region boosting method with moving obstacles and moving robots in every frame.After that,A*algorithm is adopted to search a collision-free path,and robot moves to the next configuration.
In order to evaluate the proposed method,extensive experiments are implemented with two manipulators modeled byparameters of practical6-DoF Kawasakimanipulators (FS03N)in 3D W-space.The two manipulators are mounted on two fixed bases which amount to 12-DoFs.12 dimensional C-space is constructed,because the 12-DoFs of manipulators are considered simultaneously.The reachable workspace of two manipulators is decomposed into 406134 grids,and each grid is a cube with the size of 4X4X4.Collision detection in our experiments is implemented by a free 3D Collision Detection Library,ColDet 1.1.All the experiments are carried out on an Intel Dual-core CPU 3.00 GHz with 4 GB memory.
4.1.Scenarios of experiments
Three groups of experiments are arranged to evaluate the proposed method,as shown in Fig.6.The planning tasks are tofinish a hand shaking movements between two manipulators while passing through variant obstacles.The start and goal positions of each scenarios are given in Figs.7-9,respectively.As shown in Fig.6(a),two horizontal small boards are moving up and down in a big board with two rectangular holes,creating four dynamic holes for robots to pass through. Scenario II consists of a square board with a hole moving up and down in the air,the start and goal positions can be found in Figs.6(b)and 8,respectively.
Six bars flying in the air construct a more complex environment which is shown in Fig.6(c).All the bars are horizontal,while four of them are in the periphery of two manipulators and the other two are between two manipulators. The outside 4 bars move up and down and the other 2 bars move up and down in opposite direction.During the run time, two vertical bars first move toward each other.When they are close enough,they drift apart in the opposite direction.The start and goal positions can be found in Figs.6(c)and 9, respectively.The start configuration is constant.In all scenarios,each bar has a move range,once the maximum distance is reached,the corresponding direction is reversed.
4.2.Analysis of parameters
Firstly,the value of a crucial parameter is discussed.In ARB,entropy value is calculated by K-nearest neighbors method.Thereby,parameter K has a significant influence on the performance of the planner.If K is very small,no enough sampling points to estimate accurately makes region classi fication unreliable.On the contrary,if K is very large,the region Rqbecomes too large leading to an increasing error rate.In order to obtain optimal K,100 experiments are carried out with different K in different scenarios.
As shown in Fig.10,success rate of the planner is low with both large and small K.However,when K=15 or K=20,the success rate is higher than others.As to different scenarios,the best choice of parameter K is different.Nevertheless,K=15 is the best choice to consider both computing cost and success rate.Therefore,K=15 is selected in ARB.
The size of region Rqalso depends on parameter K.The diameter of Rqis the furthest distance between two points which are K-nearest neighbors of q.Furthermore,as the center of region Rqis not always point q,all the regions are restricted in the boundaries of C-space.
4.3.Analysis of preprocessing phase
During preprocessing phase,some samples are generated to construct the roadmap at first,called the main points.Then, the incremental points are generated around main points and connected to the roadmap.However,these incremental points are inactivated initially,which means these points and edges will not be searched until they are activated.In this paper, there are eight incremental points around every main point. After that,W-C nodes mapping of all points are computed to obtain validity of these points in updating phase.
Fig.5.Two different situations for narrow passage.The blue region is current obstacle's region and the gray region represents its previous position.The black points are main points.The purple points are incremental points which are activated in previous frame.The blue points is newly activated or inactivated incremental points.(a)The difficult degree of region Rpbecomes smaller,blue points are inactivated to constrain scale of the roadmap.(b)The difficult degree of region Rpbecomes higher,blue points are activated to boost the region.
Fig.6.Experiment scenarios I(a),II(b),III(c)are shown in this figure,each consist of 12 DoFs.
Fig.7.Four goal configurations of scenario I are shown.The planner tends to choose the safer goal area to reach.
Table 1 summarizes the number of sampling points in different methods.Here,column NPis the number of main points which are initial vertex of roadmap.The cardinality of P is crucial in realization.If it is too large,updating phase will be time consuming.If it is too small,roadmap does not contain enough information for C-space construction.Column NMis the number of the middle points which are used in DBB-I and DBB-II to identify narrow passages.NBis the number of the incremental points in ARB,DBB-I and DBB-II.Column NSis the total number of sampling points in C-space.Column Tcis the time of constructing roadmap without W-C mapping.
As shown in Table 1,ARB,DBB-I and DBB-II have the same NPand DRM creates the same number of total sampling points with ARB.In preprocessing phase,much fewer total sampling points are generated in ARB than DBB-I with the same number of main points,because the middle points are not computed and much fewer incremental points are generated in ARB.The NSof DBB-II is smaller than DBB-I, because the incremental points are generated around main points instead of midpoints in DBB-I.Tcof ARB and DBB-I are similar to DRM,because ARB and DRM have similar NS. DBB-I has more sampling points and more time of roadmap construction,because more sampling points are generated and it needs much more time to connect these points.
4.4.Analysis of updating phase
Fig.8.Two goal configurations of Scenario II are shown.
Fig.9.Two goal configurations of Scenario III are shown.
Fig.10.Selection of Parameter K.The optimum value of K differs in different scenarios.However,K=15 performs very well in all the scenarios.
During updating phase,the obstacle region in C-space changes with the obstacles in W-space.The faster the obstacles move,the more intensely the C-space changes.In order to evaluate performance of our method,different speeds of obstacles and different methods are taken into account in experiments.
Tables 2-4 summarize the results of ARB compared with other methods in different motion speeds of obstacles.Each group of experiments is tested for 200 times from a random start configuration to the goal configurations.Here,column SRavgrepresents average success rate of planning.Column Navgand column Nlrepresent the number of average researching times and largest re-searching times,respectively. Column Tpis the time of preprocessing phase and column Tavgis the average planning time.
Table 1The number of sampling points in different methods.
Table 2Results of different methods with different moving speed of obstacles under Scenario I.
Table 3Results of different methods with different moving speed of obstacles under Scenario II.
Table 4Results of different methods with different moving speed of obstacles under Scenario III.
As shown in Table 2,with the same speed,the SRavgof ARB is higher than other methods in all scenarios.The SRavgof ARB achieves much improvement compared with DRM, because there are few sampling points within difficult regions in DRM.DBB-I and DBB-II are proposed to solve difficult region problems in changing environments.However,only a part of difficult regions can be identified in DBB-I and DBBII,because only a few bridges could be built in updating phase and many difficult regions are not boosted in updating phase. ARB computes the entropy value of valid points and boosts the regions around them.In this way,more regions can be boosted than DBB-I and DBB-II.Therefore,the SRavgof ARB gains huge improvements compared with the other methods.
With the same speed,the Navgand Nlof ARB are much smaller than DBB-I's and DBB-II's in all scenarios.It is due to the fact that DBB-I and DBB-II only identifies a part of difficult regions,while more local regions are boosted for increasing density of sampling points in difficult regions. Moreover,the Navgand Nlof DBB-I and DBB-II in scenario II are much larger than those in scenario I and III.Because the goal configuration is within narrow passages in scenario II, scenario II is more difficult than the others.However,the Navgand Nlof ARB in scenario II do not increase much more than the others.It presents that ARB can be used in various environments.
The Tpof DBB-I is larger than the other methods in all scenarios with the same speeds,because it has larger NSand W-C mappings of all these points are calculated.The DBB-II achieves the least Tp,because the validity of the incremental points in DBB-II is predicted in updating phase and W-C mappings are not computed in preprocessing phase.The Tpof ARB is same as DRM's,because the same number of sampling points needs to compute W-C mappings.The Tavgof all methods are similar.ARB and DBB-I cost much time to update a large amount of sampling points.DBB-II costs some time to compute predictive model.In conclusion,ARB outperforms the other methods with the same speed of obstacles.
With different speeds of obstacles,the SRavgof ARB is similar in each scenario.However,the SRavgof the other methods is decreasing with higher speed.It is due to the fact that the changing degree of environments is taken into account in ARB.The number of activated points depends on the changing degree of the region.The ARB has excellent universality for various environments.In a word,the ARB gains better outstanding performance than the other related methods.
In this paper,Adaptive Region Boosting(ARB)method is proposed to boost regions with difficult degree of regions in changing environments.A new criterion called biased entropy is proposed to evaluate the difficult degree of a local region in this paper.Moreover,the absolute value of difference between two biased entropy in two continuous frames can evaluate the changing degree of the region.Our approach works in two phases.In preprocessing phase,main points and incremental points are generated to construct the roadmap.In updating phase,the biased entropy difference is used to determine the number of activated incremental points.Compared to other related works,ARB has higher success rate of planning and less re-planning time.ARB is more suitable for various changing environments than DBB-I and DBB-II,as most local regions are boosted adaptively by our approach.In conclusion, ARB outperforms the other related works,as shown in the experiments.
However,there are a few significative works on excavating information of regions in DRM method.For example,tracking the movement of obstacles and forecasting the changing of difficult regions will be our future work.Some predictive model to capture motion of regions is a much possible method.
This work is supported by National Natural Science Foundation of China(NSFC,No.60875050,60675025,61340046), National High Technology Research and Development Program of China(863 Program,No.2006AA04Z247),Science and Technology Innovation Commission of Shenzhen Municipality(No.201005280682A,No.JCYJ20120614152234873), Specialized Research Fund for the Doctoral Program of Higher Education(No.20130001110011).
[1]H.M.Choset,Principles of Robot Motion:Theory,Algorithms,and Implementation,MIT press,2005.
[2]L.Kavraki,P.Svestka,J.Latombe,M.Overmars,IEEE Trans.Robotics Automation 12(4)(1996)566-580.
[3]R.Geraerts,M.H.Overmars,A Comparative Study of Probabilistic RoadmapPlanners,vol.7,SpringerTractsAdv.Robotics,2004,pp.43-57.
[4]R.Bohlin,L.E.Kavraki,Path planning using lazy prm,in:IEEE International Conference on Robotics and Automation(ICRA),vol.1,2000, pp.521-528.
[5]J.Denny,N.M.Amato,The toggle local planner for sampling-based motion planning,in:International Conference on Robotics and Automation(ICRA),IEEE,2012,pp.1779-1786.
[6]D.Hsu,L.E.Kavraki,J.C.Latombe,R.Motwani,S.Sorkin,On finding narrow passages with probabilistic roadmap planners,in:Proceedings of the Third Workshop on the Algorithmic Foundations of Robotics on Robotics:The Algorithmic Perspective:the Algorithmic Perspective, 2002,pp.141-153.
[7]N.M.Amato,O.B.Bayazit,L.K.Dale,Obprm:An Obstacle-based Prm for 3d Workspaces,1998.
[8]V.Boor,M.H.Overmars,et al.,The gaussian sampling strategy for probabilistic roadmap planners,in:International Conference on Robotics and Automation,vol.2,IEEE,1999,pp.1018-1023.
[9]D.Hsu,T.Jiang,J.Reif,Z.Sun,The bridge test for sampling narrow passageswith probabilistic roadmap planners,in:International Conference on Robotics and Automation(ICRA),vol.3,IEEE,2003, pp.4420-4426.
[10]B.Burns,O.Brock,Sampling-based motion planning using predictive models,in:International Conference on Robotics and Automation (ICRA),IEEE,2005,pp.3120-3125.
[11]B.Burns,O.Brock,Toward optimal configuration space sampling,in: Proc.Robotics:Science and Systems,Cambridge,MA,June 2005.
[12]L.Zhang,Y.J.Kim,D.Manocha,A hybrid approach for complete motion planning,in:International Conference on Intelligent Robots and Systems (IROS),IEEE/RSJ,2007,pp.7-14.
[13]S.Rodriguez,S.Thomas,R.Pearce,N.M.Amato,Resampl:A Regionsensitive Adaptive Motion Planner,Springer Algorithmic Foundation of Robotics VII,2008,pp.285-300.
[14]P.Leven,S.Hutchinson,Toward Real-time Path Planning in Changing Environments,2000.
[15]H.Liu,X.Deng,H.Zha,D.Ding,A path planner in changing environments by using wc nodes mapping coupled with lazy edges evaluation,in:International Conference on Intelligent Robots and Systems (IROS),IEEE/RSJ,2006,pp.4078-4083.
[16]M.Kallman,M.Matari c,Motion planning using dynamic roadmaps,in: International Conference Robotics Automation(ICRA),vol.5,IEEE, 2004,pp.4399-4404.
[17]H.Liu,K.Rao,F.Xiao,Obstacle guided rrt path planner with region classification for changing environments,in:International Conference on Robotics and Biomimetics(ROBIO),IEEE,2013,pp.164-171.
[18]D.Ding,H.Liu,X.Deng,H.Zha,A dynamic bridge builder to identify difficult regions for path planning in changing environments,in:IEEE/ RSJ International Conference on Intelligent Robots and Systems(IROS), IEEE,2007,pp.2925-2931.
[19]H.Liu,D.Ding,W.Wan,H.Zha,Predictive model for path planning by using k-near dynamic bridge builder and inner parzen window,in:International Conference on Intelligent Robots and Systems(IROS),IEEE/ RSJ,2008,pp.2133-2138.
[20]H.Liu,T.Zhang,C.Wang,A"capacitor"bridge builder based safe path planner for difficult regions identification in changing environments,in: International Conference on Intelligent Robots and Systems(IROS), IEEE/RSJ,2012,pp.3179-3186.
[21]R.O.Duda,P.E.Hart,D.G.Stork,Pattern Classification,John Wiley& Sons,2012.
[22]N.M.Amato,O.B.Bayazit,L.K.Dale,C.Jones,D.Vallejo,Choosing good distance metrics and local planners for probabilistic roadmap methods,in:International Conference on Robotics and Automation (ICRA),vol.1,IEEE,1998,pp.630-637.
Risheng Kangreceived the B.E.degree in computer science and technology from Inner Mongolia Agricultural University,inner Mongolia,China,in 2012. He is currently pursuing M.S.degree in computer science of Peking University,China.His research interests in robot motion planning.Recently,he also serves as reviewer for the IEEE International Conference on Robotics and Automation(ICRA).
Hao Tangreceived the B.E.degree in electronics and information engineering in 2013 and is working toward the Master degree at the School of Electronics and Computer Engineering,Peking University,China. His current research interests are image classification, hand gesture recognition,gender recognition,image retrieval,action recognition and deep learning.He has published several articles in ACM Multimedia Conference(MM),IEEE International Conference on Image Processing(ICIP)and International Joint Conference on Artificial Intelligence(IJCAI).
Wen-Yong Zhaogot his Ph.D.degree in pattern recognition and intelligent system from the Institute of Automation,Chinese Academy of Sciences,Beijing, in 2012.He received his B.Eng.and M.Eng.degrees from Tianjin University,China,in 2004 and 2007 respectively.From 2013 to 2016,he worked as a postdoctoral in ShenZhen graduate school of PeKing University.In 2016,he became a lecturer in ShenZhen institute of information technology.His research interests include path planning,image processing, computer vision and computer.
*Corresponding author.
E-mail address:kang@sz.pku.edu.cn(R.Kang).
Peer review under responsibility of Chongqing University of Technology.
http://dx.doi.org/10.1016/j.trit.2016.08.004
2468-2322/Copyright?2016,Chongqing Universityof Technology.Productionandhostingby Elsevier B.V.Thisis an openaccess article under the CCBY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).
Copyright?2016,Chongqing University of Technology.Production and hosting by Elsevier B.V.This is an open access article under the CC BY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).
CAAI Transactions on Intelligence Technology2016年2期