第12卷第4期 智能系统学报 Vol.12 No.4 2017年8月 CAAI Transactions on Intelligent Systems Aug.2017 D0I:10.11992/is.201604006 网s络出版地址:http:/kns.cnki.net/kcms/detail/23.1538.TP.20170630.1420.004.html. A modified consensus algorithm for multi-UAV formations based on pigeon-inspired optimization with a slow diving strategy ZHANG Tianjie,DUAN Haibin (Science and Technology on Aircraft Control Laboratory,School of Automation Science and Electrical Engineering,Beihang University (BUAA),Beijing 100191,China) Abstract:This paper considers the formation control problem for a group of unmanned aerial vehicles (UAVs) employing consensus with different optimizers.A group of UAVs can never accomplish difficult tasks without formation because if disordered they do not work any better than a single vehicle,and a single vehicle is limited by its undeveloped intelligence and insufficient load.Among the many formation methods,consensus has attracted much attention because of its effectiveness and simplicity.However,at the beginning of convergence,overshoot and oscillation are universal because of the limitation of communication and a lack of forecasting,which are inborn shortcomings of consensus.It is natural to modify this method with lots of optimizers.In order to reduce overshoot and smooth trajectories,this paper first adopted particle swarm optimization (PSO),then pigeon-inspired optimization (PIO)to modify the consensus.PSO is a very popular optimizer,while PIO is a new method,both work but still retain disadvantages such as residual oscillation.As a result,it was necessary to modify PIO,and a pigeon-inspired optimization with a slow diving strategy (SD-PIO)is proposed.Convergence analysis was performed on the SD-PIO based on the Banach fixed-point theorem and conditions sufficient for stability were achieved. Finally,a series of comparative simulations were conducted to verify the feasibility and effectiveness of the proposed approach. Keywords:unmanned aerial vehicle (UAV);formation;consensus;pigeon-inspired optimization (PIO);Banach fixed-point theorem 中图分类号:TP18:TP273文献标志码:A文章编号:1673-4785(2017)04-0570-12 英文引用格式:ZHANG Tianjie,,DUAN Haibin.A modified consensus algorithm for multi-UAV formations based on pigeon-inspired optimization with a slow diving strategy[J].CAAI transactions on intelligent systems,2017,12(4):570-581. Formation control of unmanned aerial vehicles leader-follower approach artificial potential field (UAVs)can accomplish a task better than a single and sliding mode controlts]. UAV and has been studied using various methods. Among these approaches,consensus is an UAVs are widely adopted in various situationst-2]. effective and easy method for distributed controlt9. However,the requirements nowadays have already Using several simple consensus equations,UAVs can overtaken the current abilities of a single UAV.In fly along complex trajectories.Consensus originates order to satisfy the requirements,different plans for the from the natural behavior of animals such as fish and cooperation of many UAVs have been proposed by birds.A single bird or fish is small and weak,but a Chaumette3]and Raschet4,and the design of a population can hold a complicated formation.It is method for multi-UAV formation is importantts). consensus that plays an essential role in this Different methods have been proposed such as the phenomenon.Vicsek and Ren-]have achieved many results in this field.This paper adopts this 收稿日期:2017-04-11.网络出版日期:2017-06-30. method to design a controller for a multi-UAV 基金项目:Natural Science Foundation of China under Grant(61333004). 通信作者:ZHANG Tianjie..E-mail:11031148@buaa.edu.cn. formation
第 12 卷第 4 期 智 能 系 统 学 报 Vol.12 №.4 2017 年 8 月 CAAI Transactions on Intelligent Systems Aug. 2017 DOI:10.11992 / tis.201604006 网络出版地址:http: / / kns.cnki.net / kcms/ detail / 23.1538.TP.20170630.1420.004.html. A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy ZHANG Tianjie, DUAN Haibin (Science and Technology on Aircraft Control Laboratory, School of Automation Science and Electrical Engineering,Beihang University (BUAA), Beijing 100191, China) Abstract:This paper considers the formation control problem for a group of unmanned aerial vehicles (UAVs) employing consensus with different optimizers. A group of UAVs can never accomplish difficult tasks without formation because if disordered they do not work any better than a single vehicle, and a single vehicle is limited by its undeveloped intelligence and insufficient load. Among the many formation methods, consensus has attracted much attention because of its effectiveness and simplicity. However, at the beginning of convergence, overshoot and oscillation are universal because of the limitation of communication and a lack of forecasting, which are inborn shortcomings of consensus. It is natural to modify this method with lots of optimizers. In order to reduce overshoot and smooth trajectories, this paper first adopted particle swarm optimization ( PSO), then pigeon⁃inspired optimization (PIO) to modify the consensus. PSO is a very popular optimizer, while PIO is a new method, both work but still retain disadvantages such as residual oscillation. As a result, it was necessary to modify PIO, and a pigeon⁃inspired optimization with a slow diving strategy (SD⁃PIO) is proposed. Convergence analysis was performed on the SD⁃PIO based on the Banach fixed⁃point theorem and conditions sufficient for stability were achieved. Finally, a series of comparative simulations were conducted to verify the feasibility and effectiveness of the proposed approach. Keywords: unmanned aerial vehicle (UAV); formation; consensus; pigeon⁃inspired optimization (PIO); Banach fixed⁃point theorem 收稿日期:2017-04-11. 网络出版日期:2017-06-30. 基金项目:Natural Science Foundation of China under Grant (61333004). 通信作者:ZHANG Tianjie. E⁃mail:11031148@ buaa.edu.cn. 中图分类号:TP18;TP273 文献标志码:A 文章编号:1673-4785(2017)04-0570-12 英文引用格式:ZHANG Tianjie, DUAN Haibin.A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy[J]. CAAI transactions on intelligent systems, 2017, 12(4): 570-581. Formation control of unmanned aerial vehicles (UAVs) can accomplish a task better than a single UAV and has been studied using various methods. UAVs are widely adopted in various situations [1-2] . However, the requirements nowadays have already overtaken the current abilities of a single UAV. In order to satisfy the requirements, different plans for the cooperation of many UAVs have been proposed by Chaumette [3] and Rasche [4] , and the design of a method for multi⁃UAV formation is important [5] . Different methods have been proposed such as the leader⁃follower approach [6] , artificial potential field [7] , and sliding mode control [8] . Among these approaches, consensus is an effective and easy method for distributed control [9] . Using several simple consensus equations, UAVs can fly along complex trajectories. Consensus originates from the natural behavior of animals such as fish and birds. A single bird or fish is small and weak, but a population can hold a complicated formation. It is consensus that plays an essential role in this phenomenon. Vicsek [10] and Ren [9,11-12] have achieved many results in this field. This paper adopts this method to design a controller for a multi⁃UAV formation.
第4期 ZHANG Tianjie,et al.:A modified consensus algorithm for multi-UAV formations based on pigeon-inspired optimization with a slow diving strategy ·571- There are two main parts of the consensus proposed. algorithm[.First,the formation is treated as a The remainder of this paper is organized as virtual rigid body,every UAV has a desired position follows.In Section 2,details of the application of and this acts as a point on the body.Using consensus, consensus in a multi-UAV formation are given.The the UAVs'desired positions concentrate together and construction of SD-PIO and the convergence analysis is create the desired formation.However,a distance error considered in Section 3.In Section 4,the SD-PIO- always exists between the desired position and the real based consensus algorithm is proposed for a multi-UAV position for each UAV,and the second part of the formation,followed by numerical simulation and results algorithm deals with this problem by considering this analysis in Section 5.Finally,several conclusions error as a variable in consensus.By devising a useful concerning the use of optimal algorithms in consensus algorithm,the error closes to zero as time tends to theory are drawn. infinity.The real position for each UAV is the sum of the desired position and the error.In this way,UAVs 1 Controller design based on consensus with random initial locations always concentrate and fly 1.1 Multi-UAV formation controller design along their predesigned routes.These two parts of the It is supposed that each UAV possesses a moving algorithm added to the communication networks of the coordinate system with three axes (x,y,and z).The individual vehicles comprise the whole UAV shape of the UAV is ignored and a mass point model is system.However,consensus can only ensure that used instead.This is because the time delay of an all the variables are close to the given state and does aircraft control loop is much shorter than that of the not know whether the values of variables at each navigation and guidance loops.Therefore,when moment are optimal.It is a good idea to improve dealing with formation control,the UAV model can be consensus with optimal algorithms. simplified as a point,which means there is no need to Pigeon-inspired optimization (PIO)is a novel consider each UAV's attitude.Under this assumption, swarm intelligence optimizer proposed by Duan and all the UAV's x,y,and z axes are parallel. Qiao in 2014015).The method imitates pigeons' Consensus means that multiple UAVs reach an behavior when seeking a way back to their loft.It is agreement on a common state,e.g.,the UAV's believed that pigeons adopt different tools at different position.In detail,every UAV has its initial 3- stages when returning home.In the beginning,compass dimensional position (x,y,z)in an inertial and map factors are adopted,each pigeon's velocity coordinate system,namely the ground coordinate is updated by the linear combination of its neighbors' system.The velocity of the original UAV's moving including itself)best locations and its former velocity. coordinate system converges to the standard state of the When pigeons approach home,some smart ones with virtual leader UAVo by consensus.The UAVo's state good memories can recognize familiar landmarks which consists of a time-varying position and predesigned guide them,the others just need to follow these velocity.In this way,all the UAVs'coordinates leaders.The second part of PIO is designed to become one common moving coordinate system.After concentrate the average location of all the pigeons.It is adding the relative position of each UAV in the moving widely known that each optimizer fits one or several coordinate system,the desired time-varying position is problems,and PIO was first proposed for air robot path achieved for each UAV.When viewed as a mass of plannings],image processing,and orbit moving points,all these UAVs constitute a virtual formations(20].When it comes to the optimization of body. consensus mentioned above,disadvantages appear such The formation method consists of both single-and as the trajectory closes to the desired trace with double-integrator consensus algorithms.The latter aims oscillation to an unsatisfactory degree(This means to concentrate the UAVs with random initial positions that PIO needs to be modified and a pigeon-inspired and ensure that the desired positions of all the UAVs optimization with slow diving strategy (SD-PIO)is converge to points on the predesigned trajectory.The
There are two main parts of the consensus algorithm [13-14] . First, the formation is treated as a virtual rigid body, every UAV has a desired position and this acts as a point on the body. Using consensus, the UAVs’ desired positions concentrate together and create the desired formation. However, a distance error always exists between the desired position and the real position for each UAV, and the second part of the algorithm deals with this problem by considering this error as a variable in consensus. By devising a useful algorithm, the error closes to zero as time tends to infinity. The real position for each UAV is the sum of the desired position and the error. In this way, UAVs with random initial locations always concentrate and fly along their predesigned routes. These two parts of the algorithm added to the communication networks of the individual vehicles comprise the whole UAV system [13-14] . However, consensus can only ensure that all the variables are close to the given state and does not know whether the values of variables at each moment are optimal. It is a good idea to improve consensus with optimal algorithms. Pigeon⁃inspired optimization ( PIO) is a novel swarm intelligence optimizer proposed by Duan and Qiao in 2014 [15] . The method imitates pigeons behavior when seeking a way back to their loft. It is believed that pigeons adopt different tools at different stages when returning home. In the beginning, compass and map factors are adopted [16] , each pigeons velocity is updated by the linear combination of its neighbors’ ( including itself) best locations and its former velocity. When pigeons approach home, some smart ones with good memories can recognize familiar landmarks which guide them [17] , the others just need to follow these leaders. The second part of PIO is designed to concentrate the average location of all the pigeons. It is widely known that each optimizer fits one or several problems, and PIO was first proposed for air robot path planning [15,18] , image processing [17,19] , and orbit formations [20] . When it comes to the optimization of consensus mentioned above, disadvantages appear such as the trajectory closes to the desired trace with oscillation to an unsatisfactory degree [21] . This means that PIO needs to be modified and a pigeon⁃inspired optimization with slow diving strategy ( SD⁃PIO) is proposed. The remainder of this paper is organized as follows. In Section 2, details of the application of consensus in a multi⁃UAV formation are given. The construction of SD⁃PIO and the convergence analysis is considered in Section 3. In Section 4, the SD⁃PIO⁃ based consensus algorithm is proposed for a multi⁃UAV formation, followed by numerical simulation and results analysis in Section 5. Finally, several conclusions concerning the use of optimal algorithms in consensus theory are drawn. 1 Controller design based on consensus 1.1 Multi⁃UAV formation controller design It is supposed that each UAV possesses a moving coordinate system with three axes ( x, y, and z). The shape of the UAV is ignored and a mass point model is used instead. This is because the time delay of an aircraft control loop is much shorter than that of the navigation and guidance loops. Therefore, when dealing with formation control, the UAV model can be simplified as a point, which means there is no need to consider each UAV’s attitude. Under this assumption, all the UAV’s x, y, and z axes are parallel. Consensus means that multiple UAVs reach an agreement on a common state, e. g., the UAV’ s position. In detail, every UAV has its initial 3⁃ dimensional position ( x, y, z ) in an inertial coordinate system, namely the ground coordinate system. The velocity of the original UAVi ’ s moving coordinate system converges to the standard state of the virtual leader UAV10 by consensus. The UAV10 ’ s state consists of a time⁃varying position and predesigned velocity. In this way, all the UAVs ’ coordinates become one common moving coordinate system. After adding the relative position of each UAV in the moving coordinate system, the desired time⁃varying position is achieved for each UAV. When viewed as a mass of moving points, all these UAVs constitute a virtual body. The formation method consists of both single⁃and double⁃integrator consensus algorithms. The latter aims to concentrate the UAVs with random initial positions and ensure that the desired positions of all the UAVs converge to points on the predesigned trajectory. The ·571· 第 4 期 ZHANG Tianjie, et al.:A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy
.572. 智能系统学报 第12卷 former ensures the distance error between the desired that the directed communication topology graph must and the real position converges to 0[4).The real include a spanning tree position of each UAV can be easily acquired and the To attain the desired position for one single UAV UAV's trajectory is drawn. in the moving coordinate system,the error between the Consensus of moving desired and the real position should be treated as an coordinate system extra variable,this can be described by single- State variables of integrator dynamics for simplicity.The math model can Standard moving coordinate Communication be described as state network Errors added with relative er:=u (3) pesition in the formation and the consensus algorithm is in Ref [9]. Consensus of relative position in the moving UAV's u:=- (4) relative position coordinate system -∑ag(e,-er) in the system =rpi+er (5) JA JA JAV,-UAY Where7;is the real position of UAV:,UAV:in the moving coordinate system,er;is the distance error Fig.1 Frame of consensus controller in multi-UAV formation between the real and the desired position of UAVi,rP which is shown in Fig.2,is the desired position of For the formation problem in a multi-UAV UAV;in the moving coordinate system,i1,2,.., system,the UAVs'states should include both position 9.Equations (1~5)above can be transformed into a and velocity,and the control inputs are the collective matrix form, accelerations4.This model is always described by (L☒L)E+(b☒L)E=-K[(L⑧I)E+(b☒L)]- second-order differentiation equations,and as a result, a consensus algorithm for double-integrator dynamics is K[(L☒I3)E+(b☒13)E] required.The dynamic math model of the multi-UAV e=-(L☒I3)e system is given below, r=r+专=p+er+ (6) where Loxo is the Laplace matrix of the communication (1) topology graph,bRreflects which UAV can directly 5=[x] receive information from UAVo,is the desired where专:,i:h,∈R3 represent the position,.the velocity position and also the position of UAVo,which shows and the acceleration of UAV:,respectively and x,y, the desired path for all the UAVs,finally, are the3-dimensional positions,i∈{1,2,…,l0}.The represents the Kronecker product. consensus algorithm can be described as in Ref [14]. K,=diag(K1,K2,…,Kg),K,=diag(Kl,K2, 三,咳-K--&G,-6]+ 1 …,Ky),er=(erer…erg),f=(r7…fg)T,ris := the vector of the real position of each UAV,g is the 心-k-》-k-5】(2 vector of the position of the origin of each UAV's moving coordinate system. Where a;is element (i,j)in the adjacency matrix Equation (6)is the core of the multi-UAV 10 controller,and r provides the necessary positional A66∈R6x6,andk=∑ag,K.,K.isa3×3size information for all the UAVs for use in the next step. symmetric positive definite matrix,i1,2,..,9, After acquiring the target instructions,the UAVs fly in this work this is replaced by a Pascal matrix for toward their destinations. simplicity.The algorithm has been proven effective and 1.2 Multi-UAV formation task can make all the UAVs'both positions and velocities The desired formation shape is designed as converge to the state of the root.This is because the follows: graph satisfies the sufficient and necessary condition
former ensures the distance error between the desired and the real position converges to 0 [14] . The real position of each UAV can be easily acquired and the UAV’s trajectory is drawn. Fig.1 Frame of consensus controller in multi⁃UAV formation For the formation problem in a multi⁃UAV system, the UAVs’ states should include both position and velocity, and the control inputs are the accelerations [14] . This model is always described by second⁃order differentiation equations, and as a result, a consensus algorithm for double⁃integrator dynamics is required. The dynamic math model of the multi⁃UAV system is given below, ξ · i = ζi ζ · i = μi ξi = [xi yi zi] T ì î í ï ïï ï ï (1) where ξi,ζi,μi∈R 3 represent the position, the velocity and the acceleration of UAVi, respectively and xi,yi,zi are the 3⁃dimensional positions, i∈{1,2,…,10}. The consensus algorithm can be described as in Ref [14]. μi = 1 ki ∑ 9 j = 1 aij[ζ · j - Kri(ξi - ξj) - Kvi(ζi - ζj)] + 1 ki ai10 [ζ · r - Kri(ξi - ξ r ) - Kvi(ζi - ζ r )] (2) Where aij is element ( i, j) in the adjacency matrix A6×6 ∈R 6×6 , and ki = ∑ 10 j = 1 aij , Kri,Kvi is a 3 × 3size symmetric positive definite matrix, i ∈ {1,2,…,9} , in this work this is replaced by a Pascal matrix for simplicity. The algorithm has been proven effective and can make all the UAVs’ both positions and velocities converge to the state of the root. This is because the graph satisfies the sufficient and necessary condition that the directed communication topology graph must include a spanning tree [14] . To attain the desired position for one single UAV in the moving coordinate system, the error between the desired and the real position should be treated as an extra variable, this can be described by single⁃ integrator dynamics for simplicity. The math model can be described as e · ri = ui (3) and the consensus algorithm is in Ref [9]. ui = - ∑ 9 j = 1 aij(eri - erj) (4) r ~ i = rpi + eri (5) Where r ~ i is the real position of UAVi, UAVi in the moving coordinate system, eri is the distance error between the real and the desired position of UAVi, rpi which is shown in Fig. 2, is the desired position of UAVi in the moving coordinate system, i∈{1,2,…, 9}. Equations (1~5) above can be transformed into a collective matrix form, (L I3)ξ ¨ + (b I3)ξ ¨ r =- Kr[(L I3)ξ + (b I3)ξ r ]- Kv[(L I3 )ξ · + (b I3 )ξ ·r ] e · = - (L I3 )e r = r ~ + ξ = rp + er + ξ ì î í ï ï ï ï ï ï (6) where L9×9 is the Laplace matrix of the communication topology graph, b ∈ R 9 reflects which UAV can directly receive information from UAV10 , ξ r is the desired position and also the position of UAV10 , which shows the desired path for all the UAVs, finally, represents the Kronecker product. Kr = diag(Kr1 ,Kr2 ,…,Kr9 ),Kv = diag ( Kv1 ,Kv2 , …,Kv9 ),er = (er T 1 er T 2… er T 9 ) T ,r ~ = ( r ~ T 1 r ~ T 2…r ~ T 9 ) T ,r is the vector of the real position of each UAV, ξ is the vector of the position of the origin of each UAV’ s moving coordinate system. Equation ( 6 ) is the core of the multi⁃UAV controller, and r provides the necessary positional information for all the UAVs for use in the next step. After acquiring the target instructions, the UAVs fly toward their destinations. 1.2 Multi⁃UAV formation task The desired formation shape is designed as follows: ·572· 智 能 系 统 学 报 第 12 卷
ZHANG Tianjie,et al.:A modified consensus algorithm for multi-UAV formations based on 第4期 pigeon-inspired optimization with a slow diving strategy .573. Table 1 Positions of 9 UAVs under the desired formation receive information from UAV,.There is also a virtual structure m leader named UAVio,which can emit an arrow to UAV number x another UAV so that the latter knows the predesigned UAV 0 12.60 10.00 trajectory. UAV2 -1.50 10.00 10.00 The communication topology among UAVs is UAV3 1.50 10.00 described as a directed graph in Fig.3.A graph with n 10.00 vertices can be denoted by G(V,E,A),V=(v1,2, UAVa -6.50 0 0 …,tn)is the set of vertices of graph G and:isa UAVs -8.00 -2.60 0 vertex.E=(e,e2,,e)is the edge set in which e UAV. -5.00 -2.60 0 means an edge.A=[a]demonstrates the adjacency UAV] 6.50 0 0 matrix of the graph G.If there is an edge from v:to, UAVs 5.00 -2.60 0 then a>0,otherwise ag=0.Usually,a=1 if a UAVo 8.00 -2.60 0 particular value is not given.The value of a is the weight of communication amount between vi and v;, which can influence the convergence speed between v UAV. 10 UAV. UAV. and v;.In the graph G(V,E,A),if there is a set of edges,(,2),(2,3),…(v.-1,n),then there 0 is a path from v to v.If there is a vertex from which 10 many edges reach all the remaining vertices,then this UAV UAV 4UaV。 vertex is called a root and G(V,E,A)includes a direct ym o 10 spanning tree. -10 x/m Fig.2 Formation structure for UAVs There are nine UAVs flying at two different heights,they are divided into three groups,each group containing three UAVs.In every small group,the UAVs form a triangle and these are the same size,so the UAVs fly symmetrically.The side length of the triangle and the distance between different triangles is 6 much larger than the size of an UAV,so they can fly Fig.3 Communication topology for multiple UAVs safely without danger of collision.The desired heights For consensus,the adjacency matrix and the in Table I are relative values and the height of the Laplace matrix were acquired.The Laplace matrix L origin of the moving coordinate system is their reference [l]was the same size as the adjacency matrix.In the height,so when a UAV flies at zero or even a negative mati,h,=-ay,ifi≠j,otherise=立4,: height,it is still acceptable.The blue star in Fig.2 j=1j≠i According to the sufficient and necessary represents the center of the formation.x/m means condition of multi-UAV consensus,there is a directed distance along the x-axis in meters. spanning tree with its root UAVo,in the When designing the communication plan,the UAVs were numbered from I to 9.To clarify the plan, communication topology graph,and all the state variables converge to the state of the root as time tends some background knowledge on topology and graph to infinity theory is required. A graph with only vertices and arrows can be 2 Pigeon-inspired optimization with slow adopted to show the communication relationship among diving strategy all the UAVs.Each UAV acts as a vertex,and an 2.1 Basic pigeon-inspired optimization algorithm arrow from UAV to UAV2 means that UAV2 can Qiao [is]studied pigeon behavior when seeking
Table 1 Positions of 9 UAVs under the desired formation structure m UAV number x y z UAV1 0 12.60 10.00 UAV2 -1.50 10.00 10.00 UAV3 1.50 10.00 10.00 UAV4 -6.50 0 0 UAV5 -8.00 -2.60 0 UAV6 -5.00 -2.60 0 UAV7 6.50 0 0 UAV8 5.00 -2.60 0 UAV9 8.00 -2.60 0 Fig.2 Formation structure for UAVs There are nine UAVs flying at two different heights, they are divided into three groups, each group containing three UAVs. In every small group, the UAVs form a triangle and these are the same size, so the UAVs fly symmetrically. The side length of the triangle and the distance between different triangles is much larger than the size of an UAV, so they can fly safely without danger of collision. The desired heights in Table 1 are relative values and the height of the origin of the moving coordinate system is their reference height, so when a UAV flies at zero or even a negative height, it is still acceptable. The blue star in Fig. 2 represents the center of the formation. x / m means distance along the x⁃axis in meters. When designing the communication plan, the UAVs were numbered from 1 to 9. To clarify the plan, some background knowledge on topology and graph theory is required. A graph with only vertices and arrows can be adopted to show the communication relationship among all the UAVs. Each UAV acts as a vertex, and an arrow from UAV1 to UAV2 means that UAV2 can receive information from UAV1 . There is also a virtual leader named UAV10 , which can emit an arrow to another UAV so that the latter knows the predesigned trajectory. The communication topology among UAVs is described as a directed graph in Fig. 3. A graph with n vertices can be denoted by G(V,E,A) , V = (v1 ,v2 , …,vn ) is the set of vertices of graph G and vi is a vertex. E = (e1 ,e2 ,…,en ) is the edge set in which ei means an edge. A = [aij] demonstrates the adjacency matrix of the graph G. If there is an edge from vi to vj , then aij > 0, otherwise aij = 0. Usually, aij = 1 if a particular value is not given. The value of aij is the weight of communication amount between vi and vj , which can influence the convergence speed between vi and vj . In the graph G(V,E,A) , if there is a set of edges, (v1 ,v2 ) , (v2 ,v3 ) ,… (vn-1 ,vn ) , then there is a path from v1 to vn . If there is a vertex from which many edges reach all the remaining vertices, then this vertex is called a root and G(V,E,A) includes a direct spanning tree. Fig.3 Communication topology for multiple UAVs For consensus, the adjacency matrix and the Laplace matrix were acquired. The Laplace matrix L = [l ij] was the same size as the adjacency matrix. In the matrix, l ij = - aij , if i ≠ j , otherwise l ii = ∑ n j = 1,j≠i aij . According to the sufficient and necessary condition of multi⁃UAV consensus, there is a directed spanning tree with its root UAV10 , in the communication topology graph, and all the state variables converge to the state of the root as time tends to infinity [11] . 2 Pigeon⁃inspired optimization with slow diving strategy 2.1 Basic pigeon⁃inspired optimization algorithm Qiao [15] studied pigeon behavior when seeking ·573· 第 4 期 ZHANG Tianjie, et al.:A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy
.574. 智能系统学报 第12卷 their way home and proposed the two-stage algorithm. overshoot,but the result is not desirable,and the The first stage of PIO is the map and compass part.X= altitude of a UAV drops to the standard height with [XX…X…X]T is the vector of the pigeons' continual oscillation.To improve the performance of the positions,i is the number of virtual pigeons,and in algorithm,better methods are required. this paper the UAVs fly in 3-dimensional space,X=2.2 Improved method with slow diving strategy [x ]V=[VI V.V]T is the vector of the The problem in the former algorithm occurs mainly pigeons'velocities,V=[]is its element, in the altitude part,the height of the UAV decreases i∈{l,2,…,9}.Both X and V are randomly too fast and it dives sharply.To slow the process,the initialized when the program starts.The update update formula should be changed,not only to consider formulas are: the best local position but also the slowest diving (e:(t+1)=:(t)·exp(-Rt)+rand·(xe-x:) velocity.In the first part of PIO,the formula is not x:(t+1)=x,(t)+,(t+1) changed,but the method of acquiring the best local if t Tlabed (7) position is modified.The criterion is a linear In the formula above,the best local position combination of the distance error between the desired is selected from the distance between the desired and and the real positions and the diving speed.An UAV present positions.In order to find the best local that dives very slowly is imitated by the others.This is the so-called "slow diving strategy".The update position,all the distance errors are calculated and formula is: ranked,and only the position with the smallest e:(t+1)=v:(t)·exp(-Rt)+ distance is defined as the best local position.For the minimum optimization question,the fitness function is: r·[p·(xhea-x:)+(1-p)·] fitness(x (t))=x(t)-xmmdm(t)(8) x:(t+1)=x,(t)+,(t+1) After calculating the distance between the desired if t Tubd (10) and the present positions of all the pigeons,only half where serves as a regulatory factor in the range of(0, of the pigeons with lower fitness are selected for the 1);if approaches 0,the slowest pigeon will iteration.This ensures that the positions of these influence the team;if o approaches 1,best local pigeons converge to the desired position as fast as position will have much more of an effect.r(0,1)is possible. a random number. In the second part,the landmark operator is The second part of PIO should be modified in the employed.Its update formula is: following way:all the UAVs converge to the center of wu+1)=水四 the flock,which is calculated by a combination of the 2 UAV positions and the slowest diving speed.The ∑,()·fx,() update equation can be described as: xc(t+1)= ∑fx,(t) +1)=0 2 x,(t+1)=x,(t)+rand·(xc(t+1)-x:(t)) ∑x,()fx,()+10·x·f代x) ift≥Tlabel (9) xc(t+1)= ∑fx,(t)+10·xa·fx) Apparently,xic is the weighted average of all the x,(t+1)=x,(t)+rand·(xc(t+1)-x:(t)) nearby pigeons in pigeon i's communication distance. (11) It is supposed that the map and compass operator has ift≥Tlabel where x is the position of the UAV with the slowest already converged pigeons to the desired trajectory to a diving speed,this part represents the slowing diving great extent.As a result,the landmark operator only strategy. needs to speed up this process.The PIO algorithm has Fitness function Eq.(10)remains.Each UAV already been proven to be robust and reliable for an aircraft path planning problems However,for the has its own fitness value. optimization of consensus,PIO can improve the
their way home and proposed the two⁃stage algorithm. The first stage of PIO is the map and compass part. X = [X T 1 X T 2 … X T i … X T n ] T is the vector of the pigeons’ positions, i is the number of virtual pigeons, and in this paper the UAVs fly in 3⁃dimensional space, Xi = [xi yi zi] T . V = [V T 1 V T 2 … V T n ] T is the vector of the pigeons’ velocities, Vi = [vx vy vz] T is its element, i ∈{1,2,…,9} . Both X and V are randomly initialized when the program starts. The update formulas are: vi(t + 1) = vi(t)·exp( - Rt) + rand·(xlbest - xi) xi(t + 1) = xi(t) + vi(t + 1) { if t < Tlabel (7) In the formula above, the best local position xlbest is selected from the distance between the desired and present positions. In order to find the best local position, all the distance errors are calculated and ranked, and only the position with the smallest distance is defined as the best local position. For the minimum optimization question, the fitness function is: fitness(xi(t)) = xi(t) - xistandard(t) (8) After calculating the distance between the desired and the present positions of all the pigeons, only half of the pigeons with lower fitness are selected for the iteration. This ensures that the positions of these pigeons converge to the desired position as fast as possible. In the second part, the landmark operator is employed. Its update formula is: Np(t + 1) = Np(t) 2 xiC(t + 1) = ∑xi(t)·f(xi(t)) ∑f(xi(t)) xi(t + 1) = xi(t) + rand·(xiC(t + 1) - xi(t)) ì î í ï ï ïï ï ï ïï if t ≥ Tlabel (9) Apparently, xiC is the weighted average of all the nearby pigeons in pigeon i’ s communication distance. It is supposed that the map and compass operator has already converged pigeons to the desired trajectory to a great extent. As a result, the landmark operator only needs to speed up this process. The PIO algorithm has already been proven to be robust and reliable for an aircraft path planning problem [15] . However, for the optimization of consensus, PIO can improve the overshoot, but the result is not desirable, and the altitude of a UAV drops to the standard height with continual oscillation. To improve the performance of the algorithm, better methods are required. 2.2 Improved method with slow diving strategy The problem in the former algorithm occurs mainly in the altitude part, the height of the UAV decreases too fast and it dives sharply. To slow the process, the update formula should be changed, not only to consider the best local position but also the slowest diving velocity. In the first part of PIO, the formula is not changed, but the method of acquiring the best local position is modified. The criterion is a linear combination of the distance error between the desired and the real positions and the diving speed. An UAV that dives very slowly is imitated by the others. This is the so⁃called “ slow diving strategy ”. The update formula is: vi(t + 1) = vi(t)·exp( - Rt) + r·[φ·(xlbest - xi) + (1 - φ)·vslow ] xi(t + 1) = xi(t) + vi(t + 1) ì î í ï ï ïï if t < Tlabel (10) where φ serves as a regulatory factor in the range of (0, 1) ; if φ approaches 0, the slowest pigeon will influence the team; if φ approaches 1, best local position will have much more of an effect. r ∈ (0,1) is a random number. The second part of PIO should be modified in the following way: all the UAVs converge to the center of the flock, which is calculated by a combination of the UAV positions and the slowest diving speed. The update equation can be described as: Np(t + 1) = Np(t) 2 xiC(t + 1) = ∑xi(t)·f(xi(t)) + 10·xslow·f(xslows) ∑f(xi(t)) + 10·xslow·f(xslow) xi(t + 1) = xi(t) + rand·(xiC(t + 1) - xi(t)) ì î í ï ï ïï ï ï ïï if t ≥ Tlabel (11) where xslow is the position of the UAV with the slowest diving speed, this part represents the slowing diving strategy. Fitness function Eq. ( 10) remains. Each UAV has its own fitness value. ·574· 智 能 系 统 学 报 第 12 卷
第4期 ZHANG Tianjie,et al.:A modified consensus algorithm for multi-UAV formations based on pigeon-inspired optimization with a slow diving strategy ·575. 2.3 Convergence analysis of SD-PIO based on velocities,z(i)is theith element of z(t),i1,2,..., Banach fixed-point theorem 2n} In this section,convergence analysis of SD-PIO is z(i)∈u(z(t)=[xan(d))…xa(t)h(t)…u(t)]' performed and a sufficient condition is proposed for the z()∈z),3K∈Na.st.u(2(t)=K&(). first part of SD-PIO SD-PIO-I).The main method d(T(a),T(a2)=max|T(a)-T(2)|= 1安12山 adopted to prove the stability of iteration is the Banach max Fz+Bu(z)-Fz2 Bu(z2)= 1名i名2m fixed-point theorem,as stated below: ‖Fz1+BK11-Fz2-BK22‖。(13) A map T from a normed space (V,d())into itself There is is a contraction if there exists入∈[0,l)such that for L=F(a-2)+B.[0…xa-x%0…6-%0…0] alx,y∈V,d(T(x),T(y))≤Ad(x,y) (14) In the first part of SD-PIO,each pigeon updates s.t.L=ll Fz +BK2-Fz2 BK222ll,thus the its speed and location according to Eq.(10),which largest element of Fz+BK21-Fz2 BK222 is can be transformed as selected and denoted as supposing it is thej thelement.F.and B..are the whole j th row of F and B, exp(-Rt) respectively.Ifjn,is the j th element of K121 -K2z2 and via-v2 is the (j n)th one, (12) otherwisex-xz is the (j-n)th element and v-v when it considers all the pigeons,(t 1)= is the j th one. Fz(t)+Bu(z(t)), |L=|F.(a-2)+B.[0…k4z0…k40…0]= where z(t)=[x(t)…xn(t),(t)…vn(t)]T, |F(a1-z2)+B.[0…k1…k2…0]4z≤ [1-9 exp(-Rt) ‖F(a1-2)+BK(a1-a2)‖。≤ lF+BK‖m‖a1-22‖。=lF+BKld(z1,a2) 1-ro exp(-Rt) (15) exp(-Rt) Whee4=maxl(i)-z(i)l,k,=xa-xsl/△, 1名i62 |k2=。-2sl/△,k,k2∈[-1,0)U(0,1], -rO exp(-Rt)」 「0 01 1-9 k K= ,,k2 lies in the j th column, 1- k2 B =r 1- Lo 0 The equation holds if and only ifx=v- 1-9 026=Az. u(z(t)=[xeai(t)…xuan(t)un(t)…tahn(t)] The Banach fixed-point theorem requires llF+ If iteration z(t+1)=Fz(t)+Bu(z(t))has a fixed- BKI∈[0,1) pointed',as a result,z'=Fz·+Bu(z),then max1-or exp(-Rt)+rk o +rk2(1-), SD-PIO-I will converge to 2'and each pigeon will gr exp(-Rt)+rk rk2(1-p)<1 reach its final stable location as time tends to infinity. Denoting Z as a real number space R"and distance -仁120e1-o d(y)=yyeZ,Zhas been proven (16) as a complete metric space.There is a map T:Z-Z, Equation (16)is a sufficient condition for the T(z)=Fz+Bu(z),whereF∈R2xn,B∈R2nxn, convergence of SD-PIO-I,once it is satisfied,we have u(z)=K2x2n2,then Vz1,2∈Z,T(z1),T(a2)∈Z, d(T(x),T(y)≤IF+BKld(x,y),IF+BKIe∈ it can be proven that if d(T(x),T(y))sAd(x,y), [0,1),there exists a fixed-point z'satisfying z'= A [0,1),there exists a fixed-point which satisfies Fz·+Bu(z) the condition 2'=T(2)=Fz'Bu(z)and the x=v+x states of SD-PIO-I converge to z'.The following x=x+exp(-Ri)v'+r(imx)+(1-)vi] proves this conclusion. (17) Because the best local location and the slowest m°=0 Equation(17)→ .the states of SD-PIO-I speed are among all the pigeons'locations and
2. 3 Convergence analysis of SD⁃PIO based on Banach fixed⁃point theorem In this section, convergence analysis of SD⁃PIO is performed and a sufficient condition is proposed for the first part of SD⁃PIO ( SD⁃PIO⁃I). The main method adopted to prove the stability of iteration is the Banach fixed⁃point theorem, as stated below: A map T from a normed space (V,d()) into itself is a contraction if there exists λ ∈ [0,1) such that for all x,y ∈ V,d(T(x),T(y)) ≤ λd(x,y) . In the first part of SD⁃PIO, each pigeon updates its speed and location according to Eq. (10), which can be transformed as xi(t + 1) vi(t + 1) é ë ê ê ù û ú ú = 1 - rφ exp( - Rt) - rφ exp( - Rt) é ë ê ê ù û ú ú xi(t) vi(t) é ë ê ê ù û ú ú + r φ 1 - φ φ 1 - φ é ë ê ê ù û ú ú xlbest vslow é ë ê ê ù û ú ú (12) when it considers all the pigeons, z(t + 1) = Fz(t) + Bu(z(t)) , where z(t) = x1(t) … xn(t) v1(t) … v [ n(t) ] T , F = 1 - rφ exp( - Rt) ⋱ ⋱ 1 - rφ exp( - Rt) - rφ exp( - Rt) ⋱ ⋱ - rφ exp( - Rt) é ë ê ê ê ê ê ê ê ù û ú ú ú ú ú ú ú B = r φ 1 - φ ⋱ ⋱ φ 1 - φ φ 1 - φ ⋱ ⋱ φ 1 - φ é ë ê ê ê ê ê ê ê ù û ú ú ú ú ú ú ú u(z(t))= xlbest1(t) … xlbestn(t) vslow1(t) … v [ slown(t)] T . If iteration z(t + 1) = Fz(t) + Bu(z(t)) has a fixed⁃ pointed z ∗ , as a result, z ∗ = Fz ∗ + Bu(z ∗ ) , then SD⁃PIO⁃I will converge to z ∗ and each pigeon will reach its final stable location as time tends to infinity. Denoting Z as a real number space R n and distance d(x,y) = max 1≤i≤2n xi - yi ,x,y ∈ Z,Zhas been proven as a complete metric space. There is a map T:Z → Z, T(z) = Fz + Bu(z) ,where F ∈ R2n×2n , B ∈ R2n×2n , u(z) = K2n×2n z, then ∀z1 ,z2 ∈Z, T(z1 ),T(z2 ) ∈Z, it can be proven that if d(T(x),T(y)) ≤ λd(x,y), λ ∈[0,1), there exists a fixed⁃point which satisfies the condition z ∗ = T(z ∗ ) = Fz ∗ + Bu(z ∗ ) and the states of SD⁃PIO⁃I converge to z ∗ . The following proves this conclusion. Because the best local location and the slowest speed are among all the pigeons ’ locations and velocities, z(i) is theith element of z(t), i ∈ {1,2,…, 2n} ∀z(i)∈u(z(t))= xlbest1(t) … xlbestn(t) vslow1(t) … v [ slown(t)] T z(i) ∈z(t),∃K ∈N2n×2n,s.t. u(z(t))= Kz(t) . d(T(z1 ),T(z2 )) = max 1≤i≤2n T(z1 ) - T(z2 ) = max 1≤i≤2n Fz1 + Bu(z1 ) - Fz2 - Bu(z2 ) = ‖Fz1 + BK1 z1 - Fz2 - BK2 z2‖¥ (13) There is Lj = F·j (z1 -z2) +B·j [0 … x1a - x2b 0 … v1a - v2b 0 … 0] T (14) s.t. Lj =‖ Fz1 + BK1 z1 - Fz2 - BK2 z2‖¥, thus the largest element of Fz1 + BK1 z1 - Fz2 - BK2 z2 is selected and denoted as Lj supposing it is the j thelement. F·j and B·j are the whole j th row of F and B, respectively. If j ≤ n,x1a - x2b is the j th element of K1 z1 -K2 z2 and v1a - v2b is the (j + n) th one, otherwise x1a - x2b is the (j - n) th element and v1a - v2b is the j th one. Lj = F·j (z1 - z2)+ B·j [0 … k1Δz 0 … k2Δz 0 … 0] T = F·j (z1 - z2 ) + B·j [0…k1…k2…0] TΔz ≤ ‖F(z1 - z2 ) + BK(z1 - z2 )‖¥ ≤ ‖F + BK‖¥‖z1 - z2‖¥ = ‖F + BK‖¥d(z1 ,z2 ) (15) Where Δz = max 1≤i≤2n z1(i) - z2(i) , k1 = x1a - x2b / Δz, k2 = v1a - v2b / Δz,k1 ,k2 ∈ [ - 1,0) ∪ (0,1], K = 0 0 ⋱ k1 ︙ k2 0 0 é ë ê ê ê ê ê ê ù û ú ú ú ú ú ú ,k1 ,k2 lies in the j th column, The equation holds if and only if x1a - x2b = v1a - v2b = Δz . The Banach fixed⁃point theorem requires ‖F + BK‖¥ ∈ [0,1) ⇔max{ 1 - φr + exp( - Rt) + rk1φ + rk2(1 - φ) , - φr + exp( - Rt) + rk1φ + rk2(1 - φ) } < 1 ⇔ - 1 < - φr + exp( - Rt) + rk1φ + rk2(1 - φ)<0 - 1 ≤ k1 ,k2 ≤ 1,k1 k2 ≠ 0 { (16) Equation ( 16) is a sufficient condition for the convergence of SD⁃PIO⁃I, once it is satisfied, we have d(T(x),T(y))≤‖F + BK‖¥d(x,y),‖F + BK‖¥ ∈ [0,1), there exists a fixed⁃point z ∗ satisfying z ∗ = Fz ∗ +Bu(z ∗ ) x ∗= v ∗+ x ∗ x ∗= x ∗+ exp(- Rt)v ∗+ r[φ(x ∗ lbest - x ∗ ) + (1 - φ)v ∗ slow] { (17) Equation (17) ⇒ v ∗ = 0 x ∗ = x ∗ lbest { , the states of SD⁃PIO⁃I ·575· 第 4 期 ZHANG Tianjie, et al.:A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy
,576 智能系统学报 第12卷 can only converge to a local optimal solution because where T is the moment when consensus replaces SD- each pigeon can only receive its neighbor's information instead of global information. PI0,±×arctan(R×(t-T)+0.5,the For the second part of SD-PIO,all the pigeons inverse trigonometric function,is utilized to modify the converge to the flock's center.Their trajectories can only output of the SD-PIO and consensus,it works as a contract and approach each other.In formation processes this part serves only as a transition,and as the traces do switch key to smoothly connect the two parts of the not diverge this second part satisfies the demands. trajectory.f((t),(t))is the output velocity of SD- Multi-UAV formations with a SD- PI0 from Eqs.(7)and(9).The implementation PIO-based consensus algorithm procedure of the SD-PIO-based consensus algorithm is given as follows: The first three sections introduced consensus and 1)Input the number of UAVs in the formation SD-PIO,now this section uses them to optimize the trajectory of an UAV.When consensus is only applied task,give the expectation formation and consider the to the planned routes of all the UAVs,there is communication restriction. unavoidable overshoot because of the distributed control 2)Design a suitable communication network plan itself.Every UAV has its own limited topology and acquire its adjacency matrix and the communication relationship,which means it cannot Laplace matrix. always receive the desired trajectory.For example, 3)Randomly initialize the positions and velocities UAV can never acquire the planned trajectory of all the UAVs along with the parameters involved in directly,it relies on UAV,to pass on the information. If UAV3's initial altitude is much too high,it must the equations used in Eq.(18). dive to meet the desired trajectory.Because of inertia 4)Calculate the acceleration input,consensus, and a lack of forecasting,UAV,can only produce and the velocity input using Eq.(18) positive additions to its height when it goes below the 5)Update each UAV's position and velocity using desired trajectory and this is the reason for overshoot. Eq.(18) However,as for UAV,only when its leader UAV3 6)Draw the UAV's trajectory according to its flies below the proposed height does it begin to position in each iteration. recognize that it should fly a bit higher,then its velocity is changed by Eq.(7),followed by its The process is summarized in the following position.This process is much slower than a change in process flow chart: the desired route.As a result,the UAVs'trajectories Design the desired dive sharply and appear to overshoot,then they formation and give the converge to the desired trajectory.SD-PIO provides a communication topology gentle and slowly changing trajectory for each UAV at Acquire the adjacency matrix the start,then,as soon as the UAVs fly down and and the Laplace martix meet the desired trajectory,consensus starts to operate. Calculate the acceleration Initialize the positions and input by the second stage The formula of the combination of these two velocities of all the UAVs formula of SD-PIO,Eq.(18) algorithms can be described as: [x(t+1)=x(t)+T·((t+1)+,(t+1))》 Calculate the acceleration Update the velocity and input by the first stage v(t+1)=v(t)+T·a(t+1) position of each UAV formula of SD-PIO.Eq.(18) a(t+1)=-(L☒3)·x(t+1)+ Ift>Tlabel Update the velocity and N y·(L☒I3)·(t+1))· position of each UAV Y 三×arctan(R×(t-Ths)+0.5) Tlabel Draw the trajectories of UAVs 2(t+1)=f(x(t),v(t))· N ×arctan(R×(t-Te)+0.5) Fig.4 The process of the SD-PIO based consensus (18) algorithm for a multi-UAV formation
can only converge to a local optimal solution because each pigeon can only receive its neighbor’s information instead of global information. For the second part of SD⁃PIO, all the pigeons converge to the flock’s center. Their trajectories can only contract and approach each other. In formation processes this part serves only as a transition, and as the traces do not diverge this second part satisfies the demands. 3 Multi⁃UAV formations with a SD⁃ PIO⁃based consensus algorithm The first three sections introduced consensus and SD⁃PIO, now this section uses them to optimize the trajectory of an UAV. When consensus is only applied to the planned routes of all the UAVs, there is unavoidable overshoot because of the distributed control plan itself. Every UAV has its own limited communication relationship, which means it cannot always receive the desired trajectory. For example, UAV1 can never acquire the planned trajectory directly, it relies on UAV3 to pass on the information. If UAV3 ’ s initial altitude is much too high, it must dive to meet the desired trajectory. Because of inertia and a lack of forecasting, UAV3 can only produce positive additions to its height when it goes below the desired trajectory and this is the reason for overshoot. However, as for UAV1 , only when its leader UAV3 flies below the proposed height does it begin to recognize that it should fly a bit higher, then its velocity is changed by Eq. ( 7 ), followed by its position. This process is much slower than a change in the desired route. As a result, the UAVs’ trajectories dive sharply and appear to overshoot, then they converge to the desired trajectory. SD⁃PIO provides a gentle and slowly changing trajectory for each UAV at the start, then, as soon as the UAVs fly down and meet the desired trajectory, consensus starts to operate. The formula of the combination of these two algorithms can be described as: x(t + 1) = x(t) + T·(v(t + 1) + v2(t + 1)) v(t + 1) = v(t) + T·a(t + 1) a(t + 1) = - ((L I3 )·x(t + 1) + γ·(L I3 )·v(t + 1))· ( 1 π × arctan(R × (t - Tchg)) + 0.5) v2(t + 1) = f(x(t),v(t))· ( - 1 π × arctan(R × (t - Tchg)) + 0.5) ì î í ï ï ï ï ï ï ï ï ï ï ï ï (18) where Tchg is the moment when consensus replaces SD⁃ PIO, ± 1 π × arctan (R × (t - Tchg)) + 0.5, the inverse trigonometric function, is utilized to modify the output of the SD⁃PIO and consensus, it works as a switch key to smoothly connect the two parts of the trajectory. f(x(t),v(t)) is the output velocity of SD⁃ PIO from Eqs. ( 7 ) and ( 9 ). The implementation procedure of the SD⁃PIO⁃based consensus algorithm is given as follows: 1) Input the number of UAVs in the formation task, give the expectation formation and consider the communication restriction. 2 ) Design a suitable communication network topology and acquire its adjacency matrix and the Laplace matrix. 3)Randomly initialize the positions and velocities of all the UAVs along with the parameters involved in the equations used in Eq. (18). 4) Calculate the acceleration input, consensus, and the velocity input using Eq. (18) 5)Update each UAV’s position and velocity using Eq. (18) 6) Draw the UAV’ s trajectory according to its position in each iteration. The process is summarized in the following process flow chart: Fig. 4 The process of the SD⁃PIO based consensus algorithm for a multi⁃UAV formation ·576· 智 能 系 统 学 报 第 12 卷
ZHANG Tianjie,et al.:A modified consensus algorithm for multi-UAV formations based on 第4期 pigeon-inspired optimization with a slow diving strategy .577. 4 Simulation results and analysis from this perspective,however,it is important to clarify the fact that the trajectories do not cross and the To validate the SD-PIO-based consensus, UAVs do not crash in the sky. numerical simulations were conducted using MATLAB 2012a on a PC with an i5-480 m,2.67 GHz CPU and UAV Windows 10 operating system.SD-PIO was compared 40 with consensus itself and consensus with two other 号20 optimizations,particle swarm optimization (PSO)and UAV 6005004003002001000分m UAV PIO.The results prove the effectiveness of SD-PIO. m The main parameters in Egs.(7),(9),and Fig.5 Trajectories of 9 UAVs by consensus (18)are listed in Table 2 below.The time step for the simulation is T=0.05 s. To demonstrate the advantage of SD-PIO-based Table 2 Simulation parameters consensus,PSO-and PIO-based consensus algorithms were adopted to compare their performances with that R Tule/s Tae/s of the modified algorithm.In Fig.6,most trajectories 50 4 6 are apparently better than those in Fig.5,as the Nine UAVs were initialized with random positions trajectories are gentle at the beginning and turn without in a 3-dimensional environment as shown in Table 3. diving sharply.UAV3,UAVs,andUAV,serve as a set Table 3 Initial positions of 9 UAVs m of convincing simulations that support the conclusions UAV number above.In Figs.7 and 8,PIO and SD-PIO work and UAV, 20.804 30.971 30.477 further improve the performance.In Fig.7,the UAV2 12.685 37.633 31.252 overshoot almost disappears,but there is still UAV3 22.260 38.622 24.661 fluctuation in the trajectories from this perspective, UAVa 0.318 27.929 9.703 especially for UAV3,UAV,and UAVs.In Fig.8,the UAVs trajectories become much smoother and the overshoot 21.859 15.846 6.076 UAVo can be ignored compared with Figs.5 to 7.This is 16.942 13.999 6.644 reflected in the lowest parts of the trajectories of UAV7 19.293 20.723 17.662 UAV3,UAV,and UAVs. UAVs 24.806 23.061 9.317 UAVo 37.298 12.688 7.861 UAV These heights,the numbers on the z-axis,are 号20 UAV relative positions corresponding to the height of origin UAV of the common moving coordinate system in Fig.2. 0 UAV 60050040030020010006 Each UAV had the same initial position in the four x/m UAV 3/m simulations,i.e.,consensus,PSO-based consensus, Fig.6 Trajectories of 9 UAVs by PSO-based consensus PIO-based consensus,and SD-PIO-based consensus. As known from Sections 2 and 3,the performance of consensus cannot satisfy the requirements,which is 401 UAV UAV UAV clearly displayed in Fig.5.Because of communication UAV 20 -UAV restrictions and lack of forecasting,all the UAVs tend ≠9UAV。 UAVs to dive sharply and climb quickly.This causes great 6005040300200100六m0AV m UAV overload at the turning point,which may damage the UAV's structure and result in difficulties for flight Fig.7 Trajectories of 9 UAVs by PIO-based consensus control.Their routes seem to tie a knot when viewed
4 Simulation results and analysis To validate the SD⁃PIO⁃based consensus, numerical simulations were conducted using MATLAB 2012a on a PC with an i5⁃480 m, 2.67 GHz CPU and Windows 10 operating system. SD⁃PIO was compared with consensus itself and consensus with two other optimizations, particle swarm optimization (PSO) and PIO. The results prove the effectiveness of SD⁃PIO. The main parameters in Eqs. ( 7 ), ( 9 ), and (18) are listed in Table 2 below. The time step for the simulation is T = 0.05 s. Table 2 Simulation parameters γ R Tlabel / s Tchg / s 5 50 4 6 Nine UAVs were initialized with random positions in a 3⁃dimensional environment as shown in Table 3. Table 3 Initial positions of 9 UAVs m UAV number x y z UAV1 20.804 30.971 30.477 UAV2 12.685 37.633 31.252 UAV3 22.260 38.622 24.661 UAV4 0.318 27.929 9.703 UAV5 21.859 15.846 6.076 UAV6 16.942 13.999 6.644 UAV7 19.293 20.723 17.662 UAV8 24.806 23.061 9.317 UAV9 37.298 12.688 7.861 These heights, the numbers on the z⁃axis, are relative positions corresponding to the height of origin of the common moving coordinate system in Fig. 2. Each UAV had the same initial position in the four simulations, i. e., consensus, PSO⁃based consensus, PIO⁃based consensus, and SD⁃PIO⁃based consensus. As known from Sections 2 and 3, the performance of consensus cannot satisfy the requirements, which is clearly displayed in Fig. 5. Because of communication restrictions and lack of forecasting, all the UAVs tend to dive sharply and climb quickly. This causes great overload at the turning point, which may damage the UAV’ s structure and result in difficulties for flight control. Their routes seem to tie a knot when viewed from this perspective, however, it is important to clarify the fact that the trajectories do not cross and the UAVs do not crash in the sky. Fig.5 Trajectories of 9 UAVs by consensus To demonstrate the advantage of SD⁃PIO⁃based consensus, PSO⁃ and PIO⁃based consensus algorithms were adopted to compare their performances with that of the modified algorithm. In Fig. 6, most trajectories are apparently better than those in Fig. 5, as the trajectories are gentle at the beginning and turn without diving sharply. UAV3 ,UAV5 , andUAV9 serve as a set of convincing simulations that support the conclusions above. In Figs. 7 and 8, PIO and SD⁃PIO work and further improve the performance. In Fig. 7, the overshoot almost disappears, but there is still fluctuation in the trajectories from this perspective, especially for UAV3 ,UAV4 , and UAV5 . In Fig. 8, the trajectories become much smoother and the overshoot can be ignored compared with Figs. 5 to 7. This is reflected in the lowest parts of the trajectories of UAV3 ,UAV4 , and UAV5 . Fig. 6 Trajectories of 9 UAVs by PSO⁃based consensus Fig. 7 Trajectories of 9 UAVs by PIO⁃based consensus ·577· 第 4 期 ZHANG Tianjie, et al.:A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy
.578. 智能系统学报 第12卷 The performance of the x trajectories can be UAV: UAV evaluated using change of direction as the criterion. -UAV After starting,if a UAV flies along a smooth trajectory 号20 —UAVg towards the desired one without turning,it must have 0月 6005004003002001000⊙mUAV4 UAV chosen the best and straightest trajectory.An y/m undesirable trajectory leads the UAV into making frequent turns,which costs too much energy and Fig.8 Trajectories of 9 UAVs by SD-PIO-based consensus increases the probability of collision. The effect of the SD-PIO-based consensus can be The X trajectories in Figs.9 and 10 belong to the analyzed according to the x and z trajectories,but the y undesirable type.In Fig.9,the trajectories of UAV2, trajectory by consensus is smooth enough and does not UAV,and UAV,curve during flight and each of them need improvement.In Figs.9 and 10,oscillation is turns twice or even three times.UAV,flies in the more obvious than the counterparts in Figs.11 and 12. direction in which x becomes larger,however at nearly The undesirable trajectories in Fig.11 partly explain the t=1 s,x begins to decrease,then at nearly t=3.5 s,x shortcomings of PIO. increases again and meets the desired value at t=5 s. 40 30 This conflict process can be explained by its topology. 30 In Fig.3,UAV,is deeply influenced by UAV,and 10 UAV3.Unfortunately,all its neighbors start at positions further along the x-axis,which means that UAV2 has to -10 15 20 catch up with its neighbor by consensus without Fig.9 X trajectory of 9 UAVs by consensus reaching its desired positions.Only when its leader realizes they have deviated from the right position,do they begin to correct it.This is a result of a lack of 304 forecasting.However,if UAV2 can directly receive 票20 information from UAVo perhaps it can perform better. UAV UAV This is the limit of communication.These two UAV; 10 20 shortcomings illustrate the necessity for improvement. s In Fig.10,the situation is not any better.UAV3, Fig.10 X trajectories of 9 UAVs by PSO-based consensus UAV,UAVs,and UAVs have the same problem.For 40 UAV example,UAVs starts at x=24.806 m at t=0,then x= 30 UAV UAV 21.5 m at nearly t=2 s,then x=25.1 m at nearly t=6 UAV: UAV s and decreases to x =18.3 m at t=8 s;the real UAV UAV trajectory goes up and down frequently.This implies UAV UAV PSO cannot satisfy the demand of performance 1 20 t/s improvement. Fig.11 X trajectories of 9 UAVs by PlO-based consensus In Figs.11 and 12,fluctuation weakens and the 40 trajectories become much smoother.Most UAVs can fly UAV UAV 30 UAV directly towards their destinations at t=7 s.In Fig.11, UAV 层20 -UAV the x position of UAVs decreases quickly from the UAV beginning to t=5 s,this fast slide slip spoils the 10 UAV UAV performance as it causes a sliding movement of a flying 15 20 UAV and makes it harder to control. Fig.12 X trajectories of 9 UAVs by SD-PIO-based consensus In Fig.12,most trajectories are pleasant and
Fig. 8 Trajectories of 9 UAVs by SD⁃PIO⁃based consensus The effect of the SD⁃PIO⁃based consensus can be analyzed according to the x and z trajectories, but the y trajectory by consensus is smooth enough and does not need improvement. In Figs. 9 and 10, oscillation is more obvious than the counterparts in Figs. 11 and 12. The undesirable trajectories in Fig.11 partly explain the shortcomings of PIO. Fig. 9 X trajectory of 9 UAVs by consensus Fig. 10 X trajectories of 9 UAVs by PSO⁃based consensus Fig. 11 X trajectories of 9 UAVs by PIO⁃based consensus Fig. 12 X trajectories of 9 UAVs by SD⁃PIO⁃based consensus The performance of the x trajectories can be evaluated using change of direction as the criterion. After starting, if a UAV flies along a smooth trajectory towards the desired one without turning, it must have chosen the best and straightest trajectory. An undesirable trajectory leads the UAV into making frequent turns, which costs too much energy and increases the probability of collision. The X trajectories in Figs. 9 and 10 belong to the undesirable type. In Fig. 9, the trajectories of UAV2 , UAV4 , and UAV7 curve during flight and each of them turns twice or even three times. UAV2 flies in the direction in which x becomes larger, however at nearly t = 1 s, x begins to decrease, then at nearly t = 3.5 s, x increases again and meets the desired value at t = 5 s. This conflict process can be explained by its topology. In Fig. 3, UAV2 is deeply influenced by UAV1 and UAV3 . Unfortunately, all its neighbors start at positions further along the x⁃axis, which means that UAV2 has to catch up with its neighbor by consensus without reaching its desired positions. Only when its leader realizes they have deviated from the right position, do they begin to correct it. This is a result of a lack of forecasting. However, if UAV2 can directly receive information from UAV10 , perhaps it can perform better. This is the limit of communication. These two shortcomings illustrate the necessity for improvement. In Fig. 10, the situation is not any better. UAV3 , UAV4 ,UAV5 , and UAV8 have the same problem. For example, UAV8 starts at x = 24.806 m at t = 0, then x = 21.5 m at nearly t = 2 s, then x = 25.1 m at nearly t = 6 s and decreases to x = 18. 3 m at t = 8 s; the real trajectory goes up and down frequently. This implies PSO cannot satisfy the demand of performance improvement. In Figs. 11 and 12, fluctuation weakens and the trajectories become much smoother. Most UAVs can fly directly towards their destinations at t = 7 s. In Fig. 11, the x position of UAV5 decreases quickly from the beginning to t = 5 s, this fast slide slip spoils the performance as it causes a sliding movement of a flying UAV and makes it harder to control. In Fig. 12, most trajectories are pleasant and ·578· 智 能 系 统 学 报 第 12 卷
第4期 ZHANG Tianjie,et al.:A modified consensus algorithm for multi-UAV formations based on pigeon-inspired optimization with a slow diving strategy ·579 suitable as commands for multi-UAV formations. the designed trajectory. Performance of the z trajectories can be evaluated 40m UAV by the following criterion:diving slowly and never 30外 UAV;UAV: flying up and down.In Fig.13,all the trajectories 20 UAV quickly go down before t=4 s,this means that the 10UAV,UAV.UAV,UAV, UAVs dive sharply in the beginning,as shown in Fig. UAV o 1520 5.The aim of this work is to overcome this shortcoming. Fig.15 Z trajectories of 9 UAVs by PIO-based consensus The trajectories in Fig.14 clarify the advantage of 40 PSO-based consensus because the modified algorithm UAV can effectively reduce oscillation.UAV,and UAVs 20 AV? clarify UAVs this view as they both slowly decrease I0 AV.DAKV UAV their height following the desired trajectory instead of UAV 0 10 1520 diving down swiftly. 修 40- Fig.16 Z trajectories of 9 UAVs by SD-PIO-based consensus 30 UAV The SD-PIO-based consensus algorithm also -UAV 号20 follows the steps above,but the difference occurs in -UAV UAV the first 8 s as the UAVs descend more slowly and 10 smoothly.UAV,UAV2 and UAV3 can support this A UAV, 0 10 15 20 opinion,as they at the same respective height,but at ti t=5 s,UAV2 and UAV3 by SD-PIO are above 20 m Fig.13 Z trajectories of 9 UAVs by consensus while their counterparts are clearly lower.At t=7 s, 40 the height of UAV,by SD-PIO is 16.04 m and the outcome by PIO is only 15.47 m. 30 UAV From the fitness function Eg.(8),the closer the 2 -UAV:UAV UA UAV UAV;approaches its desired location,the smaller the AV 10 一UAV。UAV, fitness value.In Fig.17,as iteration Nc increases, UAV average fitness values decrease and finally approach 0 5 10 15 20 tis zero,and different optimizers possess different convergence rates.In Fig.17(a),PIO shows the Fig.14 Z trajectories of 9 UAVs by PSO-based consensus fastest speed in the beginning,SD-PIO is a bit slower However,reduplicative climb and dive still exist, and PSO shows the weakest performance of the three. which may induce oscillation.The dangerous However,after Ne 80,PIO becomes slow and is trajectories of UAV3 and UAVs at UAVst [0,10]s finally overtaken by SD-PIO at Ne 93.The poor efficiency of PIO is due to the inadequate local in Fig.14 reveal that they are not the desired ones that optimum. descend slowly at the beginning,then converge to the 12 desired trajectory.This is why there is great demand for -SDPIP 10 ·PIO another optimizer. oPSO In Fig.15,PIO is used to apply a gliding procedure.Before 4s,the map and compass operator takes effect and all the UAVs fly towards the best local position.In the time range,4 to 6 s,the landmark operator drives the UAVs nearby to the same height. When all of them fly to the standard height,the PIO 100 200300400 500 part stops working and consensus continues Nc concentrating the UAVs and guiding them to fly along (a)fitness value
suitable as commands for multi⁃UAV formations. Performance of the z trajectories can be evaluated by the following criterion: diving slowly and never flying up and down. In Fig. 13, all the trajectories quickly go down before t = 4 s, this means that the UAVs dive sharply in the beginning, as shown in Fig. 5. The aim of this work is to overcome this shortcoming. The trajectories in Fig. 14 clarify the advantage of PSO⁃based consensus because the modified algorithm can effectively reduce oscillation. UAV2 and UAV8 clarify UAV8 this view as they both slowly decrease their height following the desired trajectory instead of diving down swiftly. Fig. 13 Z trajectories of 9 UAVs by consensus Fig. 14 Z trajectories of 9 UAVs by PSO⁃based consensus However, reduplicative climb and dive still exist, which may induce oscillation. The dangerous trajectories of UAV3 and UAV5 at UAV5 t∈[0, 10] s in Fig.14 reveal that they are not the desired ones that descend slowly at the beginning, then converge to the desired trajectory. This is why there is great demand for another optimizer. In Fig. 15, PIO is used to apply a gliding procedure. Before 4s, the map and compass operator takes effect and all the UAVs fly towards the best local position. In the time range, 4 to 6 s, the landmark operator drives the UAVs nearby to the same height. When all of them fly to the standard height, the PIO part stops working and consensus continues concentrating the UAVs and guiding them to fly along the designed trajectory. Fig. 15 Z trajectories of 9 UAVs by PIO⁃based consensus Fig. 16 Z trajectories of 9 UAVs by SD⁃PIO⁃based consensus The SD⁃PIO⁃based consensus algorithm also follows the steps above, but the difference occurs in the first 8 s as the UAVs descend more slowly and smoothly. UAV1 , UAV2 and UAV3 can support this opinion, as they at the same respective height, but at t = 5 s,UAV2 and UAV3 by SD⁃PIO are above 20 m while their counterparts are clearly lower. At t = 7 s, the height of UAV1 by SD⁃PIO is 16. 04 m and the outcome by PIO is only 15.47 m. From the fitness function Eq. (8), the closer the UAVi approaches its desired location, the smaller the fitness value. In Fig. 17, as iteration Nc increases, average fitness values decrease and finally approach zero, and different optimizers possess different convergence rates. In Fig. 17 ( a ), PIO shows the fastest speed in the beginning, SD⁃PIO is a bit slower and PSO shows the weakest performance of the three. However, after Nc = 80, PIO becomes slow and is finally overtaken by SD⁃PIO at Nc = 93. The poor efficiency of PIO is due to the inadequate local optimum. (a)X fitness value ·579· 第 4 期 ZHANG Tianjie, et al.:A modified consensus algorithm for multi⁃UAV formations based on pigeon⁃inspired optimization with a slow diving strategy