44 X.Zhang et aL Physica A 415 (2014)43-53 is tightly linked to the world economy and to the international trade business cycle;thus it enjoyed a long prosperous period with growing trade at the international level until the financial crisis in 2008.Since then the shipping industry has faced idle capacity,huge losses,and risk of bankruptcy[14].The shipping industry is also dynamic and volatile.The Baltic capsize index(BCl),which measures the volatility in shipping markets,is significantly higher(~79%)than the average volatility in commodity markets(50%)and equity markets(e.g.,S&P50020%)[15].This extremely high risk is not only due to volatility in global economic cycles,but also is highly influenced by intrinsic characteristics of the shipping industry itself.The shipping industry comprises several separate but closely connected markets including the new ship,the second- hand ship,and the freight markets.Each of these markets comprises several tightly integrated sub-sectors according to ship type:oil tanker,dry bulk carrier,and container carrier.Oil tanker is designed for the bulk transport of oil and tankers are generally categorized by size from smallest to largest,e.g..Panamax,Aframax,Suezmax,VLCC and UVLCC.Dry bulk carrier is mainly used to transport dry bulk cargo,such as iron ore,grain and coal.Similar to oil tanker dry bulk ship also can be classified by size into Handysize,Handymax,Panamax,Super-Panamax and VLOC.Dry bulk shipping provides an economical and convenient way to transport three major raw materials to support the world industry.Container shipping provide transportation of containerized goods over sea via regular linear services.According to ship size,container vessel from smallest size to largest one also includes Handymax,Panamax,Post-Panamax and Large Container Vessel. Despite the economic importance of the shipping industry,there are surprisingly few studies about shipping industry risk.Studies of systemic risk in the shipping industry tend to fall into three categories.The first category uses a linear or non-linear stochastic model and focuses on freight rate returns and the volatility of some specific submarkets in the shipping industry[16-18].The second category focuses on asset bubbles caused by the supercycle of the shipping industry and determines how much asset values in the second-hand market deviate from underlying fundamentals [19,20.The third category identifies factors affecting the performance of shipping industry stocks in order to understand the linkage between the real shipping market and financial markets[21,22].Most previous studies focus on individual segments of the shipping industry and not the industry as a whole.Thus these studies ignore the interactions among different market sectors that are likely to compound systemic risk. In this paper we use the correlation-based network and the causality measures to examine the structure and dynamics of the shipping industry.We begin our analysis by using the minimal spanning tree (MST)and the hierarchical tree(HT)to examine the topology of correlation networks among different submarkets and ship types of the shipping industry during the pre-crisis,crisis,and post-crisis periods.Then we use a causality analysis based on Granger-causality and Brownian distance correlation to explore the directional connections between the physical market and the financial market of the shipping industry before,during,and after the financial crisis. 2.Methods 2.1.Network topology Using the minimal spanning tree(MST)and hierarchical tree(HT).we study the structure and dynamics of the shipping industry and explore the hierarchical structure of various time series.Hierarchical structure methods have been introduced in finance to ascertain the structure of asset price influences within a market(23-28],but application of this method is not limited to financial markets,and we extend the method to time series in other economic systems[29-32]. The minimal spanning tree (MST)is a graph of a set of elements in the node arrangement in a given metric space, e.g.,an ultrametric space [23].In the MST the taxonomy displays meaningful clusters,and it reduces the noise in a historical correlation matrix 33. A hierarchical tree is an important tool for data clustering.It partitions a dataset into subsets(clusters)such that the data in each subset share some common traits-often similarity or proximity at some defined distance.In our case,the construction of an ultrametric hierarchical tree structure allows us to determine the hierarchical structure of a network[34]. Both MST and HT require that a metric distance be defined.Because the definition of correlation does not fulfill the three axioms that define a metric,Mantegna[23]introduced a definition of distance, (YY分》-(Y)Y》 P时= (1) V(肾-(》(《Y-(Y》 where (denotes the mean.For each time series vector,we calculate the monthly return,defined as the change of logarithmic price of time series Yi(t)=log(P)-log(P-1)and Pr is the value of a time series at time t.Here we use the absolute value of the Pearson correlation coefficient to define the distance between two time series as[9] d=√2(1-lpl). (2) The distance dij fulfills the three axioms of a metric:(i)dij=0 if and only if i=j.(ii)dij=di.and (iii)dijs dik+dkj [9]. We then use the distance matrix di to determine the minimal spanning tree (MST).An MST is defined as the set of n-1 links that connects a set of elements across the smallest possible total distance.The determination of the hierarchical tree of a subdominant ultrametric is thus completely controlled by the ultrametric distance matrix.44 X. Zhang et al. / Physica A 415 (2014) 43–53 is tightly linked to the world economy and to the international trade business cycle; thus it enjoyed a long prosperous period with growing trade at the international level until the financial crisis in 2008. Since then the shipping industry has faced idle capacity, huge losses, and risk of bankruptcy [14]. The shipping industry is also dynamic and volatile. The Baltic capsize index (BCI), which measures the volatility in shipping markets, is significantly higher (≈79%) than the average volatility in commodity markets (≈50%) and equity markets (e.g., S&P500 ≈ 20%) [15]. This extremely high risk is not only due to volatility in global economic cycles, but also is highly influenced by intrinsic characteristics of the shipping industry itself. The shipping industry comprises several separate but closely connected markets including the new ship, the secondhand ship, and the freight markets. Each of these markets comprises several tightly integrated sub-sectors according to ship type: oil tanker, dry bulk carrier, and container carrier. Oil tanker is designed for the bulk transport of oil and tankers are generally categorized by size from smallest to largest, e.g., Panamax, Aframax, Suezmax, VLCC and UVLCC. Dry bulk carrier is mainly used to transport dry bulk cargo, such as iron ore, grain and coal. Similar to oil tanker dry bulk ship also can be classified by size into Handysize, Handymax, Panamax, Super-Panamax and VLOC. Dry bulk shipping provides an economical and convenient way to transport three major raw materials to support the world industry. Container shipping provide transportation of containerized goods over sea via regular linear services. According to ship size, container vessel from smallest size to largest one also includes Handymax, Panamax, Post-Panamax and Large Container Vessel. Despite the economic importance of the shipping industry, there are surprisingly few studies about shipping industry risk. Studies of systemic risk in the shipping industry tend to fall into three categories. The first category uses a linear or non-linear stochastic model and focuses on freight rate returns and the volatility of some specific submarkets in the shipping industry [16–18]. The second category focuses on asset bubbles caused by the supercycle of the shipping industry and determines how much asset values in the second-hand market deviate from underlying fundamentals [19,20]. The third category identifies factors affecting the performance of shipping industry stocks in order to understand the linkage between the real shipping market and financial markets [21,22]. Most previous studies focus on individual segments of the shipping industry and not the industry as a whole. Thus these studies ignore the interactions among different market sectors that are likely to compound systemic risk. In this paper we use the correlation-based network and the causality measures to examine the structure and dynamics of the shipping industry. We begin our analysis by using the minimal spanning tree (MST) and the hierarchical tree (HT) to examine the topology of correlation networks among different submarkets and ship types of the shipping industry during the pre-crisis, crisis, and post-crisis periods. Then we use a causality analysis based on Granger-causality and Brownian distance correlation to explore the directional connections between the physical market and the financial market of the shipping industry before, during, and after the financial crisis. 2. Methods 2.1. Network topology Using the minimal spanning tree (MST) and hierarchical tree (HT), we study the structure and dynamics of the shipping industry and explore the hierarchical structure of various time series. Hierarchical structure methods have been introduced in finance to ascertain the structure of asset price influences within a market [23–28], but application of this method is not limited to financial markets, and we extend the method to time series in other economic systems [29–32]. The minimal spanning tree (MST) is a graph of a set of elements in the node arrangement in a given metric space, e.g., an ultrametric space [23]. In the MST the taxonomy displays meaningful clusters, and it reduces the noise in a historical correlation matrix [33]. A hierarchical tree is an important tool for data clustering. It partitions a dataset into subsets (clusters) such that the data in each subset share some common traits—often similarity or proximity at some defined distance. In our case, the construction of an ultrametric hierarchical tree structure allows us to determine the hierarchical structure of a network [34]. Both MST and HT require that a metric distance be defined. Because the definition of correlation does not fulfill the three axioms that define a metric, Mantegna [23] introduced a definition of distance, ρij = ⟨YiYj⟩ − ⟨Yi⟩⟨Yj⟩ (⟨Y 2 i − ⟨Y 2 i ⟩⟩)(⟨Y 2 j − ⟨Y 2 j ⟩⟩) , (1) where ⟨· · ·⟩ denotes the mean. For each time series vector, we calculate the monthly return, defined as the change of logarithmic price of time series Yi(t) = log(Pt) − log(Pt−1) and Pt is the value of a time series at time t. Here we use the absolute value of the Pearson correlation coefficient to define the distance between two time series as [9] dij = 2(1 − |ρij|). (2) The distance dij fulfills the three axioms of a metric: (i) dij = 0 if and only if i = j, (ii) dij = dji, and (iii) dij ≤ dik + dkj [9]. We then use the distance matrix dij to determine the minimal spanning tree (MST). An MST is defined as the set of n − 1 links that connects a set of elements across the smallest possible total distance. The determination of the hierarchical tree of a subdominant ultrametric is thus completely controlled by the ultrametric distance matrix