# Chapter 13 The chordDiagram() function

One unique feature of circular layout is the circular visualization of relations between objects by links. See examples in http://circos.ca/intro/tabular_visualization/. The name of such plot is called Chord diagram. In circlize, it is easy to plot Chord diagram in a straightforward or in a highly customized way.

There are two data formats that represent relations, either adjacency matrix or adjacency list. In adjacency matrix, value in $$i^{th}$$ row and $$j^{th}$$ column represents the relation from object in the $$i^{th}$$ row and the object in the $$j^{th}$$ column where the absolute value measures the strength of the relation. In adjacency list, relations are represented as a three-column data frame in which relations come from the first column and to the second column, and the third column represents the strength of the relation.

Following code shows an example of an adjacency matrix.

mat = matrix(1:9, 3)
rownames(mat) = letters[1:3]
colnames(mat) = LETTERS[1:3]
mat
##   A B C
## a 1 4 7
## b 2 5 8
## c 3 6 9

And the code in below is an example of a adjacency list.

df = data.frame(from = letters[1:3], to = LETTERS[1:3], value = 1:3)
df
##   from to value
## 1    a  A     1
## 2    b  B     2
## 3    c  C     3

Actually, it is not difficult to convert between these two formats. There are also R packages and functions do the conversion such as in reshape2 package, melt() converts a matrix to a data frame and dcast() converts the data frame back to the matrix.

Chord diagram shows the information of the relation from several levels. 1. the links are straightforward to show the relations between objects; 2. width of links are proportional to the strength of the relation which is more illustrative than other graphic mappings; 3. colors of links can be another graphic mapping for relations; 4. width of sectors represents total strength for an object which connects to other objects or is connected from other objects. You can find an interesting example of using Chord diagram to visualize leagues system of players clubs by their national team from https://gjabel.wordpress.com/2014/06/05/world-cup-players-representation-by-league-system/ and the adapted code is at http://jokergoo.github.io/circlize/example/wc2014.html.

Since the usage for the two types of inputs are highly similar, in this chapter, we mainly show figures generated from matrix, but still keep the code which uses adjacency list runable.

## 13.1 Basic usage

First let’s generate a random matrix and the corresponding adjacency list.

set.seed(999)
mat = matrix(sample(18, 18), 3, 6)
rownames(mat) = paste0("S", 1:3)
colnames(mat) = paste0("E", 1:6)
mat
##    E1 E2 E3 E4 E5 E6
## S1  8 13 18  6 11 14
## S2 10 12  1  3  5  7
## S3  2 16  4 17  9 15
df = data.frame(from = rep(rownames(mat), times = ncol(mat)),
to = rep(colnames(mat), each = nrow(mat)),
value = as.vector(mat),
stringsAsFactors = FALSE)
df
##    from to value
## 1    S1 E1     8
## 2    S2 E1    10
## 3    S3 E1     2
## 4    S1 E2    13
## 5    S2 E2    12
## 6    S3 E2    16
## 7    S1 E3    18
## 8    S2 E3     1
## 9    S3 E3     4
## 10   S1 E4     6
## 11   S2 E4     3
## 12   S3 E4    17
## 13   S1 E5    11
## 14   S2 E5     5
## 15   S3 E5     9
## 16   S1 E6    14
## 17   S2 E6     7
## 18   S3 E6    15

The most simple usage is just calling chordDiagram() with mat (Figure 13.1).

chordDiagram(mat)
circos.clear()

or call with df:

chordDiagram(df)
circos.clear()

The default Chord Diagram consists of a track of labels, a track of grids (or you call it lines) with axes, and links. Sectors which correspond to rows in the matrix locate at the bottom half of the circle. The order of sectors is the order of union(rownames(mat), colnames(mat)) or union(df[[1]], df[[2]]) if input is a data frame. The order of sectors can be controlled by order argument (Figure 13.2 right). Of course, the length of order vector should be same as the number of sectors.

chordDiagram(mat, order = c("S1", "E1", "E2", "S2", "E3", "E4", "S3", "E5", "E6"))
circos.clear()

Under default settings, the grid colors which represent sectors are randomly generated, and the link colors are same as grid colors which correspond to rows (or the first column if the input is an adjacency list) but with 50% transparency.

Since Chord Diagram is implemented by basic circlize functions, like normal circular plot, the layout can be customized by circos.par().

The gaps between sectors can be set by circos.par(gap.after = ...) (Figure 13.3). It is useful when you want to distinguish sectors between rows and columns. Please note since you change default graphical settings, you need to use circos.clear() in the end of the plot to reset it.

circos.par(gap.after = c(rep(5, nrow(mat)-1), 15, rep(5, ncol(mat)-1), 15))
chordDiagram(mat)
circos.clear()

If the input is a data frame:

circos.par(gap.after = c(rep(5, length(unique(df[[1]]))-1), 15,
rep(5, length(unique(df[[2]]))-1), 15))
chordDiagram(df)
circos.clear()

Similar to a normal circular plot, the first sector (which is the first row in the adjacency matrix or the first row in the adjacency list) starts from right center of the circle and sectors are arranged clock-wisely. The start degree for the first sector can be set by circos.par(start.degree = ...) and the direction can be set by circos.par(clock.wise = ...) (Figure 13.4).

circos.par(start.degree = 90, clock.wise = FALSE)
chordDiagram(mat)
circos.clear()

## 13.3 Colors

### 13.3.1 Set grid colors

Grids have different colors to represent different sectors. Generally, sectors are divided into two groups. One contains sectors defined in rows of the matrix or the first column in the data frame, and the other contains sectors defined in columns of the matrix or the second column in the data frame. Thus, links connect objects in the two groups. By default, link colors are same as the color for the corresponding sectors in the first group.

Changing colors of grids may change the colors of links as well. Colors for grids can be set through grid.col argument. Values of grid.col better be a named vector of which names correspond to sector names. If it is has no name index, the order of grid.col is assumed to have the same order as sectors (Figure 13.5).

grid.col = c(S1 = "red", S2 = "green", S3 = "blue",
E1 = "grey", E2 = "grey", E3 = "grey", E4 = "grey", E5 = "grey", E6 = "grey")
chordDiagram(mat, grid.col = grid.col)
chordDiagram(t(mat), grid.col = grid.col)

## 13.8 Symmetric matrix

When the matrix is symmetric, by setting symmetric = TRUE, only lower triangular matrix without the diagonal will be used (Figure 13.21).

mat3 = matrix(rnorm(25), 5)
colnames(mat3) = letters[1:5]
cor_mat = cor(mat3)
col_fun = colorRamp2(c(-1, 0, 1), c("green", "white", "red"))
chordDiagram(cor_mat, grid.col = 1:5, symmetric = TRUE, col = col_fun)
title("symmetric = TRUE")
chordDiagram(cor_mat, grid.col = 1:5, col = col_fun)
title("symmetric = FALSE")

## 13.9 Directional relations

In some cases, when the input is a matrix, rows and columns represent directions, or when the input is a data frame, the first column and second column represent directions. Argument directional is used to illustrate such direction on the plot. directional with value 1 means the direction is from rows to columns (or from the first column to the second column for the adjacency list) while -1 means the direction is from columns to rows (or from the second column to the first column for the adjacency list). A value of 2 means bi-directional.

By default, the two ends of links have unequal height (Figure 13.22) to represent the directions. The position of starting end of the link is shorter than the other end to give users the feeling that the link is moving out. If this is not what your correct feeling, you can set diffHeight to a negative value.

par(mfrow = c(1, 3))
chordDiagram(mat, grid.col = grid.col, directional = 1)
chordDiagram(mat, grid.col = grid.col, directional = 1, diffHeight = uh(5, "mm"))
chordDiagram(mat, grid.col = grid.col, directional = -1)

Row names and column names in mat can also overlap. In this case, showing direction of the link is important to distinguish them (Figure 13.23).

mat2 = matrix(sample(100, 35), nrow = 5)
rownames(mat2) = letters[1:5]
colnames(mat2) = letters[1:7]
mat2
##    a  b  c  d  e  f  g
## a 55 20 84 16 14 97 57
## b 82 44 78 45 54 63 31
## c 77  3 99 76 86  8 18
## d 70 40  6 43 39 67 60
## e 79 12 25 17 10 93 30
chordDiagram(mat2, grid.col = 1:7, directional = 1, row.col = 1:5)

If you do not need self-link for which two ends of a link are in a same sector, just set corresponding values to 0 in the matrix (Figure 13.24).

mat3 = mat2
for(cn in intersect(rownames(mat3), colnames(mat3))) {
mat3[cn, cn] = 0
}
mat3
##    a  b  c  d  e  f  g
## a  0 20 84 16 14 97 57
## b 82  0 78 45 54 63 31
## c 77  3  0 76 86  8 18
## d 70 40  6  0 39 67 60
## e 79 12 25 17  0 93 30
chordDiagram(mat3, grid.col = 1:7, directional = 1, row.col = 1:5)

Links can have arrows to represent directions (Figure 13.25). When direction.type is set to arrows, Arrows are added at the center of links. Similar as other graphics parameters for links, the parameters for drawing arrows such as arrow color and line type can either be a scalar, a matrix, or a three-column data frame.

If link.arr.col is set as a data frame, only links specified in the data frame will have arrows. Pleast note this is the only way to draw arrows to subset of links.

arr.col = data.frame(c("S1", "S2", "S3"), c("E5", "E6", "E4"),
c("black", "black", "black"))
chordDiagram(mat, grid.col = grid.col, directional = 1, direction.type = "arrows",

If combining both arrows and diffHeight, it will give you better visualization (Figure 13.26).

arr.col = data.frame(c("S1", "S2", "S3"), c("E5", "E6", "E4"),
c("black", "black", "black"))
chordDiagram(mat, grid.col = grid.col, directional = 1,
direction.type = c("diffHeight", "arrows"),

There is another arrow type: big.arrow which is efficient to visualize arrows when there are too many links (Figure 13.27).

matx = matrix(rnorm(64), 8)
chordDiagram(matx, directional = 1, direction.type = c("diffHeight", "arrows"),

If diffHeight is set to a negative value, the start ends are longer than the other ends (Figure 13.28).

chordDiagram(matx, directional = 1, direction.type = c("diffHeight", "arrows"),
link.arr.type = "big.arrow", diffHeight = -uh(2, "mm"))

It is almost the same to visualize directional Chord diagram form a adjacency list.

chordDiagram(df, directional = 1)

## 13.10 Reduce

If a sector in Chord Diagram is too small, it will be removed from the original matrix, since basically it can be ignored visually from the plot. In the following matrix, the second row and third column contain tiny values.

mat = matrix(rnorm(36), 6, 6)
rownames(mat) = paste0("R", 1:6)
colnames(mat) = paste0("C", 1:6)
mat[2, ] = 1e-10
mat[, 3] = 1e-10

In the Chord Diagram, categories corresponding to the second row and the third column will be removed.

chordDiagram(mat)
circos.info()
##  [1] "R1" "R3" "R4" "R5" "R6" "C1" "C2" "C4" "C5" "C6"
##
## [1] 1 2
##
## Your current sector.index is C6
## Your current track.index is 2

The reduce argument controls the size of sectors to be removed. The value is a percent of the size of a sector to the total size of all sectors.

You can also explictly remove sectors by assigning corresponding values to 0.

mat[2, ] = 0
mat[, 3] = 0

All parameters for sectors such as colors or gaps between sectors are also reduced accordingly by the function.