--- title: "Partial Centrality" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{06 partial centrality} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- This vignette describes some methods to analyse partial rankings as obtained from [neighborhood-inclusion](neighborhood_inclusion.html) or, more general, [positional dominance](positional_dominance.html). More on the topic of partial rankings can be found in the following literature. > Schoch, David. (2018). Centrality without Indices: Partial rankings and rank Probabilities in networks. *Social Networks*, **54**, 50-60.([link](https://doi.org/10.1016/j.socnet.2017.12.003)) > Patil, G.P. & Taillie, C. (2004). Multiple Indicators, partially ordered sets, and linear extensions: Multi-criterion ranking and prioritization. *Environmental and Ecological Statistics*, **11**, 199-228 ([link](https://link.springer.com/article/10.1023/B:EEST.0000027209.93218.d9)) ```{r global_options, include=FALSE} knitr::opts_chunk$set(fig.width=5,fig.align = 'center') ``` ________________________________________________________________________________ ## Rank intervals ```{r setup-blind,include=FALSE} library(Matrix) ``` ```{r setup, warning=FALSE,message=FALSE} library(netrankr) library(igraph) library(magrittr) ``` The function `rank_intervals()` is used to calculate the maximal and minimal possible rank for each node in any ranking that is in accordance with a given partial ranking. ```{r pos_dom} data("dbces11") g <- dbces11 #neighborhood inclusion P <- g %>% neighborhood_inclusion(sparse = FALSE) #without %>% operator: # P <- neighborhood_inclusion(g, sparse = FALSE) rank_intervals(P) ``` The package uses the convention, that higher numerical ranks correspond to top ranked position. The lowest possible rank is thus 1. The column `mid_point` should not be confused with the *expected rank* of nodes, which is calculated with the function `exact_rank_prob()`. See [this](probabilistic_cent.html) vignette for more details. \ Rank intervals are useful to assess the ambiguity of ranking nodes. The bigger the intervals are, the more freedom exists, e.g. for centrality indices, to rank nodes differently. \ The intervals can be visualized with its own `plot()` function. The function can take a data frame of centrality scores as an additional parameter `cent_scores`. The ranks of each node for each index are then plotted within each interval. Again, the higher the numerical rank the higher ranked the node is according to the index. ```{r vis_intervals_cent} cent_scores <- data.frame( degree=degree(g), betweenness=round(betweenness(g),4), closeness=round(closeness(g),4), eigenvector=round(eigen_centrality(g)$vector,4)) rk_int <- rank_intervals(P) plot(rk_int,cent_scores = cent_scores) ``` A small jitter effect is added to the points to reduce over-plotting. \ Note that you may encounter situations, where ranks of centralities may fall outside of interval. This can happen in cases of ties in rankings, especially for betweenness centrality. Betweenness is, so far, the only index that does not *strictly* preserve neighborhood-inclusion. That is, while $$ N(u)\subseteq N[v] \text{ and } N(v)\not\subseteq N[u] \implies c(u)% neighborhood_inclusion(sparse = FALSE) #without %>% operator: # P <- neighborhood_inclusion(tg,sparse = FALSE) plot(rank_intervals(P)) ``` The described betweenness inconsistancy is most evident for threshold graphs as shown in the rank intervals below. ```{r tg_ri_cent,out} cent_scores <- data.frame( degree=degree(tg), betweenness=round(betweenness(tg),4), closeness=round(closeness(tg),4), eigenvector=round(eigen_centrality(tg)$vector,4)) plot(rank_intervals(P),cent_scores = cent_scores) ```