Network Medicine Framework for Identifying Drug Repurposing Opportunities for COVID-19
4.3 Network Proximity
$$d(v,t)$$ is the shortest distance. The expected distance between two randomly selected groups of proteins is follwing:
$$dc(V,T) = \frac{1}{|T|} \sum{t \in T} \min_{v \in V} d(v,t)$$
Then we do a normalization, from absolute distance $$dc$$ to the relavtive distance $$Z{d_c}$$
$$Z{d_c} = \frac{d_c - \mu{dc}(V,T)}{\sigma{d_c}(V,T)}$$
4.4 Diffusion State Distance
The diffusion state distance (DSD) uses graph diffusion peoperty to get node pair similarity. $$H_e(A,B)$$ means how many random walks starting from $$A$$ visits $$B$$, and each node is represented by
$$H_e(V_i) = [H_e(V_i, V_1), H_e(V_i, V_2), ..., H_e(V_i, V_n)]$$
Then the similarity between two nodes is L1-norm.
$$DSD(A,B) = | H_e(A) - H_e(B) |$$
Based on this, we try different metrics
- Pipeline D1: $$I{DSD} = \frac{1}{|V|} \sum{t\in T} \min_{v\in V} DSD(t,v)$$
- Pipeline D2: $$I{KL} = \sum{t \in T} \min_{v \in V} KL(t,v)$$
- Pipeline D3: $$I{KL} = \sum{t \in T} \text{median}_{v \in V} KL(t,v)$$
- Pipeline D4: $$I{JS} = \sum{t \in T} \min_{v\in V} JS(t,v)$$
- Pipeline D5: $$I{JS} = \sum{t \in T} \text{median}_{v\in V} JS(t,v)$$
4.5 Graph Convolutional Networks
Use GNN based on the Protein-Protein, Drug-Protein, Disease-Protein, Drug-Disease.
COVID-19 treatment recommendation task Take this as a link prediction problem. 2 components:
- Encoder: From graph G to node embedding
- Decoder: optimizing embeddings for prediction
Frou decoders from Section 2.2.3
- Pipeline A1: cosine similartiy between COVID-19 and all the drugs in the decoded embedding space. The decoding is based on N=10 nearest neighbor nodes with minimum distance of D=0.25.
- Pipeline A2: D=0.8 and N=10.
- Pipeline A3: N=5, D=0.5.
- Pipeline A4: N=10, D=1.