Learning interpretable disease self-representations for drug repositioning

Learning interpretable disease self-representations for drug repositioning

Fabrizio Frasca, Diego Galeano, Guadalupe Gonzalez, Ivan Laponogov, Kirill Veselkov, Alberto Paccanaro, Micheal M. Bronstein

Geometric Self-Expressive Models (GSEM)

Regularisation Framework

self-expressive models(SEM)的目标是将datapoints(也就是disease)表示为其他datapoints的线性集合。提出的算法能同时进行clustering和在low-dimensional subspace上进行high-dimensional data lying,这个模型可以有效的对low-rank matrix completion进行泛化。这里我们将drug repositioning当作一个high-rank matrix completion task,然后将SEM进行沿伸,将datapoints之间的relational inductive prior也计入在内。因为考虑到了datapoints之间的地理结构,所以我们将这个模型讲座Geometric SEM(GSEM)。

有n drugs和m diseases,构成的矩阵,$$X \in \mathbb{R}^{n \times m}$$,每一个列是一个datapoint。GSEM是要学习到一个sparse zero-diagonal self-representation matrix $$C \in \mathbb{R}^{m \times m}$$,构成了disease的参数,使得 $$\hat X \approx X C$$,其中null diagonal是为了避免trivial solution。

最终的cost function为

$$ \min \frac{1}{2} | X - XC |F^2 + \frac{\beta}{2} | C |_F^2 + \lambda | C |_1 + \frac{\alpha}{2} | C |{D, G} + \gamma Tr(C)

$$

$$|C|_{D,G}$$ 是 Dirichlet norm on the graph G。第一项就是self-representation term,目的是为了让CX看起来比较接近原始matrix X。

我们将geometric structure也带入到self-representation matrix C 中,这个geometric structure是来自于disease-disease similarity graph。理论上,在图中比较靠近datapoint有相近的coefficient(在矩阵C中)。这一点可以通过以下得到:

$$ \sum{i,j} G{ij} | ci - c_j |^2 = Tr(CLC^T) = | C |{D,G}^2

$$

where $$L = D - W$$ 是 graph laplacian。

The multiplicative learning algorithm

对应于上述的cost function,我们提出了一个搞笑的multiplicative learning algorithm,并有很好的convergence guarantee。我们的算法是迭代地用下面一个公式:

$$ c{ij} = c{ij} \frac{(X^T X + \alpha C W){ij} }{(X^T X C + \alpha CD + \beta C + \lambda + \gamma I){ij}}

$$

Experimental Results

使用PREDICT数据集,有593个drug和313个disease,以及其相互之间的1933个关系。

Drug是从DrugBank中挖取的,disease是从Online Mendelian Inheritance in Man (OMIM) database中挖取的。

为了构造disease-disease graph $$G$$,我们使用phenotypic disease similarity constructed by [Van Driel et al., 2006]。

results matching ""

    No results matching ""