Abstract
The gene expression profile of a tissue averages the expression profiles of all cells in this tissue. Digital tissue deconvolution addresses the following inverse problem: given the expression profile y of a tissue, what is the cellular composition c of that tissue? If X is a matrix whose columns are reference profiles of individual cell types, the composition c can be computed by minimizing ...
Abstract
The gene expression profile of a tissue averages the expression profiles of all cells in this tissue. Digital tissue deconvolution addresses the following inverse problem: given the expression profile y of a tissue, what is the cellular composition c of that tissue? If X is a matrix whose columns are reference profiles of individual cell types, the composition c can be computed by minimizing Script capital L(y-Xc) for a given loss function Script capital L. Current methods use predefined all-purpose loss functions. They successfully quantify the dominating cells of a tissue, while often falling short in detecting small cell populations. In this study we use training data to learn the loss function Script capital L along with the composition c. This allows us to adapt to application-specific requirements such as focusing on small cell populations or distinguishing phenotypically similar cell populations. Our method quantifies large cell fractions as accurately as existing methods and significantly improves the detection of small cell populations and the distinction of similar cell types.