Jeff A. Bilmes's Publications

Sorted by DateClassified by Publication TypeClassified by Research CategorySorted by First Author Last NameClassified by Author Last Name

PREDICTD: PaRallel Epigenomics Data Imputation With Cloud-based Tensor Decomposition

Timothy J Durham, Maxwell W Libbrecht, J Jeffry Howbert, Jeffrey Bilmes, and William S Noble. PREDICTD: PaRallel Epigenomics Data Imputation With Cloud-based Tensor Decomposition. bioRxiv, Cold Spring Harbor Laboratory, 2017.

Download

[PDF] [gzipped postscript] [postscript] [HTML] 

Abstract

The Encyclopedia of DNA Elements (ENCODE) and the Roadmap Epigenomics Project have produced thousands of data sets mapping the epigenome in hundreds of cell types. However, the number of cell types remains too great to comprehensively map given current time and financial constraints. We present a method, PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition (PREDICTD), to address this issue by computationally imputing missing experiments in collections of epigenomics experiments. PREDICTD leverages an intuitive and natural model called \textquotedbllefttensor decomposition\textquotedblright to impute many experiments simultaneously. Compared with the current state-of-the-art method, ChromImpute, PREDICTD produces lower overall mean squared error, and combining methods yields further improvement. We show that PREDICTD data can be used to investigate enhancer biology at non-coding human accelerated regions. PREDICTD provides reference imputed data sets and open-source software for investigating new cell types, and demonstrates the utility of tensor decomposition and cloud computing, two technologies increasingly applicable in bioinformatics.

BibTeX

@article {Durham123927,
	author = {Durham, Timothy J and Libbrecht, Maxwell W and Howbert, J Jeffry and Bilmes, Jeffrey and Noble, William S},
	title = {PREDICTD: PaRallel Epigenomics Data Imputation With Cloud-based Tensor Decomposition},
	year = {2017},
	doi = {10.1101/123927},
	publisher = {Cold Spring Harbor Laboratory},
	abstract = {The Encyclopedia of DNA Elements (ENCODE) and the Roadmap Epigenomics Project have produced thousands of data sets mapping the epigenome in hundreds of cell types. However, the number of cell types remains too great to comprehensively map given current time and financial constraints. We present a method, PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition (PREDICTD), to address this issue by computationally imputing missing experiments in collections of epigenomics experiments. PREDICTD leverages an intuitive and natural model called {\textquotedblleft}tensor decomposition{\textquotedblright} to impute many experiments simultaneously. Compared with the current state-of-the-art method, ChromImpute, PREDICTD produces lower overall mean squared error, and combining methods yields further improvement. We show that PREDICTD data can be used to investigate enhancer biology at non-coding human accelerated regions. PREDICTD provides reference imputed data sets and open-source software for investigating new cell types, and demonstrates the utility of tensor decomposition and cloud computing, two technologies increasingly applicable in bioinformatics.},
	URL = {https://www.biorxiv.org/content/early/2017/04/04/123927},
	eprint = {https://www.biorxiv.org/content/early/2017/04/04/123927.full.pdf},
	journal = {bioRxiv},
}

Share


Generated by bib2html.pl (written by Patrick Riley ) on Wed Dec 13, 2023 01:42:17