As whole-genome expression profiling has become increasingly affordable,
publicly available repositories of microarray data are expanding rapidly.
Still, a large fraction of the enormous information content in these databases
remains unexplored. In this talk, I will discuss data integration techniques
allowing us to take advantage of this data to discover new components of metabolic
pathways. Specifically, we have developed a technique that systematically searches
data repositories for genes that are consistently and specifically co-expressed
with a pathway of interest. Using this method, we have successfully identified five
proteins as required for functional heme biosynthesis, and a novel regulator of
oxidative phosphorylation acting on mitochondrial mRNA.