Research Statement Summary

With the advent of high throughput technologies in biomedical research, vast amount of high-dimensional biological datasets have been generated to characterize biological systems and diseases. Particularly next generation sequencing (NGS) technology has being employed to measure genetic, epigenetic and structural changes in DNA and RNA. Petabytes of data such as DNA methylation, copy number alteration, mRNA expression and microRNA expression are publicly available. The availability of these high throughput and complex datasets require new computational methods to be developed to analyze and integrate them to answer high impact biological questions.

The research goal of my group is to develop open-source computational tools that integrate high throughput biological datasets i) to reverse engineer disease-specific gene regulatory networks and ii) to compute predictive models for biological processes and clinical outcomes.