The availability of completely sequenced genomes and genome-scale datasets provides us with a wealth of information and the opportunity to investigate fundamental problems of a different magnitude and kind that is complementary to conventional approaches. While much of this information is publicly available, one of the major challenges we face is to be able to mine this enormous amount of data and integrate them in a meaningful way in order to (i) generate experimentally testable hypotheses and (ii) uncover general principles that govern living systems. ||Research in my group aims to understand how regulation is achieved in cellular systems at three levels of complexity (Molecular, Systems and Genome level) and how they influence genome evolution. This area of research is important because many human diseases are a result of dysfunction of regulatory processes and understanding design principles of regulation can be exploited in biotechnology and medicine. At the molecular level, we investigate regulatory proteins that are involved in protein-protein, protein-nucleic acid and protein-small molecule interactions. In particular, we aim to gain an understanding of the proteins involved in the ubiquitin pathway, chromatin modification pathway and GPCR receptor signaling. At the systems level, we aim to gain an understanding of (a) how aggregation prone proteins are tolerated and regulated in cellular systems, (b) how pathogens could possibly exploit host machinery and signaling through molecular mimicry by secreting unstructured proteins that expose host-like peptide motifs and (c) understand the dynamics of regulatory proteins such as protein kinases, metabolic enzymes, etc within the context of their position in the various molecular interaction networks. Finally at the genome level, we investigate (a) the interplay between nucleosome modification and genome organization, (b) mutations and their impact on genome and transcriptome evolution and (c) the genetic basis of natural variation in specific behavioral traits.||To pursue these lines of research, we develop new computational methods that make use of the publicly available large-scale datasets. These include sequence, structure, expression and molecular interaction data from several different model organisms ranging from bacteria to fungi to humans. We also use newly available data on natural variation, cancer genome sequence and gene expression and nucleosome modification data, and information from genetic screens linking genotype with phenotype. The problems that we work on in my group have the ultimate objective of being able to identify new regulatory principles and features of regulatory systems, to exploit our findings and the general principles that we describe in specific applications such as genetic engineering, and to better understand regulatory dysfunctions leading to human diseases.