|
Current PICASso Fellows |
||
| Eric Banks | Computer Science | I am currently developing computational methods for predicting protein-protein interactions. Recently, several genomic-scale methods have been developed for predicting protein interactions; however, most of these do not directly incorporate structural information. I am using structural information about protein interaction domains (e.g. SH3, PDZ, WW, coiled coil, etc.) and mapping them onto proteins that are functionally linked using other computational methods (e.g. phylogenetic profiles, gene fusion, shared regulatory elements, etc.), with the goal being high-confidence predictions about protein interactions. |
| Chris Bristow | Chemical Engineering | We are interested in how cell-signaling pathways pattern cell fates in developing tissues. The eggshell development of the fruit fly Drosophila melanogaster is an excellent model system to study how cell fates are determined. During egg development, the oocyte is surrounded by a layer of ~1000 uniform epithelial cells that differentiate into over 10 types of cells fates. This process relies on the non-uniform activation of two signaling pathways (EGFR and BMP). Many of the signaling components have been identified and a picture of the spatial distribution of signaling activity is emerging. Yet, a complete picture of what targets are downstream of these pathways and how they influence the development of the tissue remains. We use a range of genetic, genomic, and bioinformatic techniques aimed to provide a systems level understanding of how a tissue is patterned by multiple signaling pathways. For this we identify all of the targets of the EGFR and BMP pathways in the tissue with qRT-PCR and Affymetrix Gene Chips, characterize the spatial images of the gene expression patterns and archive them in a public database, and computationally identify and experimentally verify regulatory regions that mediate spatial-temporal patterns of gene expression. In addition, we explore the role of these newly identified targets in eggshell morphogenesis. |
Michael Burns |
Computer Science |
Three dimensional volumetric data sets are generated by complex fluid flow simulations and medical imaging (CT/MRI) scans. Hardware advances have allowed the creation of larger, higher resolution datasets than can be visualized interactively with traditional volumetric visualization methods. My research focuses on creating informative and interactive visualization algorithms for volumetric data. Thus far I've been working on generating line drawings from volumetric data, leveraging their sparse nature to quickly and efficiently generate images from large data sets. |
| Melissa Carroll | Computer Science |
Modern neuroimaging tools, such as Functional Magnetic Resonance Imaging (fMRI), have made unraveling the nature of computation in the mind a foreseeable reality; however making sense of the vast amounts of data remains a daunting task. There has been growing interest in using fMRI for "mind reading," particularly in applying machine learning methods to classifying fMRI brain images based on the subject's instantaneous cognitive state. Machine learning approaches used to date for fMRI classification have treated individual voxels as features and have ignored the spatial correlation between voxels. Presently, I am developing methods that generate features that capture this spatial information and investigating the interaction of feature generation and classification methods. I am also exploring various approaches to interpreting classifiers and exploiting similarity structure in cognitive states to improve classification. |
| Christopher DeCoro | Computer Science |
Emerging computer architectures have abandoned the traditional model of a single sequential stream of computation, in favor of one in which many independent computational units operate simultaneously, achieving dramatic increases in speed through parallelism. This is necessary in order to continue the rate of increase in computational performance that we have seen over the last few decades, as traditional architectures are no longer able to take advantage of the increasing capabilities of hardware production. However, this new poses additional challenges for computer scientists, as many algorithms that are efficient on one architecture are quite inefficient on the other. My research focuses on developing novel algorithms that utilize the potential of these new hardware architectures as applied to problems in scientific visualization and computer graphics. One major application is the extraction of surfaces from volume datasets, such as the surface of a liquid in a fluid simulation, or of organs in a MRI scan. Another application is the parallel processing of massive meshes generated from 3d scanning, such as in manufacturing, or for recording of cultural heritage artifacts, including paintings and sculptures. We have seen performance increases in these applications of orders of magnitude through the use of new parallel computation models, and we look to apply these to additional problems. |
| Matt Hibbs | Computer Science |
The quantity and quality of large scale genomic datasets is undergoing a period of rapid expansion. In particular, the lower cost and higher reliability of gene expression microarrays allow laboratories from all over the world to quickly gather whole-genome data for their area of interest. My research focuses on methods and systems to leverage the large amount of publically available data in a way that is useful for real-world biologists. This includes intelligent algorithms for data processing and manipulation as well as intuitive, useful visualization components to make these algorithms accessible to the broader biological community. |
| David Karig | Electrical Engineering | One of the important challenges within the emerging field of synthetic biology is to design artificial networks that coordinate the behavior of vast numbers of cells. A requirement is to engineer communication systems with multiple non-interacting signals. Using these communication systems, complex genetic networks for pattern formation and synchronized dynamic behavior can be designed. Developing such networks involves close coupling of scientific computing and experimental validation. The construction and experimental study of cell-cell communication systems typically requires significant amounts of time and laboratory resources. Thus, simulations dramatically speed the pace of research by allowing for fast exploration of design parameters, investigation of the effects of noise, and testing of system robustness. Furthermore, computational models provide a means for extrapolating knowledge learned from our synthetic systems and applying our findings to other complex biological systems, both natural and synthetic. |
| Scott McAllister | Chemical Engineering | One of the ultimate goals in computational biology is the identification of the three-dimensional structure of a protein given only an amino acid sequence. My thesis project involves extending current ab initio protein structure prediction approaches to larger proteins through improved parallel programming algorithms, more efficient potential and electrostatic energy calculations, and the development of additional restraints. I will also explore the extension of the ASTRO-FOLD prediction protocol to membrane proteins. In addition, I am developing new optimization-based mathematical models for packing of helices, tertiary contact prediction, membrane proteins, and protein domain recognition. |
| Arron Melvin | MAE | The current generation of high-performance parallel computers has allowed for the significant advancement of aerodynamic design methods. By embedding the flow physics within an optimization process, detailed design work can be performed automatically, resulting in decreased design time and cost, and improved performance. Adjoint-based shape optimization methods have proven to be a computationally efficient implementation of this approach. The majority of the studies on adjoint methods have used structured grids to discretize the computational domain. We are developing an unstructured mesh implementation, which has major advantages when dealing with complex configurations. These types of simulations are necessary in order to obtain the best performing design, as components must be designed as a complete aerodynamic system, rather than individually as is the traditional method. |
| Chad Myers | Computer Science | Recent advances in biological technology have produced large datasets often containing rich but noisy information. Sophisticated computational techniques are necessary in both analysis and, perhaps more importantly, defining direction for further experimentation. My research interests are in developing signal processing and machine learning techniques for extracting and understanding natural structure in such datasets. Specifically, I am developing methods for using transcriptional trends to infer underlying chromosomal structure, particularly in cases where abnormalities are present. Gross chromosomal amplifications and deletions have been associated with several types of cancer, but the mechanisms and conditions under which these occur are not well understood. General methods that can be applied to the growing repositories of gene expression data will undoubtedly be an important step in developing further insight. I am also interested in the general problem of high-throughput heterogeneous data integration for the purposes of gene function prediction and pathway analysis. Computation will certainly play a key role in extending our understanding of biology, but a major obstacle is determining how to reconcile different sources of data with varying degrees of relevance and reliability. |
| Elena Nabieva | Computer Science | I study protein-protein interaction networks and the ways in which they shed light on biological function of proteins. In addition to using physical interactions, I work on constructing integrated functional interaction networks from data of diverse types. |
| Jason Schlessman | Electrical Engineering | My dissertation work focuses on the development and exploration of novel approaches to embedded systems for computer vision and media processing. In addition, I consider optimized implementations of computer vision applications mapped to parallel computing platforms. My consideration of embedded systems is done with a widely cast net. On one end of the spectrum, I consider conventional embedded systems, such as those found in portable electronic devices. As consumer-drive performance demands increase, and process technology scaling ceases or is unable to provide means for meeting these demands, these conventional systems have evolved into Multiprocessor Systems on Chip (MpSoC), with expected parallel computing attributes and considerations. On the other end of the spectrum, I consider conventional high performance computing machines, such as supercomputing clusters. It is my opinion and experience that these machines, from a pragmatic perspective, tend to be tailored towards application-specific implementation, thus fitting the accepted definition of an embedded system. As research pushes past conventional architectures, heterogeneous designs are paramount. Accordingly, my work also considers points between the two aforementioned ends of the spectrum. I have looked at heterogeneous or hybrid embedded vision systems consisting of specialized components. These include FPGA accelerators, GPU-accelerated processing nodes within clusters, commercial off-the-shelf (COTS) processors such as the IBM Cell MpSoC and Texas Instrument's DaVinci DSP. Furthermore, I am in the process of developing a hybrid accelerator consisting of two of these specialized components. My overall dissertation goal is to provide generalized architectural guidelines for embedded vision system designers. |
| Erich Schmidt | Computer Science | All current search engines present some or all of the same major shortcomings: coverage, content refresh rate, efficient and accurate page ranking. The web grows continuously, making it very difficult for search engines to efficiently crawl, index and rank web pages. My current research addresses these problems, placing general keyword search in the context of publish-subscribe systems. I am developing a distributed persistent search system integrating multiple data sources; while this will provide better data coverage and update frequency, it will also enhance keyword-based search on the World Wide Web with persistent query storage and user notification capabilities. |
| Philip Shilane | Computer Science |
There has been a recent growth in 3D shape databases for graphics models, CAD parts, scanned military vehicles, and protein structures. A key research problem is how to perform similarity search within these databases, which is useful for classification, retrieval, and comparative analysis. My research focuses on using distinguishing features of each shape, which has the potential to improve search results while maintaining efficient retrieval time. I have focused on 3D graphics models for most of my research but have recently been applying similar techniques to protein binding site detection and protein matching. |
| Past PICASso Fellows: | ||
| Emily Belli | Princeton Plasma Physics Laboratory | My thesis research involves developing faster algorithms to aid in the development of a transport model based on nonlinear gyrokinetic simulations of plasma turbulence. Specifically, I am investigating 1) fast implicit solvers for the gyrokinetic equation using physics-based preconditioners, as well as order-reduction methods such as 2) trial function-based methods for fast solutions of the linear gyrokinetic equation, and 3) subgrid models of turbulence that can allow for faster nonlinear simulations by reducing resolution requirements. Transport models using these algorithms will aid in a further understanding of the mechanisms leading to improved confinement in tokamak plasmas, particularly focusing on the effects of non-circular flux surfaces, electromagnetic effects, and non-adiabatic electron dynamics. |
| Stacy Janak | Chemical Engineering | My research aims at developing a systematic framework to analyze the effects of uncertainty in the parameters used in the synthesis, design, and operations stages of process model formation. There are several specific objectives stemming from this main theme. The first is to investigate new algorithms for the synthesis, design, and operations of chemical processes under uncertainty that will incorporate the advances in robust optimization and deterministic global optimization. Robust optimization techniques have been applied to the problems of scheduling under uncertainty as well as uncertainty in the network topology of the signaling pathways in yeast. Another objective of my research is to apply robust optimization techniques to important large-scale industrial problems containing uncertainty. |
| Ben Phillips | Geosciences | I have developed a project to better understand continental motion on the Earth's surface. As such motions are integrally related to large-scale convective processes in the Earth's interior, this is a fully three-dimensional (3D) fluid dynamics problem. Solving this problem with sufficient resolutions and integration times is a highly computational endeavor. To this end, I use MPI to facilitate distributed processing on more than 10 million finite elements, amounting to a global grid spacing of tens of kilometers. Using 64 dual-processor nodes of the Geowulf, the Department of Geosciences' Beowulf computer cluster, I can compute time series equivalent to the billions of years necessary to understand problems in Earth history. |
| Edwin Sirko | Astrophysics | A standard cosmological model for the universe has emerged in the last couple years, largely due to the success of the Wilkinson Microwave Anisotropy Probe. Cosmological parameters have been pinned to precise values at the several percent level and will soon be known with even better confidence limits. However, numerical cosmological simulations have not kept up with this precision. While using a larger box size and greater mass resolution will converge to the continuum limit, the resolution needed is currently out of scope of today's computers. Using lower resolution simulations, I aim to quantify the effects of discreteness and resolution on the difference between numerical simulations and the continuum limit. |