Research Topics
Species | Mark B GersteinSummaryAffiliation: Yale University Country: USA Publications
| Collaborators
|
Detail Information
Publications
Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factorsKevin Y Yip
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
Genome Biol 13:R48. 2012..While this large amount of data creates a valuable resource, it is nonetheless overwhelmingly complex and simultaneously incomplete since it covers only a small fraction of all human transcription factors...
On sports and genesGili Zilberman-Schapira
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
Recent Pat DNA Gene Seq 6:180-8. 2012..We discuss the considerable success, and significant drawbacks, of past research along these lines, and propose interesting directions for future research...
Architecture of the human regulatory network derived from ENCODE dataMark B Gerstein
Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
Nature 489:91-100. 2012..The regulatory information obtained in this study will be crucial for interpreting personal genome sequences and understanding basic principles of human biology and disease...
Predicting protein ligand binding motions with the conformation explorerSamuel C Flores
Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
BMC Bioinformatics 12:417. 2011..Often the 3D structure of the protein is available in some conformation, but binding the ligand of interest may involve a large scale conformational change which is difficult to predict with existing methods...
Of mice and men: phylogenetic footprinting aids the discovery of regulatory elementsZhaolei Zhang
Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520-8114, USA
J Biol 2:11. 2003..A new study shows how much it improves the prediction of gene-regulatory elements in the human genome...
A statistical framework for modeling gene expression using chromatin features and application to modENCODE datasetsChao Cheng
Department of Molecular Biophysics and Biochemistry, Yale University, 260 Whitney Avenue, New Haven, CT 06520, USA
Genome Biol 12:R15. 2011..Moreover, our framework reveals the positional contribution around genes (upstream or downstream) of distinct chromatin features to the overall prediction of expression levels...
The CRIT framework for identifying cross patterns in systems biology and application to chemogenomicsTara A Gianoulis
Department of Genetics, 77 Ave of Louis Pasteur, Harvard Medical School, Boston, MA 02115, USA
Genome Biol 12:R32. 2011..Here we present an approach (CRIT) to find connections such as these and show how it can be applied in a variety of genomic contexts including chemogenomics data...
Uncovering trends in gene namingMichael R Seringhaus
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
Genome Biol 9:401. 2008..We categorize, for instance, those that involve a naming system transferred from another context (for example, Pavlov's dogs). We hope this analysis provides clues to better steer gene naming in the future...
Network security and data integrity in academia: an assessment and a proposal for large-scale archivingAndrew Smith
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Genome Biol 6:119. 2005....
Comparing protein abundance and mRNA expression levels on a genomic scaleDov Greenbaum
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520 8114, USA
Genome Biol 4:117. 2003..We also merge many of the available yeast protein-abundance datasets, using the resulting larger 'meta-dataset' to find correlations between protein and mRNA expression, both globally and within smaller categories...
BoCaTFBS: a boosted cascade learner to refine the binding sites suggested by ChIP-chip experimentsLu-yong Wang
Integrated Data Systems Department, Siemens Corporate Research, 755 College Road East, Princeton, New Jersey 08540, USA
Genome Biol 7:R102. 2006..We applied BoCaTFBS within the ENCODE project and showed that it outperforms many traditional binding site identification methods (for instance, profiles)...
The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic propertiesNicholas M Luscombe
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA
Genome Biol 3:RESEARCH0040. 2002..We present this behavior in a unified framework and propose that all these observations are connected to an underlying DNA duplication process as genomes evolved to their current state...
Comprehensive analysis of pseudogenes in prokaryotes: widespread gene decay and failure of putative horizontally transferred genesYang Liu
Department of Molecular Biophysics and Biochemistry, Yale University, PO Box 208114, New Haven, CT 06520 8114, USA
Genome Biol 5:R64. 2004..Pseudogenes often manifest themselves as disabled copies of known genes. In prokaryotes, it was generally believed (with a few well-known exceptions) that they were rare...
Comparative analysis of processed ribosomal protein pseudogenes in four mammalian genomesSuganthi Balasubramanian
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Genome Biol 10:R2. 2009..The availability of genome sequences of numerous organisms allows comparative study of pseudogenes in syntenic regions. Conservation of pseudogenes suggests that they might have a functional role in some instances...
Genomic analysis of membrane protein families: abundance and conserved motifsYang Liu
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA
Genome Biol 3:research0054. 2002..CONCLUSIONS: We carried out a genome-wide analysis on patterns of the classified polytopic membrane protein families and analyzed the distribution of conserved amino acids and motifs in the transmembrane helix regions in these families...
Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genomeJia Qian Wu
Molecular, Cellular and Developmental Biology Department, KBT918, Yale University, New Haven, Connecticut 06511, USA
Genome Biol 9:R3. 2008..However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced...
Identification and analysis of unitary pseudogenes: historic and contemporary gene losses in humans and other primatesZhengdong D Zhang
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Genome Biol 11:R26. 2010..They constitute only a small fraction of annotated pseudogenes in the human genome. However, as they represent distinct functional losses over time, they shed light on the unique features of humans in primate evolution...
Design principles of molecular networks revealed by global comparisons and composite motifsHaiyuan Yu
Department of Molecular Biophysics and Biochemistry, Whitney Avenue, Yale University, New Haven, CT 06520, USA
Genome Biol 7:R55. 2006..Conversely, the segregation of regulator hubs from other hubs diverges from social intuitions (as managers often are centers of interactions)...
PubNet: a flexible system for visualizing literature derived networksShawn M Douglas
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Genome Biol 6:R80. 2005..This feature allows one to, for example, examine a literature derived network of genes based on functional similarity...
A computational approach for identifying pseudogenes in the ENCODE regionsDeyou Zheng
Department of Molecular Biophysics and Biochemistry, Yale University, Whitney Avenue, New Haven, CT 06520, USA
Genome Biol 7:S13.1-10. 2006....
mRNA expression profiles show differential regulatory effects of microRNAs between estrogen receptor-positive and estrogen receptor-negative breast cancerChao Cheng
Program in Computational Biology and Bioinformatics, Yale University, George Street, New Haven, CT 06511, USA
Genome Biol 10:R90. 2009..Given this, it is useful to define an overall metric of regulatory effect for a specific microRNA and see how this changes across different conditions...
PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing dataJan O Korbel
Gene Expression Unit, European Molecular Biology Laboratory EMBL, Meyerhofstr, Heidelberg, 69117, Germany
Genome Biol 10:R23. 2009..The simulations demonstrated high structural variant reconstruction efficiency for PEMer's coverage-adjusted multi-cutoff scoring-strategy and showed its relative insensitivity to base-calling errors...
Molecular sampling of prostate cancer: a dilemma for predicting disease progressionAndrea Sboner
Department of Pathology and Laboratory Medicine, Weill Cornell Medical Center, New York, NY, USA
BMC Med Genomics 3:8. 2010..Hence, we sought to develop a molecular panel for prostate cancer progression by reasoning that molecular profiles might further improve current clinical models...
Identification of genomic indels and structural variations using split readsZhengdong D Zhang
Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
BMC Genomics 12:375. 2011..Here we present split-read identification, calibrated (SRiC), a sequence-based method for SV detection...
Comparison and calibration of transcriptome data from RNA-Seq and tiling arraysAshish Agarwal
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
BMC Genomics 11:383. 2010..Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs...
Relationship between gene co-expression and probe localization on microarray slidesYuval Kluger
Department of Molecular Biophysics and Biochemistry, Yale University, PO Box 208114, New Haven, CT 06520, USA
BMC Genomics 4:49. 2003..This is a potentially useful tool for evaluating co-expression of genes and extraction of useful functional and chromosomal structural information about genes...
The relationship between the evolution of microRNA targets and the length of their UTRsChao Cheng
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
BMC Genomics 10:431. 2009..Their widespread and important role in animals is gauged by estimates that approximately 25% of all genes are miRNA targets...
Mismatch oligonucleotides in human and yeast: guidelines for probe design on tiling microarraysMichael Seringhaus
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
BMC Genomics 9:635. 2008..Here, we present the results of two large-scale microarray experiments on S. cerevisiae and H. sapiens genomic DNA, to explore MM oligonucleotide behavior with real sample mixtures under tiling-array conditions...
Efficient yeast ChIP-Seq using multiplex short-read DNA sequencingPhilippe Lefrançois
Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06520, USA
BMC Genomics 10:37. 2009....
Integration of curated databases to identify genotype-phenotype associationsChern Sing Goh
Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
BMC Genomics 7:257. 2006....
Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencingRobert Pinard
454 Life Sciences, 20 Commercial Street, Branford CT 06405, USA
BMC Genomics 7:216. 2006..The amplification-induced bias of each method was assessed by sequencing both genomes in their entirety using the 454 Sequencing System technology and comparing the results with those obtained from unamplified controls...
Systematic identification of transcription factors associated with patient survival in cancersChao Cheng
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
BMC Genomics 10:225. 2009..However, the association between transcription factors and cancers is largely dependent on the transcription regulatory activities rather than mRNA expression levels...
Identification of specificity determining residues in peptide recognition domains using an information theoretic approach applied to large-scale binding mapsKevin Y Yip
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
BMC Biol 9:53. 2011..They mediate protein-protein interactions by recognizing and binding short motifs in their ligands. Although a great deal is known about PRDs and their interactions, prediction of PRD specificities remains largely an unsolved problem...
Tiling array data analysis: a multiscale approach using waveletsAlexander Karpikov
Diagnostic Radiology, Yale University, New Haven, CT, USA
BMC Bioinformatics 12:57. 2011..In doing this, we used specific wavelet basis functions, Coiflets, since their triangular shape closely resembles the expected profiles of true ChIP-chip peaks...
Using 3D Hidden Markov Models that explicitly represent spatial coordinates to model and compare protein structuresVadim Alexandrov
Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave, New Haven, CT 06511, USA
BMC Bioinformatics 5:2. 2004..Thus far, however, they have been confined to representing 1D sequence (or the aspects of structure that could be represented by character strings)...
PARE: a tool for comparing protein abundance and mRNA expression dataEric Z Yu
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
BMC Bioinformatics 8:309. 2007..Since proteins are translated from mRNAs, their expression is expected to be related to their abundance, to some degree...
Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian modelZhengdong D Zhang
Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
BMC Bioinformatics 11:539. 2010....
Information assessment on predicting protein-protein interactionsNan Lin
Department of Mathematics, Washington University in St Louis, St Louis, MO 63130, USA
BMC Bioinformatics 5:154. 2004....
LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomicsAndrew K Smith
Department of Computer Science, Yale University, New Haven, Connecticut, 06520 USA
BMC Bioinformatics 8:S5. 2007..g. proteins) and the massive graph of relationships among them. These relationships are sometimes simple (e.g. synonyms) but are often more complex (e.g. one-to-many relationships in protein family membership)...
Publishing perishing? Towards tomorrow's information architectureMichael R Seringhaus
Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520, USA
BMC Bioinformatics 8:17. 2007..We present our vision of the optimal information architecture for the biosciences, and close with tangible steps to improve our handling of scientific information today while paving the way for an expansive central index in the future...
Multi-level learning: improving the prediction of protein, domain and residue interactions by allowing information flow between levelsKevin Y Yip
Department of Computer Science, Yale University, 51 Prospect Street, New Haven, CT 06511, USA
BMC Bioinformatics 10:241. 2009..The predictions at each level could benefit from using the features at all three levels. However, it is not trivial as the features are provided at different granularity...
Structural genomics: a new era for pharmaceutical researchYang Liu
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA
Genome Biol 3:REPORTS4004. 2002
Integrative database analysis in structural genomicsM Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
Nat Struct Biol 7:960-3. 2000..Integrative database analysis allows one to survey the 'finite parts list' of protein folds from many perspectives, highlighting certain folds and structural features that stand out in particular ways...
The current excitement in bioinformatics-analysis of whole-genome expression data: how does it relate to protein structure and function?M Gerstein
Department of Molecular Biophysics and Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA
Curr Opin Struct Biol 10:574-84. 2000..Other attributes of proteins can also be related to expression-in particular, structure and localization-and sometimes show a clearer relationship than function...
Proteomics. Integrating interactomesMark Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Science 295:284-7. 2002
Comparing genomes in terms of protein structure: surveys of a finite parts listM Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
FEMS Microbiol Rev 22:277-304. 1998..Continuously updated tables and further information pertinent to this review are available over the web at http://bioinfo.mbb.yale.edu/genome...
A structural census of the current population of protein sequencesM Gerstein
Molecular Biophysics and Biochemistry Department, P O Box 208114, Yale University, New Haven, CT 06520 8114, USA
Proc Natl Acad Sci U S A 94:11911-6. 1997..They also have important implications for database-based methods for fold recognition, suggesting that an unknown sequence from a plant is more likely to have a certain fold (e.g., a TIM barrel) than an unknown sequence from an animal...
An interdepartmental Ph.D. program in computational biology and bioinformatics: the Yale perspectiveMark Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
J Biomed Inform 40:73-9. 2007..Further updated information is available from the program's website, www.cbb.yale.edu.)..
Predicting interactions in protein networks by completing defective cliquesHaiyuan Yu
Department of Molecular Biophysics and Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520-8285, USA
Bioinformatics 22:823-9. 2006..More information can be found on our website http://topnet.gersteinlab.org/clique/ CONTACT: SUPPLEMENTARY INFORMATION: Supplementary Materials are available at Bioinformatics online...
How representative are the known structures of the proteins in a complete genome? A comprehensive structural censusM Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Fold Des 3:497-512. 1998..It is also important for improving database-based methods of structure prediction and genome annotation...
Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteinsM Gerstein
Molecular Biophysics and Biochemistry Department, Yale University, New Haven, Connecticut 06520 8114, USA
Protein Sci 7:445-56. 1998..With these improvements and systematic tests, our procedure should be useful for the development of scop and the future classification of protein folds...
A database of macromolecular motionsM Gerstein
Department of Molecular Biophysics and Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA
Nucleic Acids Res 26:4280-90. 1998..These pathways can be viewed in a variety of movie formats, and the database is associated with a server that can automatically generate these movies from submitted coordinates...
What is a gene, post-ENCODE? History and updated definitionMark B Gerstein
Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06511, USA
Genome Res 17:669-81. 2007..It also manifests how integral the concept of biological function is in defining genes...
Exploring the range of protein flexibility, from a structural proteomics perspectiveMark Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave, New Haven, CT 06520, USA
Curr Opin Chem Biol 8:14-9. 2004..New data emphasize a breadth of possible structural mechanisms, particularly the ability to drastically alter protein architecture and the native flexibility of many structures...
Patterns of protein-fold usage in eight microbial genomes: a comprehensive structural censusM Gerstein
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
Proteins 33:518-34. 1998..This implies there are no marked preferences for proteins with particular numbers of TM-helices (e.g. 7-TM) in microbial genomes...
Analysis of copy number variants and segmental duplications in the human genome: Evidence for a change in the process of formation in recent evolutionary historyPhilip M Kim
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
Genome Res 18:1865-74. 2008..In addition to a coarse-grained analysis, we performed targeted sequencing of 67 CNVs and then analyzed a combined set of 270 CNVs (540 breakpoints) to verify our conclusions...
Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogsHaiyuan Yu
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
Genome Res 14:1107-18. 2004..We test a number of these in two-hybrid experiments and are able to verify 45 overlaps, which we show to be statistically significant...
Genome-wide analysis of chromatin features identifies histone modification sensitive and insensitive yeast transcription factorsChao Cheng
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Genome Biol 12:R111. 2011....
Annotating non-coding regions of the genomeRoger P Alexander
Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
Nat Rev Genet 11:559-71. 2010..Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis...
Variation in transcription factor binding among humansMaya Kasowski
Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA
Science 328:232-5. 2010..Our results indicate that many differences in individuals and species occur at the level of TF binding, and they provide insight into the genetic events responsible for these differences...
PeakSeq enables systematic scoring of ChIP-seq experiments relative to controlsJoel Rozowsky
Molecular Biophysics and Biochemistry Dept, Yale University, PO Box 208114, New Haven, Connecticut 06520 8114, USA
Nat Biotechnol 27:66-75. 2009..Our scoring procedure enables us to optimize experimental design by estimating the depth of sequencing required for a desired level of coverage and demonstrating that more than two replicates provides only a marginal gain in information...
Systematic prediction and validation of breakpoints associated with copy-number variants in the human genomeJan O Korbel
Departments of Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, CT 06520, USA
Proc Natl Acad Sci U S A 104:10110-5. 2007..Further, it enabled us to demonstrate a clear Mendelian pattern of inheritance for one of the CNVs...
Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array dataZhi John Lu
Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA
Genome Res 21:276-85. 2011..Overall, our study identifies many new potential ncRNAs in C. elegans and provides a method that can be adapted to other organisms...
MOTIPS: automated motif analysis for predicting targets of modular protein domainsHugo Y K Lam
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
BMC Bioinformatics 11:243. 2010..However, predicting domain targets by motif sequence alone without considering other genomic and structural information has been shown to be lacking in accuracy...
StoneHinge: hinge prediction by network analysis of individual protein structuresKevin S Keating
Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
Protein Sci 18:359-71. 2009..By comparison, a popular hinge detection method that requires knowledge of both the open and closed conformations finds 10 of the 13 known hinges, while predicting four additional, false hinges...
Paired-end mapping reveals extensive structural variation in the human genomeJan O Korbel
Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA
Science 318:420-6. 2007..The breakpoint junction sequences of more than 200 SVs were determined with a novel pooling strategy and computational analysis. Our analysis provided insights into the mechanisms of SV formation in humans...
Integrating sequencing technologies in personal genomics: optimal low cost reconstruction of structural variantsJiang Du
Department of Computer Science, Yale University, New Haven, Connecticut, USA
PLoS Comput Biol 5:e1000432. 2009..Our strategy should facilitate the sequencing of human genomes at maximum accuracy and low cost...
Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenesNathaniel Echols
Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, Box 208114, New Haven, CT 06520-8114, USA
Nucleic Acids Res 30:2515-23. 2002..Our compositional analyses with the interactive viewer are available over the web at http://genecensus.org/pseudogene...
Integrative analysis of the Caenorhabditis elegans genome by the modENCODE projectMark B Gerstein
Program in Computational Biology and Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, CT 06520, USA
Science 330:1775-87. 2010..Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome...
Analysis of membrane proteins in metagenomics: networks of correlated environmental features and protein familiesPrianka V Patel
Department of Molecular Biophysics, Yale University, New Haven, Connecticut 06520, USA
Genome Res 20:960-71. 2010....
HingeMaster: normal mode hinge prediction approach and integration of complementary predictorsSamuel C Flores
Department of Physics, Yale University, New Haven, Connecticut 06520, USA
Proteins 73:299-319. 2008..We integrated all the methods into a combined predictor using a weighted voting scheme. Finally, we encapsulated all our results in a web tool which can be used to run all the predictors on submitted proteins and visualize the results...
Structural genomics analysis: characteristics of atypical, common, and horizontally transferred foldsHedi Hegyi
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, USA
Proteins 47:126-41. 2002..In particular, we find three possible examples of transfer between archaea and bacteria and six between eukarya and bacteria. We make available our detailed results at http://genecensus.org/20...
The GENCODE pseudogene resourceBaikang Pei
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
Genome Biol 13:R51. 2012..However, recent evidence suggests that many of them might have some form of biological activity, and the possibility of functionality has increased interest in their accurate annotation and integration with functional genomics data...
Pseudofam: the pseudogene families databaseHugo Y K Lam
Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
Nucleic Acids Res 37:D738-43. 2009..The statistics also show how the number of genes and pseudogenes in families correlates across different species. Overall, they highlight the fact that housekeeping families tend to be enriched with a large number of pseudogenes...
Statistical analysis of the genomic distribution and correlation of regulatory elements in the ENCODE regionsZhengdong D Zhang
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
Genome Res 17:787-97. 2007..Data sets associated with histone modifications have particularly strong correlations. Finally, we show how the correlations between factors change when only regulatory elements far from the transcription start sites are considered...
Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologiesGhia M Euskirchen
Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520 8103, USA
Genome Res 17:898-909. 2007..Overall, this study provides information for robust identification, scoring, and validation of TF targets using ChIP-based technologies...
Analysis of genomic variation in non-coding elements using population-scale sequencing data from the 1000 Genomes ProjectXinmeng Jasmine Mu
Program in Computational Biology and Bioinformatics, Department of Molecular Biophysics and Biochemistry, W M Keck Foundation Biotechnology Resource Laboratory, Yale University, New Haven, CT 06520, USA
Nucleic Acids Res 39:7058-76. 2011....
Analysis of combinatorial regulation: scaling of partnerships between regulators with the number of governed targetsNitin Bhardwaj
Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
PLoS Comput Biol 6:e1000755. 2010..Finally, we perform various down-sampling calculations on the observed data to illustrate the robustness of our conclusions...
FlexOracle: predicting flexible hinges by identification of stable domainsSamuel C Flores
Department of Physics, Yale University, Bass 432, New Haven, CT 06520, USA
BMC Bioinformatics 8:215. 2007..Programs such as GNM and FIRST have made global flexibility predictions available at low computational cost, but are not designed specifically for finding hinge points...
TopNet: a tool for comparing biological sub-networks, correlating protein properties with topological statisticsHaiyuan Yu
Department of Molecular Biophysics and Biochemistry, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA
Nucleic Acids Res 32:328-37. 2004..This phenomenon may reflect the incompleteness of the experimentally determined yeast interaction network...
Helix Interaction Tool (HIT): a web-based tool for analysis of helix-helix interactions in proteinsAnne E Counterman Burba
Department of Molecular Biophysics and Biochemistry, Yale University New Haven, CT 06520, USA
Bioinformatics 22:2735-8. 2006..For the latter purpose, a direct interface from entries in the Molecular Motions Database to the HIT site has been provided...
A supervised hidden markov model framework for efficiently segmenting tiling array data in transcriptional and chIP-chip experiments: systematically incorporating validated biological knowledgeJiang Du
Department of Computer Science, Yale University, New Haven, CT 06520, USA
Bioinformatics 22:3016-24. 2006..This latter result has strong implications for the optimum way medium-scale validation experiments should be carried out to verify the results of the genome-scale tiling array experiments...
Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint libraryHugo Y K Lam
Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
Nat Biotechnol 28:47-55. 2010..As new data become available, we expect our BreakSeq approach will become more sensitive and facilitate rapid SV genotyping of personal genomes...
Relating three-dimensional structures to protein networks provides evolutionary insightsPhilip M Kim
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Science 314:1938-41. 2006....
Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genomeOlof Emanuelsson
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520 8114, USA
Genome Res 17:886-97. 2007..Finally, our experiments reveal a significant amount of novel transcription outside of known genes, and an appreciable sample of this was validated by independent experiments...
Modeling ChIP sequencing in silico with applicationsZhengdong D Zhang
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
PLoS Comput Biol 4:e1000158. 2008..This enables us to identify transcription-factor binding sites in ChIP-seq data in a statistically rigorous fashion...
The transcriptional landscape of the yeast genome defined by RNA sequencingUgrappa Nagalakshmi
Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA
Science 320:1344-9. 2008..We also found unexpected 3'-end heterogeneity and the presence of many overlapping genes. These results indicate that the yeast transcriptome is more complex than previously appreciated...
Analysis of mRNA expression and protein abundance data: an approach for the comparison of the enrichment of features in the cellular population of proteins and transcriptsDov Greenbaum
Department Genetics, 266 Whitney Avenue, Yale University, PO Box 208114, New Haven, CT 06520, USA
Bioinformatics 18:585-96. 2002..SUPPLEMENTARY INFORMATION: http://genecensus.org/expression/translatome..
Analysis of diverse regulatory networks in a hierarchical context shows consistent tendencies for collaboration in the middle levelsNitin Bhardwaj
Program in Computational Biology and Bioinformatics, Yale University, Bass 426, 266 Whitney Avenue, New Haven, CT 06520, USA
Proc Natl Acad Sci U S A 107:6841-6. 2010..There is, however, one notable difference between networks in different species: The amount of collaborative regulation and democratic character increases markedly with overall genomic complexity...
Novel insights through the integration of structural and functional genomics data with protein networksDeclan Clarke
Department of Chemistry, Yale University, New Haven, CT 06520, USA
J Struct Biol 179:320-6. 2012....
The DART classification of unannotated transcription within the ENCODE regions: associating transcription with known and novel lociJoel S Rozowsky
Molecular Biophysics and Biochemistry Department, Yale University, New Haven, Connecticut 06520 8114, USA
Genome Res 17:732-45. 2007..Overall, we find that 18 of the 46 connections tested validate by RT-PCR and four of five sequenced PCR products confirm connectivity unambiguously...
Dynamic and complex transcription factor binding during an inducible response in yeastLi Ni
Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
Genes Dev 23:1351-63. 2009....
Integrated assessment of genomic correlates of protein evolutionary rateYu Xia
Bioinformatics Program, Boston University, Boston, Massachusetts, USA
PLoS Comput Biol 5:e1000413. 2009..Our integrated assessment framework can be readily extended to other correlational analyses at the genome scale...
Hinge Atlas: relating protein sequence to sites of structural flexibilitySamuel C Flores
Department of Physics, Yale University, New Haven, CT, USA
BMC Bioinformatics 8:167. 2007..Efforts in this field have been hampered by the lack of a proper dataset for studying characteristics of hinges...
Positive selection at the protein network periphery: evaluation in terms of structural constraints and cellular contextPhilip M Kim
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
Proc Natl Acad Sci U S A 104:20274-9. 2007..e., extracellular space or cell membrane). This suggests that the observed positive selection at the network periphery may be due to an increase of adaptive events on the cellular periphery responding to changing environments...
Integration of protein motions with molecular networks reveals different mechanisms for permanent and transient interactionsNitin Bhardwaj
Yale University, New Haven, Connecticut 06520, USA
Protein Sci 20:1745-54. 2011..We provide evidence for this hypothesis through showing that interfaces involved in transient interactions bind fewer classes of domains than those in a control set...
Regulation of gene expression by a metabolic enzymeDavid A Hall
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520 8005, USA
Science 306:482-4. 2004..Deletion of Arg5,6 causes altered transcript levels of both nuclear and mitochondrial target genes. These results indicate that metabolic enzymes can directly regulate eukaryotic gene expression...
A small reservoir of disabled ORFs in the yeast genome and its implications for the dynamics of proteome evolutionPaul Harrison
Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA
J Mol Biol 316:409-19. 2002..See genecensus.org/pseudogene for further information.)..
Divergence of transcription factor binding sites across related yeast speciesAnthony R Borneman
Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
Science 317:815-9. 2007..Transcription factor binding sites have therefore diverged substantially faster than ortholog content. Thus, gene regulation resulting from transcription factor binding is likely to be a major cause of divergence between related species...
Transcription factor binding site identification in yeast: a comparison of high-density oligonucleotide and PCR-based microarray platformsAnthony R Borneman
Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
Funct Integr Genomics 7:335-45. 2007..The HDO array platform provides a far more robust array system by all measures than PCR-based arrays, all of which is directly attributable to the large number of probes available...
Differential binding of calmodulin-related proteins to their targets revealed through high-density Arabidopsis protein microarraysSorina C Popescu
Department of Molecular, Cellular, and Developmental Biology, Yale University, 219 Prospect Street, New Haven, CT 06520 8103, USA
Proc Natl Acad Sci U S A 104:4730-5. 2007..Our results suggest that calcium functions through distinct CaM/CML proteins to regulate a wide range of targets and cellular activities...
