Research Topics
Species | Martin C FrithSummaryAffiliation: Computational Biology Research Center Country: Japan Publications
| Collaborators
|
Detail Information
Publications
A mostly traditional approach improves alignment of bisulfite-converted DNAMartin C Frith
Computational Biology Research Center, National Institute for Advanced Industrial Science and Technology AIST, 2 4 7 Aomi, Koto ku, Tokyo 135 0064, Japan
Nucleic Acids Res 40:e100. 2012..It also illustrates how to adapt general-purpose alignment methods to a special case with distorted base patterns: this should be informative for other special cases such as ancient DNA and AT-rich genomes...
Gentle masking of low-complexity sequences improves homology searchMartin C Frith
Computational Biology Research Center, Institute for Advanced Industrial Science and Technology, Koto ku, Tokyo, Japan
PLoS ONE 6:e28819. 2011..Gentle masking is currently the best way to treat low-complexity tracts during homology search...
A new repeat-masking method enables specific detection of homologous sequencesMartin C Frith
Computational Biology Research Center, Institute for Advanced Industrial Science and Technology, Sequence Analysis Team, 2 4 7 Aomi, Koto ku, Tokyo 135 0064, Japan
Nucleic Acids Res 39:e23. 2011..This method thoroughly eliminates spurious homology predictions for DNA-DNA, protein-protein and DNA-protein comparisons. Moreover, it enables accurate homology search for non-coding DNA with extreme A + T composition...
Parameters for accurate genome alignmentMartin C Frith
Computational Biology Research Center, Institute for Advanced Industrial Science and Technology, Tokyo 135 0064, Japan
BMC Bioinformatics 11:80. 2010..Surprisingly, there has been no large-scale assessment of these choices using real genomic data. Moreover, rigorous procedures to control the rate of spurious alignment have not been employed...
Incorporating sequence quality data into alignment improves DNA read mappingMartin C Frith
Computational Biology Research Center, Institute for Advanced Industrial Science and Technology, Koto ku, Tokyo 135 0064, Japan
Nucleic Acids Res 38:e100. 2010..This approach enables more effective use of DNA reads from organisms that lack reference genomes, are extinct or are highly polymorphic...
Discovering sequence motifs with arbitrary insertions and deletionsMartin C Frith
Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology AIST, Tokyo, Japan
PLoS Comput Biol 4:e1000071. 2008..It may be equally useful for arbitrarily gapped motifs in DNA and RNA, although fewer examples of such motifs are known at present. GLAM2 is public domain software, available for download at http://bioinformatics.org.au/glam2...
A code for transcription initiation in mammalian genomesMartin C Frith
Genome Exploration Research Group Genome Network Project Core Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, 1 7 22 Suehiro cho, Tsurumi ku, Yokohama, Kanagawa, 230 0045, Japan
Genome Res 18:1-12. 2008....
Large-scale clustering of CAGE tag expression dataKazuro Shimokawa
Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Suehiro cho, Tsurumi ku, Yokohama, Kanagawa, Japan
BMC Bioinformatics 8:161. 2007..The standard hierarchical clustering algorithm, which gives us easily understandable graphical tree images, has difficulties in processing such huge amounts of TSS data and a better method to calculate and display the results is needed...
Splicing bypasses 3' end formation signals to allow complex gene architecturesMartin C Frith
Genome Exploration Research Group Genome Network Project Core Group, RIKEN Genomic Sciences Centre GSC, RIKEN Yokohama Institute, 1 7 22 Suehiro cho, Tsurumi ku, Yokohama, Kanagawa, 230 0045, Japan
Gene 403:188-93. 2007..This bypassing mechanism permits nested and interleaved gene architectures, as well as fusion transcripts that combine exons from adjacent genes...
Pseudo-messenger RNA: phantoms of the transcriptomeMartin C Frith
Genome Exploration Research Group (Genome Network Project Core Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
PLoS Genet 2:e23. 2006..Many of these transcripts do not correspond cleanly to any identifiable object in the genome, implying fundamental limits to the goal of annotating all functional elements at the genome sequence level...
Dynamic usage of transcription start sites within core promotersHideya Kawaji
NTT Software Corporation, 209 Yamashita cho Nakak ku, Yokohama, Kanagawa, 231 8551, Japan
Genome Biol 7:R118. 2006..We previously characterized the static structures of transcription start site usage within promoters at the base pair level, based on large-scale sequencing of transcript 5' ends...
Evolutionary turnover of mammalian transcription start sitesMartin C Frith
Genome Exploration Research Group, RIKEN Genomic Sciences Centre (GSC, RIKEN Yokohama Institute, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
Genome Res 16:713-22. 2006..These findings suggest that the signals encoding transcription start sites are highly flexible and evolvable, and have cautionary implications for the use of sequence-level conservation to detect gene regulatory elements...
The abundance of short proteins in the mammalian proteomeMartin C Frith
Genome Exploration Research Group (Genome Network Project Core Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
PLoS Genet 2:e52. 2006..Translation assays confirm that some of these novel proteins can be translated and localised to the secretory pathway...
Discrimination of non-protein-coding transcripts from protein-coding mRNAMartin C Frith
Genome Exploration Research Group Genome Network Project Core Group, RIKEN Genomic Sciences Center GSC, RIKEN Yokohama Institute, Kanagawa, Japan
RNA Biol 3:40-8. 2006..Conversely and surprisingly, our analyses also provide evidence that as much as approximately 10% of entries in the manually curated protein database Swiss-Prot are erroneous translations of actually non-coding transcripts...
Probabilistic alignments with quality scores: an application to short-read mapping toward accurate SNP/indel detectionMichiaki Hamada
Graduate School of Frontier Sciences, University of Tokyo, Kashiwa 277 8562, Japan
Bioinformatics 27:3085-92. 2011..g. SHRiMP2, Stampy, BWA and Novoalign. The study also suggested that our approach yields favorable precision for SNP/indel calling...
Recount: expectation maximization based error correction tool for next generation sequencing dataEdward Wijaya
AIST, Computational Biology Research Center, 2 42 Aomi, Koutou ku, Tokyo 135 0064, Japan
Genome Inform 23:189-201. 2009..RECOUNT is an open-source C++ program available at http://seq.cbrc.jp/recount...
The whole alignment and nothing but the alignment: the problem of spurious alignment flanksMartin C Frith
Computational Biology Research Center, Institute for Advanced Industrial Science and Technology, Tokyo 135 0064, Japan
Nucleic Acids Res 36:5863-71. 2008..Moreover, by explicitly demonstrating the tradeoff between over- and under-alignment, our methods guide the rational choice of scoring schemes for various alignment tasks...
Finding protein-coding genes through human polymorphismsEdward Wijaya
Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Japan
PLoS ONE 8:e54210. 2013..On average 10% of these genes are located in the genomic regions devoid of annotated genes in 12 other catalogs. Our statistical analysis showed that these ORFs are unlikely to occur by chance...
Genome-wide analysis of mammalian promoter architecture and evolutionPiero Carninci
Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
Nat Genet 38:626-35. 2006..These data permit genome-scale identification of tissue-specific promoters and analysis of the cis-acting elements associated with them...
RecountDB: a database of mapped and count corrected transcribed sequencesEdward Wijaya
Graduate School of Frontier Sciences, University of Tokyo, 5 1 5 Kashiwanoha, Kashiwa 277 8562, Japan
Nucleic Acids Res 40:D1089-92. 2012..The database is currently populated with 2265 entries from 45 organisms and continuously growing. RecountDB is publicly available at: http://recountdb.cbrc.jp...
