Research Topics
| Rajarshi GuhaSummaryAffiliation: National Institutes of Health Country: USA Publications
| Collaborators
|
Detail Information
Publications
Open Data, Open Source and Open Standards in chemistry: The Blue Obelisk five years onNoel M O'Boyle
Analytical and Biological Chemistry Research Facility, Cavanagh Pharmacy Building, University College Cork, College Road, Cork, CO, Cork, Ireland
J Cheminform 3:37. 2011..abstract:..
Towards interoperable and reproducible QSAR analyses: Exchange of datasetsOla Spjuth
Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
J Cheminform 2:5. 2010..This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data...
Advances in cheminformatics methodologies and infrastructure to support the data mining of large, heterogeneous chemical datasetsRajarshi Guha
School of Informatics, Indiana University, Bloomington, IN 47408, USA
Curr Comput Aided Drug Des 6:50-67. 2010..Given that the above work is applicable to arbitrary types of cheminformatics problems, we also present some case studies related to virtual screening for anti-malarials and predictions of anti-cancer activity...
Improving usability and accessibility of cheminformatics tools for chemists through cyberinfrastructure and educationRajarshi Guha
Indiana University School of Informatics, Bloomington, IN 47405, USA
In Silico Biol 11:41-60. 2011....
Counting clusters using R-NN curvesRajarshi Guha
School of Informatics, Indiana University, Bloomington, Indiana 47406, USA
J Chem Inf Model 47:1308-18. 2007..Our results indicate that the R-NN curve algorithm is able to determine the natural number of clusters and is in general agreement the average silhouette width in identifying the optimal number of clusters...
Ensemble feature selection: consistent descriptor subsets for multiple QSAR modelsDebojyoti Dutta
School of Informatics, Indiana University, Bloomington, Indiana 47406, USA
J Chem Inf Model 47:989-97. 2007..In addition, interpretations of the individual models developed using this approach indicate that they encode similar structure-activity trends...
Chemical data mining of the NCI human tumor cell line databaseHuijun Wang
Indiana University School of Informatics and Chemical Informatics, and Cyberinfrastructure Collaboratory, 901 East Tenth Street, Bloomington, IN 47408, USA
J Chem Inf Model 47:2063-76. 2007..In this paper we describe a formal knowledge discovery approach to characterizing and data mining this set and report the results of some of our initial experiments in mining the set from a chemoinformatics perspective...
Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assaysRajarshi Guha
School of Informatics, Indiana University, Bloomington, IN, 47406, USA
J Comput Aided Mol Des 22:367-84. 2008..Finally, we present a visualization technique that allows one to compare a new dataset to the training set of the models to decide whether the new dataset may be reliably predicted...
PubChem as a source of polypharmacologyBin Chen
School of Informatics, Indiana University, Bloomington, Indiana 47408, USA
J Chem Inf Model 49:2044-55. 2009..Our results indicate this approach could be a useful way to investigate polypharmacology in the PubChem bioassay collection...
Assessing how well a modeling protocol captures a structure-activity landscapeRajarshi Guha
School of Informatics, Indiana University, Bloomington, IN 47406, USA
J Chem Inf Model 48:1716-28. 2008..In particular, the integral of these curves, denoted as SCI and being a number ranging from -1.0 to 1.0, approaches a value of 1.0 for two literature models, which are both known to be prospectively useful...
On the interpretation and interpretability of quantitative structure-activity relationship modelsRajarshi Guha
School of Informatics, Indiana University, Bloomington, IN 47408, USA
J Comput Aided Mol Des 22:857-71. 2008..Finally, we discuss a number of case studies where workers have provide some form of interpretation of a QSAR model...
Flexible Web service infrastructure for the development and deployment of predictive modelsRajarshi Guha
School of Informatics, Indiana University, Bloomington, Indiana 47406, USA
J Chem Inf Model 48:456-64. 2008..We highlight the advantages of this infrastructure by developing and subsequently deploying random forest models for two data sets...
Web service infrastructure for chemoinformaticsXiao Dong
Indiana University School of Informatics, Community Grids Laboratory, and Chemical Informatics and Cyberinfrastructure Collaboratory, Indiana University, Bloomington, Indiana 47408, USA
J Chem Inf Model 47:1303-7. 2007..In this paper, we describe this infrastructure, give some examples of its use, and then discuss our plans to use it as a platform for chemoinformatics application development in the future...
