Research Topics
| Hua XuSummaryAffiliation: Vanderbilt University Country: USA Publications
Research Grants
| Collaborators
|
Detail Information
Publications
Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarinHua Xu
Department of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA
J Am Med Inform Assoc 18:387-91. 2011..This study sought to develop natural-language-processing algorithms to extract drug-dose information from clinical text, and to assess the capabilities of such tools to automate the data-extraction process for pharmacogenetic studies...
A new clustering method for detecting rare senses of abbreviations in clinical notesHua Xu
Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37203, USA
J Biomed Inform 45:1075-83. 2012..Further analysis demonstrated that the improvement by the TCRS method was mainly from additionally detected rare senses, thus indicating its usefulness for building more complete sense inventories of clinical abbreviations...
Applying semantic-based probabilistic context-free grammar to medical language processing--a preliminary study on parsing medication sentencesHua Xu
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, TN 37232, USA
J Biomed Inform 44:1068-75. 2011..Our evaluation using a 10-fold cross validation showed that the PCFG parser dramatically improved parsing performance when compared to the CFG parser...
MedEx: a medication information extraction system for clinical narrativesHua Xu
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, Tennessee 37232, USA
J Am Med Inform Assoc 17:19-24. 2010..5%, 93.9%, and 96.0% respectively. We then applied MedEx unchanged to outpatient clinic visit notes. It performed similarly with F-measures over 90% on a set of 25 clinic visit notes...
Methods for building sense inventories of abbreviations in clinical notesHua Xu
Department of Biomedical Informatics, Columbia University, New York, NY, USA
J Am Med Inform Assoc 16:103-8. 2009..To develop methods for building corpus-specific sense inventories of abbreviations occurring in clinical documents...
Detecting abbreviations in discharge summaries using machine learning methodsYonghui Wu
Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
AMIA Annu Symp Proc 2011:1541-9. 2011..8% (precision 98.8% and recall of 91.2%). When a voting scheme was used to combine output from various ML classifiers, the system achieved the highest F-measure of 95.7%...
Extracting timing and status descriptors for colonoscopy testing from electronic medical recordsJoshua C Denny
Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, USA
J Am Med Inform Assoc 17:383-8. 2010..Further investigations must validate extension of NLP approaches for other types of CRC screening applications...
A study of transportability of an existing smoking status detection module across institutionsMei Liu
Department of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, TN, USA
AMIA Annu Symp Proc 2012:577-86. 2012..Our results showed that the customized module achieved significantly higher F-measures at all levels of classification (i.e., sentence, document, patient) compared to the direct application of the cTAKES module to the Vanderbilt data...
Gene symbol disambiguation using knowledge-based profilesHua Xu
Department of Biomedical Informatics, Columbia University, New York City, New York, USA
Bioinformatics 23:1015-22. 2007..Existing knowledge sources, such as Entrez Gene and the MEDLINE database, contain information concerning the characteristics of a particular gene that could be used to disambiguate gene symbols...
Extracting and integrating data from entire electronic health records for detecting colorectal cancer casesHua Xu
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, TN, USA
AMIA Annu Symp Proc 2011:1564-72. 2011..996 for document level concept identification, and 0.93 for patient level case detection...
Predicting warfarin dosage in European-Americans and African-Americans using DNA samples linked to an electronic health recordAndrea H Ramirez
Department of Medicine, Vanderbilt University in Nashville, TN 37232, USA
Pharmacogenomics 13:407-18. 2012..Electronic health record (EHR) systems linked to biobanks may allow for pharmacogenomic analysis, but they have not yet been used for this purpose...
Using contextual and lexical features to restructure and validate the classification of biomedical conceptsJung Wei Fan
Department of Biomedical Informatics, Columbia University Vanderbilt Clinic, 5th Floor, 622 West 168th Street, New York, NY 10032, USA
BMC Bioinformatics 8:264. 2007..In this paper, we introduce another classification approach based on words of the concept strings and compare it to the contextual syntactic approach...
A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summariesMin Jiang
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, Tennessee 37232, USA
J Am Med Inform Assoc 18:601-6. 2011..This project was part of the 2010 Center of Informatics for Integrating Biology and the Bedside/Veterans Affairs (VA) natural-language-processing challenge...
Modeling drug exposure data in electronic medical records: an application to warfarinMei Liu
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, TN, USA
AMIA Annu Symp Proc 2011:815-23. 2011..We applied the framework to determine patient warfarin exposure at hospital admissions and achieved 87% precision, 79% recall, and an area under the receiver-operator characteristic curve of 0.93...
Using distributional analysis to semantically classify UMLS conceptsJung Wei Fan
Department of Biomedical Informatics, Columbia University, USA
Stud Health Technol Inform 129:519-23. 2007..54 and recall of 0.654 was achieved by the top prediction; precision of 0.64 and recall of 0.769 was achieved by the top 2 predictions. Error analysis revealed problems in the current method, and provided insight into future improvements...
Integrating existing natural language processing tools for medication extraction from discharge summariesSon Doan
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, TN, USA
J Am Med Inform Assoc 17:528-31. 2010..This task required accurate recognition of medication name, dosage, mode, frequency, duration, and reason for drug administration...
A comparative study of current Clinical Natural Language Processing systems on handling abbreviations in discharge summariesYonghui Wu
Department of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, TN, USA
AMIA Annu Symp Proc 2012:997-1003. 2012..This study suggested that accurate identification of clinical abbreviations is a challenging task and that more advanced abbreviation recognition modules might improve existing clinical NLP systems...
Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugsMei Liu
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, Tennessee 37232, USA
J Am Med Inform Assoc 19:e28-35. 2012..Accurate prediction of potential ADRs is required in the entire life cycle of a drug, including early stages of drug design, different phases of clinical trials, and post-marketing surveillance...
Natural language processing and visualization in the molecular imaging domainP Karina Tulipano
Department of Biomedical Informatics, Columbia University, 622 West 168th Street, Vanderbilt Clinic Floor 5, NY 10032, USA
J Biomed Inform 40:270-81. 2007..74 (95% CI: [.70-.76]) and 0.70 (95% CI [.63-.76]), respectively. We adapt a JAVA viewer known as PGviewer for the simultaneous visualization of images with NLP extracted information...
DTome: a web-based tool for drug-target interactome constructionJingchun Sun
Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
BMC Bioinformatics 13:S7. 2012....
Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issuesHua Xu
Department of Biomedical Informatics, Columbia University, 622 168th St, New York City, New York, USA
BMC Bioinformatics 7:334. 2006..Thus, there is a need to explicitly address the factors and to systematically quantify their effects on performance...
Applying active learning to assertion classification of concepts in clinical textYukun Chen
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, TN, USA
J Biomed Inform 45:265-72. 2012..For example, to achieve an AUC of 0.79, the random sampling method used 32 samples, while our best active learning algorithm required only 12 samples, a reduction of 62.5% in manual annotation effort...
Data from clinical notes: a perspective on the tension between structure and flexible documentationS Trent Rosenbloom
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
J Am Med Inform Assoc 18:181-6. 2011..When reusable data are needed from notes, providers can use structured documentation or rely on post-hoc text processing to produce structured data, as appropriate...
Extracting semantic lexicons from discharge summaries using machine learning and the C-Value methodMin Jiang
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, Nashville, TN, USA
AMIA Annu Symp Proc 2012:409-16. 2012..04%). We conclude that such corpus-based methods are effective for generating semantic lexicons, which may improve named entity recognition tasks and may aid in augmenting synonymy within existing terminologies...
The use of a DNA biobank linked to electronic medical records to characterize pharmacogenomic predictors of tacrolimus dose requirement in kidney transplant recipientsKelly A Birdwell
Division of Nephrology, Vanderbilt University Medical Center, Nashville, Tennesse, USA
Pharmacogenet Genomics 22:32-42. 2012..The importance of other drug absorption, distribution, metabolism, and elimination (ADME) gene variants has not been well characterized...
Development of a natural language processing system to identify timing and status of colonoscopy testing in electronic medical recordsJoshua C Denny
Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
AMIA Annu Symp Proc 2009:141. 2009..The system detected completed colonoscopies with recall and precision of 0.93 and 0.92. The system was superior to a query of colonoscopy billing codes to determine screening status...
Identifying the status of genetic lesions in cancer clinical trial documents using machine learningYonghui Wu
Department of Biomedical Informatics, Vanderbilt University, School of Medicine, 2209 Garland Ave, Nashville, TN 37232, USA
BMC Genomics 13:S21. 2012..To facilitate search and identification of gene-associated clinical trials by potential participants and clinicians, it is important to develop automated methods to identify genetic information from narrative trial documents...
Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviationsHua Xu
Department of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, TN, USA
AMIA Annu Symp Proc 2012:1004-13. 2012..875 on the same test set, indicating that integrating sense frequency information with local context is effective for clinical abbreviation disambiguation...
Facilitating cancer research using natural language processing of pathology reportsHua Xu
Department of Biomedical Informatics, College of Physicians and Surgeons, Columbia University, 622 W. 168th Street, VC-5, New York, NY 10032, USA
Medinfo 11:565-72. 2004..The evaluation outcome showed that the extended NLP system had a sensitivity of 90.6% and a precision of 91.6%. Results indicated that this system performed satisfactorily for capturing information for the cancer research project...
Genomics in 2012: challenges and opportunities in the next generation sequencing eraZhongming Zhao
Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
BMC Genomics 13:S1. 2012..It included six sessions, a tutorial - Introduction to Proteome Informatics, a workshop - Next Generation Sequencing, and a poster session. The selected papers in this Supplement issue represent the genomic focus in ICIBM 2012...
Comparing content coverage in medical curriculum to trainee-authored clinical notesJoshua C Denny
Department of Biomedical Informatics, Vanderbilt University, Nashville, TN
AMIA Annu Symp Proc 2010:157-61. 2010..Such methods may prove useful for future curriculum evaluations and revisions...
Research Grants
- An in-silico method for epidemiological studies using Electronic Medical RecordsHua Xu; Fiscal Year: 2009..The priority score reflects the average of all the scores given by the full committee after a thorough discussion. ..
- Real-time Disambiguation of Abbreviations in Clinical NotesHua Xu; Fiscal Year: 2010....
- An in-silico method for epidemiological studies using Electronic Medical RecordsHua Xu; Fiscal Year: 2010..The informatics approach will be validated on EMRs from two major hospitals to demonstrate its generalizability. Epidemiological findings from our study will be compared to reported findings for validation. ..
