ICON Web & News
Virtual Journal
Everything
Resources
Virtual Journal
Analyze Database
GoodNanoGuide
ICON Reports
ICON Backgrounders
Glossary
Policy Reports
Ratings Comment
Guidelines
Links
Quick Search:
Keywords:
Search:
Search Using OECD Database
Details
Return to Previous Page
Addition or Correction
Title:
Analyzing High Dimensional Toxicogenomic Data Using Consensus Clustering
Date:
8/2012
Link to Journal Abstract
Abstract:
Rapid development of high-throughput toxicogenomics technologies has created new approaches to screen environmental samples for mechanistic toxicity assessment. However, challenges remain in the analysis, especially clustering of the resulting high-dimensional data. Because of the lack of commonly accepted validation methods, it is difficult to compare clustering results between studies or to identify the key experimental or data features that impact the clustering results. We applied consensus clustering (CC), an approach that clusters the input data repeatedly through iterative resampling, and identifies frequently occurring high-confidence clusters. We used CC to analyze a set of high dimensional transcriptomics data with temporal resolution, which were generated using our E. coli whole-cell array system for a diverse variety of toxicants at different dose concentrations. The CC analysis allowed us to evaluate the clustering results' robustness and sensitivity against a number of conditions that represent the common variations in high-throughput experiments, including noisy data, subsets of treatments, subsets of reporter genes, and subsets of time points. We demonstrated the value of utilizing rich time-series data and underscored the importance of careful selection of sampling times for a given experimental system. The results also indicated that temporal data compression using our proposed Transcriptional Effect Level Index (TELI) concept followed by CC largely conserved the cluster resolution. We also found that for our cellular stress response ensemble-based high-throughput transcriptomics assay platform, the size and composition of the reporter gene set are critical factors that affect the resulting coherency of clusters. Taken together, these results demonstrated that more robust consensus clustering such as CC may be valuable in analyzing high-dimensional toxicogenomic data sets.
Non-technical Summary:
In this study, the authors applied consensus clustering (CC), an approach that clusters the input data repeatedly through iterative resampling, and identifies frequently occurring high-confidence clusters. They used CC to analyze a set of high dimensional transcriptomics data with temporal resolution, which were generated using an E. coli whole-cell array system for a diverse variety of toxicants at different dose concentrations.
Content Emphasis
Peer Reviewed Journal Article
Exposure Or Hazard Target
Other/Unspecified
Exposure Pathway
Other/Unspecified
Method Of Study
Computational and System Modeling
Paper Type
Hazard
Particle Type
Other/Unspecified
Production Method
Engineered
Risk Exposure Group
Other/Unspecified
Target Audience
Technical Research
Citation:
Environ. Sci. Technol., 2012, 46(15): 8413-8421
Publication:
Environmental Science & Technology
Author:
Gao C, Weisman D, Gou N, Ilyin V, Gu AZ
Volume:
46
Number:
15
Pages:
8413-8421
Last updated on September 26, 2012
Permalink
Join Us
|
About
|
Newsroom
|
Working Groups
|
Projects
|
Resources
|
Virtual Journal
|
Events
|
Logout
This work is supported in part by the Nanoscale Science and Engineering Initiative of the National Science Foundation
under NSF Award Number EEC-0118007.
Why Join Us?
Mission and Strategy
Background
Organization
Sponsors
Participants
Contact Us
ICON Releases
News
Media Alert
RSS
Governance
Knowledge Base
Best Practices
Communications
Virtual Journal
Analyze Database
Good Nano Guide
ICON Reports
ICON Backgrounders
Glossary
Policy Reports
Links
Council Events
Other Events
Virtual Journal
Analyze Database
The GoodNanoGuide
Nano EHS Research Needs
Current Practices Survey