Gene Expression Data Analysis Suite (GEDAS)


Histogram

·        Description or interpretation: Histogram is most popular visualization technique in array data mining, and can be applied to visualization of gene expression data also.  This representation is useful in giving a comparative/quantitative/qualitative visualization of the parameters and is used in the visualization of raw data and pre-processed data.  In a specific case, it has been applied to PCA also, e.g., the following figure shows the histogram of PCA applied on the breast cancer data.

·        Complexity: A histogram can be generated in O(n2)

·        Special considerations/features:  A line connecting the mid point of the bars could be generated to represent line graph.

·        Advantages and drawbacks:  Histogram is very primitive and provides limited statistical information compared to other techniques

 

 

Figure: The histogram is the most fundamental visualization of gene expression data and can be applied to all forms of datasets

 


Home