Pathway-Centric Tools and Technology™

Newsletter

Volume 2, Issue 1, March  2005

Obtaining Optimal Results from Oligo GEArrays
Part One: Focus on cRNA Amount and Image Acquisition

The Oligo GEArray from SABiosciences is designed for performing gene expression profiling that focuses on specific biological pathways, genes related to a particular disease state, or genes that are otherwise functionally similar. The development of the platform has already optimized many aspects of microarray performance. However as with any biological assay system, the end user or researcher needs to optimize a few experimental parameters in order to extract the maximum amount of useful information from their actual experiment. First, a sufficient amount of labeled target must be used in the hybridization, a factor determined not only by how much is added but also the initial synthesis yield. Some arrays require more labeled target than others. Second, the exposure time must also be varied to generate an image with a low background, high positive call, and wide dynamic range. Finally, choosing the appropriate normalization factor based on the observed results is also very important. All three of these factors are interdependent on one another. This article discusses each of these factors in turn with a particular focus on the amount of labeled cRNA target and on image acquisition.

Anticipated Raw Data:

The initial result (or the real raw data) from an Oligo GEArray experiment is a grayscale digital image of the array spots. Generation of the image involves either an integrated imager, such as a CCD camera or a laser fluorescent scanner, or a flatbed scanner digitized image of developed X-ray film. The relative level of gene expression is directly related to the spot intensity. Dark spots represent highly expressed genes, light spots represent genes expressed at a lower level, and blank spots indicate a lack of expression or at least an undetectable level of expression. Figure 1 displays a few hypothetical array images containing only four spots. The spots represent either no, low, medium, or high levels of gene expression. The three simulated array images represent a result with too little cRNA or an under-exposure (Figure 1A), an ideal experimental result (Figure 1B), and a result with too much cRNA or an over-exposure (Figure 1C). The goal of optimizing the microarray experimental parameters is to obtain an image as close to the ideal as possible.

Figure 1: This figure displays three hypothetical microarray images with spots of varying intensities and results using various exposure times or amounts of labeled cRNA target in the hybridization.

Optimizing Labeled cRNA Target Yield:

The first critical parameter for optimal cRNA target yield is the quality of the RNA preparation. Follow all recommendations made by the RNA isolation kit manufacturer very closely. Small errors or mistakes in these protocols drastically affect the quantity and quality of the resulting total RNA. When isolating RNA from transformed cultured cells for use with the Oligo GEArray, we recommend using a spin-column based method, specifically the ArrayGrade™ Total RNA Isolation Kit (GA-013). For RNA isolation from tissues, we recommend using an extraction-based method first (specifically TRIzol from Invitrogen) followed by a clean up step with the above spin-column based method. Also, perform all RNA quality control checks recommended by the microarray labeling method before proceeding with cRNA target synthesis. For example, determine the A260/A280 and A260/A230 ratios to insure a lack of potential contamination by proteins or salts, respectively. Also, characterize the total RNA preparation by agarose gel electrophoresis to insure its integrity. Ethidium-bromide staining should reveal two sharp 28S and 18S ribosomal RNA (rRNA) bands with roughly a 2:1 intensity ratio. Analysis by the Agilent 2100 BioAnalyzer may also substitute for analysis by agarose gel.

First time users in particular should use as much RNA in the cRNA target synthesis as possible while remaining within the specifications set forth in the microarray user manual. For most applications where ample RNA is available (such as cultured cells), use the maximum amount of RNA recommended for the Oligo GEArray, 3 µg. The lower end of the RNA range is meant for applications where large quantities of RNA are difficult to isolate (such as small samples of difficult animal tissues or patient biopsies), and also require more optimization (to be discussed elsewhere), such as the length of the synthesis reaction (from 4 up to 18 hours). Under the ideal circumstances, a four-hour incubation should be sufficient to obtain adequate yields of cRNA (10 to 30 µg) from larger amounts of RNA (3 µg) starting material. Applications using smaller amounts of input RNA (as little as 500 ng) should extend the reaction up to 18 hours (overnight) for yields of up to 8 µg.

Throughout the labeled cRNA target synthesis procedure, use certified RNase-free reagents and lab ware particularly pipette tips and tubes. Prepare dilutions to determine cRNA yield by A260 in RNase-free TE buffer pH 8.0 and not water. The dilution needs to be well buffered because the UV absorbance of nucleic acid changes dramatically with pH. Also, perform all recommended quality control checks of the labeled cRNA target listed in the use manual before proceeding with microarray hybridization including agarose gel characterization and determination of the labeling efficiency.

Determining How Much To Use cRNA During Hybridization:
Low-Density Tetra-Spotted versus High-Density Single-Spotted Oligo GEArrays

The amount of cRNA or target used in a hybridization experiment is one of the most critical factors determining the overall signal level on array image. The more cRNA used, the more intense the signal on the array image. When an insufficient amount of target is used in the hybridization, the overall signal will be low and false negatives will be more likely (Figure 1A). When too much cRNA is used, it may cause a high background and more signal saturation (Figure 1C). However, an optimized amount of cRNA target can yield an ideal image (Figure 1B).

The Oligo GEArrays are available in different densities and printing layouts. The majority of the Oligo GEArrays contain approximately 100 genes (catalog numbers 001 though 400). In this type of array, each gene is represented by four repeated dots arranged in a unique square shape called a tetra-spot. SABiosciences also offers Oligo GEArrays (catalog numbers 400 through 999) that contain up to 440 genes per array. In this type of array, a single spot represents each gene, and the distances between these spots are much smaller than between the tetra-spots. These differences dictate different considerations in the amount of cRNA used for hybridization and subsequent image acquisition. A previous newsletter article (Volume 1, Issue 5, Article 2) also discusses this concept.

In an experiment to further demonstrate these differences as well as the importance of using the correct amount of cRNA, we have generated, for testing purposes only, a single spot version of the normally tetra-spotted Oligo GEArray Mouse Cell Cycle Microarray (OMM-020). Increasing amounts of biotin-labeled cRNA target prepared from a universal source of reference RNA was hybridized to either the tetra-spot or the single spot versions of OMM-020. Figure 2 shows the array images acquired using the same exposure time, and Table 1 summarizes the raw data obtained via densitometry of the array images. The two different versions of the same microarray display different responses to the increasing amounts of cRNA.

Figure 2: The same microarray spotted in two different formats responds differently to increasing amounts of hybridized cRNA. A single spot version of the normally tetra-spotted Oligo GEArray® Mouse Cell Cycle Microarray (OMM-020) was generated. Labeled cRNA target was prepared from 3 μg of XpressRef™ Mouse Universal Reference Total RNA (GA-005) using the TrueLabeling-AMP Kit (GA-010). Increasing amounts of biotin-labeled cRNA target (from 1 to 15 μg) was hybridized to both versions of the microarray. The image of each microarray is displayed. (The actual size of the single spot version of OMM-020 is about one-quarter that of the standard OMM-020. The former array images were enlarged to match the larger tetra-spotted OMM-020). Table 1 summarizes the densitometry results from these arrays.

Table 1: Summary of Spot Intensity Data from OMM-020 Microarray Images:

Increasing the amount of cRNA in the hybridization solution generally increases the signal intensities on both arrays formats. The background and minimum intensity values increase at roughly the same rate on both arrays with increasing amounts of cRNA hybridized. However, even the microarrays using the lowest amount of cRNA contain saturated spots as evidenced by the observation of roughly the same maximum intensity on all microarrays. To a certain extent therefore, increasing amounts of cRNA raise the background level ever closer to the saturation level and thereby reduce the linear dynamic range of the method, making the results look less like Figure 1B and more like Figure 1C.

Using the same amount of cRNA target, the mean and total intensities on the single-spotted microarray are greater than those of the corresponding tetra-spot arrays. These intensities reach saturated levels when roughly 5 µg of cRNA is used in the hybridization with the single-spotted microarray. However for the tetra-spot array, the mean and total signal intensities continue to increase even when 15 μg of cRNA is used in the hybridization. This result indicates that the single-spotted microarrays more easily saturate with lower levels of cRNA than the tetra-spotted microarrays. These results also show a loss of positive calls at higher levels of cRNA on the single-spotted microarray but an increase to a steady number of positive calls even at the highest level of cRNA on the tetra-spotted microarray. Reciprocal changes occur in the absent calls. An increasing background and an increasing incidence of spots that "bleed" into adjacent areas on the single-spotted microarray most likely causes this phenomenon.

Therefore, to avoid turning ideal Oligo GEArray experiments (Figure 1B) into over-exposed images (Figure 1C), a lower amount of cRNA (2-5 µg) is recommended for the higher-density, single-spotted microarrays (catalog numbers 000-400), and a higher amount of cRNA (5-20 µg) is recommended for the lower-density, tetra-spotted microarrays (catalog numbers 401-999).

Each array image in this experiment has been acquired with the same exposure time. As discussed below, exposure time is another control point where adjustments must be made to optimize the array image and results. For example, an image of the single-spotted microarray hybridized with 15 µg of cRNA re-acquired using a shorter exposure time may not be as over-exposed and yield fewer saturated signals. However, microarray experiments are best compared if the data is acquired under the same conditions, including exposure time. Therefore, a balance is usually struck between the cRNA amount used in hybridization and the exposure time.

Determining The Optimal Exposure Time:

Exposure time to the capture medium, whether X-ray film or a CCD camera, is another critical factor defined by the researcher determining the overall signal level on array image. Longer exposure times generate more intense signals on the array image. When the exposure time is too short, the overall signal will be low and false negatives will be more likely (Figure 1A). When the exposure time is too long, the background will be higher and more signals will reach saturation (Figure 1C). Both situations also narrow the available linear dynamic range of the detection medium and the microarray experiment. However, an optimized exposure time can yield the best image possible (Figure 1B).

In Figure 1B, all positive spots are significantly darker than the blank spot and significantly lighter than the most intense spot yielding an obvious intensity difference between all of these positive spots. The four spots span the complete distance between the background and the maximum signal level that the capture medium permits. Therefore, this array image utilizes the full dynamic range available. The under-exposed image (Figure 1A) does not utilize the upper part of the capture medium's dynamic range. All of the positive spots are packed in a narrow range close to the background level, and the low expression spot is not significantly darker than the background causing more false negative signals. In the over-exposed image (Figure 1C), all of the spots are much darker. However, the increased background level shrinks the dynamic range between the low and high intensity spots. The high background also masks the genes expressed at a lower level thereby, much like short exposure times, increasing the number of false negatives. Over-exposure also makes medium expression spots much darker so that they cannot be differentiated from high expression spots.

A good array image should have a minimum background and fully utilize the linear dynamic range offered by the recording medium. A good array image with good intensity distribution can only be obtained by trial and error. A good practice from photography, called bracketing, can be used. Generate a series of exposures with both longer and shorter than the estimated proper exposure time to help guarantee that one of the exposures should be optimal. Therefore when a researcher starts using a microarray for the first time or a new array experimentation condition is used for the first time (e.g. new sample, new labeling or processing method), appropriate testing should be conducted to determine optimal exposure conditions.

The principles discussed here apply to most image acquisition devices, particularly X-ray film or cooled CCD camera imager. Multiple exposure times can be generated in succession without affecting the results. Similar precautions should be observed when setting the laser power or PMT setting of laser fluorescent scanners. Those settings also influence the background and signal saturation of the fluorescent detection method. However, these scans must be obtained correctly the first time due to the photo bleaching caused by the scan itself. It may prove helpful in fluorescent detection to generate a set of replicate test microarray experiments to be used for optimizing detection before real experiments are performed with more precious samples.

Remember that image acquisition and the amount of cRNA used during hybridization work hand-in-hand to define the overall intensity and dynamic range of the array image and results. If the hybridization contains too little labeled cRNA target, it may be difficult to obtain a long enough exposure time for a good array image. Similarly, if the hybridization contains too much labeled cRNA target, it may prove difficult to obtain a short enough exposure time for a good array image.

Normalizing Microarray Results:

Normalizing microarray results is the mathematical operation of dividing background-corrected intensity values by a common factor that has a unique value for each microarray. This factor must have the ability to respond to systematic variation or error in the data from one array to the next. The division corrects the data for that systematic variation or error so that results for each gene can be compared across arrays and therefore across biological experimental conditions.

Microarray researchers use a few different methods for normalization. The most commonly used and recognized method corrects the data based on the relative expression level of an appropriate housekeeping gene. These genes are usually expressed well enough for the microarray to detect and maintain a constant expression despite the occurrence of other biological changes. However, not every housekeeping gene is suitable for every experimental study. Some of these genes may change their expression level during certain biological changes or responses. For example, the expression of ACTB increases throughout development, and GAPD may be a better choice for such studies. Conversely, in certain nutrition models, the level of GAPD changes with nutrient status, and ACTB may be a better choice.

For this very reason, the Oligo GEArrays include several (from 4 to 9) different housekeeping genes so that the researcher may choose the appropriate one for each experiment or study. The chosen housekeeping gene must not change its raw expression level (spot intensity) more than the array-to-array reproducibility of the microarray method (10 percent in the case of the Oligo GEArray). If it changes by more than this amount, its expression may very well be regulated by the experimental conditions under study. Also, the expression level of the chosen gene must not lie too close to the saturation or background level of the microarray. Only if the gene has a moderate level of expression can the microarray reliably detect whether the expression is actually changing or not.

Other factors used for normalizing microarray results, particularly when all of the available housekeeping genes fail to meet both of the above criteria, include the median and interquartile values of the microarray. The median value is the simple statistical parameter of the median of all the intensity values on the microarray. (For example, see Table 1.) This value changes with the systematic variation on the array. Plots of intensity values in rank order for several different arrays results clearly demonstrate this notion. The resulting sigmoid-shaped curve, which looks much like a traditional hybridization "COT" curve, and the median intensity value shifts in concert up or down in this plot from array to array. (Data not shown.)

The interquartile is also a median value, except the calculation only uses the middle 50 percent of all of the intensity values on the array. It ignores one-quarter of the least intense signals and one-quarter of the most intense signals. This method is particularly useful for low-density microarrays such as the Oligo GEArrays. If the microarray results contain a disproportionate number of low intensity (genes expressed at a very low level) or high intensity spots (genes expressed at a very high level), results normalized to a simple median skew the gene expression levels down or up, respectively. The interquartile removes the influence of the more extreme intensity values, uses only the genes theoretically expressed in the linear dynamic range of the microarray experiment, and prevents skewed gene expression levels.

SUMMARY:

The amounts of cRNA used for hybridization and the image acquisition settings are two important experimental parameters that can affect GEArray results. The investigator needs to use enough labeled target to generate good signal intensity. However, more cRNA is not always better if signal saturation and bleeding becomes a problem. The cRNA range used in this example should only be considered as a reference or starting point for your own experiment, since different cRNA preparations may have different qualities. The quantity and quality of the initial total RNA samples also affect the quantity and quality of the labeled cRNA target. The quality of all materials generated for the microarray experiment should always be checked as described. Finally, using the proper factor for normalization also helps optimize the microarray result by correcting for systematic error between microarrays and experimental conditions.

Article: 1     2     3     4