|
Focusing on a smaller set of relevant genes
in your DNA microarray studies provides you with more information in less
time. Publicly accessible resources and pre-designed arrays are available to
help you choose the right sets of genes for your research. This article
discusses the concept and advantages of focusing your microarray or RT-PCR
experiments on a pathway-specific set of genes, the resources available to
define a pathway-specific gene list, and how one company has already grouped
genes into pathways to generate Pathway-Centric Tools and Technology™.
1. Why Define Pathway-Specific Genes:
The advancement of microarray technology greatly aids discovery-based
research by allowing investigators to obtain a snapshot of the expression of
an entire genome. Once the relevant genes are discovered, investigators
return to performing hypothesis-driven research in order to better
characterize the biological system at hand. However, due to the need for
expensive equipment and to the high volume of data generated, the
whole-genome techniques remain out of reach for many researchers. Also, many
researchers only wish to focus on one biological pathway or one gene family
and do not necessarily need to examine the entire complement of expressed
genes all at once. A new, more systematic approach to biological research in
this post-genomic era is required.
Organizing genes into pathway-specific lists provides several unique
advantages. The planning of research projects is simplified. The experiment
focuses the research only on the genes or biological pathways of interest.
The execution of research projects is quicker and more efficient. Examining
only roughly one hundred genes instead of thousands at one time yields less
data to examine that is not nearly as overwhelming to interpret. As a result,
you learn more in less time. Also, the overall cost of the experiment becomes
lower.
Low-density or focused microarray experimentation not only serves as an
alternative to high-density work but also serves as a complement.
High-density microarray experiments usually indicate that hundreds to
thousands of genes are differentially regulated by the experimental
conditions. To aid interpretation, most researchers immediately cluster these
genes into their respective biological pathways using on-line database
resources. A pathway-focused microarray provides an alternative platform to
verify that the identified pathways truly play a role in the biological
response in question. The researcher can then examine the entire pathway in a
focused and systematic fashion and in isolation from the rest of the genome.
Thus, low-density microarrays serve as an intermediate step between
high-density arrays and more tedious gene-by-gene microarray data
verification by real-time or conventional RT-PCR.
Combining the current understanding of important biological pathways with the
microarray technologies generates pathway or application focused or specific
research tools. This knowledge-based design merges the benefits of
hypothesis-driven and discovery-based research and allows researchers to
answer more specific questions in a systematic fashion. The continual
revision of the gene content based on new experimental results helps ensure
that these research tools will accelerate, simplify and improve life science
research.
2. Steps Toward Defining Pathway-Specific Genes:
Defining a pathway or application specific set of genes
involves four basic steps. Literature Surveys on the field of study identify
relevant keywords, pathways, and even individual genes. Database Searches
using those terms quickly provide an initial list of specific genes. Other
databases group these gene lists into biological pathways that contain other
genes potentially important to the original research topic. Finally, Expert
Review and User Feedback fine-tunes and optimizes the pathway or
application specific gene list.
Literature Surveys
Comprehensive literature searches on the field of research
help define a list of keywords and other related terms important to the
topic. The search engine known as PubMed,
developed by the National Center for Biotechnology Information (NCBI),
returns lists of scientific publications containing user-defined keywords and
authors. The titles and abstracts of these papers contain other keywords,
biological pathways, and gene families relevant to the field of research
providing other search parameters for PubMed again or other on-line databases
and resources. This process may also reveal a few relevant individual genes.
Collect all of the keywords, pathways, and genes found through the literature
survey.
Database Searches
The Online Mendelian Inheritance in Man™(OMIM™)
resource, a well-curated database of information for individual genes,
returns lists of genes whose entries contain user-defined search terms or
individual genes such as those defined by the literature search. Each entry
in this database represents one gene and contains a brief description as well
as information about cloning, gene function, nomenclature, biochemical
features, gene structure, mapping, molecular genetics, animal models, allelic
variants, and references. Compile the list of genes found from searches of
both OMIM and PubMed using all of the identified keywords.
Lists of genes involved in a subset of biological pathways
or gene families may also be found by browsing corporate web sites such as Biocarta™,
SABiosciences, and others. Biocarta™ displays illustrations of
their featured pathways indicating the relationships between the relevant
gene products. SABiosciences offers DNA microarrays designed for
specific biological pathways. Each array includes a gene list subdivided into
convenient functional gene groupings. The company also offers RT-PCR kits
designed for specific gene families or biological pathways. The individual
product pages provide pre-designed and customizable genes lists for each
available kit. If these web sites contain the same pathways discovered in the
literature or database searches, add the genes to the growing list of focused
genes.
Other publicly accessible search engines and databases
accept individual genes or groups of genes and define all of the pathways to
which they belong. The National Cancer Institute (NCI)
maintains and provides the most comprehensive source of this information, the
Cancer Genome Anatomy Project (CGAP).
Their web site contains a number of interconnected bioinformatic modules
designed for cancer researchers including one used to search "Pathways".
Upon the submission of a list of genes, this search engine returns a table
annotating each gene with terms that correspond to well-defined biological
pathways as defined by other databases including the Gene Ontology (GO)
Consortium, the KEGG PATHWAY
Database, and the previously mentioned resource developed by Biocarta™
The Gene Ontology Consortium consolidates the work of many
collaborating researchers who strive to develop a consistent and universal
set of terms for the description of gene function independent on the genes'
species of origin. The KEGG PATHWAY Database consolidates the most up-to-date
information regarding gene and protein interactions found in regulatory and
metabolic pathways as well as known multi-protein complexes. The annotation
of the pathways in these databases is not necessarily intuitive; therefore,
using CGAP as an entry point to these databases proves very helpful.
Searching the GO or KEGG databases with the terms returned by CGAP provides a
list of even more genes associated with these pathways and the application of
interest.
Other resources, such as HUGO
and PROSITE, focus on gene
families and provide comprehensive lists of genes belonging to those
families. The HUGO Gene Nomenclature Committee maintains the HGNC Gene
Grouping/Family Nomenclature, a simple alphabetical list of gene families and
groupings with hyperlinks to their respective lists of genes. A quick browse
of this web page very quickly reveals whether the resource includes any of
the interesting pathways. The PROSITE web site groups genes into families
based on conserved sequence homology across species in domains of the protein
product. The search engine driving this database accommodates imperfect
matches making the identification of the pathways of interest as well as
other potentially related pathways very easy. Add the resulting lists of
genes from these resources to the growing specific gene list, or feed them
back into the CGAP database to obtain even more pathways and genes.
Expert Review & User Feedback
Even with the best-organized and eminently searchable databases, nothing
beats the human mind and eye when performing a final review of these
pathway-specific gene lists. The designer, a colleague, or another expert in
the relevant field of research should also double-check the gene list for
duplications, inconsistencies, and omissions. The iterative process of
submitting gene lists from one resource to another resource for more genes
introduces redundancies. The on-line resources contain a large amount of
information that the curators update routinely but perhaps not often enough
to keep up-to-date with the literature. An expert in the field who is more
aware of the latest developments could add or remove genes as necessary. In
case reductions need to be made for technical reasons, priorities should be
defined based on the interests of the original designer. Finally, custom
microarray manufacturers can generate pathway- or application-specific
microarrays using the gene list submitted by the designer.
It should be noted that this article and many of the resources discussed
within focus on the more commonly used human and mouse model systems. Despite
the availability of complete genome sequence information for other model
organisms (such as rat, flies and worms), the genes have not yet been as well
curated into pathways as the human or mouse genes. The available resources,
such as FlyBase and WormBase, for these other organisms rely entirely on
submissions made to the GO database. Other ongoing efforts are attempting to
define pathways in other organisms by identifying conserved protein-protein
interactions.
3. Pre-Designed Array Products:
SABiosciences already performs all of these steps
with the release of each Pathway-Centric™
GEArray™ DNA Microarray or MultiGene-12™
RT-PCR Profiling Kit. The company constantly repeats this same process of
literature surveys, database searches, expert review, and user feedback to
generate each product's comprehensive lists of genes. The current collection
of over 100 microarrays representing genes from human, mouse, and rat covers
many if not most research applications. These pre-designed
application-specific arrays save researchers the time and effort in designing
their own specific set of genes. However, the availability of SABiosciences'
Custom Focused DNA Microarray Service allows researchers to obtain
microarrays containing their previously defined list of genes or a
modification of an existing gene list from a cataloged product. The RT-PCR
profiling kits group smaller sets of genes, such as families, together
accommodating even more focused research questions. Investigators also have
the option of modifying the cataloged kits with other genes from the same
gene family or biological pathway to order a kit that better suits their
needs. With these products and services, researchers can immediately reap the
time and cost saving benefits of performing focused or application-specific
gene expression profiling analyses.
Summary of Available Resources Discussed in the Article:
US Government Organizations: National
Center for Biotechnology Information (NCBI) contains multiple search engines
including:
PubMed: Searches for scientific papers by keyword Online Mendelian Inheritance in Man™(OMIM™): Complete,
well-curated, and searchable summary of literature for many human genes. Cancer
Anatomy Genome Project (CGAP): Annotates gene lists to GO, KEGG, and Biocarta
terms
Private Organizations:
Gene Ontology (GO) Consortium: Well-defined biological pathways
Kyoto Encyclopedia of Genes and Genomes (KEGG): Well-defined biological
pathways
HUGO Gene Nomenclature Committee: Well-defined gene families and groupings
PROSITE: Gene families defined by domain with cross-species sequence homology
Corporations: Biocarta:
Illustrations of biological pathways
SABiosciences: Pathway-Centric Tools and Technology™ Microarrays
and RT-PCR kits with pre-designed gene lists
Related Products from SABiosciences:
GEArray™ DNA Microarrays
-
Pathway-Focused Design:
Contains 100-500 well-characterized genes belonging to relevant biological
pathways.
The design reflects the most current understanding of pathways and gene
families.
-
Robust Performance:
Employs the most sensitive proprietary labeling method.
Produces the most reproducible results.
-
Easy to Use & Cost-Effective:
No special equipment required.
Can be used in any laboratory setting.
Priced from $50 per array (US List Price).
MultiGene-12™ RT-PCR Profiling Kits
Profile the expression of eleven closely related genes
in a single analysis. Use a pre-designed gene list or customize one of our
designs. Kits are designed to profile genes: in the same family, in or
responsive to the same biological pathway, useful as disease biomarkers,
with multiple splice variants. Employs a simple assay format with
pre-dispensed primer sets and ready to use PCR master mix. Just mix
template with master mix and transfer to tubes in same strip already
containing primers. Transfer completed reactions directly to agarose gel
wells. Process 8 experiments simultaneously. The pre-validated assay
utilizes primers sets designed with a proprietary, experimentally verified
computer algorithm that yield single PCR products of the predicted size.
|