Bioinformatics, Computational
Biology and Related Fields
William S, Barnes, Ph.D.
Clarion University of Pennsylvania
Three short definitions:
“Bioinformatics is conceptualizing biology in terms of molecules
(in the sense of Physical Chemistry) and then applying informatics techniques
(derived from disciplnes such as Applied Math, Computer Science and Staitstics)
to understand and organize the information associated with these molecules
on a large scale.”
- Mark Gerstein, Yale University, 1999.
"Bioinformatics is the application of computer technology to the management
and analysis of biological data. The result is that computers are being
used to gather, store, analyse and merge biological data. Bioinformatics
is an interdisciplinary research area that is the interface between the
biological and computational sciences. The ultimate goal of bioinformatics
is to uncover the wealth of biological information hidden in the mass of
data and obtain a clearer insight into the fundamental biology of organisms."
- European Bioinformatics Institute (EBI)
"The mathematical, statistical and computing methods that aim to solve
biological problems using DNA and amino acid sequences and related information."
- Fredj Tekaia, Institut Pasteur
|
A longer discussion.
Information in this section
is largely exerpted and abridged from:
"Bioinformatics
Frequently Asked Questions." by Damian Counsell.
Roughly, bioinformatics describes any use of computers to handle biological
information. In practice, the definition used by most people is narrower;
bioinformatics to them is a synonym for "computational molecular biology"
- the use of computers to characterize the molecular components of living
things.
"Classical" bioinformatics
Most biologists talk about "doing bioinformatics" when they use computers
to store, compare, retrieve, analyze or predict the composition or the
structure of biomolecules. As computers become more powerful you could
probably add simulate to this list of bioinformatics verbs.
It is a mathematically interesting property of most large biological
molecules (macromolecules such as DNA, RNA or protein) that they are polymers
composed of monomers (the five nitrogenous bases or the twenty amino acids).
Because of their length and complexity, macromolecules can have exquisitely
specific informational content and/or chemical properties. The concern
of "classical" bioinformatics, is to understand this information, most
often with the computational techniques of sequence analysis.
"New" bioinformatics
The greatest achievement of bioinformatics methods, the Human Genome
Project, is currently being completed. Because of this the nature and priorities
of bioinformatics research and applications are changing. People often
talk portentously of our living in the " post-genomic" era. My personal
view is that this will affect bioinformatics in several ways:
-
Now we possess multiple whole genomes we can look for differences and similarities
between all the genes of multiple species. From such studies we can draw
particular conclusions about species and general ones about evolution.
This kind of science is often referred to as comparative genomics.
-
With the complete sequencing of many genomes, functional genomics
elucidation of gene functions and associations has become a major
project of biology. New technologies to measure gene expression at different
stages in development or disease or in different tissues require heavy
computational and statistical analysis.
-
There will be a general shift in emphasis toward proteomics
with attempts to catalogue the activities and characterize interactions
between all gene .
-
An mportant sub-discipline of proteomics is structural biology
which elucidates the structure and function of proteins and their ligands.
Mathematical, statistical and computational approaches are integral and
essential tools.
There are other "-omics" and "-informatics"
fields which are related to Bioinformatics or even overlap with it:
Medical Informatics.
Biomedical Informatics is an emerging discipline that has been defined
as the study, invention, and implementation of structures and algorithms
to improve communication, understanding and management of medical information.
Unlike Bioinformatics, Medical informatics seems to be more concerned with
structures and algorithms for the manipulation of medical data, rather
than with the data itself. Medical informatics, for practical reasons,
is more likely to deal with data obtained at "grosser" biological levels---that
is information from super-cellular systems, right up to the population
level---while most bioinformatics is concerned with information about cellular
and biomolecular structures and systems.
Computational Biology.
Computational biology is not so much a "field", as an "approach" involving
the use of computers to study biological processes; hence it is an area
as diverse as biology itself. Whereas Bioinformatics has more to do with
management and the subsequent use of biological information, particularly
genetic information, Computational biology implies modelling and simulation
of dynamic biological systems using mathematical and computational approaches.
Evolutionary, population and theoretical biology has traditionally been
a strong emphasis among Computational Biologists. However “Molecular
Systems Biology”, in which dynamic processes at the level of the
cell or organelle are modelled, is a new field attracting great excitement.
Mathematical Biology.
Mathematical biology is easier to distinguish from bioinformatics than
computational biology. Mathematical biology also tackles biological problems,
but the methods it uses to tackle them need not be numerical and need not
be implemented in software or hardware. Indeed, such methods need not "solve"
anything; in mathematical biology it would be considered reasonable to
publish a result which merely explains how a biological problem can be
quantified and understood. Mathematical Biology "...includes things of
theoretical interest which are not necessarily algorithmic, not necessarily
molecular in nature, and are not necessarily useful in analyzing collected
data."
Pharmacogenomics.
Pharmacogenomics is the application of genomic approaches and technologies
to the identification of drug targets. Examples include trawling entire
genomes for potential receptors by bioinformatics means, or by investigating
patterns of gene expression in both pathogens and hosts during infection,
or by examining the characteristic expression patterns found in tumours
or patients samples for diagnostic purposes (possibly in the pursuit of
potential cancer therapy targets).
Pharmacogenetics is a subset of pharmacogenomics. All
individuals respond differently to drug treatments; some positively, others
with little obvious change in their conditions and yet others with side
effects or allergic reactions. Much of this variation is known to have
a genetic basis. Pharmacogenetics uses genomic/bioinformatic methods to
identify genomic correlates, for example SNPs (Single Nucleotide Polymorphisms),
characteristic of particular patient response profiles and use those markers
to inform the administration and development of therapies. Strikingly,
such approaches have been used to "resurrect" drugs thought previously
to be ineffective, but subsequently found to work with in subset of patients.
They can also be used for optimizing the doses of chemotherapy for particular
patients.
BACK TO Home
Page.
|