Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
BY
SAKSHI PAHUJANI
SAKTI RANJAN ROUT
ASHUTOSH KHANDUAL
In computational biology, gene prediction or gene finding refers to the process
of identifying the regions of genomic DNA that encode genes.
GENE STRUCTURE IN PROKARYOTES
•
Gene finding programs in prokaryotes
▪
Prediction Using Discriminant Analysis. (By LDA and QDA)
● LDA works by plotting a two-dimensional graph of coding signals versus all potential 3’ splice site positions and
drawing a diagonal line that best separates coding signals from noncoding signals based on knowledge learned
from training data sets of known gene structures
FGENES (Find Genes; www.softberry.com/) is a web-based program that uses LDA to determine whether a
signal is an exon.
● QDA draws a curved line based on a quadratic function instead of drawing a straight line to separate coding
and noncoding features.
MZEF (Michael Zhang’s Exon Finder; http://argon.cshl.org/genefinder/) is a web-based program that uses QDA
for exon prediction
Two-variable example in which a quadratic function (solid line) separates the two groups (× and ○) completely, while
the best linear function (dotted line) misclassifies an appreciable number of points.