Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Silvio Tosatto
Aula 90, 5° est, Vallisneri
Email: silvio.tosatto@unipd.it,
Tel: 049-827-6269
Structural Bioinformatics
A.Y. 2017/2018
BioComputing UP,
Dipartimento di Scienze Biomediche,
Università di Padova
URL: http://protein.bio.unipd.it/
Bioinformatics
What is it?
>IGF1R_HUMAN
MKSGSGGGSPTSLWGLLFLSAALSLWPTSGEICGPGIDIRNDYQQLKRLENCTVIEGYLH
ILLISKAEDYRSYRFPKLTVITEYLLLFRVAGLESLGDLFPNLTVIRGWKLFYNYALVIF
EMTNLKDIGLYNLRNITRGAIRIEKNADLCYLSTVDWSLILDAVSNNYIVGNKPPKECGD
LCPGTMEEKPMCEKTTINNEYNYRCWTTNRCQKMCPSTCGKRACTENNECCHPECLGSCS
APDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRCVDRDFCANILSAESSDSEGFVIHD
GECMQECPSGFIRNGSQSMYCIPCEGPCPKVCEEEKKTKTIDSVTSAQMLQGCTIFKGNL
LINIRRGNNIASELENFMGLIEVVTGYVKIRHSHALVSLSFLKNLRLILGEEQLEGNYSF
YVLDNQNLQQLWDWDHRNLTIKAGKMYFAFNPKLCVSEIYRMEEVTGTKGRQSKGDINTR
NNGERASCESDVLHFTSTTTSKNRIIITWHRYRPPDYRDLISFTVYYKEAPFKNVTEYDG
QDACGSNSWNMVDVDLPPNKDVEPGILLHGLKPWTQYAVYVKAVTLTMVENDHIRGAKSE
ILYIRTNASVPSIPLDVLSASNSSSQLIVKWNPPSLPNGNLSYYIVRWQRQPQDGYLYRH
NYCSKDKIPIRKYADGTIDIEEVTENPKTEVCGGEKGPCCACPKTEAEKQAEKEEAEYRK
VFENFLHNSIFVPRPERKRRDVMQVANTTMSSRSRNTTAADTYNITDPEELETEYPFFES
RVDNKERTVISNLRPFTLYRIDIHSCNHEAEKLGCSASNFVFARTMPAEGADDIPGPVTW
EPRPENSIFLKWPEPENPNGLILMYEIKYGSQVEDQRECVSRQEYRKYGGAKLNRLNPGN
YTARIQATSLSGNGSWTDPVFFYVQAKTGYENFIHLIIALPVAVLLIVGGLVIMLYVFHR
KRNNSRLGNGVLYASVNPEYFSAADVYVPDEWEVAREKITMSRELGQGSFGMVYEGVAKG
VVKDEPETRVAIKTVNEAASMRERIEFLNEASVMKEFNCHHVVRLLGVVSQGQPTLVIME
LMTRGDLKSYLRSLRPEMENNPVLAPPSLSKMIQMAGEIADGMAYLNANKFVHRDLAARN
CMVAEDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMSPESLKDGVFTTYSDVWSFGV
VLWEIATLAEQPYQGLSNEQVLRFVMEGGLLDKPDNCPDMLFELMRMCWQYNPKMRPSFL
EIISSIKEEMEPGFREVSFYYSEENKLPEPEELDLEPENMESVPLDPSASSSSLPLPDRH
SGHKAENGPGPGVLVLRASFDERQPYAHMNGGRKNERALPLPQSSTC
IGFR1 structure
>IGF1R_HUMAN
MKSGSGGGSPTSLWGLLFLSAALSLWPTSGEICGPGIDIRNDYQQLKRLENCTVIEGYLH
ILLISKAEDYRSYRFPKLTVITEYLLLFRVAGLESLGDLFPNLTVIRGWKLFYNYALVIF
EMTNLKDIGLYNLRNITRGAIRIEKNADLCYLSTVDWSLILDAVSNNYIVGNKPPKECGD
LCPGTMEEKPMCEKTTINNEYNYRCWTTNRCQKMCPSTCGKRACTENNECCHPECLGSCS
APDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRCVDRDFCANILSAESSDSEGFVIHD
GECMQECPSGFIRNGSQSMYCIPCEGPCPKVCEEEKKTKTIDSVTSAQMLQGCTIFKGNL
LINIRRGNNIASELENFMGLIEVVTGYVKIRHSHALVSLSFLKNLRLILGEEQLEGNYSF
YVLDNQNLQQLWDWDHRNLTIKAGKMYFAFNPKLCVSEIYRMEEVTGTKGRQSKGDINTR
NNGERASCESDVLHFTSTTTSKNRIIITWHRYRPPDYRDLISFTVYYKEAPFKNVTEYDG
QDACGSNSWNMVDVDLPPNKDVEPGILLHGLKPWTQYAVYVKAVTLTMVENDHIRGAKSE
ILYIRTNASVPSIPLDVLSASNSSSQLIVKWNPPSLPNGNLSYYIVRWQRQPQDGYLYRH
NYCSKDKIPIRKYADGTIDIEEVTENPKTEVCGGEKGPCCACPKTEAEKQAEKEEAEYRK
VFENFLHNSIFVPRPERKRRDVMQVANTTMSSRSRNTTAADTYNITDPEELETEYPFFES
RVDNKERTVISNLRPFTLYRIDIHSCNHEAEKLGCSASNFVFARTMPAEGADDIPGPVTW
EPRPENSIFLKWPEPENPNGLILMYEIKYGSQVEDQRECVSRQEYRKYGGAKLNRLNPGN
YTARIQATSLSGNGSWTDPVFFYVQAKTGYENFIHLIIALPVAVLLIVGGLVIMLYVFHR
KRNNSRLGNGVLYASVNPEYFSAADVYVPDEWEVAREKITMSRELGQGSFGMVYEGVAKG
VVKDEPETRVAIKTVNEAASMRERIEFLNEASVMKEFNCHHVVRLLGVVSQGQPTLVIME
LMTRGDLKSYLRSLRPEMENNPVLAPPSLSKMIQMAGEIADGMAYLNANKFVHRDLAARN
CMVAEDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMSPESLKDGVFTTYSDVWSFGV
VLWEIATLAEQPYQGLSNEQVLRFVMEGGLLDKPDNCPDMLFELMRMCWQYNPKMRPSFL
EIISSIKEEMEPGFREVSFYYSEENKLPEPEELDLEPENMESVPLDPSASSSSLPLPDRH
SGHKAENGPGPGVLVLRASFDERQPYAHMNGGRKNERALPLPQSSTC
IGFR1 structure
MERPEPELIRQSWRAVSRSPLEHGTV
LFARLFALEPDLLPLFQYNCRQFSSP
EDCLSSPEFLDHIRKVMLVIDAAVTN
VEDLSSLEEYLASLGRKHRAVGVKLS
SFSTVGESLLYMLEKCLGPAFTPATR
AAWSQLYGAVVQAMSRGWDGE
3D characteristics
Alignments (from structure)
1D predictions
(sequence based)
Bioinformatics to understand disease-associated aspects of
protein structure and function
Interpretation of disease-
associated sequence
variants
p.C152F
BioPython
http://biopython.org/
Useful information
All the course matherial is availble on the E-learning site:
http://elearning.unipd.it/dsb/
Registering to “Bioinformatics & Computational Biology“ course of L.Bio.Mol.
• Slides (previous year).
• Lecture notes (“dispense“).
Practical sessions will take place once per week, from 14.30 to 17.30.
• 4 practicals: sequence, structure (x 2), non-globular proteins
• Online resources
• BioPython examples
“The time will come, I believe, though I shall not live to see it, when we shall
have fairly true genealogical trees of each great kingdom of Nature.”
Charles Darwin
Evolution
Molecular evolution:
What does it mean???
“The time will come, I believe, though I shall not live to see it, when we shall
have fairly true genealogical trees of each great kingdom of Nature.”
Charles Darwin
Evolution
The study of changes occurring in DNA and in its products is the
object of study of Molecular Evolution.
AAAAAAAAA
ACAAAAAAAA
AAAAAADAA
ACAAAAARAA
ADAAAADAA AAAACADAA ACAAATAAAA
AEAAAADAA
ACAAQTAAAA
AEASAADAA ACAAATAAAW
AEAAAADAW
Evolution
The study of changes occurring in DNA and in its products is the
object of study of Molecular Evolution.
Alternative methods
Example: Alternative viewpoints on proteins
• Evolutionary model
– Desciptive
– “Knowledge-based“
• Physical model
– predictive
– Optimization (“Ab initio“)
Alternative methods
• A lot of methods we will cover (e.g. pattern and Neural
Networks) are knowledge-based, meaning that they require
background knowledge on the field of study with training sets
on which build predictions on top of.
– Use of previous knowledge to interpret new case, no new knowledge
generation.
– Not able to predict situations different from the already observed ones.