mandatory to put some order in such a vast wealth of structural knowledge 1 4. nucleic acids and...
TRANSCRIPT
mandatory to put some order in such a vast wealth of structural
knowledge 14. Nucleic acids and proteins in one and more dimensions - second part
Taxonomy (from Ancient Greek: τάξις taxis, "arrangement," and -νομία -nomia,
"method") is the science of defining groups of biological organisms on the basis of
shared characteristics and giving names to those groups. Organisms are grouped
together into taxa (singular: taxon) and given a taxonomic rank; groups of a given rank can be aggregated to form a super group of higher rank and thus create a
taxonomic hierarchy.
24. Nucleic acids and proteins in one and more dimensions - second part
Learning from Biology
34. Nucleic acids and proteins in one and more dimensions - second part
Learning from Biology
44. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
54. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
64. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
74. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
84. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
94. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
104. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
red: mainly αgreen: mainly βyellow: αβblue: low content of secondary structures
114. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
124. Nucleic acids and proteins in one and more dimensions - second part
protein structure taxonomy
from http://www.proteinstructures.com/Structure/Structure/protein-fold.html
most of different protein
folds have been
already found?
134. Nucleic acids and proteins in one and more dimensions - second part
comparing protein structure
144. Nucleic acids and proteins in one and more dimensions - second part
comparing protein structure
154. Nucleic acids and proteins in one and more dimensions - second part
comparing protein structure
164. Nucleic acids and proteins in one and more dimensions - second part
1898 proteins representative of the most common fold
174. Nucleic acids and proteins in one and more dimensions - second part
3. genome analysis
Structural genomics
0101#01001010#10111010#01010001#10010#1001#10110010#100100100101011#0
DNA
Algorithm
Residue
THR 0.0 147.7 172.9
THR 107.2 -125.3 187.4
CYS 123.4 63.6 103.7
PRO 60.3 83.9 -116.7
Protein Structure
X Ray
diff
ract
ometr
y
NMR
cryo-electron tomography
18
194. Nucleic acids and proteins in one and more dimensions - second part
from structural knowledge to structural predictions
secondary structure prediction
204. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
214. Nucleic acids and proteins in one and more dimensions - second part
CHOU & FASMANChou, P.Y. & Fasman, G.D. (1974). Biochemistry, 13, 211-222.
secondary structure prediction
224. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
# residues in window: 6
234. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
244. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
254. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
264. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
274. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
PSIPRED is a simple and reliable secondary structure prediction method, incorporating two feed-forward neural networks which perform an analysis on output obtained from PSI-BLAST (Position Specific Iterated - BLAST).
284. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
294. Nucleic acids and proteins in one and more dimensions - second part
secondary structure prediction
304. Nucleic acids and proteins in one and more dimensions - second part
4. Nucleic acids and proteins in one and more dimensions - second part
31
Sequence Alignment
“Two homologous sequences whisper ... a full multiple alignment shouts out loud.” in Hubbard TJ, Lesk AM, Tramontano A. Gathering them in to the fold. Nat Struct Biol. 1996 Apr;3(4):313.)
324. Nucleic acids and proteins in one and more dimensions - second part
334. Nucleic acids and proteins in one and more dimensions - second part
344. Nucleic acids and proteins in one and more dimensions - second part
Letters corresponding to isolated matches are shown in non-bold type. The longest matching regions, shown in boldface, are the first and last names DOROTHY and HODGKIN. Shorter matching regions, such as the OTH of dorOTHy and crowfoOTHodgkin, or the RO of doROthy and cROwfoot, are noise.
Dotplot showing identities between the palindromic sequence MAX I STAY AWAY AT SIX AM and itself. The palindrome reveals itself as a stretch of matches perpendicular to the main diagonal.
Dotplot showing identities between a repetitive sequence (ABRACADABRACADABRA) and itself. The repeats appear on several subsidiary diagonals parallel to the main diagonal.
From Introduction to Bioinformaticsby Arthur M. Lesk
dotplotThe dotplot is a simple picture that gives an overview of the similarities between two sequences. Less obvious is its close relationship to alignments.The dotplot is a table or matrix. The rows correspond to the residues of one sequence and the columns to the residues of the other sequence. In its simplest form, the positions in the dotplot are left blank if the residues are different, and filled if they match. Stretches of similar residues show up as diagonals in the upper left-lower right (Northwest-Southeast) direction.
354. Nucleic acids and proteins in one and more dimensions - second part
364. Nucleic acids and proteins in one and more dimensions - second part
374. Nucleic acids and proteins in one and more dimensions - second part
BLOSUM62 matrix does an excellent job detecting similarities in distant sequences, and this is the matrix used by default in most recent alignment applications such as BLAST
384. Nucleic acids and proteins in one and more dimensions - second part
Mutation probability matrix for the evolutionary distance of 250 PAMs
394. Nucleic acids and proteins in one and more dimensions - second part
404. Nucleic acids and proteins in one and more dimensions - second part
414. Nucleic acids and proteins in one and more dimensions - second part
tertiary structure prediction
424. Nucleic acids and proteins in one and more dimensions - second part
tertiary structure prediction
434. Nucleic acids and proteins in one and more dimensions - second part
tertiary structure prediction
444. Nucleic acids and proteins in one and more dimensions - second part
teminates with >
Amino Acid Code Meaning
A Alanine
B Aspartic acid or Asparagine
C CysteineD Aspartic acidE Glutamic acidF PhenylalanineG GlycineH HistidineI IsoleucineK LysineL LeucineM MethionineN AsparagineO PyrrolysineP ProlineQ GlutamineR ArginineS SerineT ThreonineU SelenocysteineV ValineW TryptophanY Tyrosine
Z Glutamic acid or Glutamine
X any* translation stop
- gap of indeterminate length
454. Nucleic acids and proteins in one and more dimensions - second part
464. Nucleic acids and proteins in one and more dimensions - second part
474. Nucleic acids and proteins in one and more dimensions - second part
tertiary structure prediction
484. Nucleic acids and proteins in one and more dimensions - second part
tertiary structure prediction
494. Nucleic acids and proteins in one and more dimensions - second part
tertiary structure prediction
504. Nucleic acids and proteins in one and more dimensions - second part
Protein folding
ab initio calculations of protein structure
514. Nucleic acids and proteins in one and more dimensions - second part
MSSPQAPEDGQGCGDRGDPPGDLRSVLVTTV
ROSETTAFrammenti di 9 aa
Sceglie le strutture delle 25 sequenze più vicine
Ottimizzazione e Assemblaggio(Knowledge-based potential)
Metodo Assemblaggio di frammenti: Dividendo la sequenza in frammenti
524. Nucleic acids and proteins in one and more dimensions - second part
Rosetta Fragment Libraries
25-200 fragments for each 3 and 9 residue sequence window
Selected from database of known structures > 2.5Å resolution < 50% sequence identity
Ranked by sequence similarity and similarity of predicted and known secondary structure
534. Nucleic acids and proteins in one and more dimensions - second part
544. Nucleic acids and proteins in one and more dimensions - second part
554. Nucleic acids and proteins in one and more dimensions - second part
RNA structure prediction
564. Nucleic acids and proteins in one and more dimensions - second part
RNA structure prediction
Secondary structure of a telomerase RNAPrimary structure of RNA
Tertiary structure of RNA
574. Nucleic acids and proteins in one and more dimensions - second part
RNA structure prediction