mandatory to put some order in such a vast wealth of structural knowledge 1 4. nucleic acids and...

57
mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Upload: javier-lancaster

Post on 01-Apr-2015

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

mandatory to put some order in such a vast wealth of structural

knowledge 14. Nucleic acids and proteins in one and more dimensions - second part

Page 2: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Taxonomy (from Ancient Greek: τάξις taxis, "arrangement," and -νομία -nomia,

"method") is the science of defining groups of biological organisms on the basis of

shared characteristics and giving names to those groups. Organisms are grouped

together into taxa (singular: taxon) and given a taxonomic rank; groups of a given rank can be aggregated to form a super group of higher rank and thus create a

taxonomic hierarchy.

24. Nucleic acids and proteins in one and more dimensions - second part

Learning from Biology

Page 3: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

34. Nucleic acids and proteins in one and more dimensions - second part

Learning from Biology

Page 4: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

44. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 5: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

54. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 6: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

64. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 7: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

74. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 8: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

84. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 9: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

94. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 10: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

104. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 11: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

red: mainly αgreen: mainly βyellow: αβblue: low content of secondary structures

114. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

Page 12: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

124. Nucleic acids and proteins in one and more dimensions - second part

protein structure taxonomy

from http://www.proteinstructures.com/Structure/Structure/protein-fold.html

most of different protein

folds have been

already found?

Page 13: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

134. Nucleic acids and proteins in one and more dimensions - second part

comparing protein structure

Page 14: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

144. Nucleic acids and proteins in one and more dimensions - second part

comparing protein structure

Page 15: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

154. Nucleic acids and proteins in one and more dimensions - second part

comparing protein structure

Page 16: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

164. Nucleic acids and proteins in one and more dimensions - second part

1898 proteins representative of the most common fold

Page 17: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

174. Nucleic acids and proteins in one and more dimensions - second part

Page 18: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

3. genome analysis

Structural genomics

0101#01001010#10111010#01010001#10010#1001#10110010#100100100101011#0

DNA

Algorithm

Residue

THR 0.0 147.7 172.9

THR 107.2 -125.3 187.4

CYS 123.4 63.6 103.7

PRO 60.3 83.9 -116.7

Protein Structure

X Ray

diff

ract

ometr

y

NMR

cryo-electron tomography

18

Page 19: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

194. Nucleic acids and proteins in one and more dimensions - second part

from structural knowledge to structural predictions

Page 20: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

204. Nucleic acids and proteins in one and more dimensions - second part

Page 21: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

214. Nucleic acids and proteins in one and more dimensions - second part

Page 22: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

CHOU & FASMANChou, P.Y. & Fasman, G.D. (1974). Biochemistry, 13, 211-222.

secondary structure prediction

224. Nucleic acids and proteins in one and more dimensions - second part

Page 23: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

# residues in window: 6

234. Nucleic acids and proteins in one and more dimensions - second part

Page 24: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

244. Nucleic acids and proteins in one and more dimensions - second part

Page 25: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

254. Nucleic acids and proteins in one and more dimensions - second part

Page 26: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

264. Nucleic acids and proteins in one and more dimensions - second part

Page 27: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

274. Nucleic acids and proteins in one and more dimensions - second part

Page 28: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

PSIPRED is a simple and reliable secondary structure prediction method, incorporating two feed-forward neural networks which perform an analysis on output obtained from PSI-BLAST (Position Specific Iterated - BLAST).

284. Nucleic acids and proteins in one and more dimensions - second part

Page 29: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

294. Nucleic acids and proteins in one and more dimensions - second part

Page 30: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

secondary structure prediction

304. Nucleic acids and proteins in one and more dimensions - second part

Page 31: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

4. Nucleic acids and proteins in one and more dimensions - second part

31

Page 32: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Sequence Alignment

“Two homologous sequences whisper ... a full multiple alignment shouts out loud.” in Hubbard TJ, Lesk AM, Tramontano A. Gathering them in to the fold. Nat Struct Biol. 1996 Apr;3(4):313.)

324. Nucleic acids and proteins in one and more dimensions - second part

Page 33: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

334. Nucleic acids and proteins in one and more dimensions - second part

Page 34: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

344. Nucleic acids and proteins in one and more dimensions - second part

Page 35: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Letters corresponding to isolated matches are shown in non-bold type. The longest matching regions, shown in boldface, are the first and last names DOROTHY and HODGKIN. Shorter matching regions, such as the OTH of dorOTHy and crowfoOTHodgkin, or the RO of doROthy and cROwfoot, are noise.

Dotplot showing identities between the palindromic sequence MAX I STAY AWAY AT SIX AM and itself. The palindrome reveals itself as a stretch of matches perpendicular to the main diagonal.

Dotplot showing identities between a repetitive sequence (ABRACADABRACADABRA) and itself. The repeats appear on several subsidiary diagonals parallel to the main diagonal.

From Introduction to Bioinformaticsby Arthur M. Lesk

dotplotThe dotplot is a simple picture that gives an overview of the similarities between two sequences. Less obvious is its close relationship to alignments.The dotplot is a table or matrix. The rows correspond to the residues of one sequence and the columns to the residues of the other sequence. In its simplest form, the positions in the dotplot are left blank if the residues are different, and filled if they match. Stretches of similar residues show up as diagonals in the upper left-lower right (Northwest-Southeast) direction.

354. Nucleic acids and proteins in one and more dimensions - second part

Page 36: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

364. Nucleic acids and proteins in one and more dimensions - second part

Page 37: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

374. Nucleic acids and proteins in one and more dimensions - second part

Page 38: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

BLOSUM62 matrix does an excellent job detecting similarities in distant sequences, and this is the matrix used by default in most recent alignment applications such as BLAST

384. Nucleic acids and proteins in one and more dimensions - second part

Page 39: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Mutation probability matrix for the evolutionary distance of 250 PAMs

394. Nucleic acids and proteins in one and more dimensions - second part

Page 40: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

404. Nucleic acids and proteins in one and more dimensions - second part

Page 41: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

414. Nucleic acids and proteins in one and more dimensions - second part

Page 42: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

tertiary structure prediction

424. Nucleic acids and proteins in one and more dimensions - second part

Page 43: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

tertiary structure prediction

434. Nucleic acids and proteins in one and more dimensions - second part

Page 44: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

tertiary structure prediction

444. Nucleic acids and proteins in one and more dimensions - second part

Page 45: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

teminates with >

Amino Acid Code Meaning

A Alanine

B Aspartic acid or Asparagine

C CysteineD Aspartic acidE Glutamic acidF PhenylalanineG GlycineH HistidineI IsoleucineK LysineL LeucineM MethionineN AsparagineO PyrrolysineP ProlineQ GlutamineR ArginineS SerineT ThreonineU SelenocysteineV ValineW TryptophanY Tyrosine

Z Glutamic acid or Glutamine

X any* translation stop

- gap of indeterminate length

454. Nucleic acids and proteins in one and more dimensions - second part

Page 46: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

464. Nucleic acids and proteins in one and more dimensions - second part

Page 47: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

474. Nucleic acids and proteins in one and more dimensions - second part

Page 48: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

tertiary structure prediction

484. Nucleic acids and proteins in one and more dimensions - second part

Page 49: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

tertiary structure prediction

494. Nucleic acids and proteins in one and more dimensions - second part

Page 50: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

tertiary structure prediction

504. Nucleic acids and proteins in one and more dimensions - second part

Page 51: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Protein folding

ab initio calculations of protein structure

514. Nucleic acids and proteins in one and more dimensions - second part

Page 52: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

MSSPQAPEDGQGCGDRGDPPGDLRSVLVTTV

ROSETTAFrammenti di 9 aa

Sceglie le strutture delle 25 sequenze più vicine

Ottimizzazione e Assemblaggio(Knowledge-based potential)

Metodo Assemblaggio di frammenti: Dividendo la sequenza in frammenti

524. Nucleic acids and proteins in one and more dimensions - second part

Page 53: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

Rosetta Fragment Libraries

25-200 fragments for each 3 and 9 residue sequence window

Selected from database of known structures > 2.5Å resolution < 50% sequence identity

Ranked by sequence similarity and similarity of predicted and known secondary structure

534. Nucleic acids and proteins in one and more dimensions - second part

Page 54: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

544. Nucleic acids and proteins in one and more dimensions - second part

Page 55: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

554. Nucleic acids and proteins in one and more dimensions - second part

RNA structure prediction

Page 56: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

564. Nucleic acids and proteins in one and more dimensions - second part

RNA structure prediction

Secondary structure of a telomerase RNAPrimary structure of RNA

Tertiary structure of RNA

Page 57: Mandatory to put some order in such a vast wealth of structural knowledge 1 4. Nucleic acids and proteins in one and more dimensions - second part

574. Nucleic acids and proteins in one and more dimensions - second part

RNA structure prediction