copyright (c) 2004 by limsoon wong assessing reliability of protein-protein interaction experiments...

54
Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Post on 15-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

Assessing Reliability

of Protein-Protein Interaction

Experiments

Limsoon WongInstitute for Infocomm

Research

Page 2: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Outline

• Reliability of experimental protein-protein interaction data

• A couple of existing reliability indices– interaction generality– interaction generality 2

• A new reliability index– interaction pathway reliability

• Further refinements– interaction reliability by belief inference– false negative detection

Page 3: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

How Reliable are Experimental Protein-Protein Interaction Data?

Figure credit: Jeong et al. 2001

Page 4: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

• Complete genomes are now available

• Knowing the genes is not enough to understand how biology functions

• Proteins, not genes, are responsible for many cellular activities

• Proteins function by interacting w/ other proteins and biomolecules

GENOME PROTEOME

“INTERACTOME”

Why Protein Interactions?

Copyright (c) 2004 by Limsoon Wong

Page 5: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

FACT: Generating large amounts of

experimental data about protein-protein

interactions can be done with ease.

FACT: Generating large amounts of

experimental data about protein-protein

interactions can be done with ease.

High-Tech Expt PPI Detection Methods• Yeast two-hybrid assays• Mass spec of purified complexes (e.g.,

TAP)• Correlated mRNA expression• Genetic interactions (e.g., synthetic

lethality)• ...

Page 6: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

High-throughput approach sacrifice quality for quantity:

(a) limited or biased coverage: false negatives, &

(b) high error rates : false positives

Key Bottleneck

• Many high-throughput expt detection methods for protein-protein interactions have been devised. But ...

Copyright (c) 2004 by Limsoon Wong

Page 7: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Large disagreement betw methods

Some Protein Interaction Data SetsSprinzak et al., JMB, 327:919-923, 2003

• GY2H: genome-scale Y2H• 2M, 3M, 4M: intersection of 2, 3, 4 methods

Copyright (c) 2004 by Limsoon Wong

Page 8: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Ditto wrt co-cellular-role

Expected proportion of co-localized pairs among non true interacting pairs

Expected proportion of co-localized pairs among true interacting pairs

Quantitative EstimatesSprinzak et al, JMB, 327:919-923, 2003

Page 9: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

% of TP based on co-localization% of TP based on shared cellular role (I = 1)% of TP based on shared cellular role (I = .95)

TP = ~50%

Reliability of Protein Interaction DataSprinzak et al, JMB, 327:919-923, 2003

Copyright (c) 2004 by Limsoon Wong

Page 10: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

50-70% of interactions are spurious

10-30% of interactions catalogued

Protein interaction data

90% of spots are good data

80-90% of transcripts represented

mRNA profiling

99.9% correct99% of genome sequence

DNA genome sequence

Data qualityCoverage

Are We There Yet?

Copyright (c) 2004 by Limsoon Wong

Page 11: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Objective

• Some high-throughput protein interaction expts have as much as 50% false positives

• Can we find a way to rank candidate interaction pairs according to their reliability?

• How do we do this?– Would knowing their neighbours help?– Would knowing their local topology help?– Would knowing their global topology help?

Page 12: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

Would knowing their neighbours help?

The story of interaction generality

Page 13: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

a b

An Observation

• It seems that configuration a is less likely than b in protein interaction networks

• Can we exploit this?

Copyright (c) 2004 by Limsoon Wong

Page 14: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Interaction GeneralitySaito et al., NAR, 30:1163-1168, 2002

ig(YDR412WGLC7)= 1 + # of yellow nodes

The number of proteinsthat “interact” with justX or Y, and nobody else

Copyright (c) 2004 by Limsoon Wong

Page 15: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

a b

Assessing Reliability Using Interaction Generality

• Recall configuration a is less likely than b in protein interaction networks

• The smaller the “ig” value of a candidate interaction pair is, the more likely that interaction is

Copyright (c) 2004 by Limsoon Wong

Page 16: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

There are 229 pairsin Ito having ig = 1.Of these, 66 (or 34%)are also reported by Uetz

Evaluation wrt Intersection of Ito et al. & Uetz et al.

• Interacting pairs c’mon to Ito et al. & Uetz et al. are more reliable

• Also have smaller “ig”

“ig” seems to work

Copyright (c) 2004 by Limsoon Wong

Page 17: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

~60% of pairs inin Ito having ig=1 are known to havecommon localization

Evaluation wrt Co-localization

• Interaction pairs having common cellular localization are more likely

• Also have lower “ig” “ig” seems to work

Copyright (c) 2004 by Limsoon Wong

Page 18: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

A: before restrict to pairs with “ig = 1”B: after restrict to pairs with “ig = 1”

reduced x-talk

Evaluation wrt Co-cellular Role

• Interaction pairs having common cellular role are more likely

• Also have lower “ig” “ig” seems to work

Copyright (c) 2004 by Limsoon Wong

Page 19: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

Would knowing their local topology help?

The story of interaction generality 2

Page 20: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Observed 70 times in S. cerevisiae Observed ~11 times in random data

Existence of Network MotifsMilo et al., Science, 298:824-827, 2002

• A network motif is just a local topological configuration of the network

• “Detected” in gene regulation networks, WWW links, etc.

Copyright (c) 2004 by Limsoon Wong

Page 21: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

1 2 3

4 5

5 Possible Network Motifs

• Classify a protein C that directly interacts with the pair AB according to these 5 topological configurations

Copyright (c) 2004 by Limsoon Wong

Page 22: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

A New Interaction GeneralitySaito et al., Bioinformatics, 19:756--763, 2002

'

Page 23: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

~90% of pairs in intersectionof Ito & Uetz have ig2 < 0.

~60% of pairs not in intersection of Ito & Uetz have ig2 < 0

Evaluation wrt Reproducible Interactions

• “ig2” correlates to “reproducible” interactions

“ig2” seems to work

Page 24: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

~95% of pairs havingig2 = –6 have commoncellular roles

Evaluation wrt Common Cellular Role, etc.

• “ig2” correlates well to common cellular roles, localization, & expression

• “ig2” seems to work better than “ig”

Copyright (c) 2004 by Limsoon Wong

Page 25: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

Would knowing their global topology help?

The story of interaction pathway reliability

Page 26: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Some “Reasonable” Speculations• A true interacting pair is often connected

by at least one alternative path (reason: a biological function is performed by a highly interconnected network of interactions)

• The shorter the alternative path, the more likely the interaction (reason: evolution of life is through “add-on” interactions of other or newer folds onto existing ones)

Page 27: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Therefore...

Conjecture:

“An interaction that is associated with an alternate path of

reliable interactions is likely to be

reliable.”

Idea:Use alternative

interaction paths as a measure to

indicate functional linkage between the two proteins

Page 28: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Interaction Pathway Reliability

IPR is also called IRAP, “Interaction Reliability by Alternate Pathways”

Page 29: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Non-reducible Paths

• Non-reducible paths are– AFE– ABE

• Reducible paths are– ABCDE– ABCE

A

DCB

E

F

Page 30: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Evaluation wrt Reproducible Interactions

• “ipr” correlates well to “reproducible” interactions

“ipr” seems to work

The number of pairs not in theintersection of Ito & Uetz is notchanged much wrt the ipr valueof the pairs

The number of pairs in theintersection of Ito & Uetzincreases wrt the ipr valueof the pairs

Page 31: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

At the ipr thresholdthat eliminated 80%of pairs, ~85% of theof the remaining pairshave common cellularroles

Evaluation wrt Common Cellular Role, etc

• “ipr” correlates well to common cellular roles, localization, & expression

“ipr” seems to work better than “ig2”

Copyright (c) 2004 by Limsoon Wong

Page 32: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Part of the network of physical interactions reported byIto et al., PNAS, 2001

Stability in Protein NetworksMaslov & Sneppen, Science, 296:910-913, 2002

• According to Maslov & Sneppen– Links betw high-connected proteins are suppressed– Links betw high- & low-connected proteins are favoured

• This decreases cross talks & increases robustness

Copyright (c) 2004 by Limsoon Wong

Page 33: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Evaluation wrt “Many-few” Interactions

• Number of “Many-few” interactions increases when more “reliable” IPR threshold is used to filter interactions

• Consistent with the Maslov-Sneppen prediction

Page 34: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Evaluation wrt “Cross-Talkers”• A MIPS functional cat:

– | 02 | ENERGY – | 02.01 | glycolysis and gluconeogenesis – | 02.01.01 | glycolysis methylglyoxal bypass – | 02.01.03 | regulation of glycolysis & gluconeogenesis

• First 2 digits is top cat• Other digits add more granularity to the

cat Compare high- & low- IPR pairs that are

not co-localised to determine number of pairs that fall into same cat. If more high-IPR pairs are in same cat, then IPR works

Page 35: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Evaluation wrt “Cross-Talkers”• For top cat

– 148/257 high-IPR pairs are in same cat– 65/260 low-IPR pairs are in same cat

• For fine-granularity cat– 135/257 high-IPR pairs are in same cat.

37/260 low-IPR pairs are in same cat

IPR works IPR pairs that are not co-localized are

real cross-talkers!

Page 36: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon Wong

Example Cross Talkers

Page 37: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

Would knowing their global topology help?

The story of interaction reliability by belief inference

Page 38: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Shortcoming of IPR

• IPR considers only the strongest alternative path

may overlook other interactions close to the target pairs that may provide significant supporting evidence for their reliability

need an improve measure that considers all the protein interactions close to the target pair.

Page 39: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Interaction Reliability by Belief Inference

Page 40: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

IPR

Effectiveness of IRBI

• irbi is more effective than ipr, ig1, ig2

Page 41: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Cop

yrig

ht (

c) 2

004

by L

imso

on W

ong

How a bout discovering false negatives?

The story of detecting missing information

Page 42: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

False Negatives

• A “false negative” is a failure to detect a real protein-protein interaction

Page 43: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

IPR (& IRBI) Detects False Negatives

• To find out if there is a “missing” interaction between X and Y, we do:– compute ipr value of XY in G{XY}– predict if XY as false negative if ipr is high

Page 44: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

How do we test if this works?

• To test this, we mimic false negatives by random removal of 5% to 15% of known interactions. Then we check:– how many removed interactions are

rediscovered?– is there diff in rediscovery rates of false

negative vs random links?– Is there support in terms of gene expression

correlation, common cellular roles, & common cellular locations?

Page 45: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Rediscover Hidden Interactions

IPR > 0.95 IPR < 0.01

Page 46: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

False Negatives vs Random Links

Ave

IPR

Copyright (c) 2004 by Limsoon Wong

Page 47: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Indirect Evidence

• Randomly select 2 nodes• “Pretend” they interact in G• Compute their IPR• Check if they have gene expression correlation,

common cellular roles, common cellular location

Page 48: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Case Study 1

• Dataset – 10,199 non-redundant interactions between

4,336 yeast proteins from MIPS with date Jan. 18, 2005

– 833 “verified true” interactions based on Ito et al.’s core set

• Apply IPR on dataset with the 833 true interactions hidden from the program

• IPR re-discovered 730 interactions

Page 49: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Distribution of rediscovered protein pairs

0

200

400

600

800

>0.95 >0.8

IPR value threshold

No

. of P

rote

in In

tera

ctio

ns

71%

45%

Copyright (c) 2004 by Limsoon Wong

Case Study 1: Re-Discovered True Interactions Have High IPR Values

Page 50: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Only values greater than 0.95 are shown in the table

Case Study 1: Comparison With Random Connections

Copyright (c) 2004 by Limsoon Wong

Example true x-talkers according to MIPS

Example likely interaction predicted

Page 51: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Case Study 2

• Yeast 26S Proteosome Complex– 4 sub-structures found

in PDB covering 13 proteins

– 23 physical interactions using the (10, 5)-rule

• Out of 23 physical interactions in this complex, only 1 was detected in MIPS expt dataset

• IPR detected 42 interactions (out of 78 possible interactions) betw the 13 proteins:– 14 are physical

interactions in complex (c.f. 1 detected in MIPS)

– 9 are expt detected interactions in MIPS

– 19 new interactions are also predicted

MIPSLots!

1

IPR

19

KnownComplexes

9

139

Page 52: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Conclusions • There are latent local &

global network “motifs” that indicate likelihood of protein interactions

• These network “motifs” can be exploited in computational elimination of false positives & false negatives from high-throughput Y2H expt & possibly other highly erroneous interaction data

• IPR is so far the most effective topologically-based computational measure for assessing the reliability (false positives) of protein-protein interactions detected by high-throughput methods

• IPR can also discover new interactions (false negatives) not detected in the expt PPI network

Page 53: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

Acknowledgements

NUS SOC• Jin Chen, Wynne Hsu, Mong Li Lee

I2R• See-Kiong Ng

GIS• Prasanna Kolatkar, Jer-Min Chia

Page 54: Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright (c) 2004 by Limsoon WongCopyright (c) 2004 by Limsoon Wong

References

• J. Chen et al, “Systematic assessment of high-throughput protein interaction data using alternative path approach”, Proc. ICTAI 2004, pages 368--372

• R. Saito et al, “Interaction generality, a measurement to assess the reliability of a protein-protein interaction”, NAR 30:1163--1168, 2002

• R. Saito et al, “Construction of reliable protein-protein interaction networks with a new interaction generality measure”, Bioinformatics 19:756--763, 2002

• S. Maslov & K. Sneppen, “Specificity and stability in topology of protein networks”, Science 296:910--913, 2002

• E. Sprinzak et al, “How reliable are experimental protein-protein interaction data?”, JMB 327:919--923, 2003