optimizing conditions for recombinant soluble protein ... · optimizing conditions for recombinant...

31
Optimizing conditions for recombinant soluble protein production in E.coli Keshav Vasanthavada, MSc., MS May 8 th , 2014

Upload: doandiep

Post on 24-Mar-2019

215 views

Category:

Documents


0 download

TRANSCRIPT

Optimizing conditions for recombinant soluble

protein production in E.coli

Keshav Vasanthavada, MSc., MS

May 8th, 2014

Make Research Easy

1

2

3

4

5

6

Presentation overview

Introduction

Before embarking on a protein expression project

E.coli expression

GenScript’s solution for expression optimization

Factors critical to solubility

Conclusion

2

Make Research Easy

About GenScript

3

GenScript

Products & Services Make Research Easy

Gene

Peptide

Protein

Antibody

Discovery

Biology

Cell Line

Introduction

Make Research Easy

Historical perspective

4

1940 1950 1960 1970 1980 1990 2000 2010 2020

1953 – elucidation of

the helical nature of

DNA

(Franklin & Wilkins)

1941 – One Gene One

Enzyme Hypothesis

(Beadle & Tatum)

Solving the 3-D

structure of DNA

(Watson & Crick)

1957 - Central Dogma

(Francis Crick)

1970 – Discovery of

Restriction Enzymes

(Smith & Nathans)

1973 – Birth of

Recombinant DNA

Technology

(Cohen & Boyer)

1977 - First

recombinant protein

(somatostatin) made

in E. coli

1977 – DNA

sequencing

(Maxam & Gilbert)

1980 – PCR was

developed

(Kary Mullis)

1987 – Refolding of

Inclusion bodies from E. coli

(Buchner, Kelley , Rudolph)

1992 – Baculovirus

Expession Vector System

(Summers, 1983; King &

Posse, 1992)

1993 – Affinity purification

of His- & GST-tagged

proteins

(Hochuli 1988; Jones 1993)

Mammalian cell expression

(Hauser 1988; Page 1991;

Geisse 1996)

Full length synthetic

Proteins?

Protein Amplification

Technology?

Introduction

Make Research Easy

• Recombinant protein production (RPP) is essential to

functional characterization and structure determination

• Variety of hosts available for RPP

• Scientists faced with overwhelming choices

• Escherichia coli [E. coli] is “work horse”

• Successful implementation of bacterial protein production project dependent on several factors

• Observations & Recommendations for optimizing recombinant protein production in E. coli

• We describe learnings from our collective experience

• GenScript Production Team has shipped over 3,000 batches of recombinant proteins

Introduction to topic

5 Introduction

Make Research Easy

Common questions

6

My Protein

My Own

My Precious

But me why needs it?

How is me making it?

Activity

Before embarking on a protein expression project

Make Research Easy

Host considerations

7

Bacteria

E. coli

1. “Work horse”

2. Well established

3. High expression

4. Simple genetics

5. Easy scale up

6. Speed

7. Costs

8. Equipment

Insect Sf9, Sf21, S2, High-5

1. PTMs

2. Soluble proteins

3. High expressers

Mammalian CHO, HEK, COS

1. PTMs

2. Soluble proteins

3. Low expresser

4. Expensive

Yeast S. cerevisiae

P. pastoris

1. PTMs

2. Soluble proteins

3. High expresser

Cell Free In vitro

1. Expensive

2. Not reproducible

3. Scalability issues

Before embarking on a protein expression project

Make Research Easy

92%

3.8% 2.06% 1.03% 0.29% 0%

20%

40%

60%

80%

100%

E.coli Insect Yeast Mammalian Cell Free

PDB entries reflects dominance of E. coli expression

E. coli is the top choice

8

Number of entries in the PDB by expression system as a percentage of total number of chains with an

identifiable expression system, as of April 15, 2014. All values are approximate. Reference - http://www.rcsb.org/pdb/home/home.do

E.coli expression

Make Research Easy

Advantages Disadvantages

Simple genetics Lack of PTM

Easy to manipulate Codon usage

Inexpensive to culture Inclusion bodies

Easy to Scale up Low yield of many eukaryotic proteins

Fast expression Poor secretion

High yields High MW proteins difficult

E. coli expression – why & why not?

9

Low/No

Expression

Insoluble

Expression

E.coli expression

Make Research Easy

What is the problem? what are the solutions?

10

E.coli

Refold

Optimize

No Target Engineering

• Temperature

• Strains

• Media

• Buffers

• Chaperones

Engineer Target

• Truncate

• Mutate

• Fusion Partner

Inclusion

Bodies

57%

43%

~24,000 Targets NESG Complete

Soluble

Insoluble

E.coli expression

Make Research Easy

Variables Desirables

Goal of a protein production campaign

11

Generation of high yields of soluble protein in a cost-effective and expeditious manner is the single most important

determinant of a successful protein production campaign

E.coli expression

Make Research Easy

Project initiation

Literature Review

Target Analysis &

Engineering

Host Consideration

Cloning scheme/Gene

synthesis

Small scale expression

optimization

Scale up and Purification

Protein expression campaign workflow

12

What has been done already? Homologs, truncations, mutations?

Full-length or fragment? Tag or tag-less?

Which tag? N or C-terminus?

E.coli expression

Make Research Easy

Target Protein Expression in E. coli

No

Discard

Re-evaluate

Optimize

[PROTentialTM]

Insoluble

Optimize

[PROTentialTM]

Refold

[FoldArtTM]

Soluble

Scale up & Purify

Archive

Flow chart of a bacterial expression project

13

WITHOUT target engineering

• Codon optimization

• Promoters

• Vectors

• Temperature & Culture

• Strains

• Media

• Buffers

• Chaperone co-expression

WITH target engineering

• Truncations

• Mutants

• Fusion partners

Good idea to run small scale expression and test purification

E.coli expression

Make Research Easy

Eliminate the guesswork from your Protein production work

What is PROTentialTM?

• PROTentialTM is a new protein expression, evaluation and optimization service

offered by GenScript.

What are the applications of PROTentialTM?

• Evaluate whether your target protein expresses in your chosen system.

• Identify the best expression system for your target protein.

• Identify the construct and conditions that give you the most robust soluble expression

of your target protein.

Evaluate before scale-up protein production, to avoid wasting your time & resources

PROTentialTM - expression evaluation & optimization

14

One stop service at GenScript: gene synthesis Subcloning PROTentialTM Scale up protein production

GenScript’s solution for expression optimization

Make Research Easy

PROTentialTM Standard Package (starting from $280)

• Available for 3 expression systems: bacterial, insect & mammalian

• Protein expression evaluation & solubility test

PROTentialTM Silver Package (starting from $1440)

• Currently for bacterial expression system only

• Test 8 conditions by combining temperature, media components & inducer concentration.

• Allow to identify the best expression condition with your chosen vector & bacterial strain.

PROTentialTM Gold Package (starting from $4800)

• Currently for bacterial expression system only

• Test 48 conditions, by combining parameters including not only those 3 covered in silver package,

but also promoter, host strain, and fusion partner.

• The most robust and high throughput protein expression & solubility optimization matrix.

Add-on item: 1L bacterial expression with 1 step purification at $600/condition (except for Flag-tagged

protein, the cost is $800/condition).

PROTentialTM portfolio

15 GenScript’s solution for expression optimization

Make Research Easy 16

PROTentialTM standard packages

Expression system Price Timeline Deliverables

Bacterial Starting from $280 1-3 weeks

• SDS & WB result • Expression data report

Baculovirus/ insect

Starting from $400 3-4 weeks

Mammalian Starting from $500 3-4 weeks

All 3 systems Starting from $950 3-4 weeks

Each package represents one gene, one vector and one expression system. These standard

packages offer expression and solubility test but no optimization

GenScript’s solution for expression optimization

Make Research Easy

Options for protein tag, growth media & vectors

17

Mammalian

Protein Tag

His (default)

Fc

Flag

V5

HA

C-Myc

GST

Insect

Protein Tag

His (default)

Fc

Flag

GST

Strep II

MBP

V5

HA

C-Myc

E. Coli

Protein Tag

His (default)

GST

Trx

MBP

SUMO

Flag

Growth Media

LB (default)

TB

2xYT

Vectors

pET system

pGEX system

pCold system

Gene synthesis customers have listed

options with the standard PROTentialTM

packages.

If a modification or tag is not on this list,

PROTentialTM silver or gold package is

required in order to accommodate such

request.

GenScript’s solution for expression optimization

Make Research Easy 18

Other optimization parameters (Gold package only)

Promoters

T5, T7, trc, tac, CSPA, AraBAD, Lac, Lac UV5, Trp, pL, T7-lac, T5-lac, Lac-ara1, Ltet0-1,

PhoA, Psyn, recA, proU, cst-1, tetA, cadA, nar, T3-lac, T4 gene 32, nprM-lac,

lpp, lpp-lac, VHb, or others.

Bacterial strains

BL21, BL21(DE3), BL21*(DE3), BL21(DE3) pLysS (E), BL21-SI, BL21-AI, BL21trxB, BL21

CodonPlus, Origami, Origami B, Origami B pLysS, C41(DE3), C43 (DE3), Lemo21(DE3),

Turner, Tuner pLysS, AE, Rosetta, Rosetta pLysS, Rosetta-gami-pLysS, Shuffle, DH5a,

TOP10, JM109, TG1, Stbl2, XL10-Gold, AD494, HMS174, NovaBlue(DE3), BLR , HB2151,

W3110 or others.

Induction conditions

Chosen based on our expertise; Can accommodate customer’s specific request.

Medium components

Chosen based on our expertise; can accommodate customer’s specific request.

Temperature

6C, 10C, 15C, 16C, 20C, 25C, 30C, 37C, or as specified by customer.

GenScript’s solution for expression optimization

Make Research Easy

• Vectors & Promoters

• Induction time

• Chaperones & Foldases

• Growth Media • Inducer

I. Codon Optimization

II. Strain

IV. Fusion Partner

III. Temperature

Factors critical to solubility

19

SOLUBLE

PROTEIN

Factors critical to solubility

Make Research Easy

I. OptimumGeneTM codon optimization

20

GenScript has optimized over 50,000 sequences in all

major expression systems. The expression level of

target proteins can be significantly improved by our

OptimumGeneTM Codon Optimization Technology.

Factors critical to solubility

Make Research Easy 21

Case study: codon optimization

Fig. 1: Expression Result of Protein α after Codon Optimization. The expression level of Protein α using GenScript’s OptimumGeneTM Codon Optimization is 3 times more than that of competitor’s.

Fig. 2: Expression Result of Protein β after Codon Optimization. The expression level of Protein β using GenScript’s OptimumGeneTM Codon Optimization is 13 times more than that of competitor’s.

Factors critical to solubility

Make Research Easy

# Expression Strain Rationale

1 BL21(DE3) Work-horse of recombinant protein expression. Lamba DE3 lysogen. Basal exp high. Non-toxic preferred

2 BL21(DE3) pLysS T7 lysozyme to repress basal level expression

3 BL21(DE3) pLysE T7 lysozyme to repress basal level expression. Higher level of repression

4 BL21 Star (DE3) RNaseE (rne131) mutant. Reduced mRNA degradation. Higher stability of transcripts

5 BL21 Star (DE3) pLysS As above, coupled with higher level of repression

6 BL21 (DE3) Codon Plus RIPL Contain extra copies of amino acids R,I,P,L tRNA genes. Supplements for codon deficiencies and bias

7 Origami™ 2(DE3) pLacI trxB and gor reductase mutants. Enhance disulfide bonds in cytoplasm. Rare tRNA genes also

8 Origami™ 2(DE3) pLysS trxB and gor reductase mutants. Enhance disulfide bonds in cytoplasm. Lower basal repression

9 Origami™ B(DE3) pLacI Enhance disulfide bonds in cytoplasm and control IPTG entry into cells

10 Origami™ B(DE3) pLysS As above, coupled with higher level of repression

11 OverExpress C41(DE3)pLysS Uncharacterized mutation that allows higher production of "difficult" proteins

12 OverExpress C43(DE3) pLysS Uncharacterized mutation that allows higher production of "difficult" proteins

13 BL21-Gold (DE3)pLysS Lon and OmpT protease deficient for greater recombinant protein stability. Higher plasmid stability

14 SHuffle™ T7 Express lysY Enhances disulfide bond formation in cytoplasm, DsbC promotes correction of misoxidized proteins

15 Arctic Express™ Co-expression with chaperonins Cpn60 & Cpn10 for increased yield of soluble proteins

16 Rosetta™ 2(DE3)pLysS tRNA genes for 7 rare amino acids plus reduced basal expression

17 Rosetta-gami™ B(DE3)pLysS Above plus increased disulfide bond formation in cytoplasm plus reduced basal expression

18 Tuner™(DE3)pLacI Lac permease lacY mutant allows uniform entry of IPTG into cells, enhances solubility

19 Tuner™(DE3)pLysS Above function plus reduced basal expression

20 Lemo21(DE3) Tunable expression by varying lysozyme concentration, regulated by addition of Rhamnose

II. Strain

22 Factors critical to solubility

Make Research Easy

Case study: Impact of E.coli strains on

soluble expression

23

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

17 18 19 20 21 22 23 24

17 18 19 20 21 22 23 24

M

M

M

0.5mM IPTG induced/15C Overnight, Ni-IMAC purified

Lemo21 (DE3) with varying

concentrations of Rhamnose

30kDa

A

20kDa

B

M

0.5mM IPTG induced/15C Overnight, Ni-IMAC purified

Lemo21 (DE3) with varying

concentrations of Rhamnose

STRAIN MAKES A DIFFERENCE

Factors critical to solubility

Make Research Easy

III. Temperature: case study

24

15C RT 37C

65kDa

WB

Lower temperatures result in more soluble protein

Total

Soluble

TEMPERATURE MAKES A DIFFERENCE

Factors critical to solubility

Make Research Easy

IV. Fusion partner/affinity tag

25

Tag ~MW [kDa] Pros Cons

6His ≤1

Universal, small, works

with native and denaturing

conditions

Usually requires more than

1-step elution. Not entirely

innocuous

CBP 4 High specificity Purity differs greatly

Flag 1 High purity

Introduces Enterokinase

Expensive and elution can

be a problem occasionally

GST 26 Elution easy, functions as

solubilizing tag

Dimerizing nature

Leaches occasionally

MBP 40 Solubility enhancer Need longer contact times,

Relatively large tag

Strep ≤1 High specificity, Mild

elution conditions Need longer contact times

NusA 55 Solubility enhancer Very large tag

Trx 12 Solubility enhancer Does not work well with

larger MW target proteins

Factors critical to solubility

Make Research Easy

Case study: Importance of fusion partner

26

1 2 3 4

30kDa

C

15kDa

D

FUSION PARTNERS MAKE A DIFFERENCE

Factors critical to solubility

Make Research Easy

Case study: Duration of induction

27

1. Protein D [0.1mM IPTG, 15C, 3 hours]

2. Protein D [0.1mM IPTG, 15C, Overnight]

3. Protein D [0.1mM IPTG, RT, 3 hours]

4. Protein D [0.1mM IPTG, RT, Overnight]

0 15 30 45 60 75 90 105 120 135 150 165 180

1 2 3 4 1 2

20kDa

E

Minutes

Detectable recombinant protein expression begins as early as 15-30 minutes

50kDa

F

Longer need not always be better

1. Protein E [0.1mM IPTG, 15C, 3 hours]

2. Protein E [0.1mM IPTG, 15C, Overnight]

60kDa

G

Longer can occasionally be bad

Factors critical to solubility

Make Research Easy

Cloning Strategy

LIC or recombination-based cloning strategy for high throughput & parallel

construct generation

If possible, create expression vectors with N-terminal fusion/solubility

partners for easy cloning scheme

Expression

Express both N- and C-terminally affinity tagged constructs [especially 6His]

In many cases at least one version will express

If checking for enzyme activity, one version might show higher activity than

the other

Purification

Note that C-terminally affinity tagged constructs allow you to purify full

length proteins

You could try double-tagging approach by using one affinity tag [e.g., GST,

MBP at the N-ter and 6His at the C-ter to solubilize and purify full length

protein at the same time

Factors to consider

28 Conclusion

Make Research Easy

No simple, global solution to address insolubility issues in E.coli expression

Codon optimization, strain, induction temperature and duration, fusion partners make a difference

Small scale expression study should be predictive for early triage of soluble constructs

Small scale purification to assess protein behavior and avoid hiccups during scale up

Parallel approaches increase the chance of success

Suggested approach –

• Combinatorial tagging at the N-terminus [6His + solubilizing tag]

• Protease Cleavage site for tag removal after 1st round purification

Expression levels, activity, purity, homogeneity, stability are important factors that must be considered

Summary & takeaways

29

Solubilizing Tag 6His ATG Protease

Tag Target

Promoter Protease

Linker

Conclusion

Make Research Easy

GenScript’s experience in protein expression & purification

30

GenScript has delivered over 3,000 proteins in four expression systems with 92% success rate for all protein projects. GenScript has successfully delivered a variety of difficult proteins, including trans-membrane proteins, co-expression proteins, proteases, kinases, cytokines, antibodies and several other proteins to customers

worldwide.

Conclusion

Make Research Easy

Thank you for your participation

We wish you all success in your Research

[email protected]

31

Register for other webinars in the GenScript Webinar Series @ http://www.genscript.com/webinars.html

May 15, 2014/ 2PM EST

Avoiding peptide assay failure: hidden problems and solutions - Tiffany Gupton Campolongo, Ph.D.

May 21, 2014/ 11:00 am EST

Gene variant libraries: design, construction, and research applications - Rachel Speer, Ph.D.

May 28, 2014/1:00 pm EST

Protein or peptide antigen: choosing the optimal immunogen for antibody production - Jessica Kaplunov, Ph.D.

June 5, 2014/ 2:00 pm EST

Stem cell culture: choosing optimal conditions for expansion and differentiation - Matthew Riolo, Ph.D.

June 11, 2014/ 1:00 pm EST

Recombinant protein expression & purification: challenges and solutions - Liyan Pang, Ph.D.

June 18, 2014/ 2:00 pm EST

Can CRISPR/Cas9 off-target genomic editing be avoided? Ways to improve target specificity - Maxine Chen, Ph.D

June 25, 2014/ 2:00 pm EST

Building a Synthetic Eukaryotic Genome – Sc2.0 - Leslie Mitchell, Ph.D., NYU Langone Medical Center

Conclusion