Genome Engineering Using CRISPR-Cas9 System
Fig. 1 Principle for genome engineering using designer nucleases or nickases. DNA cleavage induced bydesigner nucleases or DNA nicks created by designer nickases can result in double strand break (DSB), singlestrand nick (SSN), or double nicks (DNs) ( top ). These site-specifi c DNA cleavages can be repaired via two differentpathways ( bottom ). In the error-prone NHEJ pathway for DSB or DNs, the ends of the breaks are processedby endogenous DNA repair machinery and rejoined, which can result in random indels at the site ofjunction. The other pathway is the HDR pathway for both DSB, SSN, and DNs, where a repair template is supplied,allowing precise editing of the endogenous genome sequences. Parts of this fi gure are adapted
Fig. 2 Schematic of the type II CRISPR-mediated DNA double-strand break and thedesign of chimeric sgRNA.
Place the mixture in a thermocycler using the followin parameters to phosphorylate and anneal the oligos.
Add 2 μl of annealed oligos into 398 μl of water to dilutethe products 200-fold.
Incubate the ligation reaction in a thermocycler followingthe parameters below.
3. (Optional but highly recommended) Plasmid-Safe treatment. Add Plasmid-Safe buffer, ATP, and the ATP-dependent Plasmid-Safe exonuclease as listed below to treat the ligation reaction andthe negative control to increase the effi ciency of cloning bydegrading unligated DNA fragments in the product mixture.
Incubate the reaction mixture in a thermocycler followingthe parameters below.
4. Transformation.
Add 2 μl of each reaction from previous step (including thenegative control ligation reaction) into chosen competentcells, such as Stbl3, transform following the manufacturer’sprotocol. Plate the transformed cells on Ampicillin selection LB agarplates or other type of plates depending on the selection markerof the backbone vector.
5. Plasmid preparation.
Check the plate for presence of bacterial colonies after overnightincubation. Usually there should be no or less than fi vecolonies on the negative control plate, whereas over tens tohundreds of colonies will grow on the cloning plate, indicatinghigh cloning effi ciency. Pick two to three colonies from each agar plate and set upa small volume (typically 5 ml) LB culture for miniprep. Incubate and shake under 37 °C overnight, prepare plasmidDNA using a spin miniprep kit according to manufacturer’sprotocol.
6. (Optional) Digest the plasmid DNA to verify the insertion ofthe oligos.
Digest the miniprep DNA with diagnostic restriction enzymes,BbsI and AgeI, if using the vector backbone PX330/PX335. Run the digested product on a 1 % agarose gel to visualize theband pattern of the digested product.
When a lot of cloning is done at the same time, it is possibleto screen for correct insertion of the target sequence oligosby this digestion because a successful insertion will destroy theBbsI sites. After double digestion, clones with insertion of annealed sgRNA oligos will show only linearized plasmid,whereas clones without insertion will yield two fragments, withsizes of ~980 and ~7,520 bp in the case of pX330 when runningand visualizing on an agarose gel.
7. Sequencing verifi cation of positive clones. Sequence the clones with a forward primer that binds the humanU6 promoter to verify the successful insertion of guide sequences. The verifi ed clones can be then prepared using a Midi or Maxiplasmid extraction kit for downstream experiments.
3.2 Mammalian Cell Culture and Transfection
As a general protocol, the steps below use HEK 293FT cells as anexemplar system to demonstrate CRISPR-Cas9 genome engineering. Additional suggestions can be found in Subheading 4 ( see Note 5 ).
1. HEK 293FT cell maintenance.
Maintain the HEK cells in DMEM medium supplementedwith 10 % FBS (D10 medium) in an incubator at 37 °C temperatureand supplemented with 5 % CO 2 as recommended bythe manufacturer. Feed the cells every other day, and passagethe cells so they never reach over 75 % confluence.
2. Preparing cells for transfection.
At 16–24 h before transfection, dislodge and disassociate thecells by trypsinization. Plate cells into 24-well plates containing500 μl antibiotic free medium at a density of 250,000 cellsper mL of culture medium, or 125,000 cells per well. Scale upor down the culture volume proportionally to the plate formatbased on the number of cells needed in the experiment. At thetime of transfection, the cells should be at around 75–85 %confl uence.
3. Transfection.
Mix a total of up to 600 ng DNA containing the targetingconstructs in a microcentrifuge tube for each well of transfectionon a 24-well plate. For transfecting plasmid derived frompX330 or pX335, where a guide sequence has been clonedinto the backbone vector, directly use 600 ng of the preparedplasmid in the transfection mixture. When transfecting morethan one construct for alternative designs, e.g., the doublenickase method, or when multiplex genome cleavage is desired,mix equal molar ratio of all constructs to a total of 600 ng inthe transfection mixture ( see Notes 2 and 4 ). Add 50 μlOptiMEM medium OptiMEM medium to the DNA mixture.
Mix well and spin down.
Dilute Lipofectamine 2000 transfection reagent by adding1.5 μl of the reagent into 50 μl of OptiMEM, Mix well andincubate at room temperature for 5 min. Within 15 min ofdiluting the transfection reagent, add all the dilutedLipofectamine 2000 reagent into the DNA-OptiMEM mixtureprepared earlier. Mix well and spin down. Incubate themixture for another 20 min to allow the formation of DNA–Lipofectamine complex. Add the fi nal complex directly ontothe culture medium of each well on the 24-well plate. (Notrequired but highly recommended) Include a transfection controlto monitor transfection effi ciency. Use a control plasmidexpressing a fl uorescent protein, such as pCMV-EGFP, andtransfect this plasmid following same protocol as above intothe cell. Transfect another well of cells using the SpCas9 backbonevector, e.g. pX330 or pX335, as a negative control fordownstream processing and validation of assay.
Replace the medium with warmed fresh medium around12–24 h post transfection. Maintain the cells for another48–72 h to allow suffi cient time for genome engineering mediatedby CRISPR-Cas9 system.
3.3 Genome Cleavage Analysis
1. Genomic DNA extraction.
Extract genomic DNA from transfected cells using theQuickExtract DNA extraction kit following the manufacturer’srecommended protocol. Briefl y, disassociate cells from theplate, harvest by spin down the cell suspension at 250 × g for 5 min. Wash cell pellet with 500 μl of PBS and then resuspendedin QuickExtract solution. We typically use 50 μl for one well ina 24-well plate (scale up and down accordingly). Vortex thesuspension and incubate at 65 °C for 15 min, 68 °C for 15 min,and 98 °C for 10 min.
2. SURVEYOR assay to detect genomic cleavage.
Use SURVEYOR assay to detect genomic cleavage after extractionof genomic DNA ( see Note 6 ), following the stepwiseinstructions provided in the SURVEYOR Mutation DetectionKit manual. A brief description of the steps is listed below.
(a) Amplify extracted genomic DNA with a pair of primersdesigned for the target region of interest using a highfidelity enzyme such as Herculase II fusion polymerase ofKapa Hifi Hotstart polymerase. Typically an amplicon sizeof less than 1,000 bp is preferred as shorter amplicon givesmore specifi c amplifi cation products.
(b) Visualize the PCR product on an agarose gel to check thespecifi city of the amplifi cation. It is important to havevery specifi c amplifi cation of genomic region to yieldaccurate SURVEYOR assay results as nonspecifi c bandswill interfere with the interpretation of gel electrophoresisanalysis.
(c) Purify and quantify the PCR products, set up a denaturing/re-annealing reaction by mixing up to 400 ng of PCRproducts with water and re-annealing buffer (we typicallyuse the PCR reaction buffer and add to a fi nal concentrationof 1×, refer to the SURVEYOR assay manual for moreinformation).
(d) Run the reaction in a thermocycler with followingparameters.
(e) Digest the re-annealed products with SURVEYOR enzymekit at 42 °C for 1 h as recommended by the manufacturer’sprotocol.
(f) Visualize the digested product using gel electrophoresis.
For visualization of SURVEYOR assay results, we recommendedloading of the Surveyor Nuclease digestion productswith Polyacrylamide gel electrophoresis (PAGE) methodas it gives better solution compared with agarose gels.
Quantifi cation of the assay results and the method toconvert it to an estimation of the frequency of indelsgenerated by CRISPR-Cas system in the population ofcells are described in Subheading 4 ( see Note 7 ).
3.4 Implementation of Homologous Recombination (HR) Using CRISPR-Cas System
The guideline and considerations for designing a HR experimentto use CRISPR-Cas system to precisely modify the genomicsequence of interest by inserting, deleting, or replacing part of thegenome are described in Subheading 4 ( see Note 8 ). Briefl y, followingdesign and cloning of the HR template, perform HRexperiment following steps below.
1. HEK 293FT cell maintenance and the preparation of cells fortransfection.
This part is same as the corresponding steps in Subheading 3.2 .
Briefl y, plate cells into 24-well plates and make sure at the timeof transfection, the cells should be at around 75–85 %confl uence.
2. Transfection.
Mix a total of up to 800 ng DNA containing the targeting constructsand the HR template vector (or single-stranded DNAoligos, see Note 9 ) in a microcentrifuge tube for each well oftransfection on a 24-well plate. Generally, apply a molar ratio of1:3–5:1 for the targeting vector and HR template vector.
(Optional but recommended) Titrate different molar ratiobetween the targeting vector and HR template vector to test theoptimal condition for the HR experiment. Add 50 μl OptiMEMmedium to the DNA mixture. Mix well and spin down.
Dilute Lipofectamine 2000 transfection reagent by adding2 μl of the reagent into 50 μl of OptiMEM, Mix well and incubateat room temperature for 5 min. Within 15 min of dilutingthe transfection reagent, add all the diluted Lipofectamine2000 reagent into the DNA-OptiMEM mixture prepared earlier.
Mix well and spin down. Incubate the mixture for another20 min to allow the formation of DNA–Lipofectamine complex.
Add the fi nal complex directly onto the culture mediumof each well on the 24-well plate.
Transfect another well of cells using the SpCas9 backbonevector, e.g., pX330 or pX335, together with the same amountof HR template vector as a negative control.
Replace the medium with warmed fresh medium around12–24 h post transfection. Maintain the cells for another 72 hto allow suffi cient time for HR mediated by CRISPR-Cas9 systemand the template.
3.5 Verifi cation and Quantifi cation of HR Effi ciency
To verify the homologous recombination between the HR templateand the endogenous genome, restriction fragment lengthpolymorphism (RFLP) assay for HR can be applied.
1. Genomic DNA extraction.
Extract the genomic DNA using the same extraction protocolusing the QuickExtract DNA extraction kit as in theSURVEYOR assay (Subheading 3.3 ).
2. Target region amplifi cation.
Amplify the genomic region of interest by a HR testing primerset where the two primers bind outside the homology regionto avoid false positive results given by amplifi cation of the residueHR template.
3. Perform RFLP digestion.
Run the resulting PCR product on an agarose gel to check forspecifi city of amplifi cation, as nonspecifi c PCR products willinterfere with the assay and prevent accurate quantifi cation ofHR effi ciency. In many cases, several pairs of HR testing primersets should be screened to obtain robust, specifi c amplicons.
Purify the PCR amplifi cation product by standard PCRpurifi cation, or in the case where clean PCR product cannot beobtained, gel extract the desired amplicon following separationof PCR product on an agarose gel.
Digest the purifi ed products with the appropriate enzymecorresponding to the design of the HR template ( see Note 8 ),and visualize on an agarose gel or PAGE gel. The latter usuallygives better resolution and is highly recommended. The effi -ciency of HR in the population of cells assayed can be estimatedby the following formula:
HR percentage (%) = ( m + n / m + n + p ) × 100
Here the number “ m ” and “ n ” indicate the relative quantityof bands from digested genomic PCR products, whereasthe “ p ” equals the relative quantity of undigested products.
4. Perform additional Sanger and next-generation DNA sequencingto verify the presence of desired engineered sequenceswithin the genome. Briefl y, clone the genomic PCR productinto a sequencing vector, TOPO-TA, or other blunt-endcloning method, and perform Sanger sequencing to detectrecombined genomic amplicons. Alternatively for higherthroughput, subject the genomic PCR products to next-generationsequencing.
4 Notes
1. Identifi cation and selection of target genomic site. The twoprimary rules for identifying a target site for the SpCas9 systemare: (1) fi nding the “NGG” PAM sequence which is requiredfor SpCas9 targeting, and (2) picking a sequence of 20 bp inlength upstream of the PAM to its 5′ end as the guide sequence. Following these two guidelines, multiple potential target sites can be usually found within the genomic region of interest(Fig. 2 ). Additionally, when using U6 promoter to expresssgRNA (such as pX330, pX335), we suggest adding the G (notreplacing but add one more base) because the human U6 promoterrequires a “G” at the transcription start site to havehighest level of expression (Fig. 2 ). While we do notice thatsometimes the sgRNA will still work without the extra “G”, itis generally better to have this additional base. In the casewhere the guide sequence starts with a base “G”, this additioncan be omitted. In our open-source online resources website( ), we provide the mostup-to- date information for using the CRISPR-Cas9 system forgenome engineering, focusing on the SpCas9 system.
Additionally, we also developed an online tool for the selectionof SpCas9 targets for different organisms including human,mouse, zebrafi sh, C. elegans , etc. This tool can greatly facilitateand simplify the process of performing target selection in batch( ). Because the effi ciency of differenttargets could vary considerably depending on the guidesequences, we highly recommend testing multiple target sitesfor each gene or region of interest and selecting the most effectivetarget ( see Note 1 ). In the case of double nickase design,we recommend individually testing each target with the wildtype SpCas9 system to assess the cleavage effi ciency of individualguides and then combine the most effi cient pair ofguides with opposite directionality and appropriate spacing forthe genome engineering application.
2. Design of oligos for inserting guide sequences into backbonevectors. The cloning vectors we use for typical SpCas9 genometargeting are pX330 for the wild type SpCas9 and pX335 forthe nickase version SpCas9n. Both vectors are mammaliandual- expression vectors, which enables the co-expression ofSpCas9 protein driven by the potent constitutive promoterCBh and sgRNA driven by the RNA Pol III human U6 promoterin mammalian cells (Fig. 3 ). CBh promoter is a hybridpromoter derived from the CAG promoter, which have beenvalidated to support strong expression of transgene in multiplecell types/lines, including HEK 293FT, mouse Neuro-2a,mouse Hepa1-6, HepG2, HeLa, human ESCs, and mouseESCs. To clone custom guide sequences into these backbonevectors, a pair of oligos encoding the guide sequences can beordered with the appropriate overhangs (Fig. 3 ), then annealedto form a clone-ready duplex DNA fragment. The vector canbe then digested using BbsI, and a pair of annealed oligos canbe cloned into the backbone to express the correspondingsgRNA (Fig. 3 ). The oligos are designed based on the targetsite sequence selected in previous section. A common confusionsometimes in cloning the guide into backbone vector is to include the “NGG” PAM sequence in the guide sequence.
Hence, it is important to check that only the 20 bp sequenceto the 5′ end of the PAM is being used for designing the oligosequences to order. An alternative way of designing oligos fordirectly amplifying a PCR fragment that contains the U6 promoterdriving a sgRNA could also be employed for testing theguide sequences, which simplifi es the test by avoiding the needof cloning, but might be less effi cient than using the clonedvector plasmids ( see Note 4 , [ 41 ]).
Fig. 3 Schematics for the cloning backbone vectors pX330/pX335 with oligo design for inserting guidesequences. The pX330 and pX335 vectors contains dual-expression cassettes for both the SaCas9 protein andthe sgRNA. Digestion of the backbone with Bbs I Type II restriction sites ( blue ) generates the complementarycloning overhangs to the annealed oligos ( purple boxed ). Note that a G–C base pair is added at the 5′ end ofthe guide sequence for optimal U6 transcription. The oligos contain overhangs for ligation into the overhangsof Bbs I sites. The top and bottom strand orientations is exactly identical to those of the genomic target butexclude the “NGG” PAM. Parts of this fi gure are adapted from [ 19 , 41 ]
3. Screening of multiple guides. For most applications, we screenfor at least three guide sequences within the target genomicregion in an effort to fi nd the most effi cient ones. This isbecause while CRISPR-Cas9 system works very effi ciently, theactual cleavage effi ciency could be affected by the sequence ofthe guide, the accessibility of local chromatin, the activity ofthe endogenous DNA repair pathways, and other guide-specific or cell-type-specifi c factors. Hence, to ensure that a validguide sequence is obtained, this screening process is highlyrecommended. Following the same logic, a guide sequencethat has been verifi ed in one cell type will not necessarily workto the same effi ciency in another cell type or condition. Hence,additional optimization or re-screening of new guide sequencesmight be required when moving from one experimental sys-tem to another. This same situation is also applicable to theHR experiment where the HDR effi ciency can very considerablyamong different types of cells or tissues.
4. Additional strategy for screening guides and backbones for differentapplications. We have also developed another way ofquickly screening guide sequences with amplifi ed PCR products.
In this design, two primers are used to amplify the U6RNA-expression promoter, where the forward primer binds tothe 5′ beginning of U6 promoter, and the reverse primer bindsto the 3′ end of the U6 promoter. Because the reverse primeralso contains a long extension that can add on the guidesequence and the chimeric sgRNA scaffold, the amplifi ed PCRproduct contains all necessary elements for expressing a sgRNAcontaining the guide specifi ed in the reverse primer. Hence,the screening of guide sequences can be done by co-transfectingthis PCR product with a backbone vector expressing theSaCas9 protein. Because many application of CRISPR-Cas9genome engineering involve cell lines that might be diffi cult towork with, e.g., cell lines that are hard to transfect, we developedadditional backbone vectors to facilitate selection andscreening for transfected cells. These vectors contain the fl uorescentmaker protein, GFP, or the selectable puromycin resistancegene, linked to the expressing of SaCas9 via a 2A peptidelinker. These constructs will enable fl uorescence activated cellsorting (FACS) or the selection of transfected population,which can further improve the overall effi ciency of genomeengineering particularly in the case of HR applications.
Additional details on these designs and backbones can befound in our recent publication [ 41 ].
5. Cell line choice for validation of guide design. Functional validationof targeting constructs bearing the designed guides canbe carried out in relevant cell lines, e.g., HEK 293FT, K562,Hela for human genome engineering, or Neuro-2a, Hepa1-6for mouse. This process takes advantage of some favorableexperimental properties of these lines, such as robust and easymaintenance, effi cient transfection, etc., before embarking oncomplicated procedures in other mammalian systems.
Nonetheless, achieving best results for each experiment mightrequire additional optimization ( see Note 2 ). Moreover, dueto the genetic and epigenetic differences between cell types orsubjects of study, results obtained from one cell type might notnecessarily correspond to those from another cell type of thesame species ( see Note 2 ).
6. Mechanism of SURVEYOR nuclease assay. Following thedelivery of SpCas9 and the sgRNAs into mammalian cells, theinduced genomic cleavage could be assayed by the SURVEYORassay, which could detect modifi cation of genomic DNA within of the cells will be modifi ed by SpCas9 so that their genomicsequence at target site is different from the un-modifi ed population.
Hence, in the assay, it is possible to amplify region ofinterest from genomic DNA via PCR, then through a denaturingand re-annealing process to form mismatched DNA. Thismismatched DNA can then be recognized by the SURVEYORnuclease and cleaved for visualization on analytical gels. Toquantify the effi ciency of genomic cleavage, one can then assessthe percentage of cleaved products as a surrogate for the percentageof indels generated within the target genomic region.
7. Analysis of SURVEYOR assays results. To calculate the genomecleavage effi ciency of a tested target, quantify the band intensityof SURVEYOR assay products visualized by PAGE usingthe following formula:
Indel percentage (%) = (1 . √(1 . x )) × 100, where x = ( a + b )/( a + b + c )
In this formula, the number “ a ” and “ b ” represent therelative quantities of the cleaved bands, while “ c ” equals to therelative quantity of the non-cut full-length PCR product.
Other methodology of detecting the genomic cleavage canalso be applied. One such method is to clone the SURVEYORPCR products into a sequencing vector, e.g., pUC19, andtransformed into E. coli . These individual clones can be thensequenced via Sanger sequencing to reveal the identity ofgenome modifi cations. Additionally, the percentage of modified clones can also be used as a measurement for the effi ciencyof genome engineering. Alternatively, the PCR products couldalso be sequenced in a more high-throughput way with nextgenerationsequencing.
8. Design and synthesis of repair template for HR experiment. Forintroducing a precise genomic modifi cation into the genome,the HDR pathway can be employed. This is achieved by cotransfectingSpCas9 constructs (derived from pX330 or pX335)bearing guide sequences with a HR template in the target cellline. After recombination, modifi cations such as point mutation,small and large insertions/deletions, or other type ofchromosomal changes could be engineered into the endogenousgenome. A few considerations for the choice of guide:
(a) Typically, a screening for the most effi cient guide sequenceis performed fi rst. We recommend picking several (three tosix) targets within the genomic region of interest followingprotocols listed earlier. Tests are then performed to assaythe cleavage effi ciency of each of these guides. Then, theactual HR experiments can be carried out with themost effi cient guides (also see additional considerations inNotes 3 and 4 ).
(b) For maximize the effi ciency of HR, it is recommended thatthe cleavage site of the guide is as close to the junction ofthe homology arm, i.e., the size at which genome modifi -cations are introduced, as possible. Usually this distanceshould be less than 100 bp, ideally less than 10 bp.
(c) To minimize the off-target cleavage, the double nickasedesign can be used. In this case, multiple guide sequencescan be fi rst tested individually, and typically the combinationof highest cutting guide designs with appropriatedirectionality will yield highest cleavage when used in thepaired fashion, thus giving best results in HR experiment.
9. The HR template is essentially the desired sequence that needsto be present in the engineered genome, fl anked by twohomology arms bearing the same sequence as the referencegenome. Below are considerations for the choice of HRtemplate:
(a) It is usually advised to insert a testable marker in the HRtemplate to facilitate the assay for successful HR events.
For example, a restriction site could be inserted to allowRFLP assay. Alternatively, the insertion of fl uorescent proteinsor selectable drug-resistance genes such as puromycin-resistance cassette can also be used.
(b) For introducing single-point mutation the best HR templatefor transfection is usually single-stranded DNA(ssDNA) oligos. For ssDNA oligo design, we typically usearound 50–90 bp homology arms on each side and introduceyour mutation/modifi cation in between the twoarms. When ordering long oligos, ultramer oligo (IDT) isrecommended.
(c) For introducing larger genomic modifi cation, plasmidDNA vector can be used because of the length limit ofssDNA oligos. When designing a plasmid-based HR template,a minimum of 800 bp homology arms on each sideis recommended.
(d) If you have intact “protospacer + PAM” sequence within theHR template, it can lead to the HR template being degradedby Cas9. Hence, it is recommended to make silent mutationsto destroy the sgRNA-binding site, or avoid putting inthe full target site in the HR template by choosing targetsites that span the site of modifi cation. For making silentmutations, one good option is to mutate the PAM “NGG”within the HR template, as the PAM is required for cleavage.
For example, change the “NGG” to “NGT” or“NGC”, in addition to mutations in the spacer itself, couldusually prevent degradation of donor plasmid.
