WO2017050722A1 - Ompg variants - Google Patents

Ompg variants Download PDF

Info

Publication number
WO2017050722A1
WO2017050722A1 PCT/EP2016/072224 EP2016072224W WO2017050722A1 WO 2017050722 A1 WO2017050722 A1 WO 2017050722A1 EP 2016072224 W EP2016072224 W EP 2016072224W WO 2017050722 A1 WO2017050722 A1 WO 2017050722A1
Authority
WO
WIPO (PCT)
Prior art keywords
ompg
variant
nanopore
seq
nucleotide
Prior art date
Application number
PCT/EP2016/072224
Other languages
French (fr)
Inventor
Cynthia CECH
Tim Craig
Christos TZITZILONIS
Alexander Yang
Liv JENSEN
Charlotte YANG
Corissa HARRIS
Original Assignee
Genia Technologies, Inc.
F. Hoffmann-La Roche Ag
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Genia Technologies, Inc., F. Hoffmann-La Roche Ag filed Critical Genia Technologies, Inc.
Priority to CA2998970A priority Critical patent/CA2998970C/en
Priority to JP2018514905A priority patent/JP6956709B2/en
Priority to AU2016326867A priority patent/AU2016326867B2/en
Priority to US15/762,092 priority patent/US10752658B2/en
Priority to EP16775518.0A priority patent/EP3353194B1/en
Priority to CN201680054910.5A priority patent/CN108449941B/en
Publication of WO2017050722A1 publication Critical patent/WO2017050722A1/en
Priority to US16/925,848 priority patent/US11767348B2/en
Priority to US18/449,904 priority patent/US20240010688A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/24Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • C07K14/245Escherichia (G)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • C12N9/1252DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/07Nucleotidyltransferases (2.7.7)
    • C12Y207/07007DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide

Definitions

  • Protein nanopores have become powerful single-molecule analytical tools that enable the study of fundamental problems in chemistry and biology.
  • nanopores have attracted considerable attention because of their potential applications in the detection and analysis of single biomolecules, such as DNA, RNA, and proteins.
  • Molecular detection using a single nanopore is achieved by observing modulations in ionic current flowing through, or the voltage across the pore during an applied potential.
  • a nanopore that spans an impermeable membrane is placed between two chambers that contain an electrolyte, and voltage is applied across the membrane using electrodes. These conditions lead to ionic flux through the pore.
  • Nucleic acid or protein molecules can be driven through the pore, and structural features of the biomolecules are observed as measurable changes in the trans-membrane current or voltage.
  • a challenge of nanopore sequencing is resolving nucleotide sequences at a single base level.
  • One of the factors that hinders the discrimination of individual nucleotide bases is the fluctuation in the ionic current flow through the nanopore that is inherent to the structure of the nanopore.
  • the present disclosure provides variant outer membrane protein G (OmpG) polypeptides, compositions comprising the OmpG variant polypeptides, and methods for using the variant OmpG polypeptides as nanopores for nucleic acid ⁇ e.g., DNA, RNA) and/or polymeric (e.g., protein) sequencing and counting.
  • the variant OmpG nanopores reduce the ionic current noise of the parental OmpG polypeptide from which they are derived.
  • the disclosure provides variant OmpG polypeptides.
  • an isolated variant of a parental OmpG of SEQ ID NO:2 or homolog thereof wherein the variant comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 , and wherein said variant retains the ability to form a nanopore.
  • the OmpG variant has at least 70% identity to the OmpG of SEQ ID NO:2.
  • the variant OmpG comprises the linker-His-SpyTag construct of SEQ ID NO: 16.
  • an isolated variant of a parental OmpG of SEQ ID NO:2 or homolog thereof wherein the variant comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and a deletion of amino acid D215, and retains the ability to form a pore.
  • the isolated variant can further comprise a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 of SEQ ID NO:2.
  • the OmpG variant has at least 70% identity to the OmpG of SEQ ID NO:2.
  • the variant OmpG comprises the linker-His-SpyTag construct of SEQ ID NO:16.
  • the isolated OmpG variant comprises a deletion of one or more of amino acids 216-227 and a substitution Y50K. In other embodiments, the OmpG variant further comprises a deletion of D215.
  • the isolated OmpG variant comprises a "circular permutation" in which one or more C-terminal ⁇ -strand(s) of the parental OmpG are moved to the N-terminus of the protein sequence.
  • the C- terminal ⁇ -strand is moved to the N-terminus, retaining the penultimate ⁇ -strand of the parental OmpG as the new C-terminus of the protein.
  • two or more ⁇ -strands are moved to the N-terminus.
  • the variant includes a tag sequence (e.g., comprising a "SpyTag” or “His-SpyTag: sequence, optionally further comprising one or more linker sequences (e.g., SEQ ID NO:16) at the C-terminus, downstream from the penultimate ⁇ -strand of the parental OmpG.
  • the variant includes a linker sequence, e.g., GSG, between the new N-terminal ⁇ -strand that was moved from the C-terminus of the parental OmpG and the ⁇ -strand that was previously at the N-terminus of the parental OmpG.
  • the variant is a variant of the E.
  • the variant comprises movement of amino acid residues 267-280 of SEQ ID NO:2 from the C-terminus to the N-terminus of SEQ ID NO:2, optionally with a linker, e.g., GSG, between previous residue 280 and the N-terminus of SEQ ID NO:2, and optionally with a methionine (M) residue at the N-terminus of the variant, prior to previous residue 267, and optionally with the amino acid sequence depicted in SEQ ID NO:16 at the C-terminus of the variant.
  • the variant has the amino acid sequence depicted in SEQ ID NO: 17.
  • the variant OmpG retains the ability to form a nanopore in a lipid or polymer layer.
  • the OmpG variant displays a reduced ionic current noise when an applied voltage is applied across the lipid bilayer.
  • the variant OmpG has reduced ionic current noise as compared to the parental E. Coli OmpG having the amino acid sequence of SEQ ID NO:2.
  • the variant OmpG can further comprise a genetic polymerase fusion, e.g., the isolated OmpG variant comprises a
  • polymerase that is operably linked to said variant OmpG (still functional after linkage).
  • the variant OmpG enables detection of the incorporation of nucleotides by said polymerase into a growing nucleic acid strand with single nucleotide resolution.
  • the disclosure provides isolated nucleic acids that encode the variant OmpG polypeptides.
  • an isolated nucleic acid comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (ii) a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31.
  • the polynucleotide sequence encodes a variant having at least 70% identity to the OmpG of SEQ ID NO:2.
  • the polynucleotide sequence encodes an OmpG circular
  • permutation variant e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
  • an expression vector that comprises an isolated nucleic acid that encodes a variant OmpG polypeptide as disclosed herein.
  • the expression vector comprises a nucleic acid comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215, i.e., del215; and/or (ii) a mutation of one or more of amino acids R21 1 ,E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31.
  • the expression vector comprises a nucleic acid encoding a polynucleotide sequence that encodes an OmpG circular permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
  • a host microorganism that comprises an expression vector that expresses an OmpG variant described herein.
  • the host microorganism comprises an expression vector comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (ii) a mutation of one or more of amino acids R21 1 ,E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31.
  • a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2
  • said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (
  • the host microorganism comprises an expression vector comprising a polynucleotide sequence encoding an OmpG circular permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
  • an OmpG circular permutation variant e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
  • a method for producing a variant OmpG in a host cell comprises a) transforming a host cell with an expression vector comprising a nucleic acid encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (ii) a mutation of one or more of amino acids R21 1.E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 ; and b) culturing the host cell under conditions suitable for the production of the variant OmpG.
  • the method comprises a) transforming a host cell with an expression vector comprising a polynucleotide sequence that encodes an OmpG circular permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17; and b) culturing the host cell under conditions suitable for the production of the variant OmpG.
  • the method further comprises recovering the produced variant.
  • a method for sequencing a nucleic acid sample with the aid of a variant OmpG nanopore comprises: (a) providing tagged nucleotides into a reaction chamber comprising the variant OmpG nanopore, wherein an individual tagged nucleotide of the tagged nucleotides contains a tag coupled to a nucleotide, which tag is detectable with the aid of said nanopore; (b) carrying out a polymerization reaction with the aid of a single polymerase coupled to said variant OmpG nanopore, thereby incorporating an individual tagged nucleotide of the tagged nucleotides into a growing strand complementary to a single stranded nucleic acid molecule from the nucleic acid sample; and (c) detecting, with the aid of the variant OmpG nanopore, a tag associated with the individual tagged nucleotide during incorporation of the individual tagged nucleotide, wherein the tag is detected with the aid of
  • the chip for sequencing a nucleic acid sample.
  • the chip comprises a plurality of the variant OmpG nanopores disclosed herein, an OmpG nanopore of the plurality being disposed adjacent or in proximity to an electrode, wherein said nanopore is individually addressable and has a single polymerase attached to the nanopore; and wherein an individual nanopore detects the tag associated with the tagged nucleotide during
  • nucleotide incorporation of the nucleotide into a growing nucleic acid chain by the polymerase.
  • composition comprises a plurality of polymerase enzymes, each complexed with a template nucleic acid, each polymerase enzyme attached to a variant OmpG nanopore as disclosed herein or attached proximal to the variant OmpG nanopore, and nucleic acid sequencing reagents including at least one tagged nucleotide or nucleotide analog.
  • Figures 1A and 1 B depict the architecture of the wild-type OmpG pore from E. coli as a ribbon structure ( Figure 1 A) and as a surface representation ( Figure 1 B). The constriction zone of the OmpG nanopore is shown.
  • Figure 2 is a schematic diagram of an embodiment of a circuit used in a nanopore device for controlling an electrical stimulus and for detecting electrical signatures of an analyte molecule.
  • Figures 3A-3B show a schematic diagram of an embodiment of a chip that includes a nanopore device array.
  • a perspective view is shown in Figure 3A.
  • a cross-sectional view of the chip is shown in Figure 3B.
  • Figures 4A-4E depict single channel current traces obtained at an applied constant voltage for OmpG variants comprising a deletion of amino acids 216-227, and amino acid substitution E229A ( ⁇ .6/ ⁇ 229 ⁇ ) (as shown in (SEQ ID NO:5)), and the amino acid substitution Y50K (SEQ ID NO:6; ( Figure 4A)), R68N (SEQ ID NO:7; ( Figure 4B)), R21 1 N (SEQ ID NO:8; ( Figure 4C)), E17K (SEQ ID NO:9: ( Figure 4D)), and the amino acid deletion del215 (SEQ ID NO:10; ( Figure 4E)).
  • Figures 5A-5B depict the mean open channel (OC) current (filled bar) and the percentage of events greater than 1 standard deviation of the mean open channel in both higher and lower directions (Figure 5A) and the mean open channel current in black (filled bar) and the percentage of events greater than 1 std deviation of the mean open channel in only the lower direction (downward current only) ( Figure 5B) determined for each of the OmpG variants of ⁇ .6/ ⁇ 229 ⁇ (SEQ ID NO:5), and the amino acid substitution ⁇ .6/ ⁇ 229 ⁇ - ⁇ 50 ⁇ (SEQ ID NO:6), AL6/E229A-R68N (SEQ ID NO:7), AL6/E229A-R21 1 N (SEQ ID NO:8),
  • AL6/E229A-E17K (SEQ ID NO:9), and the amino acid deletion AL6/E229A-del215 (SEQ ID NO:10); (del215)); AL6/E229A-del215-Y50K (SEQ ID NO:1 1 ).
  • Figure 6 depicts the single nucleotide resolution of a mixture of four different tagged nucleotides shown as changes in baseline open channel direct current as each of the tagged nucleotides is detected by the variant OmpG nanopore AL6/E229A-del215 (SEQ ID NO:10). Measurements were made with the application of a direct current (DC).
  • DC direct current
  • Figures 7A-7D depict the identification of each of the tagged nucleotides detected in Figure 6 by the variant OmpG nanopore AL6/E229A-del215 (SEQ ID NO:10) as separate changes in baseline open channel current for each of the four tagged nucleotides. Measurements were made with the application of a direct current (DC).
  • DC direct current
  • Figure 8 depicts an expanded view of the single nucleotide resolution of the mixture of four different tagged nucleotides shown in Figure 6 as detected by the variant OmpG nanopore AL6/E229A-del215 (SEQ ID NO:10). Measurements were made with the application of a direct current (DC).
  • DC direct current
  • Figures 9A -9E depicts a protein alignment of bacterial membrane protein homologs of the OmpG from E. coli (SEQ ID NOS 1 and 12-15, respectively, in order of appearance).
  • Figures 10A - 10B schematically depict OmpG antiparallel ⁇ -strands ( Figure 10A) and an embodiment of a circular permutation variant thereof ( Figure 10B).
  • Figure 11 shows the arrangement of the C-terminus of OmpG in the wild- type parental OmpG versus the circular permutation variant as described herein, relative to the constriction site of the nanopore protein.
  • outer membrane (OM) of Gram-negative bacteria contains a large number of channel proteins that mediate the uptake of ions and nutrients necessary for growth and functioning of the cell.
  • outer membrane protein G (OmpG) from Escherichia coli (E. coli) functions as a monomer.
  • the crystal structure of E. coli K12 OmpG has been determined (Subbarao and van den Berg, J Mol Biol, 360:750-759 [2006]). The structure shows that the OmpG barrel consists of 14 ⁇ -strands connected by seven flexible loops on the extracellular side and seven short turns on the periplasmic side ( Figure 1A).
  • the OmpG channel has its largest diameter (20-22 A) at the periplasmic exit and tapers to a constriction located close to the extracellular side (Figure 1 B).
  • the constriction is formed by the side chains of inward pointing residues of the barrel wall, and not by surface loops folding inwards.
  • This architecture gives rise to a relatively large central pore with a circular shape and a diameter of about 13 A.
  • the nanopore spontaneously transitions between open and closed states during an applied potential, which gives rise to flickering single channel currents.
  • the longest of the extracellular loop of OmpG, loop 6, has been recognized as the main gating loop that closes the pore at low pH and opens it at high pH.
  • the present disclosure provides variant OmpG polypeptides, compositions comprising the variant OmpG polypeptides, and methods for using the variant OmpG polypeptides as nanopores for determining the sequence of single stranded nucleic acids.
  • the variant OmpG nanopores reduce the ionic current noise of the parental OmpG polypeptide from which they are derived and thereby enable sequencing of polynucleotides with single nucleotide resolution.
  • the reduced ionic current noise also provides for the use of these OmpG nanopore variants in other single molecule sensing applications, e.g., protein sequencing.
  • variant refers to an OmpG derived from another (i.e., parental) OmpG and contains one or more amino acid mutations ⁇ e.g., amino acid deletion, insertion or substitution) as compared to the parental OmpG.
  • isolated refers to a molecule, e.g., a nucleic acid molecule, that is separated from at least one other molecule with which it is ordinarily associated, for example, in its natural environment.
  • An isolated nucleic acid molecule includes a nucleic acid molecule contained in cells that ordinarily express the nucleic acid molecule, but the nucleic acid molecule is present extrachromosomally or at a chromosomal location that is different from its natural chromosomal location.
  • mutation refers to a change introduced into a parental sequence, including, but not limited to, substitutions, insertions, deletions
  • the consequences of a mutation include, but are not limited to, the creation of a new character, property, function, phenotype or trait not found in the protein encoded by the parental sequence.
  • wild-type refers to a gene or gene product, which has the characteristics of that gene or gene product when isolated from a naturally- occurring source.
  • nucleotide herein refers to a monomeric unit of DNA or RNA consisting of a sugar moiety (pentose), a phosphate, and a nitrogenous
  • heterocyclic base The base is linked to the sugar moiety via the glycosidic carbon (1 ' carbon of the pentose) and that combination of base and sugar is a nucleoside.
  • nucleoside contains a phosphate group bonded to the 3' or 5' position of the pentose it is referred to as a nucleotide.
  • a sequence of operatively linked nucleotides is typically referred to herein as a "base sequence” or “nucleotide sequence,” and is represented herein by a formula whose left to right orientation is in the conventional direction of 5'-terminus to 3'-terminus.
  • DNA deoxyribonucleic acid
  • RNA ribonucleic acid
  • polymerase herein refers to an enzyme that catalyzes the polymerization of nucleotides (i.e., the polymerase activity).
  • the term polymerase encompasses DNA polymerases, RNA polymerases, and reverse transcriptases.
  • a "DNA polymerase” catalyzes the polymerization of deoxyribonucleotides.
  • An “RNA polymerase” catalyzes the polymerization of ribonucleotides.
  • a “reverse transcriptase” catalyzes the polymerization of deoxyribonucleotides that are complementary to an RNA template.
  • template DNA molecule refers to a strand of a nucleic acid from which a complementary nucleic acid strand is synthesized by a DNA polymerase, for example, in a primer extension reaction.
  • template-dependent manner refers to a process that involves the template dependent extension of a primer molecule (e.g., DNA synthesis by DNA polymerase).
  • template-dependent manner typically refers to polynucleotide synthesis of RNA or DNA wherein the sequence of the newly synthesized strand of polynucleotide is dictated by the well-known rules of complementary base pairing (see, for example, Watson, J. D. et al., In: Molecular Biology of the Gene, 4th Ed., W. A. Benjamin, Inc., Menlo Park, Calif. (1987)).
  • tag refers to a detectable moiety that may be one or more atom(s) or molecule(s), or a collection of atoms and molecules.
  • a tag may provide an optical, electrochemical, magnetic, or electrostatic (e.g., inductive, capacitive) signature.
  • a tag may block the flow of current through a nanopore.
  • nanopore refers to a pore, channel or passage formed or otherwise provided in a membrane.
  • a membrane may be an organic membrane, such as a lipid bilayer, or a synthetic membrane, such as a membrane formed of a polymeric material.
  • the nanopore may be disposed adjacent or in proximity to a sensing circuit or an electrode coupled to a sensing circuit, such as, for example, a complementary metal oxide semiconductor (CMOS) or field effect transistor (FET) circuit.
  • CMOS complementary metal oxide semiconductor
  • FET field effect transistor
  • a nanopore has a characteristic width or diameter on the order of 0.1 nm to about 1000 nm.
  • Some nanopores are proteins. OmpG is an example of a protein nanopore.
  • spontaneous gating refers to changes in ion current related to the channel's inherent structural changes. For example, OmpG in planar lipid bilayers undergoes pH-dependent rapid fluctuations between open and closed states of the pore, which manifest themselves as intense "flickering" in current recordings and contributes to the overall noise of the channel.
  • the tertiary make-up of the nanopore barrel can comprise more than one recognition site for the analyte that is being sensed by the nanopore thereby inducing additional signals that contribute to the overall noise of the channel.
  • upward noise refers to fluctuations of ionic current to levels greater than mean open channel current.
  • downstream noise refers to fluctuations of ionic current to levels lower than mean open channel current.
  • positive current refers to a current in which a positive charge, e.g., K + , moves through the pore from the trans to the c/ ' s side, or negative charge, e.g., CI " , moves from the c/ ' s to the trans side.
  • a positive charge e.g., K +
  • negative charge e.g., CI "
  • constriction amino acids refers to the amino acids that determine the size of the OmpG pore at the constriction zone.
  • the constriction zone may be the same as the constriction zone of the wild-type OmpG or it may be a constriction zone introduced via protein engineering, or by the introduction of a molecular adapter.
  • parental refers to an OmpG to which modifications, e.g., substitution(s), insertion(s), deletion(s), and/or truncation(s), are made to produce the OmpG variants disclosed herein.
  • This term also refers to the polypeptide with which a variant is compared and aligned.
  • the parent may be a naturally occurring (wild type) polypeptide, or it may be a variant thereof, prepared by any suitable means.
  • "parental" proteins are homologs of one another.
  • purified refers to a polypeptide, e.g., a variant OmpG polypeptide, that is present in a sample at a concentration of at least 95% by weight, or at least 98% by weight of the sample in which it is contained.
  • nucleoside triphosphates e.g., (S)-Glycerol nucleoside triphosphates (gNTPs) of the common nucleobases: adenine, cytosine, guanine, uracil, and thymidine (Horhota et al., Organic Letters, 8:5345-5347 [2006]).
  • nucleoside tetraphosphate nucleoside pentaphosphates and nucleoside hexaphosphates.
  • tagged nucleotide refers to a nucleotide that includes a tag (or tag species) that is coupled to any location of the nucleotide including, but not limited to a phosphate (e.g., terminal phosphate), sugar or nitrogenous base moiety of the nucleotide.
  • Tags may be one or more atom(s) or molecule(s), or a collection of atoms and molecules.
  • a tag may provide an optical, electrochemical, magnetic, or electrostatic (e.g., inductive, capacitive) signature, which signature may be detected with the aid of a nanopore (US2014/013616).
  • a tag can also be attached to a polyphosphate as is shown in Figure 13 of US2014/013616.
  • the disclosure provides variant OmpG polypeptides.
  • the variant OmpG polypeptides can be derived from a parental OmpG of E. coli, for example, the parental OmpG depicted in SEQ ID NO:2.
  • a parental OmpG can be a homolog of the parental OmpG from E. coli.
  • E. coli sp. strain K12 OmpG (SEQ ID NO: 2) is used as a starting point for discussing variant OmpGs herein, it will be appreciated that other gram- negative bacterial OmpGs having a high degree of homology to the E. coli sp. strain K12 OmpG may serve as a parental OmpG within the scope of the compositions and methods disclosed herein. This is particularly true of other naturally-occurring bacterial OmpGs that include only minor sequence differences in comparison to E. coli sp. strain K12 OmpG, not including the substitutions, deletions, and/or insertions that are the subject of the present disclosure. For example, OmpG homologs expressed in Salmonella sp., Shigella sp., and
  • Pseudomonas sp. can be used as parental OmpG polypeptides from which variant forms can be derived.
  • the nanopore is a pore from a mitochondrial membrane.
  • Homologs of the parental OmpG from E. coli can share sequence identity with the OmpG from E. coli (SEQ ID NO:1 of at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%.
  • a variant OmpG can be derived from a homolog of the E. coli OmpG that is at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identical to the parental OmpG from E. coli.
  • the parental OmpG is the OmpG from the E. coli sp. strain K12. The polypeptide sequence of the full length E.
  • SEQ ID NO:1 is the mature form of the full-length E. coli OmpG polypeptide depicted in SEQ ID NO:1 .
  • the parental polypeptide is a wild-type OmpG polypeptide.
  • the parental polypeptide is an OmpG variant to which additional mutations can be introduced to improve the ability of the OmpG polypeptide to reduce ionic current noise.
  • the variant OmpG retains the ability to form a nanopore.
  • the parental OmpG polypeptide is the wild- type E. coli OmpG polypeptide of SEQ ID NO:1 , or the mature form thereof (SEQ ID NO:2). It is understood that the variant OmpG polypeptides can be expressed having the N-terminal Met.
  • the parental OmpG polypeptide is a variant OmpG polypeptide from which the amino acids that comprise loop 6 are deleted.
  • the parental OmpG is the OmpG of SEQ ID NO:2 from which the amino acids that comprise loop 6 have been deleted.
  • the OmpG of SEQ ID NO:3 is the mature form of the wild-type OmpG (SEQ ID NO:2) from which amino acids 216- 227 have been deleted and amino acid 229 is replaced by an Ala, i.e., ⁇ 216- 227/E229A.
  • SEQ ID NO:3 comprises a sequence of amino acids at the C- terminus that denotes the Nnker-His6-linker-SpyTag sequences ("His6" disclosed as SEQ ID NO: 18) as described elsewhere herein.
  • Variant OmpGs comprising a deletion of loop 6 and substitution of Ala at amino acid 229, i.e., ⁇ 216-227/ ⁇ 229 ⁇ are interchangeably denoted by ⁇ .6/ ⁇ 229 ⁇ .
  • truncation of loop 6 can be made by deleting one or more of amino acids 216-227 of SEQ ID NO:2.
  • amino acids 216-227, inclusive are deleted.
  • the numbering of the amino acids refers to the amino acid positions of SEQ ID NO: 2.
  • the variant OmpG is a variant of the parental OmpG of SEQ ID NO:2 that comprises a deletion of amino acids 216-227, i.e., ⁇ 216-227.
  • the variant OmpG comprises E229A, i.e., ⁇ 216-227/E229A.
  • the variant OmpG comprises a deletion of D215, i.e., ⁇ 215-227/ ⁇ 229 ⁇ .
  • constriction zone amino acids at the constriction zone of the OmpG pore are identified as contributing to the symmetry of the lining and/or the length of the constriction of OmpG.
  • the constriction zone amino acids can be mutated to shorten the length of the constriction and/or even the width of the internal diameter of the constriction. Mutagenesis of the constriction amino acids can be designed to create a unique constriction zone. Mutations of the constriction zones reduce the ion current noise of the variant OmpG when compared to the parental OmpG from which the variant is derived.
  • the variant OmpG polypeptide provided comprises one or more mutations of amino acids that are positioned at the constriction zone at the extracellular side of the OmpG nanopore.
  • the variant OmpG polypeptide can be further mutated to bind molecular adaptors, which while resident in the pore, slow the movement of analytes, e.g., nucleotide bases, through the pore and consequently improve the accuracy of the identification of the analyte (Astier et al., J Am Chem Soc 10.1021/ja057123+, published online on December 30, 2005).
  • the mutation in the constriction zone is selected from amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31 .
  • a mutation of the amino acids at the constriction zone can be one or more of a substitution, a deletion or an insertion, for example, a substitution of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31 .
  • At least one amino acid mutation is located in the constriction zone of OmpG. In other embodiments, at least two, at least three, at least four, at least five, or at least six amino acids of the constriction zone are mutated. In some embodiments, the at least one amino acid mutation at the constriction zone is the substitution Y50K. In some other embodiments, the at least one amino acid mutation at the constriction zone is the substitution Y50N. The at least one amino acid mutation at the constriction zone can be combined with the deletion of one or more of the amino acids of loop 6.
  • the variant OmpG is derived from a parental OmpG, e.g., the OmpG depicted in SEQ ID NO:2, and comprises a deletion of amino acids 216- 227 and substation of Ala at amino acid 229,, i.e., ⁇ 216-227/ ⁇ 229 ⁇ , and a mutation of at least one amino acid of the constriction zone of the wild-type OmpG, e.g., a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31.
  • the variant OmpG comprises a deletion of loop 6, a mutation of one or more amino acids at the constriction zone, and the deletion D215.
  • the variants comprises a deletion of loop 6, a mutation of one or more amino acids at the constriction zone, and the deletion D215.
  • OmpG is a variant of a parental OmpG of SEQ ID NO:2, and comprises a deletion of amino acids 216-227 and substitution of Ala at amino acid 229, i.e., ⁇ 216- 227/E229A, a mutation of at least one of amino acids of the constriction zone, e.g., a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31 , and del215.
  • the variant OmpG is a variant of a parental OmpG of SEQ ID NO:2, and comprises a deletion of amino acids 216-227, i.e., ⁇ 216-227, substitution E229A, deletion of D215, and amino acid substitution Y50K.
  • a "circular permutation variant of OmpG wherein the C-terminal ⁇ -strand of the parental OmpG is moved to the N-terminus of the protein sequence, retaining the penultimate ⁇ -strand of the parental OmpG as the new C-terminus of the protein.
  • This is depicted schematically in Figures 10 and 1 1.
  • the result of movement of the C-terminal ⁇ -strand in this manner is that the new C-terminus of the variant is closer to the constriction site of the nanopore than in the parental OmpG from which the variant was derived (see Figure 1 1 ).
  • the proximity to the constriction point is advantageous because it allows for improved capture of molecules for analysis by the nanopore.
  • This improved capture may be due to a reduction in the energy barrier of NanoTag threading.
  • the placement of the N-terminus and C-terminus on opposite sides of the lipid or polymer layer also allows for the attachment of two nucleic acid modifying enzymes, doubling the throughput of the nanopore-based instrument.
  • a circular permutation variant as described herein includes a tag sequence ⁇ e.g., comprising a "SpyTag” or "His-SpyTag” sequence, optionally further comprising one or more linker sequences (e.g., SEQ ID NO:16) at the C- terminus, downstream from the penultimate ⁇ -strand of the parental OmpG.
  • the variant includes a linker sequence, e.g., GSG, between the new N- terminal ⁇ -strand that was moved from the C-terminus of the parental OmpG and the ⁇ -strand that was previously at the N-terminus of the parental OmpG.
  • the variant is a variant of the E.
  • the variant comprises movement of amino acid residues 267-280 of SEQ ID NO:2 to the N-terminus of SEQ ID NO:2, optionally with a linker, e.g., GSG, between previous residue 280 and the N- terminus of SEQ ID NO:2, and optionally with a methionine (M) residue at the N- terminus of the variant, prior to previous residue 267, and optionally with the amino acid sequence depicted in SEQ ID NO:16 at the C-terminus of the variant.
  • the variant has the amino acid sequence depicted in SEQ ID NO: 17.
  • DNA sequences encoding a parent OmpG may be isolated from any cell or microorganism producing the OmpG in question, using various methods well known in the art.
  • a genomic DNA and/or cDNA library can be constructed using chromosomal DNA or messenger RNA from the organism that produces the OmpG to be studied.
  • homologous, labeled oligonucleotide probes may be synthesized and used to identify OmpG-encoding clones from a genomic library prepared from the organism in question.
  • a labeled oligonucleotide probe containing sequences homologous to a known OmpG gene can be used as a probe to identify OmpG-encoding clones, using hybridization and washing conditions of lower stringency.
  • the DNA sequence encoding the OmpG may be prepared synthetically by established standard methods, e.g., the phosphoroamidite method described by S. L. Beaucage and M. H. Caruthers (1981 ) Tetrahedron Letters 22:1859-1862 or the method described by Matthes et al. (1984) EMBO J.
  • oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vectors.
  • the DNA sequence may be of mixed genomic and synthetic origin, mixed synthetic and cDNA origin or mixed genomic and cDNA origin, prepared by ligating fragments of synthetic, genomic or cDNA origin (as appropriate, the fragments corresponding to various parts of the entire DNA sequence), in accordance with standard techniques.
  • the DNA sequence may also be prepared by polymerase chain reaction (PCR) using specific primers, for instance as described in U.S. Pat. No. 4,683,202 or R. K. Saiki et al. (1988) Science
  • mutations may be introduced using synthetic oligonucleotides. These oligonucleotides contain nucleotide sequences flanking the desired mutation sites; mutant nucleotides are inserted during oligonucleotide synthesis.
  • a single-stranded gap of DNA, bridging the OmpG-encoding sequence, or portion thereof, is created in a vector carrying the OmpG gene.
  • the synthetic nucleotide, bearing the desired mutation is annealed to a homologous portion of the single-stranded DNA.
  • Alternative methods for providing variants include gene shuffling, e.g., as described in WO 95/22625 (from Affymax Technologies N.V.) or in WO 96/00343 (from Novo Nordisk A/S), or other corresponding techniques resulting in a hybrid enzyme comprising the mutation(s), e.g., substitution(s) and/or deletion(s), in question.
  • a DNA sequence encoding an OmpG variant can be used to express a variant OmpG, using an expression vector, which typically includes control sequences encoding a promoter, an operator, a ribosome binding site, a translation initiation signal, and, optionally, a repressor gene or various activator genes.
  • expression vectors typically includes control sequences encoding a promoter, an operator, a ribosome binding site, a translation initiation signal, and, optionally, a repressor gene or various activator genes.
  • vectors that can be used for expressing variant OmpGs include the vectors of the pET expression system (Novagen).
  • a recombinant expression vector carrying DNA sequences encoding an OmpG variant may be any vector, which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced. Thus, the vector may be an
  • autonomously replicating vector i.e., a vector which exists as an
  • extrachromosomal entity the replication of which is independent of chromosomal replication, e.g., a plasmid, a bacteriophage or an extrachromosomal element, a minichromosome or an artificial chromosome.
  • the vector may be one which, when introduced into a host cell, is integrated into the host cell genome and replicates together with the chromosome(s) into which it has been integrated.
  • An OmpG variant can be produced in a cell that may be of a higher organism such as a mammal or an insect, but is preferably a microbial cell, e.g., a bacterial or a fungal (including yeast) cell.
  • suitable bacteria are gram- negative bacteria such as E. coli, or gram-positive bacteria such as Bacillus sp., e.g., Bacillus subtilis, Bacillus licheniformis, Bacillus lentus, Bacillus brevis,
  • Geobacillus stearothermophilus Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus circulars, Bacillus lautus, Bacillus megaterium, or Bacillus thuringiensis, or Streptomyces sp., e.g., Streptomyces lividans or Streptomyces murinus.
  • a yeast organism may be selected from a species of Saccharomyces or Schizosaccharomyces, e.g., Saccharomyces cerevisiae, or from a filamentous fungus, such as Aspergillus sp., e.g., Aspergillus oyzae or Aspergillus niger.
  • the host cell is typically bacterial and preferably E. coli.
  • a method of producing an OmpG variant comprises cultivating a host cell as described above under conditions conducive to the production of the variant and recovering the variant from the cells and/or culture medium.
  • the medium used to cultivate the cells may be any conventional medium suitable for growing the host cell in question and obtaining expression of the OmpG variant. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g., as described in catalogues of the American Type Culture Collection).
  • the OmpG variant secreted from the host cells may conveniently be recovered from the culture medium by well-known procedures, including separating the cells from the medium by centrifugation or filtration, and
  • purification of the variant OmpG may be obtained by affinity chromatography of OmpG polypeptides linked to an affinity tag.
  • affinity or epitope tags that can be used in the purification of the OmpG variants include hexahistidine tag (SEQ ID NO: 18), FLAG tag, Strep II tag, streptavidin-binding peptide (SBP) tag, calmodulin-binding peptide (CBP), glutathione S-transferase (GST), maltose-binding protein (MBP), S-tag, HA tag, and c-Myc tag.
  • SBP streptavidin-binding peptide
  • CBP calmodulin-binding peptide
  • GST glutathione S-transferase
  • MBP maltose-binding protein
  • S-tag HA tag
  • c-Myc tag a hexahistidine tag
  • linkers contemplated as useful in linking the nanopore to a polymerase include (GGGGS)i -3 (SEQ ID NO: 19), EKEKEKGS (SEQ ID NO: 20), His6-GSGGK (SEQ ID NO: 21 ), and AH I VMVDAYKPTK (SEQ ID NO: 22) (SpyTag).
  • the protein linkers can be encoded by the nucleic acid that comprises the sequence encoding the variant OmpG, and may be expressed as a fusion protein.
  • the variant OmpG can be expressed as OmpG-(EK) 3 - HiSe-GSGG-SpyTag (EK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK (SEQ ID NO:16)) expressed for example, as amino acids 269-299 of SEQ ID NO:3.
  • the His 6 tag (SEQ ID NO: 18) is expressed N-terminal to the variant OmpG polypeptide.
  • Characterization of the variant OmpG can include determining any property of the molecule that causes a variance in a measurable electrical signature. For example, reduction in gating frequency may be derived from measuring a decrease in upward and/or downward gating through the nanopore as a constant voltage is applied across the variant OmpG nanopore. Additionally,
  • characterization of the variant OmpG can include identifying tags of individual tagged nucleotides which are complementary to a DNA or RNA strand by measuring a variance in ionic current flow through the nanopore as the tags of individual nucleotides are detected in proximity or in passing through the OmpG nanopore.
  • the base sequence of a segment of a DNA or RNA molecule can be determined by comparing and correlating the measured electrical signature(s) of tags of tagged nucleotides, as the growing nucleic acid strand is synthesized.
  • measurements of ionic current flow through the OmpG nanopore are made across nanopores that have been reconstituted into a lipid membrane.
  • the OmpG nanopore is inserted in the membrane (e.g., by electroporation).
  • the nanopore can be inserted by a stimulus signal such as electrical stimulus, pressure stimulus, liquid flow stimulus, gas bubble stimulus, sonication, sound, vibration, or any combination thereof.
  • the membrane is formed with aid of a bubble and the nanopore is inserted in the membrane with aid of an electrical stimulus.
  • FIG. 2 is a schematic diagram of a nanopore device 100 that can be used to characterize a polynucleotide or a polypeptide.
  • the nanopore device 100 includes a lipid bilayer 102 formed on a lipid bilayer compatible surface 104 of a conductive solid substrate 106, where the lipid bilayer compatible surface 104 may be isolated by lipid bilayer incompatible surfaces 105 and the conductive solid substrate 106 may be electrically isolated by insulating materials 107, and where the lipid bilayer 102 may be surrounded by amorphous lipid 103 formed on the lipid bilayer incompatible surface 105.
  • the lipid bilayer comprising the nanopore can be disposed over a well, where a sensor forms part of the surface of the well.
  • the lipid bilayer 102 is embedded with a single nanopore structure 108 having a nanopore 1 10 large enough for passing of at least a portion of the molecule 1 12 being characterized and/or small ions (e.g., Na + , K + , Ca 2+ , CI " ) between the two sides of the lipid bilayer 102.
  • a layer of water molecules 1 14 may be adsorbed on the lipid bilayer compatible surface 104 and sandwiched between the lipid bilayer 102 and the lipid bilayer compatible surface 104.
  • the aqueous film 1 14 adsorbed on the hydrophilic lipid bilayer compatible surface 104 may promote the ordering of lipid molecules and facilitate the formation of lipid bilayer on the lipid bilayer compatible surface 104.
  • a sample chamber 1 16 containing a solution of the molecule 1 12 may be provided over the lipid bilayer 102 for introducing the molecule 1 12 for characterization.
  • the solution may be an aqueous solution containing electrolytes and buffered to an optimum ion concentration and maintained at an optimum pH to keep the nanopore 1 10 open.
  • the device includes a pair of electrodes 1 18 (including a negative node 1 18a and a positive node 1 18b) coupled to a variable voltage source 120 for providing electrical stimulus (e.g., voltage bias) across the lipid bilayer and for sensing electrical characteristics of the lipid bilayer (e.g., resistance, capacitance, and ionic current flow).
  • the surface of the positive electrode 1 18b is or forms a part of the lipid bilayer compatible surface 104.
  • the conductive solid substrate 106 may be coupled to or forms a part of one of the electrodes 1 18.
  • the device 100 may also include an electrical circuit 122 for controlling electrical stimulation and for processing the signal detected.
  • the variable voltage source 120 is included as a part of the electrical circuit 122.
  • the electrical circuitry 122 may include amplifier, integrator, noise filter, feedback control logic, and/or various other components.
  • the electrical circuitry 122 may be integrated electrical circuitry integrated within a silicon substrate 128 and may be further coupled to a computer processor 124 coupled to a memory 126.
  • the nanopore device 100 of Figure 2 is an OmpG nanopore device having a single OmpG protein 108, e.g., a variant OmpG as described herein, embedded in a lipid bilayer 102 formed over a lipid bilayer compatible silver-gold alloy surface 104 coated on a copper material 106.
  • the lipid bilayer compatible silver-gold alloy surface 104 is isolated by lipid bilayer incompatible silicon nitride surfaces 105, and the copper material 106 is electrically insulated by silicon nitride materials 107.
  • the copper 106 is coupled to electrical circuitry 122 that is integrated in a silicon substrate 128.
  • the lipid bilayer may comprise or consist of phospholipid, for example, selected from diphytanoyl-phosphatidylcholine (DPhPC), 1 ,2-diphytanoyl-sn- glycero-3phosphocholine, 1 ,2-Di-0-Phytanyl-sn-Glycero-3-phosphocholine
  • DPhPC diphytanoyl-phosphatidylcholine
  • 1 ,2-diphytanoyl-sn- glycero-3phosphocholine 1 ,2-Di-0-Phytanyl-sn-Glycero-3-phosphocholine
  • DoPhPC palmitoyl-oleoyl-phosphatidylcholine
  • POPC palmitoyl-oleoyl-phosphatidylcholine
  • DOPME dioleoyl-phosphatidyl- methylester
  • DPPC dipalmitoylphosphatidylcholine
  • the nanopores can form an array.
  • the disclosure provides an array of nanopore detectors (or sensors).
  • Figure 3A is a top view of a schematic diagram of an embodiment of a nanopore chip 300 having an array 302 of individually addressable nanopore devices 100 having a lipid bilayer compatible surface 104 isolated by lipid bilayer incompatible surfaces 105.
  • Each nanopore device 100 is complete with a control circuit 122 integrated on a silicon substrate 128.
  • side walls 136 may be included to separate groups of nanopore devices 100 so that each group may receive a different sample for characterization.
  • the nanopore chip 300 may include a cover plate 128.
  • the nanopore chip 300 may also include a plurality of pins 304 for interfacing with a computer processor.
  • the nanopore chip 300 may be coupled to (e.g., docked to) a nanopore workstation 306, which may include various components for carrying out (e.g., automatically carrying out) the various embodiments of the processes of the present invention, including for example, analyte delivery mechanisms such as pipettes for delivering lipid suspension, analyte solution, and/or other liquids, suspension or solids, robotic arms, computer processor, and/or memory.
  • Figure 3B is a cross sectional view of the nanopore chip 300.
  • a plurality of polynucleotides may be detected on an array of nanopore detectors.
  • each nanopore location comprises a nanopore, in some cases attached to a polymerase enzyme as described elsewhere herein.
  • Each of the nanopores can be individually
  • the methods of the invention involve the measuring of a current passing through the pore during interaction with a nucleotide.
  • sequencing a nucleic acid molecule can require applying a direct current (e.g., so that the direction at which the molecule moves through the nanopore is not reversed).
  • a direct current e.g., so that the direction at which the molecule moves through the nanopore is not reversed.
  • AC alternating current
  • Suitable conditions for measuring ionic currents through transmembrane protein pores are known in the art and examples are provided herein in the Experimental section.
  • the method is carried out with a voltage applied across the membrane and pore.
  • the voltage used is typically from -400 mV to +400 mV.
  • the voltage used is preferably in a range having a lower limit selected from -400 mV, - 300 mV, -200 mV, -150 mV, -100 mV, -50 mV, -20 mV and 0 mV and an upper limit independently selected from +10 mV, +20 mV, +50 mV, +100 mV, +150 mV, +200 mV, +300 mV and +400 mV.
  • the voltage used is more preferably in the range 100 mV to 240 mV and most preferably in the range of 160 mV to 240 mV. It is possible to increase discrimination between different nucleotides by a pore of the invention by using an increased applied potential. Sequencing nucleic acids using AC waveforms and tagged nucleotides is described in US Patent Publication US2014/0134616 entitled "Nucleic Acid Sequencing Using Tags", filed on
  • sequencing can be performed using nucleotide analogs that lack a sugar or acyclic moiety, e.g., (S)-Glycerol nucleoside triphosphates (gNTPs) of the five common nucleobases: adenine, cytosine, guanine, uracil, and thymidine (Horhota et al. Organic Letters, 8:5345-5347 [2006]).
  • S S-Glycerol nucleoside triphosphates
  • a polymerase e.g., DNA polymerase
  • the polymerase can be attached to the nanopore before or after the nanopore is incorporated into the membrane.
  • the polymerase is attached to the OmpG protein monomer and then the nanopore polymerase complex can then be inserted into the membrane.
  • An exemplary method for attaching a polymerase to a nanopore involves attaching a linker molecule to an OmpG monomer or mutating the OmpG to have an attachment site or attachment linker, and then attaching a polymerase to the attachment site or attachment linker (e.g., in bulk, before inserting into the membrane).
  • the polymerase can also be attached to the attachment site or attachment linker after the nanopore is formed in the membrane.
  • a plurality of nanopore-polymerase pairs is inserted into a plurality of membranes (e.g., disposed over the wells and/or electrodes) of a biochip, thereby forming a nanopore chip as described herein.
  • the attachment of the polymerase to the nanopore to form a nanopore-polymerase complex occurs on the biochip above each electrode.
  • the polymerase can be attached to the nanopore with any suitable chemistry (e.g., covalent bond and/or linker).
  • the polymerase is expressed as a fusion protein that comprises a SpyCatcher polypeptide, which can be covalently bound to an OmpG nanopore that comprises a SpyTag peptide.
  • the polymerase is attached to the nanopore with molecular staples.
  • molecular staples comprise three amino acid sequences (denoted linkers A, B and C).
  • Linker A can extend from an OmpG polypeptide
  • Linker B can extend from the polymerase
  • Linker C then can bind Linkers A and B (e.g., by wrapping around both Linkers A and B) and thus bind the polymerase to the nanopore.
  • Linker C can also be constructed to be part of Linker A or Linker B, thus reducing the number of linker molecules.
  • the polymerase is linked to the nanopore using
  • SolulinkTM chemistry can be a reaction between HyNic (6-hydrazino- nicotinic acid, an aromatic hydrazine) and 4FB (4-formylbenzoate, an aromatic aldehyde).
  • the polymerase is linked to the nanopore using Click chemistry (available from LifeTechnologies for example).
  • Click chemistry available from LifeTechnologies for example.
  • zinc finger mutations are introduced into the OmpG molecule and then a molecule is used (e.g., a DNA intermediate molecule) to link the polymerase to the zinc finger sites on the OmpG.
  • Specific linkers contemplated as useful herein are (GGGGS)i -3 (SEQ ID NO: 19), K-tag (RSKLG (SEQ ID NO: 23)) on N-terminus, ATEV site (12-25), ATEV site + N- terminus of SpyCatcher (12-49).
  • the polymerase may be coupled to the nanopore by any suitable means. See, for example, PCT/US2013/068967 (published as WO2014/074727; Genia Technologies, Inc.), PCT/US2005/009702 (published as WO2006/028508;
  • the nanopore and polymerase are produced as a fusion protein (i.e., single polypeptide chain), and are incorporated into the membrane as such.
  • the polymerase can be mutated to reduce the rate at which the
  • the polymerase incorporates a nucleotide into a nucleic acid strand (e.g., a growing nucleic acid strand).
  • a nucleic acid strand e.g., a growing nucleic acid strand.
  • the rate at which a nucleotide is incorporated into a nucleic acid strand can be reduced by functionalizing the nucleotide and/or template strand to provide steric hindrance, such as, for example, through methylation of the template nucleic acid strand.
  • the rate is reduced by incorporating methylated nucleotides.
  • the molecules being characterized using the variant OmpG polypeptides described herein can be of various types, including charged or polar molecules such as charged or polar polymeric molecules. Specific examples include ribonucleic acid (RNA) and deoxyribonucleic acid (DNA) molecules.
  • RNA ribonucleic acid
  • DNA deoxyribonucleic acid
  • the DNA can be a single-strand DNA (ssDNA) or a double-strand DNA (dsDNA) molecule.
  • the OmpG variants provided in the present disclosure can be used for determining the sequence of nucleic acids according to other nanopore sequencing platforms known in the art.
  • the OmpG variant provided in this disclosure may be suitable for sequencing nucleic acids according to the exonuclease-based method of Oxford Nanopore (Oxford, UK), the nanopore-based sequencing-by-hybridization of NABsys (Providence, Rl), the fluorescence-based optical nanopore sequencing of NobleGen Biosciences (Concord, MA), lllumina (San Diego, CA), and the nanopore sequencing-by- expansion of Stratos Genomics (Seattle, WA).
  • sequencing of nucleic acids using the OmpG variants can be performed using tagged nucleotides as is described in PCT/US2013/068967 (entitled “Nucleic Acid Sequencing Using Tags” filed on November 7, 2013, which is herein incorporated by reference in its entirety).
  • a variant OmpG nanopore that is situated in a membrane ⁇ e.g., a lipid bilayer) adjacent to or in sensing proximity to one or more sensing electrodes can detect the incorporation of a tagged nucleotide by a polymerase as the nucleotide base is incorporated into a polynucleotide strand and the tag of the nucleotide is detected by the nanopore.
  • the polymerase can be associated with the nanopore as described above.
  • Tags of the tagged nucleotides can include chemical groups or molecules that are capable of being detected by a nanopore. Examples of tags used to provide tagged nucleotides are described at least at paragraphs [0414] to [0452] of PCT/US2013/068967. Nucleotides may be incorporated from a mixture of different nucleotides, e.g., a mixture of tagged dNTPs where N is adenosine (A), cytidine (C), thymidine (T), guanosine (G) or uracil (U). Alternatively, nucleotides can be incorporated from alternating solutions of individual tagged dNTPs, i.e., tagged dATP followed by tagged dCTP, followed by tagged dGTP, etc.
  • Determination of a polynucleotide sequence can occur as the nanopore detects the tags as they flow through or are adjacent to the nanopore, as the tags reside in the nanopore and/or as the tags are presented to the nanopore.
  • the tag of each tagged nucleotide can be couple to the nucleotide base at any position including, but not limited to a phosphate (e.g., gamma phosphate), sugar or nitrogenous base moiety of the nucleotide.
  • tags are detected while tags are associated with a polymerase during the incorporation of nucleotide tags. The tag may continue to be detected until the tag translocates through the nanopore after nucleotide incorporation and subsequent cleavage and/or release of the tag.
  • nucleotide incorporation events release tags from the tagged nucleotides, and the tags pass through a nanopore and are detected.
  • the tag can be released by the polymerase, or cleaved/released in any suitable manner including without limitation cleavage by an enzyme located near the polymerase.
  • the incorporated base may be identified ⁇ i.e., A, C, G, T or U) because a unique tag is released from each type of nucleotide ⁇ i.e., adenine, cytosine, guanine, thymine or uracil).
  • nucleotide incorporation events do not release tags.
  • a tag coupled to an incorporated nucleotide is detected with the aid of a nanopore.
  • the tag can move through or in proximity to the nanopore and may be detected with the aid of the nanopore.
  • tagged nucleotides that are not incorporated pass through the nanopore.
  • the method can distinguish between tags associated with un- incorporated nucleotides and tags associated with incorporated nucleotides based on the length of time the tagged nucleotide is detected by the nanopore.
  • an un-incorporated nucleotide is detected by the nanopore for less than about 1 millisecond and an incorporated nucleotide is detected by the nanopore for at least about 1 millisecond.
  • the disclosure provides for a method for sequencing a nucleic acid with the aid of a variant OmpG nanopore.
  • a method is provided for sequencing a nucleic acid with the aid of a variant OmpG nanopore adjacent to a sensing electrode by (a) providing tagged nucleotides into a reaction chamber comprising the nanopore, wherein an individual tagged nucleotide of the tagged nucleotides contains a tag coupled to a nucleotide, which tag is detectable with the aid of the nanopore; (b) carrying out a polymerization reaction, with the aid of a polymerase, thereby incorporating an individual tagged nucleotide of the tagged nucleotides into a growing strand complementary to a single stranded nucleic acid molecule from the nucleic acid sample; and (c) detecting, with the aid of the nanopore, a tag associated with the individual tagged nucleotide during and/or upon incorporation of the
  • a DNA encoding a form of the mature OmpG protein (residues 22-301 ; Uniprot entry P76045) lacking loop 6 and having substitution E229A (OmpG-EXT) was synthesized (Genscript, NJ) based on the ⁇ .6 ⁇ construct described by
  • the synthetic DNA encodes an OmpG construct having a deletion of loop 6 and a C- terminal sequence: linkerl -His tag-linker2-Spytag (SEQ ID NO:3).
  • OmpG-AL6/E229A was obtained by IPTG-induced transcription of the OmpG- AL6/E229A DNA in BI21 DE3 E. coli cells growing in MagicMediaTM (Invitrogen, Carlsbad, CA) for approximately 24 hours. The cells were centrifuged then resuspended in 50mM Tris, PH8.0 (5ml buffer to 1 g cell pellet). Next, the cells were sonicated, and the lysate centrifuged (10,000 x g/20 min/4 °C). The pellet was washed twice by centrifugation and resuspension, and the final pellet was resuspended in 50 mM Tris pH8.0 at a concentration of 200 mg/ml. The inclusion bodies were aliquoted and stored at - 80°C.
  • the inclusion bodies were solubilized in 50 mM Tris pH8.0, 6 M urea, and 2.4mM TCEP at 60 °C for 10 minutes. Unsolubilized inclusion bodies were removed by centrifugation, and the solubilized protein present in the supernatant was diluted to 1 mg/ml in refolding buffer (25mM Tris, pH8.0, 1.8M urea, 1 mM TCEP, and 3% ⁇ -OG).
  • the OmpG-AL6/E229A protein was refolded for 16 hours at 37 °C, then diluted using 50 mM Tris, pH8.0, 200mM NaCI, 5mM imidazole 1 % ⁇ - OG to obtain a final concentration of 1 % ⁇ -OG and 0.8M urea.
  • the refolded protein was purified by affinity chromatography using TALON ® (Clontech,
  • TALON is an immobilized metal affinity chromatography (IMAC) resin charged with cobalt, which binds to his-tagged proteins with higher specificity than nickel-charged resins.
  • IMAC immobilized metal affinity chromatography
  • Variants of the OmpG-AL6/E229A protein were obtained by site-directed mutagenesis on the basis of the DNA encoding the OmpG-AL6/E229A construct in the pET-26b vector. All OmpG-AL6/E229A variants were expressed and purified as described for OmpG-AL6 as described above.
  • variant OmpG polypeptides were reconstituted as pores in a lipid bilayer over a well on a semiconductor sensor chip (a), and single channel recordings obtained for each of the OmpG variant pores (b).
  • Chambers were filled with 20 mM HEPES, pH 8.0, 300 mM NaCI, 3mM CaCI 2 unless otherwise noted.
  • the current was measured using in-house built GeniaChipTM DNA sequencers.
  • OmpG- AL6/E229A -R68N B
  • OmpG-AL6/E229A -R21 1 N C
  • OmpG-AL6/E229A-E17K D
  • display substantial flickering i.e., upward and downward currents.
  • OmpG- AL6/E229A-Y50K A
  • displays less flickering and a lower open channel current level of about 30 pA.
  • OmpG-AL6/E229A-del215 E shows the least level of spontaneous gating, and maintains an open channel level of about 35pA.
  • Figure 5A shows a histogram that relates the mean and S.D. of the open channel (OC) current noise (upward and downward current) to the percent of the same measurements that occurred outside 1 S.D. from the mean measurements.
  • the histogram shows that OmpG-AL6/E229A-del215 (del215) ("AL6-del215" in Figs.
  • OmpG-AL6/E229A-Y50K, OmpG-AL6/E229A-del215, and OmpG-AL6/E229A- del215-Y50K showed the smallest amount of downward current when compared, for example, to the parental OmpG-AL6/E229A .
  • Example 3
  • a DNA sequence that encodes a His-tagged polymerase, pol6, was purchased from a commercial source (DNA 2.0, Menlo Park, California), and then engineered to comprise a SpyCatcher domain at its C-terminus. (Li et al., J Mol S/ ' o/ 23:426(2):309-317 [2014]). The Pol6 was ligated into the pD441 vector
  • Kanamycin in 96-deep well plates. The plates were incubated with shaking at 250-300rpm for 36-40 hrs at 28°C.
  • reagent 20 ⁇ was then added from a 10x stock to a final concentration of 100 ⁇ g/ml DNase, 5 mM MgCI2, 100 ⁇ g/ml RNase I, and incubated on ice for 5-10 min to produce a lysate.
  • the lysate was supplemented with 200 ⁇ of 1 M Potassium Phosphate, pH 7.5 (final concentration was about 0.5M Potassium phosphate in 400 ⁇ lysate) and filtered through Pall filter plates (Part# 5053, 3 micron filters) via centrifugation at approximately 1500 rpm at 4°C for 10 minutes.
  • the clarified lysates were then applied to equilibrated 96-well His-Pur Cobalt plates (Pierce Part# 90095) and bound for 15-30 min.
  • the flow through (FT) was collected by centrifugation at 500 x g for 3 min. The FT was then washed 3 times with 400 ⁇ of wash buffer 1 (0.5M Potassium Phosphate pH 7.5, 1 M NaCI 5mM TCEP, 20mM Imidazole, and 0.5%Tween20).
  • wash buffer 1 0.5M Potassium Phosphate pH 7.5, 1 M NaCI 5mM TCEP, 20mM Imidazole, and 0.5%Tween20.
  • the FT was then washed twice in 400 ⁇ wash buffer 2 (50mM Tris pH 7.4, 200mM KCI, 5mM TCEP, 0.5% Tween20, 20mM lmidazole).
  • the Pol6 was eluted using 200 ⁇ elution buffer (50mM Tris Ph7.4, 200mM KCI, 5mM TCEP, 0.5% Tween20, 300mM Imidazole, 25%Glycerol) and collected after 1-2 min incubation. Eluate was reapplied to the same His-Pur plate 2-3 times to obtain concentrated Pol6.
  • the purified polymerase was >95% pure as evaluated by SDS-PAGE.
  • the protein concentration was ⁇ 3uM (0.35mg/ml) with a 260/280 ratio of 0.6 as evaluated by NanoDrop ® .
  • Polymerase activity was checked by fluorescence displacement assay.
  • the Pol6-His-SpyCatcher protein was incubated overnight at 4 °C in 3 mM SrCI 2 with the OmpG-EXT-His-SpyTag (SEQ ID NO:3) to allow for the covalent attachment of the SpyCatcher with the SpyTag, thereby forming an OmpG-polymerase complex.
  • the OmpG-polymerase complex was purified using affinity chromatography, and tested for its ability to capture and identify tagged nucleotides as described in Example 4.
  • OmpG-EXT-del215 (SEQ ID NO:10), i.e., OmpG- AL6/E229A-del215, to identify nucleotides captured by a polymerase was assessed using OmpG-EXT-del215 complexed with polymerase Pol6 in the presence of DNA template JAM1A in 300 mM NaCI, 3 mM CaCI2, 20 mM HEPES, pH 7.5.
  • Template JAM1A is a DNA template that provides an adenine nucleotide base that is complementary to the tagged thymidine nucleotide used in the assay (Synthesized by Roche Penzberg, Germany) and that would be captured by the polymerase.
  • FIG. 6 shows an example of a trace that demonstrates that the OmpG-EXT-del215 - polymerase complex identifies four different tagged nucleotides that were captured by the polymerase: T-T30, T-dSp30, T-Tmp6, and T-dSp5. Capture of each of the four nucleotides is reflected by four different changes in the current flowing thorough OmpG-EXT-del215 nanopore as the corresponding nucleotide tags are detected by the nanopore ( Figure 6).
  • the arrows indicate the four tags of the tagged nucleotides diminish channel current to four different levels. Each of the four nucleotides were identified in similar measurements during which current was measured as the nucleotides were individually added to the nanopore.
  • Figure 7 shows the identification of the nucleotides by the individual effects of the corresponding tags on reducing the open channel current as they were detected by the nanopore.
  • Figure 8 shows an expanded view of the capture of the four nucleotides detected under DC conditions and indicated by the arrows in Figure 6.
  • SEQ ID N0:1 Wild-type OmpG; >sp
  • OMPG_ECOLI Outer membrane protein G; OS Escherichia coli (strain K12)) KKLLPCTAL V CAGMACAQ AEERNDWHFN IGAMYEIENV EGYGEDMDGL 50
  • SEQ ID NO:2 (Mature wild-type OmpG from E.coli (strain K12);
  • SEQ ID NO:3 Synthetic OmpG-AL6 fusion protein - HisTag-SpyTag as expressed in E.coli
  • SEQ ID NO:4 Wild-Type OmpG; Escheridia coli porin (ompG) gene GM806593
  • SEQ ID NO:12 membrane protein (Shigella flexneri); KGY82041)
  • SEQ ID NO:13 outer membrane protein G (Salmonella enterica);
  • SEQ ID NO:14 outer membrane protein G (Salmonella enterica);

Abstract

The present disclosure provides variant OmpG polypeptides, compositions comprising the OmpG variant polypeptides, and methods for using the variant OmpG polypeptides as nanopores for determining the sequence of single stranded nucleic acids. The variant OmpG nanopores reduce the ionic current noise versus the parental OmpG polypeptide from which they are derived and thereby enable sequencing of polynucleotides with single nucleotide resolution. The reduced ionic current noise also provides for the use of these OmpG nanopore variants in other single molecule sensing applications, e.g., protein sequencing.

Description

OmpG Variants
TECHNICAL FIELD
[001] Engineered variants of monomeric nanopores are provided for use in determining the sequence of nucleic acids and proteins. BACKGROUND
[002] Protein nanopores have become powerful single-molecule analytical tools that enable the study of fundamental problems in chemistry and biology. In particular, nanopores have attracted considerable attention because of their potential applications in the detection and analysis of single biomolecules, such as DNA, RNA, and proteins.
[003] Molecular detection using a single nanopore is achieved by observing modulations in ionic current flowing through, or the voltage across the pore during an applied potential. Typically, a nanopore that spans an impermeable membrane is placed between two chambers that contain an electrolyte, and voltage is applied across the membrane using electrodes. These conditions lead to ionic flux through the pore. Nucleic acid or protein molecules can be driven through the pore, and structural features of the biomolecules are observed as measurable changes in the trans-membrane current or voltage.
[004] A challenge of nanopore sequencing is resolving nucleotide sequences at a single base level. One of the factors that hinders the discrimination of individual nucleotide bases is the fluctuation in the ionic current flow through the nanopore that is inherent to the structure of the nanopore. SUMMARY OF THE INVENTION
[005] The present disclosure provides variant outer membrane protein G (OmpG) polypeptides, compositions comprising the OmpG variant polypeptides, and methods for using the variant OmpG polypeptides as nanopores for nucleic acid {e.g., DNA, RNA) and/or polymeric (e.g., protein) sequencing and counting. The variant OmpG nanopores reduce the ionic current noise of the parental OmpG polypeptide from which they are derived.
[006] In one aspect, the disclosure provides variant OmpG polypeptides. In one embodiment, provided is an isolated variant of a parental OmpG of SEQ ID NO:2 or homolog thereof, wherein the variant comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 , and wherein said variant retains the ability to form a nanopore. In some embodiments, the OmpG variant has at least 70% identity to the OmpG of SEQ ID NO:2. In other embodiments, the variant OmpG comprises the linker-His-SpyTag construct of SEQ ID NO: 16.
[007] In another embodiment, provided is an isolated variant of a parental OmpG of SEQ ID NO:2 or homolog thereof, wherein the variant comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and a deletion of amino acid D215, and retains the ability to form a pore. The isolated variant can further comprise a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 of SEQ ID NO:2. In some embodiments, the OmpG variant has at least 70% identity to the OmpG of SEQ ID NO:2. In other embodiments, the variant OmpG comprises the linker-His-SpyTag construct of SEQ ID NO:16.
[008] In some embodiments the isolated OmpG variant comprises a deletion of one or more of amino acids 216-227 and a substitution Y50K. In other embodiments, the OmpG variant further comprises a deletion of D215.
[009] In another embodiment, the isolated OmpG variant comprises a "circular permutation" in which one or more C-terminal β-strand(s) of the parental OmpG are moved to the N-terminus of the protein sequence. In one embodiment, the C- terminal β-strand is moved to the N-terminus, retaining the penultimate β-strand of the parental OmpG as the new C-terminus of the protein. In other embodiments, two or more β-strands are moved to the N-terminus. Optionally, the variant includes a tag sequence (e.g., comprising a "SpyTag" or "His-SpyTag: sequence, optionally further comprising one or more linker sequences (e.g., SEQ ID NO:16) at the C-terminus, downstream from the penultimate β-strand of the parental OmpG. Optionally, the variant includes a linker sequence, e.g., GSG, between the new N-terminal β-strand that was moved from the C-terminus of the parental OmpG and the β-strand that was previously at the N-terminus of the parental OmpG. In one embodiment, the variant is a variant of the E. coli OmpG depicted in SEQ ID NO:2, or a homolog thereof. In one embodiment, the variant comprises movement of amino acid residues 267-280 of SEQ ID NO:2 from the C-terminus to the N-terminus of SEQ ID NO:2, optionally with a linker, e.g., GSG, between previous residue 280 and the N-terminus of SEQ ID NO:2, and optionally with a methionine (M) residue at the N-terminus of the variant, prior to previous residue 267, and optionally with the amino acid sequence depicted in SEQ ID NO:16 at the C-terminus of the variant. In one embodiment, the variant has the amino acid sequence depicted in SEQ ID NO: 17.
[0010] In some embodiments, the variant OmpG retains the ability to form a nanopore in a lipid or polymer layer. In other embodiments, the OmpG variant displays a reduced ionic current noise when an applied voltage is applied across the lipid bilayer. In other embodiments, the variant OmpG has reduced ionic current noise as compared to the parental E. Coli OmpG having the amino acid sequence of SEQ ID NO:2. Additionally, the variant OmpG can further comprise a genetic polymerase fusion, e.g., the isolated OmpG variant comprises a
polymerase that is operably linked to said variant OmpG (still functional after linkage).
[0011] In yet other embodiments, the variant OmpG enables detection of the incorporation of nucleotides by said polymerase into a growing nucleic acid strand with single nucleotide resolution.
[0012] In another aspect, the disclosure provides isolated nucleic acids that encode the variant OmpG polypeptides. In one embodiment, provided is an isolated nucleic acid comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (ii) a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31. In other embodiments, the polynucleotide sequence encodes a variant having at least 70% identity to the OmpG of SEQ ID NO:2. In other embodiments, the polynucleotide sequence encodes an OmpG circular
permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
[0013] In another aspect, provided is an expression vector that comprises an isolated nucleic acid that encodes a variant OmpG polypeptide as disclosed herein. In one embodiment, the expression vector comprises a nucleic acid comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215, i.e., del215; and/or (ii) a mutation of one or more of amino acids R21 1 ,E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31. In another embodiment the expression vector comprises a nucleic acid encoding a polynucleotide sequence that encodes an OmpG circular permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
[0014] In another aspect, provided is a host microorganism that comprises an expression vector that expresses an OmpG variant described herein. In one embodiment, the host microorganism comprises an expression vector comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (ii) a mutation of one or more of amino acids R21 1 ,E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31. In another
embodiment, the host microorganism comprises an expression vector comprising a polynucleotide sequence encoding an OmpG circular permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17.
[0015] In another aspect, a method for producing a variant OmpG in a host cell is provided. In one embodiment, the method comprises a) transforming a host cell with an expression vector comprising a nucleic acid encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or (ii) a mutation of one or more of amino acids R21 1.E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 ; and b) culturing the host cell under conditions suitable for the production of the variant OmpG. In another embodiment, the method comprises a) transforming a host cell with an expression vector comprising a polynucleotide sequence that encodes an OmpG circular permutation variant, e.g., a circular permutation variant of SEQ ID NO:2 or a homolog thereof, as described above, e.g., SEQ ID NO:17; and b) culturing the host cell under conditions suitable for the production of the variant OmpG. In other embodiments, the method further comprises recovering the produced variant.
[0016] In another aspect, a method is provided for sequencing a nucleic acid sample with the aid of a variant OmpG nanopore. In one embodiment, the method comprises: (a) providing tagged nucleotides into a reaction chamber comprising the variant OmpG nanopore, wherein an individual tagged nucleotide of the tagged nucleotides contains a tag coupled to a nucleotide, which tag is detectable with the aid of said nanopore; (b) carrying out a polymerization reaction with the aid of a single polymerase coupled to said variant OmpG nanopore, thereby incorporating an individual tagged nucleotide of the tagged nucleotides into a growing strand complementary to a single stranded nucleic acid molecule from the nucleic acid sample; and (c) detecting, with the aid of the variant OmpG nanopore, a tag associated with the individual tagged nucleotide during incorporation of the individual tagged nucleotide, wherein the tag is detected with the aid of the variant OmpG nanopore while the nucleotide is associated with the polymerase.
[0017] In another aspect, provided is a chip for sequencing a nucleic acid sample. In one embodiment, the chip comprises a plurality of the variant OmpG nanopores disclosed herein, an OmpG nanopore of the plurality being disposed adjacent or in proximity to an electrode, wherein said nanopore is individually addressable and has a single polymerase attached to the nanopore; and wherein an individual nanopore detects the tag associated with the tagged nucleotide during
incorporation of the nucleotide into a growing nucleic acid chain by the polymerase.
[0018] In another aspect, a composition is provided. In one embodiment, the composition comprises a plurality of polymerase enzymes, each complexed with a template nucleic acid, each polymerase enzyme attached to a variant OmpG nanopore as disclosed herein or attached proximal to the variant OmpG nanopore, and nucleic acid sequencing reagents including at least one tagged nucleotide or nucleotide analog. BRIEF DESCRIPTION OF THE DRAWINGS
[0019] Figures 1A and 1 B depict the architecture of the wild-type OmpG pore from E. coli as a ribbon structure (Figure 1 A) and as a surface representation (Figure 1 B). The constriction zone of the OmpG nanopore is shown.
[0020] Figure 2 is a schematic diagram of an embodiment of a circuit used in a nanopore device for controlling an electrical stimulus and for detecting electrical signatures of an analyte molecule.
[0021] Figures 3A-3B show a schematic diagram of an embodiment of a chip that includes a nanopore device array. A perspective view is shown in Figure 3A. A cross-sectional view of the chip is shown in Figure 3B. [0022] Figures 4A-4E depict single channel current traces obtained at an applied constant voltage for OmpG variants comprising a deletion of amino acids 216-227, and amino acid substitution E229A (ΔΙ.6/Ε229Α) (as shown in (SEQ ID NO:5)), and the amino acid substitution Y50K (SEQ ID NO:6; (Figure 4A)), R68N (SEQ ID NO:7; (Figure 4B)), R21 1 N (SEQ ID NO:8; (Figure 4C)), E17K (SEQ ID NO:9: (Figure 4D)), and the amino acid deletion del215 (SEQ ID NO:10; (Figure 4E)).
[0023] Figures 5A-5B depict the mean open channel (OC) current (filled bar) and the percentage of events greater than 1 standard deviation of the mean open channel in both higher and lower directions (Figure 5A) and the mean open channel current in black (filled bar) and the percentage of events greater than 1 std deviation of the mean open channel in only the lower direction (downward current only) (Figure 5B) determined for each of the OmpG variants of ΔΙ.6/Ε229Α (SEQ ID NO:5), and the amino acid substitution ΔΙ.6/Ε229Α-Υ50Κ (SEQ ID NO:6), AL6/E229A-R68N (SEQ ID NO:7), AL6/E229A-R21 1 N (SEQ ID NO:8),
AL6/E229A-E17K (SEQ ID NO:9), and the amino acid deletion AL6/E229A-del215 (SEQ ID NO:10); (del215)); AL6/E229A-del215-Y50K (SEQ ID NO:1 1 ).
[0024] Figure 6 depicts the single nucleotide resolution of a mixture of four different tagged nucleotides shown as changes in baseline open channel direct current as each of the tagged nucleotides is detected by the variant OmpG nanopore AL6/E229A-del215 (SEQ ID NO:10). Measurements were made with the application of a direct current (DC).
[0025] Figures 7A-7D depict the identification of each of the tagged nucleotides detected in Figure 6 by the variant OmpG nanopore AL6/E229A-del215 (SEQ ID NO:10) as separate changes in baseline open channel current for each of the four tagged nucleotides. Measurements were made with the application of a direct current (DC).
[0026] Figure 8 depicts an expanded view of the single nucleotide resolution of the mixture of four different tagged nucleotides shown in Figure 6 as detected by the variant OmpG nanopore AL6/E229A-del215 (SEQ ID NO:10). Measurements were made with the application of a direct current (DC).
[0027] Figures 9A -9E depicts a protein alignment of bacterial membrane protein homologs of the OmpG from E. coli (SEQ ID NOS 1 and 12-15, respectively, in order of appearance). [0028] Figures 10A - 10B schematically depict OmpG antiparallel β-strands (Figure 10A) and an embodiment of a circular permutation variant thereof (Figure 10B).
[0029] Figure 11 shows the arrangement of the C-terminus of OmpG in the wild- type parental OmpG versus the circular permutation variant as described herein, relative to the constriction site of the nanopore protein.
DETAILED DESCRIPTION
[0030] A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims, and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
[0031] The outer membrane (OM) of Gram-negative bacteria contains a large number of channel proteins that mediate the uptake of ions and nutrients necessary for growth and functioning of the cell. In contrast with other multimeric proteinaceous nanopores such as a-hemolysin and ClyA, outer membrane protein G (OmpG) from Escherichia coli (E. coli) functions as a monomer. The crystal structure of E. coli K12 OmpG has been determined (Subbarao and van den Berg, J Mol Biol, 360:750-759 [2006]). The structure shows that the OmpG barrel consists of 14 β-strands connected by seven flexible loops on the extracellular side and seven short turns on the periplasmic side (Figure 1A). The OmpG channel has its largest diameter (20-22 A) at the periplasmic exit and tapers to a constriction located close to the extracellular side (Figure 1 B). The constriction is formed by the side chains of inward pointing residues of the barrel wall, and not by surface loops folding inwards. This architecture gives rise to a relatively large central pore with a circular shape and a diameter of about 13 A. [0032] When current is measured across a wild-type OmpG nanopore, the nanopore spontaneously transitions between open and closed states during an applied potential, which gives rise to flickering single channel currents. The longest of the extracellular loop of OmpG, loop 6, has been recognized as the main gating loop that closes the pore at low pH and opens it at high pH.
[0033] The present disclosure provides variant OmpG polypeptides, compositions comprising the variant OmpG polypeptides, and methods for using the variant OmpG polypeptides as nanopores for determining the sequence of single stranded nucleic acids. The variant OmpG nanopores reduce the ionic current noise of the parental OmpG polypeptide from which they are derived and thereby enable sequencing of polynucleotides with single nucleotide resolution. The reduced ionic current noise also provides for the use of these OmpG nanopore variants in other single molecule sensing applications, e.g., protein sequencing.
[0034] Definitions
[0035] The term "variant" herein refers to an OmpG derived from another (i.e., parental) OmpG and contains one or more amino acid mutations {e.g., amino acid deletion, insertion or substitution) as compared to the parental OmpG.
[0036] The term "isolated" herein refers to a molecule, e.g., a nucleic acid molecule, that is separated from at least one other molecule with which it is ordinarily associated, for example, in its natural environment. An isolated nucleic acid molecule includes a nucleic acid molecule contained in cells that ordinarily express the nucleic acid molecule, but the nucleic acid molecule is present extrachromosomally or at a chromosomal location that is different from its natural chromosomal location.
[0037] The term "mutation" herein refers to a change introduced into a parental sequence, including, but not limited to, substitutions, insertions, deletions
(including truncations). The consequences of a mutation include, but are not limited to, the creation of a new character, property, function, phenotype or trait not found in the protein encoded by the parental sequence.
[0038] The term "wild-type" herein refers to a gene or gene product, which has the characteristics of that gene or gene product when isolated from a naturally- occurring source.
[0039] The term "nucleotide" herein refers to a monomeric unit of DNA or RNA consisting of a sugar moiety (pentose), a phosphate, and a nitrogenous
heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon (1 ' carbon of the pentose) and that combination of base and sugar is a nucleoside. When the nucleoside contains a phosphate group bonded to the 3' or 5' position of the pentose it is referred to as a nucleotide. A sequence of operatively linked nucleotides is typically referred to herein as a "base sequence" or "nucleotide sequence," and is represented herein by a formula whose left to right orientation is in the conventional direction of 5'-terminus to 3'-terminus.
[0040] The terms "polynucleotide" and "nucleic acid" are herein used
interchangeably to refer to a polymeric molecule composed of nucleotide monomers covalently bonded in a chain. DNA (deoxyribonucleic acid) and RNA (ribonucleic acid) are examples of polynucleotides.
[0041] The term "polymerase" herein refers to an enzyme that catalyzes the polymerization of nucleotides (i.e., the polymerase activity). The term polymerase encompasses DNA polymerases, RNA polymerases, and reverse transcriptases. A "DNA polymerase" catalyzes the polymerization of deoxyribonucleotides. An "RNA polymerase" catalyzes the polymerization of ribonucleotides. A "reverse transcriptase" catalyzes the polymerization of deoxyribonucleotides that are complementary to an RNA template.
[0042] The term "template DNA molecule" herein refers to a strand of a nucleic acid from which a complementary nucleic acid strand is synthesized by a DNA polymerase, for example, in a primer extension reaction.
[0043] The term "template-dependent manner" refers to a process that involves the template dependent extension of a primer molecule (e.g., DNA synthesis by DNA polymerase). The term "template-dependent manner" typically refers to polynucleotide synthesis of RNA or DNA wherein the sequence of the newly synthesized strand of polynucleotide is dictated by the well-known rules of complementary base pairing (see, for example, Watson, J. D. et al., In: Molecular Biology of the Gene, 4th Ed., W. A. Benjamin, Inc., Menlo Park, Calif. (1987)).
[0044] The term "tag" refers to a detectable moiety that may be one or more atom(s) or molecule(s), or a collection of atoms and molecules. A tag may provide an optical, electrochemical, magnetic, or electrostatic (e.g., inductive, capacitive) signature. A tag may block the flow of current through a nanopore.
[0045] The term "nanopore" herein refers to a pore, channel or passage formed or otherwise provided in a membrane. A membrane may be an organic membrane, such as a lipid bilayer, or a synthetic membrane, such as a membrane formed of a polymeric material. The nanopore may be disposed adjacent or in proximity to a sensing circuit or an electrode coupled to a sensing circuit, such as, for example, a complementary metal oxide semiconductor (CMOS) or field effect transistor (FET) circuit. In some examples, a nanopore has a characteristic width or diameter on the order of 0.1 nm to about 1000 nm. Some nanopores are proteins. OmpG is an example of a protein nanopore.
[0046] The term "spontaneous gating" refers to changes in ion current related to the channel's inherent structural changes. For example, OmpG in planar lipid bilayers undergoes pH-dependent rapid fluctuations between open and closed states of the pore, which manifest themselves as intense "flickering" in current recordings and contributes to the overall noise of the channel.
[0047] The terms "noise" and "ionic current noise" are herein used
interchangeably and refer to the random fluctuations of electrical signal, which include current fluctuations contributed by spontaneous gating and current fluctuations contributed by the inherent architecture of the nanopore barrel. For example, the tertiary make-up of the nanopore barrel can comprise more than one recognition site for the analyte that is being sensed by the nanopore thereby inducing additional signals that contribute to the overall noise of the channel.
[0048] The term "upward noise" herein refers to fluctuations of ionic current to levels greater than mean open channel current.
[0049] The term "downward noise" herein refers to fluctuations of ionic current to levels lower than mean open channel current.
[0050] The term "positive current" herein refers to a current in which a positive charge, e.g., K+, moves through the pore from the trans to the c/'s side, or negative charge, e.g., CI", moves from the c/'s to the trans side. For example, with reference to Figure 2, c/'s corresponds to 106 and trans corresponds to 116.
[0051] The term "constriction amino acids" herein refers to the amino acids that determine the size of the OmpG pore at the constriction zone. The constriction zone may be the same as the constriction zone of the wild-type OmpG or it may be a constriction zone introduced via protein engineering, or by the introduction of a molecular adapter.
[0052] The term "parental" or "parent" herein refers to an OmpG to which modifications, e.g., substitution(s), insertion(s), deletion(s), and/or truncation(s), are made to produce the OmpG variants disclosed herein. This term also refers to the polypeptide with which a variant is compared and aligned. The parent may be a naturally occurring (wild type) polypeptide, or it may be a variant thereof, prepared by any suitable means. In preferred embodiments, "parental" proteins are homologs of one another.
[0053] The terms "purified" herein refers to a polypeptide, e.g., a variant OmpG polypeptide, that is present in a sample at a concentration of at least 95% by weight, or at least 98% by weight of the sample in which it is contained.
[0054] The term "nucleotide analog" herein refers to analogs of nucleoside triphosphates, e.g., (S)-Glycerol nucleoside triphosphates (gNTPs) of the common nucleobases: adenine, cytosine, guanine, uracil, and thymidine (Horhota et al., Organic Letters, 8:5345-5347 [2006]). Also encompassed are nucleoside tetraphosphate, nucleoside pentaphosphates and nucleoside hexaphosphates.
[0055] The term "tagged nucleotide" herein refers to a nucleotide that includes a tag (or tag species) that is coupled to any location of the nucleotide including, but not limited to a phosphate (e.g., terminal phosphate), sugar or nitrogenous base moiety of the nucleotide. Tags may be one or more atom(s) or molecule(s), or a collection of atoms and molecules. A tag may provide an optical, electrochemical, magnetic, or electrostatic (e.g., inductive, capacitive) signature, which signature may be detected with the aid of a nanopore (US2014/013616). A tag can also be attached to a polyphosphate as is shown in Figure 13 of US2014/013616. Variant OmpG polypeptides
[0056] In one aspect, the disclosure provides variant OmpG polypeptides. The variant OmpG polypeptides can be derived from a parental OmpG of E. coli, for example, the parental OmpG depicted in SEQ ID NO:2. A parental OmpG can be a homolog of the parental OmpG from E. coli.
[0057] Although E. coli sp. strain K12 OmpG (SEQ ID NO: 2) is used as a starting point for discussing variant OmpGs herein, it will be appreciated that other gram- negative bacterial OmpGs having a high degree of homology to the E. coli sp. strain K12 OmpG may serve as a parental OmpG within the scope of the compositions and methods disclosed herein. This is particularly true of other naturally-occurring bacterial OmpGs that include only minor sequence differences in comparison to E. coli sp. strain K12 OmpG, not including the substitutions, deletions, and/or insertions that are the subject of the present disclosure. For example, OmpG homologs expressed in Salmonella sp., Shigella sp., and
Pseudomonas sp. can be used as parental OmpG polypeptides from which variant forms can be derived. In some embodiments, the nanopore is a pore from a mitochondrial membrane.
[0058] Homologs of the parental OmpG from E. coli can share sequence identity with the OmpG from E. coli (SEQ ID NO:1 of at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%. For example, a variant OmpG can be derived from a homolog of the E. coli OmpG that is at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identical to the parental OmpG from E. coli. In some embodiments, the parental OmpG is the OmpG from the E. coli sp. strain K12. The polypeptide sequence of the full length E. coli OmpG (SEQ ID NO:1 ) and examples of homologs from Shigella flexneri (SEQ ID NO: 12), Salmonella enterica (SEQ ID NOs:13 and 14), and Citrobacter farmeri (SEQ ID NO:15) are provided in Figure 9. SEQ ID NO:2 is the mature form of the full-length E. coli OmpG polypeptide depicted in SEQ ID NO:1 .
[0059] In some embodiments, the parental polypeptide is a wild-type OmpG polypeptide. In other embodiments, the parental polypeptide is an OmpG variant to which additional mutations can be introduced to improve the ability of the OmpG polypeptide to reduce ionic current noise. The variant OmpG retains the ability to form a nanopore. In one embodiment, the parental OmpG polypeptide is the wild- type E. coli OmpG polypeptide of SEQ ID NO:1 , or the mature form thereof (SEQ ID NO:2). It is understood that the variant OmpG polypeptides can be expressed having the N-terminal Met.
[0060] In another embodiment, the parental OmpG polypeptide is a variant OmpG polypeptide from which the amino acids that comprise loop 6 are deleted. For example, the parental OmpG is the OmpG of SEQ ID NO:2 from which the amino acids that comprise loop 6 have been deleted. The OmpG of SEQ ID NO:3 is the mature form of the wild-type OmpG (SEQ ID NO:2) from which amino acids 216- 227 have been deleted and amino acid 229 is replaced by an Ala, i.e., Δ216- 227/E229A. SEQ ID NO:3 comprises a sequence of amino acids at the C- terminus that denotes the Nnker-His6-linker-SpyTag sequences ("His6" disclosed as SEQ ID NO: 18) as described elsewhere herein. Variant OmpGs comprising a deletion of loop 6 and substitution of Ala at amino acid 229, i.e., Δ216-227/Ε229Α are interchangeably denoted by ΔΙ.6/Ε229Α. In some embodiments, truncation of loop 6 can be made by deleting one or more of amino acids 216-227 of SEQ ID NO:2. In other embodiments, amino acids 216-227, inclusive, are deleted. The numbering of the amino acids refers to the amino acid positions of SEQ ID NO: 2.
[0061] In one embodiment, the variant OmpG is a variant of the parental OmpG of SEQ ID NO:2 that comprises a deletion of amino acids 216-227, i.e., Δ216-227. In a further embodiment the variant OmpG comprises E229A, i.e., Δ216-227/E229A. In yet a further embodiment, the variant OmpG comprises a deletion of D215, i.e., Δ215-227/Ε229Α.
[0062] Amino acids at the constriction zone of the OmpG pore (the smallest "choke point" of the nanopore) at the extracellular surface are identified as contributing to the symmetry of the lining and/or the length of the constriction of OmpG. In some embodiments, the constriction zone amino acids can be mutated to shorten the length of the constriction and/or even the width of the internal diameter of the constriction. Mutagenesis of the constriction amino acids can be designed to create a unique constriction zone. Mutations of the constriction zones reduce the ion current noise of the variant OmpG when compared to the parental OmpG from which the variant is derived. Accordingly, in some embodiments, the variant OmpG polypeptide provided comprises one or more mutations of amino acids that are positioned at the constriction zone at the extracellular side of the OmpG nanopore. In other embodiments, the variant OmpG polypeptide can be further mutated to bind molecular adaptors, which while resident in the pore, slow the movement of analytes, e.g., nucleotide bases, through the pore and consequently improve the accuracy of the identification of the analyte (Astier et al., J Am Chem Soc 10.1021/ja057123+, published online on December 30, 2005).
[0063] In some embodiments, the mutation in the constriction zone, e.g., mutation of the OmpG depicted in SEQ ID NO:2, is selected from amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31 . A mutation of the amino acids at the constriction zone can be one or more of a substitution, a deletion or an insertion, for example, a substitution of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31 . In some embodiments, at least one amino acid mutation is located in the constriction zone of OmpG. In other embodiments, at least two, at least three, at least four, at least five, or at least six amino acids of the constriction zone are mutated. In some embodiments, the at least one amino acid mutation at the constriction zone is the substitution Y50K. In some other embodiments, the at least one amino acid mutation at the constriction zone is the substitution Y50N. The at least one amino acid mutation at the constriction zone can be combined with the deletion of one or more of the amino acids of loop 6. Thus, in some embodiments, the variant OmpG is derived from a parental OmpG, e.g., the OmpG depicted in SEQ ID NO:2, and comprises a deletion of amino acids 216- 227 and substation of Ala at amino acid 229,, i.e., Δ216-227/Ε229Α, and a mutation of at least one amino acid of the constriction zone of the wild-type OmpG, e.g., a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31. In other embodiments, the variant OmpG comprises a deletion of loop 6, a mutation of one or more amino acids at the constriction zone, and the deletion D215. For example, the variant
OmpG is a variant of a parental OmpG of SEQ ID NO:2, and comprises a deletion of amino acids 216-227 and substitution of Ala at amino acid 229, i.e., Δ216- 227/E229A, a mutation of at least one of amino acids of the constriction zone, e.g., a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and/or E31 , and del215. In one embodiment, the variant OmpG is a variant of a parental OmpG of SEQ ID NO:2, and comprises a deletion of amino acids 216-227, i.e., Δ216-227, substitution E229A, deletion of D215, and amino acid substitution Y50K.
[0064] In some embodiments, a "circular permutation variant of OmpG is provided wherein the C-terminal β-strand of the parental OmpG is moved to the N-terminus of the protein sequence, retaining the penultimate β-strand of the parental OmpG as the new C-terminus of the protein. This is depicted schematically in Figures 10 and 1 1. The result of movement of the C-terminal β-strand in this manner is that the new C-terminus of the variant is closer to the constriction site of the nanopore than in the parental OmpG from which the variant was derived (see Figure 1 1 ). The proximity to the constriction point is advantageous because it allows for improved capture of molecules for analysis by the nanopore. This improved capture may be due to a reduction in the energy barrier of NanoTag threading. The placement of the N-terminus and C-terminus on opposite sides of the lipid or polymer layer also allows for the attachment of two nucleic acid modifying enzymes, doubling the throughput of the nanopore-based instrument.
[0065] Optionally, a circular permutation variant as described herein includes a tag sequence {e.g., comprising a "SpyTag" or "His-SpyTag" sequence, optionally further comprising one or more linker sequences (e.g., SEQ ID NO:16) at the C- terminus, downstream from the penultimate β-strand of the parental OmpG. Optionally, the variant includes a linker sequence, e.g., GSG, between the new N- terminal β-strand that was moved from the C-terminus of the parental OmpG and the β-strand that was previously at the N-terminus of the parental OmpG. In one embodiment, the variant is a variant of the E. coli OmpG depicted in SEQ ID NO:2, or a homolog thereof. In one embodiment, the variant comprises movement of amino acid residues 267-280 of SEQ ID NO:2 to the N-terminus of SEQ ID NO:2, optionally with a linker, e.g., GSG, between previous residue 280 and the N- terminus of SEQ ID NO:2, and optionally with a methionine (M) residue at the N- terminus of the variant, prior to previous residue 267, and optionally with the amino acid sequence depicted in SEQ ID NO:16 at the C-terminus of the variant. In one embodiment, the variant has the amino acid sequence depicted in SEQ ID NO: 17.
DNA Sequence Encoding OmpG Variants
[0066] DNA sequences encoding a parent OmpG may be isolated from any cell or microorganism producing the OmpG in question, using various methods well known in the art. First, a genomic DNA and/or cDNA library can be constructed using chromosomal DNA or messenger RNA from the organism that produces the OmpG to be studied. Then, if the amino acid sequence of the OmpG is known, homologous, labeled oligonucleotide probes may be synthesized and used to identify OmpG-encoding clones from a genomic library prepared from the organism in question. Alternatively, a labeled oligonucleotide probe containing sequences homologous to a known OmpG gene can be used as a probe to identify OmpG-encoding clones, using hybridization and washing conditions of lower stringency.
[0067] Alternatively, the DNA sequence encoding the OmpG may be prepared synthetically by established standard methods, e.g., the phosphoroamidite method described by S. L. Beaucage and M. H. Caruthers (1981 ) Tetrahedron Letters 22:1859-1862 or the method described by Matthes et al. (1984) EMBO J.
3(4):801 -5. In the phosphoroamidite method, oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vectors.
[0068] Finally, the DNA sequence may be of mixed genomic and synthetic origin, mixed synthetic and cDNA origin or mixed genomic and cDNA origin, prepared by ligating fragments of synthetic, genomic or cDNA origin (as appropriate, the fragments corresponding to various parts of the entire DNA sequence), in accordance with standard techniques. The DNA sequence may also be prepared by polymerase chain reaction (PCR) using specific primers, for instance as described in U.S. Pat. No. 4,683,202 or R. K. Saiki et al. (1988) Science
239(4839):489-91.
Site-Directed Mutagenesis
[0069] Once an OmpG-encoding DNA sequence has been isolated, and desirable sites for mutation have been identified, mutations may be introduced using synthetic oligonucleotides. These oligonucleotides contain nucleotide sequences flanking the desired mutation sites; mutant nucleotides are inserted during oligonucleotide synthesis. In a specific method, a single-stranded gap of DNA, bridging the OmpG-encoding sequence, or portion thereof, is created in a vector carrying the OmpG gene. Then the synthetic nucleotide, bearing the desired mutation, is annealed to a homologous portion of the single-stranded DNA. The remaining gap is then filled in with DNA polymerase I (Klenow fragment) and the construct is ligated using T4 ligase. A specific example of this method is described in Morinaga et al. (1984) Nature Biotechnology 2:636-639. U.S. Pat. No. 4,760,025 discloses the introduction of oligonucleotides encoding multiple mutations by performing minor alterations of the cassette. However, an even greater variety of mutations can be introduced at any one time by the Morinaga method, because a multitude of oligonucleotides, of various lengths, can be introduced. Other methods that effect site-directed mutagenesis include Kunkel's method, cassette mutagenesis, and PCR site-directed mutagenesis. Alternative methods for providing variants include gene shuffling, e.g., as described in WO 95/22625 (from Affymax Technologies N.V.) or in WO 96/00343 (from Novo Nordisk A/S), or other corresponding techniques resulting in a hybrid enzyme comprising the mutation(s), e.g., substitution(s) and/or deletion(s), in question.
Expression of OmpG Variants
[0070] A DNA sequence encoding an OmpG variant can be used to express a variant OmpG, using an expression vector, which typically includes control sequences encoding a promoter, an operator, a ribosome binding site, a translation initiation signal, and, optionally, a repressor gene or various activator genes. Examples of vectors that can be used for expressing variant OmpGs include the vectors of the pET expression system (Novagen). [0071] A recombinant expression vector carrying DNA sequences encoding an OmpG variant may be any vector, which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced. Thus, the vector may be an
autonomously replicating vector, i.e., a vector which exists as an
extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, a bacteriophage or an extrachromosomal element, a minichromosome or an artificial chromosome. Alternatively, the vector may be one which, when introduced into a host cell, is integrated into the host cell genome and replicates together with the chromosome(s) into which it has been integrated.
[0072] The procedures used to ligate the DNA construct encoding an OmpG variant, and to insert it into suitable vectors containing the information necessary for replication, are well known to persons skilled in the art (cf., for instance, Sambrook et al., Molecular Cloning: A Laboratory Manual, Fourth Edition, Cold Spring Harbor, 2012).
[0073] An OmpG variant can be produced in a cell that may be of a higher organism such as a mammal or an insect, but is preferably a microbial cell, e.g., a bacterial or a fungal (including yeast) cell. Examples of suitable bacteria are gram- negative bacteria such as E. coli, or gram-positive bacteria such as Bacillus sp., e.g., Bacillus subtilis, Bacillus licheniformis, Bacillus lentus, Bacillus brevis,
Geobacillus stearothermophilus, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus circulars, Bacillus lautus, Bacillus megaterium, or Bacillus thuringiensis, or Streptomyces sp., e.g., Streptomyces lividans or Streptomyces murinus. A yeast organism may be selected from a species of Saccharomyces or Schizosaccharomyces, e.g., Saccharomyces cerevisiae, or from a filamentous fungus, such as Aspergillus sp., e.g., Aspergillus oyzae or Aspergillus niger. The host cell is typically bacterial and preferably E. coli.
[0074] In a further aspect, a method of producing an OmpG variant is provided, which method comprises cultivating a host cell as described above under conditions conducive to the production of the variant and recovering the variant from the cells and/or culture medium. The medium used to cultivate the cells may be any conventional medium suitable for growing the host cell in question and obtaining expression of the OmpG variant. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g., as described in catalogues of the American Type Culture Collection). [0075] The OmpG variant secreted from the host cells may conveniently be recovered from the culture medium by well-known procedures, including separating the cells from the medium by centrifugation or filtration, and
precipitating proteinaceous components of the medium by means of a salt such as ammonium sulfate, followed by the use of chromatographic procedures such as ion exchange chromatography, affinity chromatography, or the like. In some embodiments, purification of the variant OmpG may be obtained by affinity chromatography of OmpG polypeptides linked to an affinity tag. Several affinity or epitope tags that can be used in the purification of the OmpG variants include hexahistidine tag (SEQ ID NO: 18), FLAG tag, Strep II tag, streptavidin-binding peptide (SBP) tag, calmodulin-binding peptide (CBP), glutathione S-transferase (GST), maltose-binding protein (MBP), S-tag, HA tag, and c-Myc tag. In some embodiments, a hexahistidine tag (SEQ ID NO: 18) is used in the purification of OmpG. The affinity tag can be covalently attached to the variant OmpG
polypeptide by a protein linker. Specific linkers contemplated as useful in linking the nanopore to a polymerase include (GGGGS)i-3 (SEQ ID NO: 19), EKEKEKGS (SEQ ID NO: 20), His6-GSGGK (SEQ ID NO: 21 ), and AH I VMVDAYKPTK (SEQ ID NO: 22) (SpyTag). The protein linkers can be encoded by the nucleic acid that comprises the sequence encoding the variant OmpG, and may be expressed as a fusion protein. For example, the variant OmpG can be expressed as OmpG-(EK)3- HiSe-GSGG-SpyTag (EK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK (SEQ ID NO:16)) expressed for example, as amino acids 269-299 of SEQ ID NO:3. In some instance, the His6 tag (SEQ ID NO: 18) is expressed N-terminal to the variant OmpG polypeptide.
Nanopore assembly
[0076] Characterization of the variant OmpG can include determining any property of the molecule that causes a variance in a measurable electrical signature. For example, reduction in gating frequency may be derived from measuring a decrease in upward and/or downward gating through the nanopore as a constant voltage is applied across the variant OmpG nanopore. Additionally,
characterization of the variant OmpG can include identifying tags of individual tagged nucleotides which are complementary to a DNA or RNA strand by measuring a variance in ionic current flow through the nanopore as the tags of individual nucleotides are detected in proximity or in passing through the OmpG nanopore. The base sequence of a segment of a DNA or RNA molecule can be determined by comparing and correlating the measured electrical signature(s) of tags of tagged nucleotides, as the growing nucleic acid strand is synthesized.
[0077] Typically, measurements of ionic current flow through the OmpG nanopore are made across nanopores that have been reconstituted into a lipid membrane. In some instances, the OmpG nanopore is inserted in the membrane (e.g., by electroporation). The nanopore can be inserted by a stimulus signal such as electrical stimulus, pressure stimulus, liquid flow stimulus, gas bubble stimulus, sonication, sound, vibration, or any combination thereof. In some cases, the membrane is formed with aid of a bubble and the nanopore is inserted in the membrane with aid of an electrical stimulus.
[0078] Methods for assembling a lipid bilayer, forming a nanopore in a lipid bilayer, and sequencing nucleic acid molecules can be found in PCT Patent Publication Nos. WO201 1/097028 and WO2015/061510, which are incorporated herein by reference in their entirety.
[0079] Figure 2 is a schematic diagram of a nanopore device 100 that can be used to characterize a polynucleotide or a polypeptide. The nanopore device 100 includes a lipid bilayer 102 formed on a lipid bilayer compatible surface 104 of a conductive solid substrate 106, where the lipid bilayer compatible surface 104 may be isolated by lipid bilayer incompatible surfaces 105 and the conductive solid substrate 106 may be electrically isolated by insulating materials 107, and where the lipid bilayer 102 may be surrounded by amorphous lipid 103 formed on the lipid bilayer incompatible surface 105. The lipid bilayer comprising the nanopore can be disposed over a well, where a sensor forms part of the surface of the well. Descriptions of the location of nanopores in lipid bilayers over wells can be found, for example, in WO2015/061509. The lipid bilayer 102 is embedded with a single nanopore structure 108 having a nanopore 1 10 large enough for passing of at least a portion of the molecule 1 12 being characterized and/or small ions (e.g., Na+, K+, Ca2+, CI") between the two sides of the lipid bilayer 102. A layer of water molecules 1 14 may be adsorbed on the lipid bilayer compatible surface 104 and sandwiched between the lipid bilayer 102 and the lipid bilayer compatible surface 104. The aqueous film 1 14 adsorbed on the hydrophilic lipid bilayer compatible surface 104 may promote the ordering of lipid molecules and facilitate the formation of lipid bilayer on the lipid bilayer compatible surface 104. A sample chamber 1 16 containing a solution of the molecule 1 12 may be provided over the lipid bilayer 102 for introducing the molecule 1 12 for characterization. The solution may be an aqueous solution containing electrolytes and buffered to an optimum ion concentration and maintained at an optimum pH to keep the nanopore 1 10 open. The device includes a pair of electrodes 1 18 (including a negative node 1 18a and a positive node 1 18b) coupled to a variable voltage source 120 for providing electrical stimulus (e.g., voltage bias) across the lipid bilayer and for sensing electrical characteristics of the lipid bilayer (e.g., resistance, capacitance, and ionic current flow). The surface of the positive electrode 1 18b is or forms a part of the lipid bilayer compatible surface 104. The conductive solid substrate 106 may be coupled to or forms a part of one of the electrodes 1 18. The device 100 may also include an electrical circuit 122 for controlling electrical stimulation and for processing the signal detected. In some embodiments, the variable voltage source 120 is included as a part of the electrical circuit 122. The electrical circuitry 122 may include amplifier, integrator, noise filter, feedback control logic, and/or various other components. The electrical circuitry 122 may be integrated electrical circuitry integrated within a silicon substrate 128 and may be further coupled to a computer processor 124 coupled to a memory 126.
[0080] In one example, the nanopore device 100 of Figure 2 is an OmpG nanopore device having a single OmpG protein 108, e.g., a variant OmpG as described herein, embedded in a lipid bilayer 102 formed over a lipid bilayer compatible silver-gold alloy surface 104 coated on a copper material 106. The lipid bilayer compatible silver-gold alloy surface 104 is isolated by lipid bilayer incompatible silicon nitride surfaces 105, and the copper material 106 is electrically insulated by silicon nitride materials 107. The copper 106 is coupled to electrical circuitry 122 that is integrated in a silicon substrate 128. A silver-silver chloride electrode placed on-chip or extending down from a cover plate 128 contacts an aqueous solution containing dsDNA molecules.
[0081] The lipid bilayer may comprise or consist of phospholipid, for example, selected from diphytanoyl-phosphatidylcholine (DPhPC), 1 ,2-diphytanoyl-sn- glycero-3phosphocholine, 1 ,2-Di-0-Phytanyl-sn-Glycero-3-phosphocholine
(DoPhPC), palmitoyl-oleoyl-phosphatidylcholine (POPC), dioleoyl-phosphatidyl- methylester (DOPME), dipalmitoylphosphatidylcholine (DPPC),
phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, phosphatidic acid, phosphatidylinositol, phosphatidylglycerol, sphingomyelin, 1 ,2-di-O-phytanyl- sn-glycerol; 1 ,2-dipalmitoyl-sn-glycero-3-phosphoethanolamine-N- [methoxy(polyethylene glycol)-350] ; 1 ,2-dipalmitoyl-sn-glycero-3- phosphoethanolamine-N-[methoxy(polyethylene glycol)-550]; 1 ,2-dipalmitoyl-sn- glycero-3-phosphoethanolamine-N-[methoxy(polyethylene glycol)-750]; 1 ,2- dipalmitoyl-sn-glycero-3-phosphoethanolamine-N-[methoxy(polyethylene glycol)- 1000]; 1 ,2-dipalmitoyl-sn-glycero-3-phosphoethanolamine-N- [methoxy(polyethylene glycol)-2000]; 1 ,2-dioleoyl-sn-glycero-3- phosphoethanolamine-N-lactosyl; GM1 Ganglioside, Lysophosphatidylcholine (LPC) or any combination thereof.
[0082] The nanopores can form an array. The disclosure provides an array of nanopore detectors (or sensors). Figure 3A is a top view of a schematic diagram of an embodiment of a nanopore chip 300 having an array 302 of individually addressable nanopore devices 100 having a lipid bilayer compatible surface 104 isolated by lipid bilayer incompatible surfaces 105. Each nanopore device 100 is complete with a control circuit 122 integrated on a silicon substrate 128. In some embodiments, side walls 136 may be included to separate groups of nanopore devices 100 so that each group may receive a different sample for characterization. In some embodiments, the nanopore chip 300 may include a cover plate 128. The nanopore chip 300 may also include a plurality of pins 304 for interfacing with a computer processor. In some embodiments, the nanopore chip 300 may be coupled to (e.g., docked to) a nanopore workstation 306, which may include various components for carrying out (e.g., automatically carrying out) the various embodiments of the processes of the present invention, including for example, analyte delivery mechanisms such as pipettes for delivering lipid suspension, analyte solution, and/or other liquids, suspension or solids, robotic arms, computer processor, and/or memory. Figure 3B is a cross sectional view of the nanopore chip 300. With reference to Figures 3A and 3B, a plurality of polynucleotides may be detected on an array of nanopore detectors. Here, each nanopore location comprises a nanopore, in some cases attached to a polymerase enzyme as described elsewhere herein. Each of the nanopores can be individually
addressable.
[0083] The methods of the invention involve the measuring of a current passing through the pore during interaction with a nucleotide. In some embodiments, sequencing a nucleic acid molecule can require applying a direct current (e.g., so that the direction at which the molecule moves through the nanopore is not reversed). However, operating a nanopore sensor for long periods of time using a direct current can change the composition of the electrode, unbalance the ion concentrations across the nanopore and have other undesirable effects. Applying an alternating current (AC) waveform can avoid these undesirable effects and have certain advantages as described below. The nucleic acid sequencing methods described herein that utilize tagged nucleotides are fully compatible with AC applied voltages, and AC can therefore be used to achieve said advantages.
[0084] Suitable conditions for measuring ionic currents through transmembrane protein pores are known in the art and examples are provided herein in the Experimental section. The method is carried out with a voltage applied across the membrane and pore. The voltage used is typically from -400 mV to +400 mV. The voltage used is preferably in a range having a lower limit selected from -400 mV, - 300 mV, -200 mV, -150 mV, -100 mV, -50 mV, -20 mV and 0 mV and an upper limit independently selected from +10 mV, +20 mV, +50 mV, +100 mV, +150 mV, +200 mV, +300 mV and +400 mV. The voltage used is more preferably in the range 100 mV to 240 mV and most preferably in the range of 160 mV to 240 mV. It is possible to increase discrimination between different nucleotides by a pore of the invention by using an increased applied potential. Sequencing nucleic acids using AC waveforms and tagged nucleotides is described in US Patent Publication US2014/0134616 entitled "Nucleic Acid Sequencing Using Tags", filed on
November 6, 2013, which is herein incorporated by reference in its entirety. In addition to the tagged nucleotides described in US2014/0134616, sequencing can be performed using nucleotide analogs that lack a sugar or acyclic moiety, e.g., (S)-Glycerol nucleoside triphosphates (gNTPs) of the five common nucleobases: adenine, cytosine, guanine, uracil, and thymidine (Horhota et al. Organic Letters, 8:5345-5347 [2006]).
Nanopore-Polymerase Complex
[0085] In some cases, a polymerase (e.g., DNA polymerase) is attached to and/or is located in proximity to the nanopore. The polymerase can be attached to the nanopore before or after the nanopore is incorporated into the membrane. In some cases, the polymerase is attached to the OmpG protein monomer and then the nanopore polymerase complex can then be inserted into the membrane.
[0086] An exemplary method for attaching a polymerase to a nanopore involves attaching a linker molecule to an OmpG monomer or mutating the OmpG to have an attachment site or attachment linker, and then attaching a polymerase to the attachment site or attachment linker (e.g., in bulk, before inserting into the membrane). The polymerase can also be attached to the attachment site or attachment linker after the nanopore is formed in the membrane. In some cases, a plurality of nanopore-polymerase pairs is inserted into a plurality of membranes (e.g., disposed over the wells and/or electrodes) of a biochip, thereby forming a nanopore chip as described herein. In some instances, the attachment of the polymerase to the nanopore to form a nanopore-polymerase complex occurs on the biochip above each electrode.
[0087] The polymerase can be attached to the nanopore with any suitable chemistry (e.g., covalent bond and/or linker). In some instances, the polymerase is expressed as a fusion protein that comprises a SpyCatcher polypeptide, which can be covalently bound to an OmpG nanopore that comprises a SpyTag peptide. In some instances, the polymerase is attached to the nanopore with molecular staples. In some instances, molecular staples comprise three amino acid sequences (denoted linkers A, B and C). Linker A can extend from an OmpG polypeptide, Linker B can extend from the polymerase, and Linker C then can bind Linkers A and B (e.g., by wrapping around both Linkers A and B) and thus bind the polymerase to the nanopore. Linker C can also be constructed to be part of Linker A or Linker B, thus reducing the number of linker molecules.
[0088] In some instances, the polymerase is linked to the nanopore using
Solulink™ chemistry. Solulink™ can be a reaction between HyNic (6-hydrazino- nicotinic acid, an aromatic hydrazine) and 4FB (4-formylbenzoate, an aromatic aldehyde). In some instances, the polymerase is linked to the nanopore using Click chemistry (available from LifeTechnologies for example). In some cases, zinc finger mutations are introduced into the OmpG molecule and then a molecule is used (e.g., a DNA intermediate molecule) to link the polymerase to the zinc finger sites on the OmpG.
[0089] Other linkers that may find use in attaching the polymerase to a nanopore are direct genetic linkage (e.g., (GGGGS)i-3 amino acid linker (SEQ ID NO: 19)), transglutaminase mediated linking (e.g., RSKLG (SEQ ID NO: 23)), sortase mediated linking, and chemical linking through cysteine modifications. Specific linkers contemplated as useful herein are (GGGGS)i-3 (SEQ ID NO: 19), K-tag (RSKLG (SEQ ID NO: 23)) on N-terminus, ATEV site (12-25), ATEV site + N- terminus of SpyCatcher (12-49). [0090] The polymerase may be coupled to the nanopore by any suitable means. See, for example, PCT/US2013/068967 (published as WO2014/074727; Genia Technologies, Inc.), PCT/US2005/009702 (published as WO2006/028508;
President and Fellows of Harvard College), and PCT/US201 1/065640 (published as WO2012/083249; Columbia University).
[0091] In some instances, the nanopore and polymerase are produced as a fusion protein (i.e., single polypeptide chain), and are incorporated into the membrane as such.
[0092] The polymerase can be mutated to reduce the rate at which the
polymerase incorporates a nucleotide into a nucleic acid strand (e.g., a growing nucleic acid strand). In some cases, the rate at which a nucleotide is incorporated into a nucleic acid strand can be reduced by functionalizing the nucleotide and/or template strand to provide steric hindrance, such as, for example, through methylation of the template nucleic acid strand. In some instances, the rate is reduced by incorporating methylated nucleotides.
Methods for sequencing polynucleotides
[0093] The molecules being characterized using the variant OmpG polypeptides described herein can be of various types, including charged or polar molecules such as charged or polar polymeric molecules. Specific examples include ribonucleic acid (RNA) and deoxyribonucleic acid (DNA) molecules. The DNA can be a single-strand DNA (ssDNA) or a double-strand DNA (dsDNA) molecule.
[0094] In one aspect, provided are methods for sequencing nucleic acids using the instant OmpG variant nanopores. The OmpG variants provided in the present disclosure can be used for determining the sequence of nucleic acids according to other nanopore sequencing platforms known in the art. For example, the OmpG variant provided in this disclosure may be suitable for sequencing nucleic acids according to the exonuclease-based method of Oxford Nanopore (Oxford, UK), the nanopore-based sequencing-by-hybridization of NABsys (Providence, Rl), the fluorescence-based optical nanopore sequencing of NobleGen Biosciences (Concord, MA), lllumina (San Diego, CA), and the nanopore sequencing-by- expansion of Stratos Genomics (Seattle, WA). In some embodiments, sequencing of nucleic acids using the OmpG variants can be performed using tagged nucleotides as is described in PCT/US2013/068967 (entitled "Nucleic Acid Sequencing Using Tags" filed on November 7, 2013, which is herein incorporated by reference in its entirety). For example, a variant OmpG nanopore that is situated in a membrane {e.g., a lipid bilayer) adjacent to or in sensing proximity to one or more sensing electrodes, can detect the incorporation of a tagged nucleotide by a polymerase as the nucleotide base is incorporated into a polynucleotide strand and the tag of the nucleotide is detected by the nanopore. The polymerase can be associated with the nanopore as described above.
[0095] Tags of the tagged nucleotides can include chemical groups or molecules that are capable of being detected by a nanopore. Examples of tags used to provide tagged nucleotides are described at least at paragraphs [0414] to [0452] of PCT/US2013/068967. Nucleotides may be incorporated from a mixture of different nucleotides, e.g., a mixture of tagged dNTPs where N is adenosine (A), cytidine (C), thymidine (T), guanosine (G) or uracil (U). Alternatively, nucleotides can be incorporated from alternating solutions of individual tagged dNTPs, i.e., tagged dATP followed by tagged dCTP, followed by tagged dGTP, etc.
Determination of a polynucleotide sequence can occur as the nanopore detects the tags as they flow through or are adjacent to the nanopore, as the tags reside in the nanopore and/or as the tags are presented to the nanopore. The tag of each tagged nucleotide can be couple to the nucleotide base at any position including, but not limited to a phosphate (e.g., gamma phosphate), sugar or nitrogenous base moiety of the nucleotide. In some cases, tags are detected while tags are associated with a polymerase during the incorporation of nucleotide tags. The tag may continue to be detected until the tag translocates through the nanopore after nucleotide incorporation and subsequent cleavage and/or release of the tag. In some cases, nucleotide incorporation events release tags from the tagged nucleotides, and the tags pass through a nanopore and are detected. The tag can be released by the polymerase, or cleaved/released in any suitable manner including without limitation cleavage by an enzyme located near the polymerase. In this way, the incorporated base may be identified {i.e., A, C, G, T or U) because a unique tag is released from each type of nucleotide {i.e., adenine, cytosine, guanine, thymine or uracil). In some situations, nucleotide incorporation events do not release tags. In such a case, a tag coupled to an incorporated nucleotide is detected with the aid of a nanopore. In some examples, the tag can move through or in proximity to the nanopore and may be detected with the aid of the nanopore.
[0096] In some cases, tagged nucleotides that are not incorporated pass through the nanopore. The method can distinguish between tags associated with un- incorporated nucleotides and tags associated with incorporated nucleotides based on the length of time the tagged nucleotide is detected by the nanopore. In one embodiment, an un-incorporated nucleotide is detected by the nanopore for less than about 1 millisecond and an incorporated nucleotide is detected by the nanopore for at least about 1 millisecond.
[0097] Thus, in one aspect, the disclosure provides for a method for sequencing a nucleic acid with the aid of a variant OmpG nanopore. In one embodiment, a method is provided for sequencing a nucleic acid with the aid of a variant OmpG nanopore adjacent to a sensing electrode by (a) providing tagged nucleotides into a reaction chamber comprising the nanopore, wherein an individual tagged nucleotide of the tagged nucleotides contains a tag coupled to a nucleotide, which tag is detectable with the aid of the nanopore; (b) carrying out a polymerization reaction, with the aid of a polymerase, thereby incorporating an individual tagged nucleotide of the tagged nucleotides into a growing strand complementary to a single stranded nucleic acid molecule from the nucleic acid sample; and (c) detecting, with the aid of the nanopore, a tag associated with the individual tagged nucleotide during and/or upon incorporation of the individual tagged nucleotide, wherein the tag is detected with the aid of the nanopore when the nucleotide is associated with the polymerase. Other embodiments of the sequencing method that comprise the use of tagged nucleotides with the present variant OmpG nanopores for sequencing polynucleotides are provided in WO2014/074727, which is incorporated herein by reference in its entirety.
EXAMPLES
Example 1
Expression and purification of OmpG-EXT
[0098] A DNA encoding a form of the mature OmpG protein (residues 22-301 ; Uniprot entry P76045) lacking loop 6 and having substitution E229A (OmpG-EXT) was synthesized (Genscript, NJ) based on the ΔΙ.6ΙΙ construct described by
Grosse et al. {Biochemistry 53:4826-4838 [2014]). The synthetic DNA (OmpG- ΔΙ.6/Ε229Α) encodes an OmpG construct having a deletion of loop 6 and a C- terminal sequence: linkerl -His tag-linker2-Spytag (SEQ ID NO:3). A synthetic DNA sequence derived from the wild-type sequence (SEQ ID NO:4) and encoding the truncation of loop 6 (ΔΙ.6) and substitution E229A, was cloned in the pET-26b vector and produced in OmpG-deficient BI21 DE3 E. coli
(www.neb.com/products/c2527-bl21 de3-competent-e-coli) as inclusion bodies.
[0099] Expression of OmpG-AL6/E229A was obtained by IPTG-induced transcription of the OmpG- AL6/E229A DNA in BI21 DE3 E. coli cells growing in MagicMedia™ (Invitrogen, Carlsbad, CA) for approximately 24 hours. The cells were centrifuged then resuspended in 50mM Tris, PH8.0 (5ml buffer to 1 g cell pellet). Next, the cells were sonicated, and the lysate centrifuged (10,000 x g/20 min/4 °C). The pellet was washed twice by centrifugation and resuspension, and the final pellet was resuspended in 50 mM Tris pH8.0 at a concentration of 200 mg/ml. The inclusion bodies were aliquoted and stored at - 80°C.
[00100] The inclusion bodies were solubilized in 50 mM Tris pH8.0, 6 M urea, and 2.4mM TCEP at 60 °C for 10 minutes. Unsolubilized inclusion bodies were removed by centrifugation, and the solubilized protein present in the supernatant was diluted to 1 mg/ml in refolding buffer (25mM Tris, pH8.0, 1.8M urea, 1 mM TCEP, and 3% β-OG). The OmpG-AL6/E229A protein was refolded for 16 hours at 37 °C, then diluted using 50 mM Tris, pH8.0, 200mM NaCI, 5mM imidazole 1 % β- OG to obtain a final concentration of 1 % β-OG and 0.8M urea. The refolded protein was purified by affinity chromatography using TALON® (Clontech,
Mountain View, CA), and eluted in 50 mM Tris, pH8.0, 200 mM NaCI, 200 mM imidazole, and 0.1 % Tween-20. TALON is an immobilized metal affinity chromatography (IMAC) resin charged with cobalt, which binds to his-tagged proteins with higher specificity than nickel-charged resins.
[00101] Variants of the OmpG-AL6/E229A protein were obtained by site-directed mutagenesis on the basis of the DNA encoding the OmpG-AL6/E229A construct in the pET-26b vector. All OmpG-AL6/E229A variants were expressed and purified as described for OmpG-AL6 as described above.
Example 2
Characterization of OmpG variants
[00102] To demonstrate the ability of the variant OmpG polypeptides to reduce spontaneous gating, individual variants were reconstituted as pores in a lipid bilayer over a well on a semiconductor sensor chip (a), and single channel recordings obtained for each of the OmpG variant pores (b).
[00103] (a) Reconstitution of OmpG variants into lipid bilayers [00104] Variant OmpG proteins comprising the deletion of loop 6 (AL6), and amino acid substitution E229A, i.e., ΔΙ.6/Ε229Α, in combination with one of amino acid substitutions selected from Y50K, Y50N, R68N, R21 1 N, or E17K, and/or in combination with deletion of D215, were expressed and purified as described in Example 1 . The lipid bilayer was formed and the nanopore was inserted as described in PCT/US2014/061853 (entitled "Methods for Forming Lipid Bilayers on Biochips" and filed 22 October 2014).
[00105] (b) Single-channels recordings in lipid bilayers
[00106] To determine the effect of the mutations on current flow, single channel recordings were made for currents passing through variants of the OmpG-EXT nanopore. Measurements of ionic current flow through the OmpG nanopore were made using DC.
[00107] Chambers were filled with 20 mM HEPES, pH 8.0, 300 mM NaCI, 3mM CaCI2 unless otherwise noted. The current was measured using in-house built GeniaChip™ DNA sequencers.
[00108] Channel current traces of OmpG-AL6/E229A -Y50K (SEQ ID NO:6), OmpG- AL6/E229A -R68N (SEQ ID NO:7), OmpG- AL6/E229A -R21 1 N (SEQ ID NO:8), OmpG-AL6/E229A -E17K (SEQ ID NO:9), and OmpG-AL6/E229A -del215 (SEQ ID NO: 10) are shown in Figures 4A-4E. The traces show that OmpG- AL6/E229A -R68N (B), OmpG-AL6/E229A -R21 1 N (C), OmpG-AL6/E229A-E17K (D), display substantial flickering, i.e., upward and downward currents. OmpG- AL6/E229A-Y50K (A) displays less flickering, and a lower open channel current level of about 30 pA. OmpG-AL6/E229A-del215 (E) shows the least level of spontaneous gating, and maintains an open channel level of about 35pA.
[00109] The number of resistive events measured for the OmpG-AL6/E229A variants and for variant OmpG-AL6/E229A-del215-Y50K (Figures 4A-4E) was analyzed to calculate the effect of the mutations on gating frequency. Figure 5A shows a histogram that relates the mean and S.D. of the open channel (OC) current noise (upward and downward current) to the percent of the same measurements that occurred outside 1 S.D. from the mean measurements. The histogram shows that OmpG-AL6/E229A-del215 (del215) ("AL6-del215" in Figs. 5A and 5B) has the lowest amount of noise when compared to the parental OmpG-AL6/E229A ("AL6" in Figs. 5A and 5B), and to other variants: OmpG- AL6/E229A-Y50K ("AL6-Y50K" in Figs. 5A and 5B), OmpG-AL6/E229A-R68N ("AL6-R68N" in Figs. 5A and 5B), OmpG-AL6/E229A-R21 1 N ("AL6-R21 1 N" in Figs. 5A and 5B), OmpG-AL6/E229A-E17K ("AL6-E17K" in Figs. 5A and 5B), , and OmpG-AL6/E229A-del215-Y50K ("ΔΙ.6-Υ50Κ" in Figs. 5A and 5B), and maintains an open channel current of about 35 pA.
[00110] The mean and S.D. of the downward current only is provided in Figure 5B. The
OmpG-AL6/E229A-Y50K, OmpG-AL6/E229A-del215, and OmpG-AL6/E229A- del215-Y50K showed the smallest amount of downward current when compared, for example, to the parental OmpG-AL6/E229A . Example 3
Attachment of a polymerase to an OmpG nanopore
[00111] A DNA sequence that encodes a His-tagged polymerase, pol6, was purchased from a commercial source (DNA 2.0, Menlo Park, California), and then engineered to comprise a SpyCatcher domain at its C-terminus. (Li et al., J Mol S/'o/ 23:426(2):309-317 [2014]). The Pol6 was ligated into the pD441 vector
(expression plasmid), which was subsequently transformed into competent E. coli. 1 ml starter culture in LB with 0.2% Glucose and 100 μg/ml Kanamycin for approximately 8 hrs. 25 μΙ of log phase starter culture was transferred into 1 ml of expression media (Terrific Broth (TB) autoinduction media supplemented with 0.2%glucose, 50 mM Potassium Phosphate, 5mM MgCI2 and 100 μg/ml
Kanamycin) in 96-deep well plates. The plates were incubated with shaking at 250-300rpm for 36-40 hrs at 28°C.
[00112] Cells were then harvested via centrifugation at 3200 x g for 30 minutes at 4°C. The media was decanted off and the cell pellet resuspended in 200 μΙ pre- chilled lysis buffer (20mM Potassium Phosphate pH 7.5, 100 mM NaCI, 0.5% Tween20, 5mM TCEP, 10mM Imidazole, 1 mM PMSF, 1 X BugBuster® protein extraction reagent, 100 μg/ml Lysozyme and protease inhibitors) and incubate at room temperature for 20 min with mild agitation. 20 μΙ of reagent was then added from a 10x stock to a final concentration of 100 μg/ml DNase, 5 mM MgCI2, 100 μg/ml RNase I, and incubated on ice for 5-10 min to produce a lysate. The lysate was supplemented with 200 μΙ of 1 M Potassium Phosphate, pH 7.5 (final concentration was about 0.5M Potassium phosphate in 400 μΙ lysate) and filtered through Pall filter plates (Part# 5053, 3 micron filters) via centrifugation at approximately 1500 rpm at 4°C for 10 minutes. The clarified lysates were then applied to equilibrated 96-well His-Pur Cobalt plates (Pierce Part# 90095) and bound for 15-30 min.
[00113] The flow through (FT) was collected by centrifugation at 500 x g for 3 min. The FT was then washed 3 times with 400μΙ of wash buffer 1 (0.5M Potassium Phosphate pH 7.5, 1 M NaCI 5mM TCEP, 20mM Imidazole, and 0.5%Tween20).
The FT was then washed twice in 400μΙ wash buffer 2 (50mM Tris pH 7.4, 200mM KCI, 5mM TCEP, 0.5% Tween20, 20mM lmidazole).The Pol6 was eluted using 200 μΙ elution buffer (50mM Tris Ph7.4, 200mM KCI, 5mM TCEP, 0.5% Tween20, 300mM Imidazole, 25%Glycerol) and collected after 1-2 min incubation. Eluate was reapplied to the same His-Pur plate 2-3 times to obtain concentrated Pol6.
The purified polymerase was >95% pure as evaluated by SDS-PAGE. The protein concentration was ~3uM (0.35mg/ml) with a 260/280 ratio of 0.6 as evaluated by NanoDrop®. Polymerase activity was checked by fluorescence displacement assay.
[00114] The Pol6-His-SpyCatcher protein was incubated overnight at 4 °C in 3 mM SrCI2 with the OmpG-EXT-His-SpyTag (SEQ ID NO:3) to allow for the covalent attachment of the SpyCatcher with the SpyTag, thereby forming an OmpG-polymerase complex. The OmpG-polymerase complex was purified using affinity chromatography, and tested for its ability to capture and identify tagged nucleotides as described in Example 4.
Example 4
Detection of nucleotide bases by polymerase-variant OmpG complexes
[00115] The ability of OmpG-EXT-del215 (SEQ ID NO:10), i.e., OmpG- AL6/E229A-del215, to identify nucleotides captured by a polymerase was assessed using OmpG-EXT-del215 complexed with polymerase Pol6 in the presence of DNA template JAM1A in 300 mM NaCI, 3 mM CaCI2, 20 mM HEPES, pH 7.5. Template JAM1A is a DNA template that provides an adenine nucleotide base that is complementary to the tagged thymidine nucleotide used in the assay (Synthesized by Roche Penzberg, Germany) and that would be captured by the polymerase.
[00116] DC current measurements were made at a constant voltage of 100 mV applied for 10 minutes. Different sets of tagged nucleotides were used. Figure 6 shows an example of a trace that demonstrates that the OmpG-EXT-del215 - polymerase complex identifies four different tagged nucleotides that were captured by the polymerase: T-T30, T-dSp30, T-Tmp6, and T-dSp5. Capture of each of the four nucleotides is reflected by four different changes in the current flowing thorough OmpG-EXT-del215 nanopore as the corresponding nucleotide tags are detected by the nanopore (Figure 6). The arrows indicate the four tags of the tagged nucleotides diminish channel current to four different levels. Each of the four nucleotides were identified in similar measurements during which current was measured as the nucleotides were individually added to the nanopore. Figure 7 shows the identification of the nucleotides by the individual effects of the corresponding tags on reducing the open channel current as they were detected by the nanopore. Figure 8 shows an expanded view of the capture of the four nucleotides detected under DC conditions and indicated by the arrows in Figure 6.
SEQUENCE LISTING FREE TEXT
SEQ ID N0:1 (Wild-type OmpG; >sp|P76045|OMPG_ECOLI Outer membrane protein G; OS=Escherichia coli (strain K12)) KKLLPCTAL V CAGMACAQ AEERNDWHFN IGAMYEIENV EGYGEDMDGL 50
AEPSVYFNAA NGPWRIALAY YQEGPVDYSA GKRGTWFDRP ELEVHYQFLE 100 NDDFSFGLTG GFRNYGYHYV DEPGKDTANM QRWKIAPDWD VKLTDDLRFN 150
GWLSMYKFAN DLNTTGYADT RVETETGLQY TFNETVALRV NYYLERGFNM 200
DDSRNNGEFS TQEIRAYLPL TLGNHSVTPY TRIGLDRWSN WDWQDDIERE 250 GHDFNRVGLF YGYDFQNGLS VSLEYAFEWQ DHDEGDSDKF HYAGVGVNYS 300
F
SEQ ID NO:2 (Mature wild-type OmpG from E.coli (strain K12);
sequence for numbering))
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50 QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100 EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150
VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT RIGLDRWS W DWQDDIEREG HDFNRVGLFY GYDFQNGLSV 250 SLEYAFEWQD HDEGDSDKFH YAGVGVNYSF 280
SEQ ID NO:3 (Synthetic OmpG-AL6 fusion protein - HisTag-SpyTag as expressed in E.coli)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50
QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100
EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150
VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200
LGNHSVTPYT RIGLDRAGHD FNRVGLFYGY DFQNGLSVSL EYAFEWQDHD 250 EGDSDKFHYA GVGVNYSFEK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK
299
SEQ ID NO:4 (Wild-Type OmpG; Escheridia coli porin (ompG) gene GM806593)
atgaaaaagttattaccctgtaccgcactggtgatgtgtgcgggaatggcctgcgcacaggccgagg aaaggaacgactggcactttaatatcggcgcgatgtacgaaatagaaaacgtcgagggttatggcga agatatggatgggctggcggagccttcagtctattttaatgccgccaacgggccgtggagaattgct ctggcctattatcaggaagggccggtagattatagcgcgggtaaacgtggaacgtggtttgatcgcc cggagctggaggtgcattatcagttcctcgaaaacgatgatttcagtttcggcctgaccggcggttt ccgtaattatggttatcactacgttgatgaaccgggtaaagacacggcgaatatgcagcgctggaaa atcgcgccagactgggatgtgaaactgactgacgatttacgtttcaacggttggttgtcgatgtata aatttgccaacgatctgaacactaccggttacgctgatacccgtgtcgaaacggaaacaggtctgca atataccttcaacgaaacggttgccttgcgagtgaactattatctcgagcgcggcttcaatatggac gacagccgcaataacggtgagttttccacgcaagaaattcgcgcctatttgccgctgacgctcggca accactcggtgacgccgtatacgcgcattgggctggatcgctggagtaactgggactggcaggatga tattgaacgtgaaggccatgattttaaccgtgtaggtttattttacggttatgatttccagaacgga ctttccgtttcgctggaatacgcgtttgagtggcaggatcacgacgaaggcgacagtgataaattcc attatgcaggtgtcggcgtaaattactcgttctgataat
SEQ ID NO:5 (OmpG-AL6)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50 QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100 EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150 VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT RIGLDRAGHD FNRVGLFYGY DFQNGLSVSL EYAFEWQDHD 250 EGDSDKFHYA GVGVNYSFEK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK 268
SEQ ID NO:6 (Ompg-AL6-Y50K)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYK 50 QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100 EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150 VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT RIGLDRAGHD FNRVGLFYGY DFQNGLSVSL EYAFEWQDHD 250 EGDSDKFHYA GVGVNYSFEK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK 299 SEQ ID NO:7 (AL6-R68N OmpG)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50 QEGPVDYSAG KRGTWFDNPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100 EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150 VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT RIGLDRAGHD FNRVGLFYGY DFQNGLSVSL EYAFEWQDHD 250 EGDSDKFHYA GVGVNYSFEK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK 299
SEQ ID NO:8 (AL6-R211 N OmpG)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50 QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100 EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150 VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT NIGLDRAGHD FNRVGLFYGY DFQNGLSVSL EYAFEWQDHD 250 EGDSDKFHYA GVGVNYSFEK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK 299
SEQ ID NO:9 (AL6-E17K OmpG)
EERNDWHFNI GAMYEIKNVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50 QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100 EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150 VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT RIGLDRAGHD FNRVGLFYGY DFQNGLSVSL EYAFEWQDHD 250 EGDSDKFHYA GVGVNYSFEK EKEKGSHHHH HHGSGGAHIV MVDAYKPTK 299
SEQ ID NO:10 (AL6-del215 OmpG)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYY 50
QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD 100
EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150
VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200
LGNHSVTPYT RIGLRAGHDF NRVGLFYGYD FQNGLSVSLE YAFEWQDHDE 250
GDSDKFHYAG VGVNYSFEKE KEKGSHHHHH HGSGGAHIVM VDAYKPTK 298
SEQ ID NO:11 (AL6-del215-Y50K OmpG)
EERNDWHFNI GAMYEIENVE GYGEDMDGLA EPSVYFNAAN GPWRIALAYK QEGPVDYSAG KRGTWFDRPE LEVHYQFLEN DDFSFGLTGG FRNYGYHYVD EPGKDTANMQ RWKIAPDWDV KLTDDLRFNG WLSMYKFAND LNTTGYADTR 150 VETETGLQYT FNETVALRVN YYLERGFNMD DSRNNGEFST QEIRAYLPLT 200 LGNHSVTPYT RIGLRAGHDF NRVGLFYGYD FQNGLSVSLE YAFEWQDHDE 250 GDSDKFHYAG VGVNYSFEKE KEKGSHHHHH HGSGGAHIVM VDAYKPTK 298
SEQ ID NO:12 (membrane protein (Shigella flexneri); KGY82041)
MKKLLPCTAL VMCAGMACAQ AEEK DWHFN IGAMYEIENV EGYGEDMDGL 50
AEPSVYFNAA NGPWRIALAY YQEGPVDYSA GKRGTWFDRP ELEVHYQFLE 100
SDDFSFGLTG GFRNYGYHYV DEPGKDTANM QRWKIAPDWD VKLTDDLRFN 150
GWLSMYKFAN DLNTTGYADT RVETETGLQY TFNETVALRV NYYLERGFNM 200
DDSRNNGEFS TQEIRAYLPL TLGNHSVTPY TRIGLDRWSN WDWQDDIERE 250
GHDFNRVGLF YGYDFQNGLS VSLEYAFEWQ DHDEGDSDKF HYAGVGVNYS 300
F 301
SEQ ID NO:13 (outer membrane protein G (Salmonella enterica);
WP_023246462)
MKTLLSSTAL VMCAGMACAQ AAEK DWHFN IGAMYEIENV EGQAEDMDGL 50
GEPSIYFNAA NGPWKISLAY YQEGPVDYSA GKRGTWFDRP ELEIRYQLLE 100
SDDVNFGLTG GFRNYGYHYV DEPGKDTANM QRWKVQPDWD IKLSDDLRFG 150
GWLAMYQFVN ELSITGYSDS RVESETGFTY KINDMFSMVT NYYLERGFNI 200
DKSRNNGEFS TQEIRAYLPI SLGNTTLTPY TRIGLDRWTN WDWQDDPERE 250
GHDFNRLGLL YAYDFQNGLS MTLEYAFECE DHDEGESDKF HYAGIGINYA 300
F 301
SEQ ID NO:14 (outer membrane protein G (Salmonella enterica);
WP_023220551)
MKTLLSSTAL VMCAGMACAQ AAEKNDWHFN VGAMYEIENV EGQGENMDGL 50
AEPSIYFNAA NGPWRISVAY YQEGPVDYSA GKRGTWFDRP EFEVHYQFLE 100
SDDVNFGLTG GFRNYGYHYV DEPGKDTANM QRWKVQPDWD IKLSDDLRFG 150
GWFAMYQFVN DLSITGYSDS RVETETGFTY KINDTFSMVT NYYLERGFNI 200
DKSRNNGEFS TQEIRAYLPI SLGNTTLTPY TRLGLDRWSN WDWQDDPERE 250
GHDFNRLGLL YAYDFQNGLS MTLEYAFEWE DHDEGESDKF HYAGVGINYA 300 SEQ ID NO:15 (membrane protein (Citrobacter farmeri);
WP_042318786)
MKTLLSTTAL MLCAAISCAQ AAEK DWHFN IGAMYEIENV EGYGEDMDGL 50 AEPSVYFNAA NGPWRISLAY YQEGPVDYSA GKRGTWFDRP ELEVHYQIQE 100
SDEFSFGLTG GFRNYGYHYV NEAGKDTANM QRWKVQPDWD VKITDDLRFS 150
GWLSMYQFVN DLSTTGYADS RLESETGLHY TFNETVGVIV NYYLERGFNL 200
ADHRNNGEFS TQEIRAYLPL SLGNTTLTPY TRIGLDRWSN WDWRDDPERE 250
GHDFNRLGLQ YAYDFQNGLS MTLEYAYEWE DHDEGESDRF HYAGVGVNYA 300 F 301
SEQ ID NO:16 ((EK)3-His6-GSGG-SpyTag - Mnker-HisSpyTag construct)
EKEKEKGSHH HHHHGSGGAH IVMVDAYKPT K 31
SEQ ID NO:17 Circular permutation variant of E. coli OmpG
DKFHY¾GVGVIVySFGSGEERNDWHFNIGAMYEIENVEGYGEDMDGLAEPSVYFNAANGPWRIALAY KQEGPVDYSAGKRGTWFDRPELEVHYQFLENDDFSFGLTGGFRNYGYHYVDEPGKDTANMQRWKIAP DWDVKLTDDLRFNGWLSMYKFANDLNTTGYADTRVETETGLQYTFNETVALRVNYYLERGFNMDDSR NNGEFSTQEIRAYLPLTLGNHSVTPYTRIGLRAGHDFNRVGLFYGYDFQNGLSVSLEYAFEWQDHDE GDSEKEKEKGSHHHHHHGSGGAHIVMVDAYKPTK
CITATION LIST
Patent Literature
[1] PCT/US2005/009702 (published as WO2006/028508 on 16 March 2006; President and Fellows of Harvard College; entitled METHODS AND APPARATUS FOR CHARACTERIZING POLYNUCLEOTIDES.
[2] PCT/US201 1/065640 (published as WO2012/083249 on 21 June 2012; Columbia University; entitled DNA SEQUENCING BY SYNTHESIS USING MODIFIED NUCLEOTIDES AND NANOPORE DETECTION).
[3] PCT/US2013/068967 (published as WO2014/074727 on 15 May 2014; Genia Technologies; entitled NUCLEIC ACID SEQUENCING USING TAGS).
[4] US20140134616 (published on May 15 2014; Genia Technologies; entitled NUCLEIC ACID SEQUENCING USING TAGS).
[5] PCT/US2014/061853 (published AS WO2015/061510 on April 30, 2015; Genia Technologies; entitled METHODS FOR FORMING LIPID BILAYERS ON BIOCHIPS).
[6] PCT/US201 1/000205 (Genia Technologies, Inc. entitled SYSTEMS FOR MANIPULATING A MOLECULE IN A NANOPORE, published August 1 1 , 201 1 as WO201 1/097028) Non-Patent Literature
[1] Conlan and Bayley, Folding of a Monomeric Porin, OmpG, in Detergent Solution; Biochemistry 42;9453-9465 (2003).
[2] Subbarao and van den Berg, Crystal Structure of the monomeric Porin OmpG; J Mol Biol 360:750-759 (2006).
[3] Grosse et al., Structural and functional characterization of a synthetically modified OmpG; Bioorganic and Medicinal Chem 18:7716-7723 (2010).
[4] Anbazhagan et al., Incorporation of Outer Membrane Protein OmpG in Lipid Membranes: Protein-lipid Interactions and β-Barrel Orientation; Biochemistry 47:6189-698 (2008).
[5] Fahie et al., Resolved Single-Molecule Detection of Individual Species within a Mixture of anti-Biotin Antibodies using an Engineered Monomeric Nanopore; ACS Nano 9:1089-1098 (2015).
[6] Chen et al., Outer membrane protein G: Engineering a quiet pore for biosensing, Proc Natl Acad Sci 105:6272-6277 (2008). [7] Grosse et al., Structure-based Engineering of a Minimal Porin Reveals Loop- Independent Channel closure; Biochemistry 53:4826-4838 (2014).
[8] Astier et al., J Am Chem Soc 10.1021/ja057123+, published online on December 30, 2005.

Claims

PATENT CLAIMS
1. An isolated OmpG variant of the parental E. coli OmpG of SEQ ID NO:2 or homolog thereof, wherein said variant comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 , and wherein said variant retains the ability to form a nanopore.
2. An isolated OmpG variant of the parental OmpG of SEQ ID NO:2 or homolog thereof, wherein said variant comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and a deletion of amino acid D215, and retains the ability to form a nanopore.
3. The isolated OmpG variant of Claim 2, further comprising a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 of SEQ ID NO:2.
4. The isolated OmpG variant of Claim 1 or 3, wherein said mutation of said one or more of amino acids in SEQ ID NO:2 is selected from R21 1 N, R68N, Y50K,
Y50N, and E17K.
5. The isolated OmpG variant of Claim 1 or 3, wherein said mutation of said one or more amino acids is amino acid substitution Y50K.
6. The isolated OmpG variant of Claim 1 or 3, further comprising a polymerase, wherein said polymerase is operably linked to said variant OmpG.
7. The isolated OmpG variant of any one of Claims 1 -6, wherein said OmpG variant retains the ability to form a nanopore in a lipid layer.
8. An isolated nucleic acid comprising a polynucleotide sequence encoding a variant of the parental OmpG of SEQ ID NO:2, wherein said variant OmpG comprises a deletion of one or more of amino acids 216-227, amino acid substitution E229A, and (i) a deletion of amino acid D215; and/or
(ii) a mutation of one or more of amino acids R21 1 ,E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, and E31 .
9. A method for sequencing a nucleic acid sample with the aid of a variant OmpG nanopore in a membrane adjacent to a sensing electrode, said method
comprising:
(a) providing tagged nucleotides into a reaction chamber comprising said nanopore, wherein an individual tagged nucleotide of said tagged nucleotides comprises a tag coupled to a nucleotide, which tag is detectable with the aid of said nanopore;
(b) carrying out a polymerization reaction with the aid of a single polymerase coupled to said variant OmpG nanopore, thereby incorporating an individual tagged nucleotide of said tagged nucleotides into a growing strand complementary to a single stranded nucleic acid molecule from said nucleic acid sample; and
(c) detecting, with the aid of a variant OmpG nanopore of any one of Claims 1 -1 1 , a tag associated with said individual tagged nucleotide during incorporation of said individual tagged nucleotide, wherein said tag is detected with the aid of said nanopore while said nucleotide is associated with said polymerase.
10. A chip for sequencing a nucleic acid sample, said chip comprising: a plurality of a variant OmpG nanopores according to any one of claims 1-1 1 , an OmpG nanopore of said plurality being disposed adjacent or in proximity to an electrode, wherein said nanopore is individually addressable and has a single polymerase attached to said nanopore; and wherein an individual nanopore detects a tag associated with a tagged nucleotide during incorporation of the nucleotide into a growing nucleic acid chain by said polymerase.
1 1 . The isolated OmpG variant of any one of Claims 1 -1 1 , wherein said variant comprises the linker-His-SpyTag construct of SEQ ID NO:16.
12. An isolated OmpG variant of the parental OmpG of SEQ ID NO:2 or homolog thereof, wherein said variant comprises (i) a deletion of one or more of amino acids 216-227, or (ii) amino acid substitution E229A, or (iii) a mutation of one or more of amino acids R21 1 , E15, R68, Y50, E152, E174, E17, D215, Y259, K1 14, E174, F66, E31 , and wherein said variant retains the ability to form a nanopore.
13. An isolated OmpG variant of the parental E. coli OmpG of SEQ ID NO:2 or homolog thereof, wherein said variant comprises one or more C-terminal β- strand(s) of the parental OmpG moved to the N-terminus of the protein sequence, and wherein said variant retains the ability to form a nanopore.
14. A method for sequencing a nucleic acid sample with the aid of a variant OmpG nanopore in a membrane adjacent to a sensing electrode, said method comprising:
(a) providing tagged nucleotides into a reaction chamber comprising said nanopore, wherein an individual tagged nucleotide of said tagged nucleotides comprises a tag coupled to a nucleotide, which tag is detectable with the aid of said nanopore;
(b) carrying out a polymerization reaction with the aid of a single polymerase coupled to said variant OmpG nanopore, thereby incorporating an individual tagged nucleotide of said tagged nucleotides into a growing strand complementary to a single stranded nucleic acid molecule from said nucleic acid sample; and
(c) detecting, with the aid of a variant OmpG nanopore of any one of claims 27- 32, a tag associated with said individual tagged nucleotide during incorporation of said individual tagged nucleotide, wherein said tag is detected with the aid of said nanopore while said nucleotide is associated with said polymerase.
15. A chip for sequencing a nucleic acid sample, said chip comprising: a plurality of a variant OmpG nanopores according to any one of claims 27-32, an OmpG nanopore of said plurality being disposed adjacent or in proximity to an electrode, wherein said nanopore is individually addressable and has a single polymerase attached to said nanopore; and wherein an individual nanopore detects a tag associated with a tagged nucleotide during incorporation of the nucleotide into a growing nucleic acid chain by said polymerase.
PCT/EP2016/072224 2015-09-22 2016-09-20 Ompg variants WO2017050722A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
CA2998970A CA2998970C (en) 2015-09-22 2016-09-20 Ompg variants
JP2018514905A JP6956709B2 (en) 2015-09-22 2016-09-20 OMPG variant
AU2016326867A AU2016326867B2 (en) 2015-09-22 2016-09-20 OmpG variants
US15/762,092 US10752658B2 (en) 2015-09-22 2016-09-20 OmpG variants
EP16775518.0A EP3353194B1 (en) 2015-09-22 2016-09-20 Ompg variants
CN201680054910.5A CN108449941B (en) 2015-09-22 2016-09-20 OmpG variants
US16/925,848 US11767348B2 (en) 2015-09-22 2020-07-10 OmpG variants
US18/449,904 US20240010688A1 (en) 2015-09-22 2023-08-15 Ompg variants

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201562222197P 2015-09-22 2015-09-22
US62/222,197 2015-09-22
US201662333672P 2016-05-09 2016-05-09
US62/333,672 2016-05-09

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US15/762,092 A-371-Of-International US10752658B2 (en) 2015-09-22 2016-09-20 OmpG variants
US16/925,848 Continuation US11767348B2 (en) 2015-09-22 2020-07-10 OmpG variants

Publications (1)

Publication Number Publication Date
WO2017050722A1 true WO2017050722A1 (en) 2017-03-30

Family

ID=57068053

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2016/072224 WO2017050722A1 (en) 2015-09-22 2016-09-20 Ompg variants

Country Status (7)

Country Link
US (3) US10752658B2 (en)
EP (1) EP3353194B1 (en)
JP (1) JP6956709B2 (en)
CN (1) CN108449941B (en)
AU (1) AU2016326867B2 (en)
CA (1) CA2998970C (en)
WO (1) WO2017050722A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022033998A1 (en) 2020-08-11 2022-02-17 F. Hoffmann-La Roche Ag Nucleoside-5'-oligophosphates tagged with positively-charged polymers, nanopores incorporating negative charges, and methods and systems using the same
WO2022263489A1 (en) 2021-06-17 2022-12-22 F. Hoffmann-La Roche Ag Nucleoside-5 -oligophosphates having a cationically-modified nucleobase
WO2022263496A1 (en) 2021-06-17 2022-12-22 F. Hoffmann-La Roche Ag Engineered nanopore with a negatively charged polymer threaded through the channel

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2016326867B2 (en) * 2015-09-22 2018-09-13 F. Hoffmann-La Roche Ag OmpG variants
KR101838687B1 (en) * 2016-01-26 2018-03-15 한국생명공학연구원 A screening method of protein-protein interaction inhibitors using a nanopore
CN112480204A (en) * 2020-04-13 2021-03-12 南京大学 Protein/polypeptide sequencing method adopting Aerolysin nanopores

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4760025A (en) 1984-05-29 1988-07-26 Genencor, Inc. Modified enzymes and methods for making same
WO1995022625A1 (en) 1994-02-17 1995-08-24 Affymax Technologies N.V. Dna mutagenesis by random fragmentation and reassembly
WO1996000343A1 (en) 1994-06-24 1996-01-04 Audi Ag Method of controlling the electric heating of a catalytic converter
WO2006028508A2 (en) 2004-03-23 2006-03-16 President And Fellows Of Harvard College Methods and apparatus for characterizing polynucleotides
WO2011097028A1 (en) 2010-02-08 2011-08-11 Genia Technologies, Inc. Systems and methods for manipulating a molecule in a nanopore
WO2012083249A2 (en) 2010-12-17 2012-06-21 The Trustees Of Columbia University In The City Of New York Dna sequencing by synthesis using modified nucleotides and nanopore detection
US20140013616A1 (en) 2012-07-13 2014-01-16 Samsung Electronics Co., Ltd. Water level sensing device and clothing dryer including the same
WO2014074727A1 (en) 2012-11-09 2014-05-15 Genia Technologies, Inc. Nucleic acid sequencing using tags
US20150080242A1 (en) * 2013-07-22 2015-03-19 University Of Massachusetts Nanopore sensors and uses thereof
WO2015061509A1 (en) 2013-10-23 2015-04-30 Genia Technologies, Inc. High speed molecular sensing with nanopores

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009013115A2 (en) * 2007-07-20 2009-01-29 Basf Se Vesicles comprising a transmembrane transport trigger system
AU2014335915B2 (en) * 2013-10-18 2020-12-17 Oxford Nanopore Technologies Limited Modified helicases
US9322062B2 (en) 2013-10-23 2016-04-26 Genia Technologies, Inc. Process for biosensor well formation
US10359395B2 (en) * 2014-02-19 2019-07-23 University Of Washington Nanopore-based analysis of protein characteristics
AU2016326867B2 (en) * 2015-09-22 2018-09-13 F. Hoffmann-La Roche Ag OmpG variants

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4760025A (en) 1984-05-29 1988-07-26 Genencor, Inc. Modified enzymes and methods for making same
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4683202B1 (en) 1985-03-28 1990-11-27 Cetus Corp
WO1995022625A1 (en) 1994-02-17 1995-08-24 Affymax Technologies N.V. Dna mutagenesis by random fragmentation and reassembly
WO1996000343A1 (en) 1994-06-24 1996-01-04 Audi Ag Method of controlling the electric heating of a catalytic converter
WO2006028508A2 (en) 2004-03-23 2006-03-16 President And Fellows Of Harvard College Methods and apparatus for characterizing polynucleotides
WO2011097028A1 (en) 2010-02-08 2011-08-11 Genia Technologies, Inc. Systems and methods for manipulating a molecule in a nanopore
WO2012083249A2 (en) 2010-12-17 2012-06-21 The Trustees Of Columbia University In The City Of New York Dna sequencing by synthesis using modified nucleotides and nanopore detection
US20140013616A1 (en) 2012-07-13 2014-01-16 Samsung Electronics Co., Ltd. Water level sensing device and clothing dryer including the same
WO2014074727A1 (en) 2012-11-09 2014-05-15 Genia Technologies, Inc. Nucleic acid sequencing using tags
US20150080242A1 (en) * 2013-07-22 2015-03-19 University Of Massachusetts Nanopore sensors and uses thereof
WO2015061509A1 (en) 2013-10-23 2015-04-30 Genia Technologies, Inc. High speed molecular sensing with nanopores

Non-Patent Citations (22)

* Cited by examiner, † Cited by third party
Title
"UPI0004E5BCD2 Escherichia coli (strain K12)", 8 August 2014 (2014-08-08), XP055320050, Retrieved from the Internet <URL:Database Entry: UNIPARC> [retrieved on 20161116] *
ANBAZHAGAN ET AL.: "Incorporation of Outer Membrane Protein OmpG in Lipid Membranes: Protein-lipid Interactions and ?-Barrei Orientation", BIOCHEMISTRY, vol. 47, 2008, pages 6189 - 698
ASTIER, J AM CHEM SOC, 30 December 2005 (2005-12-30)
CHEN ET AL.: "Outer membrane protein G: Engineering a quiet pore for biosensing", PROC NATL ACAD SCI, vol. 105, 2008, pages 6272 - 6277, XP002550990, DOI: doi:10.1073/pnas.0711561105
CHEN M ET AL: "Outer membrane protein G: Engineering a quiet pore for biosensing", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, US, vol. 105, no. 17, 29 April 2008 (2008-04-29), pages 6272 - 6277, XP002550990, ISSN: 0027-8424, DOI: 10.1073/PNAS.0711561105 *
CONLAN; BAYLEY: "Folding of a Monomeric Porin, OmpG, in Detergent Solution", BIOCHEMISTRY, vol. 42, 2003, pages 9453 - 9465, XP002454012, DOI: doi:10.1021/bi0344228
FAHIE ET AL.: "Resolved Single-Molecule Detection of Individual Species within a Mixture of anti-Biotin Antibodies using an Engineered Monomeric Nanopore", ACS NANO, vol. 9, 2015, pages 1089 - 1098
GROSSE ET AL., BIOCHEMISTRY, vol. 53, 2014, pages 4826 - 4838
GROSSE ET AL.: "Structural and functional characterization of a synthetically modified OmpG", BIOORGANIC AND MEDICINAL CHEM, vol. 18, 2010, pages 7716 - 7723, XP027452429, DOI: doi:10.1016/j.bmc.2010.03.044
GROSSE ET AL.: "Structure-based Engineering of a Minimal Porin Reveals Loop-Independent Channel closure", BIOCHEMISTRY, vol. 53, 2014, pages 4826 - 4838
HORHOTA ET AL., ORGANIC LETTERS, vol. 8, 2006, pages 5345 - 5347
KORKMAZ-OZKAN F ET AL: "Correlation between the OmpG Secondary Structure and Its pH-Dependent Alterations Monitored by FTIR", JOURNAL OF MOLECULAR BIOLOGY, ACADEMIC PRESS, UNITED KINGDOM, vol. 401, no. 1, 6 August 2010 (2010-08-06), pages 56 - 67, XP027143127, ISSN: 0022-2836, [retrieved on 20100616], DOI: 10.1016/J.JMB.2010.06.015 *
LI ET AL., J MOL BIOL, vol. 426, no. 2, 23 December 2013 (2013-12-23), pages 309 - 317
MARIA FYTA: "Threading DNA through nanopores for biosensing applications", JOURNAL OF PHYSICS: CONDENSED MATTER, INSTITUTE OF PHYSICS PUBLISHING, BRISTOL, GB, vol. 27, no. 27, 10 June 2015 (2015-06-10), pages 273101, XP020287187, ISSN: 0953-8984, [retrieved on 20150610], DOI: 10.1088/0953-8984/27/27/273101 *
MATTHES ET AL., EMBO J., vol. 3, no. 4, 1984, pages 801 - 805
MORINAGA ET AL., NATURE BIOTECHNOLOGY, vol. 2, 1984, pages 636 - 639
R. K. SAIKI ET AL., SCIENCE, vol. 239, no. 4839, 1988, pages 489 - 491
S. L. BEAUCAGE; M. H. CARUTHERS, TETRAHEDRON LETTERS, vol. 22, 1981, pages 1859 - 1862
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 2012, COLD SPRING HARBOR
SUBBARAO; VAN DEN BERG, J MOL BIOL, vol. 360, 2006, pages 750 - 759
SUBBARAO; VAN DEN BERG: "Crystal Structure of the monomeric Porin OmpG", J MOL BIOL, vol. 360, 2006, pages 750 - 759, XP024951220, DOI: doi:10.1016/j.jmb.2006.05.045
WATSON, J. D. ET AL.: "Molecular Biology of the Gene", 1987, W. A. BENJAMIN, INC.

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022033998A1 (en) 2020-08-11 2022-02-17 F. Hoffmann-La Roche Ag Nucleoside-5'-oligophosphates tagged with positively-charged polymers, nanopores incorporating negative charges, and methods and systems using the same
WO2022263489A1 (en) 2021-06-17 2022-12-22 F. Hoffmann-La Roche Ag Nucleoside-5 -oligophosphates having a cationically-modified nucleobase
WO2022263496A1 (en) 2021-06-17 2022-12-22 F. Hoffmann-La Roche Ag Engineered nanopore with a negatively charged polymer threaded through the channel

Also Published As

Publication number Publication date
EP3353194B1 (en) 2023-08-30
EP3353194A1 (en) 2018-08-01
CN108449941A (en) 2018-08-24
AU2016326867A1 (en) 2018-03-08
CA2998970C (en) 2021-07-13
US20240010688A1 (en) 2024-01-11
US20180362594A1 (en) 2018-12-20
US11767348B2 (en) 2023-09-26
US10752658B2 (en) 2020-08-25
JP6956709B2 (en) 2021-11-02
AU2016326867B2 (en) 2018-09-13
CN108449941B (en) 2022-03-11
JP2018531002A (en) 2018-10-25
US20200392191A1 (en) 2020-12-17
CA2998970A1 (en) 2017-03-30

Similar Documents

Publication Publication Date Title
US11767348B2 (en) OmpG variants
JP6799072B2 (en) Mutant pore
JP6608944B2 (en) Alpha-hemolysin variants with altered characteristics
US20200190572A1 (en) Polymerase variants
JP6169976B2 (en) Mutant pore
CN107002151B (en) Method of delivering an analyte to a transmembrane pore
JP2018531002A6 (en) OMPG variant
US10752948B2 (en) Alpha-hemolysin variants
US20200216887A1 (en) Nanopore sequencing complexes
CN114605507A (en) Mutant CSGG pore
EP3423576B1 (en) Polymerase variants
JP2024012307A (en) Modified nanopores, compositions containing the same, and uses thereof
JP7157164B2 (en) Alpha-hemolysin variants and their uses
CN112119033A (en) Nanopore sensor from bacteriophage

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16775518

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2016326867

Country of ref document: AU

Date of ref document: 20160920

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2998970

Country of ref document: CA

ENP Entry into the national phase

Ref document number: 2018514905

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE