Plaque It!
|
[0002] Gene therapy can be defined as the transfer of genetic material into a cell or an organism. Gene therapy was originally conceived of as a specific gene replacement therapy for correction of heritable diseases by delivering functionally active therapeutic genes into the affected cells. The first protocol applied to man was initiated in the USA in September 1990 on a patient suffering from adenine deaminase (ADA) deficiency. This first encouraging experiment has been followed by numerous new applications and promising clinical trials are currently ongoing (see for example clinical trials listed at http://cnetdb.nci.nih.gov/trialsrch.shtml or http://www.wiley.co.uk/genetherapy/clinical/ and Mountain et al., 2000, Tibtech 18, 119-128). Beneficial therapeutic effects have been reported during the last two years for a number of them (Isner and Asahara, 1999, J. Clin. Invest. 103, 1231-1236; Kay et al., 2000, Nature Genetics 24, 257-261; Cavazzana-Calvo et al., 2000, Science 288, 669-672; Khuri et al., 2000 Nature Medicine 6, 879-885).
[0003] Successful gene therapy principally depends on the efficient delivery of the herapeutic gene to the cells of a living organism and the expression of the genetic information. Functional genes can be introduced into cells by a variety of techniques resulting in either transient expression or permanent transformation of the host cells with incorporation of said genes into the host genome. The majority of the gene therapy protocols employs viral or synthetic (non-viral) vectors but naked nucleic acids (i.e. plasmid DNA) can also be used for carrying the genes of interest into target cells (Wolff et al., 1990, Science 247, 1465-1468).
[0004] Viruses have developed diverse and highly sophisticated mechanisms to achieve transport across the cellular membrane, to escape from lysosomal degradation, for delivery of their genome to the nucleus and, consequently, have been used as vectors in many gene delivery applications. While those derived from retroviruses, adeno-associated viruses (AAV) and adenoviruses have been extensively used (for reviews, see Crystal, 1995, Science 270, 404-410; Kovesdi et al., 1997, Curr. Opinion Biotechnol 8, 583-589 Miller, 1997, Human Gene Ther. 8, 803-815), other viral vectors such as poxvirus-derived vectors, are emerging as promising candidates for in vivo gene transfer. Synthetic vectors refer to a special combination of nucleic acids (e.g. plasmid DNA) with one or more transfection-facilitating agent(s), such as lipids (DNA-lipoplex or liposomes) or polymers (DNA-polyplex) which facilitate cellular uptake of the vector. Various lipid- and/or polymer-based vectors are currently available (see for example Rolland, 1998, Critical Reviews in Therapeutic Drug Carrier Systems 15, 143-198; Wagner et al., 1990, Proc. Natl. Acad. Sci. USA 87, 3410-3414 and Gottschalk et al., 1996, Gene Ther. 3, 448-457). Although less efficient than viral vectors, synthetic vectors present potential advantages with respect to large-scale production, safety, low immunogenicity and cloning capacity (Ledley, 1995, Human Gene Ther. 6, 1129-1144).
[0005] The direct intramuscular injection of plasmid DNA promises to be an effective way for carrying out gene therapy for muscle-associated diseases which account for a large proportion of the morbidity and mortality, such as cardiovascular diseases (e.g. myocardial infraction, atherosclerosis, restenosis, ischemia) and diseases affecting skeletal muscles (e.g. muscular dystrophies, sclerosis). Moreover, muscle can be used as an in vivo expression system for disorders that involve the gene product being secreted into the bloodstream (Dai et al., 1995, Proc. Natl. Acad. Sci. USA 92, 1401-1405). Another interesting application of this technology is DNA vaccination, especially to induce humoral responses against various pathogens (i.e. bacteria, viruses, parasitic and mycoplasmic organisms) (Barry et al., 1995, Nature 377, 632-635; Davis et al., 1996, Vaccine 14, 910-915).
[0006] Following injection of gene expressing plasmids into skeletal muscles, long term expression and persistence of gene product (e.g. several months) in rodents and primates has been reported, but this observation seems related to the fact that the particular foreign protein was not very immunogenic to the injected host (Wolff et al., 1992, Hum. Mol. Genet. 1, 363-369). However, in a number of cases, persistence of the foreign gene product in skeletal muscle fibers can be limited by the development of host immune responses (cellular, humoral and/or innate) (Davis et al., 1997, Gene Ther. 4, 181-188). The presence of a β-galactosidase-specific CTL response was indeed observed by Wells et al (1997, FEBS letter 407, 164-168) in animals injected with a β-galactosidase-expressing plasmid, consistent with the decrease of enzyme levels and destruction of the gene-expressing fibers. There was no evidence of similar destruction in adjacent non-transfected muscle fibers. Moreover, Ferrer et al. (2000, Gene Ther. 7, 1439-1446) reported destruction of dystrophin-producing fibers after intramuscular injection of an expression plasmid. The presence of CD8+ T cells at proximity of the dystrophin-positive fibers strongly suggests a cell-mediated immune response through cytotoxic CD8+ cells. The preservation of gene-expressing fibers which is observed in immunodeficient (SCID) mice supports that the drop of dystrophin levels in immunocompetent mice is due to the development of an immune response rather than a loss of gene expression or a cytotoxic effect of the dystrophin protein.
[0007] The mechanisms underlying the induction of immune responses after DNA injection in muscle are only partially understood. It is likely that dentritic cells or macrophages present in the muscle are directly transfected with the plasmid DNA or/and capture antigens released by muscle. It was shown after intra-muscular administration of plasmid DNA, that transfected macrophages or dentritic cells displaying activation markers and T cells costimulatory ligands were present in the peripheral compartment as well as lymph nodes of mice (Chattergon et al., 1998, J. Immunol. 160, 5707-5718). In the case of viral vectors, it was also shown that transduction of dentritic cells was important for the efficient induction of a cellular immune response to muscle cells producing the foreign antigen (Jooss et al., 1998, J. Virol. 72, 4212-4223). If professional antigen-presenting cells are not directly transfected, they may receive the foreign gene product released from damaged or leaky transfected fibers and present peptides on MHC1 via a “cross-priming pathway” (Ulmer et al., 1996, Immunology 89, 59-67).
[0008] Beside the undesired immune responses, inflammatory events were also reported following administration of viral vectors or nacked DNA (see for example Kliman et al., 1997, J. Immunol. 158, 3635-3639).
[0009] Modification of DNA sequences by removing specific “proinflammatory elements” has been envisaged but this approach is difficult to achieve insofar as a large number of alterations might be required and some should interfere with gene expression. Moreover, this strategy gave rise to moderate improvement until now (Yew et al., 2000, Mol. Ther. 1, 255-262).
[0010] The present invention discloses that a nucleic acid fragment, which in the natural context controls the expression of a desmin gene and is present in the 5′ flanking region upstream of the promoter and enhancer of said desmin gene, contributes to improve persistence of transfected cells expressing a gene of interest in a host cell or organism. Such an improvement is likely to be due to a reduction of the host's immune responses, especially a cellular immune response, towards the gene product or the expressing cells. Desmin is a muscle-specific member of the intermediate filament protein family, which is produced at early stages of myogenesis, such as in replicating myoblasts and satellites cells and at high levels in differentiated myotubes (Li et al., 1993, Development 117, 947-959). It is encoded by a single gene (Li et al., 1989, Gene 78, 243-254). Functional analysis of the proximal upstream region of the human (Li and Paulin, 1991, J. Biol. Chem. 266, 6562-6570) and mouse (Li and Capetanaki, 1993, Nucleic Acids Res. 21, 335-343) desmin genes has revealed a common enhancer element (between −693 and −973 in human gene and −578 to −976 in mouse) which appears to be at least in part responsible for the muscle-specific expression of these genes. This enhancer region contains a combination of “muscle-specific” cis-acting sequences recognized by myogenic transcription factors (MEF-2 and E-box) as well as cis-acting sequences which bind ubiquitous transcriptional factors (Mt, GC-rich region with potential binding sites for SP1 and Krox-20) (Li and Paulin, 1991, J. Biol. Chem. 266, 6562-6570; Li et al., 1993, Development 117, 947-959; Li and Capetanaki, 1993, Nucleic Acids Res. 21, 335-343; Li and Capetanaki, 1994, EMBO J. 13, 3580-3589). The proximal 1 kb upstream desmin region which encompasses promoter/enhancer elements has been used to drive reporter genes in transgenic mice. The results show that the human desmin promoter/enhancer is only capable of conferring myotomal and skeletal muscle expression in embryos (Li et al., 1993, Development 117, 947-959). In addition, the transgene expression is completely silenced even in skeletal muscles after 15 days postpartum (Lescaudron et al., 1993, Neuromusc. Disord. 3, 419-422).
[0011] Additional regulatory elements were also identified upstream of the desmin promoter/enhancer region. EP 999 278 discloses the presence between positions −4000 to −2500 relative to the transcriptional initiation site of the mouse desmin gene of cis-acting sequences controlling gene expression specifically in arterial smooth muscle cells. On the other hand, Raguz et al. (1998, Developmental Biology 201, 26-42) generated transgenic mice harbouring a 240 kb genomic clone spanning the human desmin locus (entire desmin gene with 220 and 10 kb of upstream and downstream sequences respectively). A reproductible physiological level of human desmin gene expression was observed in all three types of muscles (skeletal, cardiac and smooth) in adult animals. However, in mouse embryos, human desmin gene expression from “the 240 kb transgene” is markedly delayed compared to the endogenous (murine) gene, especially within cardiac and smooth muscle tissues. The variance in the timing of expression between human and mouse desmin genes is in all probability due to differences in the regulatory elements of these two genes which allows the murine gene to be activated at much earlier stages of development. However, Raguz et al. did not identify the genetic elements within this 220 kb 5′ flanking region of the human desmin gene which are assumed to confer muscle-specific expression in adult transgenic mice.
[0012] Altogether, these studies make clear that there is still a need in the art to to identify DNA elements that are critical to achieve significant and sustained expression of genes while avoiding or minimising the induction of detrimental immune responses against the gene product and/or the transfected cells.
[0013] This technical problem is solved by the provision of the embodiments as defined in the claims.
[0014] Accordingly, the present invention provides the use of a nucleic acid fragment comprising a portion of at least 100 contigous nucleotide bases which portion has a sequence the same as, or homologous to a portion of corresponding length of the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 16879 or the same as, or homologous to a portion of the corresponding length of the sequence complementary to the sequence set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 16879, for improving the persistence of transfected cells expressing one or more gene(s) of interest associated to said nucleic acid fragment.
[0015] Within the context of the present invention, the term “nucleic acid” and “polynucleotide” are used interchangeably and define a polymeric form of any length of nucleotides, either deoxyribonucleotides (DNA) or ribonucleotides (RNA) or analogs thereof. A polynucleotide may also comprise modified nucleotides, such as methylated nucleotides or nucleotide analogs (see U.S. Pat. No. 5,525,711, U.S. Pat. No. 4,711,955 or EPA 302 175 as examples of modifications). If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may also be interrupted by non-nucleotide elements. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.
[0016] The term “fragment” is intended to include restriction endonuclease-generated and PCR-generated nucleic acid molecules that can be obtained from the sequence as set out is SEQ ID NO:1 or from existing nucleic acid sources comprising that particular sequence (e.g. from genomic DNA present in naturally-occuring DNA upstream of a desmin gene and especially between approximately position −50 kb to approximately −1 kb relative to the transcription initiation site of said desmin gene) or from fragment thereof or from homologous sequence thereof. The present invention also encompasses synthetic fragments (e.g. produced by oligonucleotide synthesis).
[0017] The nucleic acid fragment may be single or double stranded, linear or circular. Single stranded fragment may be “plus” strands having a sequence the same as the sequence as set out in SEQ ID NO:1 within the specified nucleotide (nt) positions or a part thereof of at least 100 contigous nucleotide bases or a sequence homologous thereto. Alternatively, the single-stranded fragment may be “minus” strands having a sequence complementary to the sequence as set out in SEQ ID NO:1 within the specified nt positions or a part thereof of at least 100 contigous nucleotide bases or a sequence homologous thereto. Double-stranded fragments contain a complementary pair of strands (e.g. one plus strand and one minus strand). RNA fragments used in the present invention will, of course, contain uridyl acid (U) residues in place of the deoxythymidylic acid residues (T) of the template strand set out in SEQ ID NO:1 within the specified nt positions or, if complementary to the template sequence, they will contain U residues in positions complementary to the adenylic acid (A) residues in the sequence set out in SEQ ID NO:1 within the specified nt positions. Preferably, the nucleic acid fragment used in the present invention is a double-stranded DNA fragment. Advantageously, it contains a portion of at least 500 contigous nucleotide base pairs (bp), advantageously at least 1000 bp, preferably at least 2000 bp, more preferably at least 5000 bp, and yet more preferably at least 10 000 bp having a sequence the same as or homologous to the sequence as set out in SEQ ID NO:1 within the specified nt positions.
[0018] The term “homologous” as used herein refers to a pourcentage of homology of, at least 70%, preferably at least 80%, more preferably at least 90%, and still more preferably at least 95% with the corresponding portion of the sequence illustrated in SEQ ID NO:1. Nevertheless, absolute identity (100% homologous) is preferred. A nucleic acid having a sequence approximately (or essentially) the same as the specified sequence or fragment thereof is also encompassed by the present invention.
[0019] To determine the homology of the sequence of the acid nucleic fragment in use in the present invention with respect to the corresponding portion of the sequence set out in SEQ ID NO:1, both sequences are aligned so as to obtain a maximum match using gaps and inserts. Two sequences are said to be <<identical >> if the sequence of residues is the same when aligned for maximum correspondence as described below. Optimal alignment of sequences for comparison can be conducted by the local homology algorithm of Smith and Waterman (1981, Adv. Appl. Math. 2, 482-489), by the homology alignment method of Needlemen and Wunsch (1970, J. Mol. Biol. 48, 443-453), by the search for similarity method of Pearson and Lippman (1988, Proc. Natl. Acad. Sci. USA 85, 2444-2448) or the like. Computer implementations of the above algorithms are known as part of the Genetics Computer Group (GCG) Wisconsin genetics Software Package (GAP, BESTFIT, BLASTA, FASTA, TFASTA and FASTDB, Madison, Wis.). These programs are preferably run using default value for all parameters. A preferred method for determining the best overall match between the aligned sequences, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (1990, Comp. App. Biosci. 6, 237-245).
[0020] Percentage of sequence homology (or identity) is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the sequence in the comparison window may comprise additions or deletions (gaps) as compared to the reference sequence for optimal alignment of the two sequences being compared. The percentage of homology is calculated by determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the comparison window and multiplying the result by 100 to yield the percentage of sequence identity. For example, if 8 of 10 of the positions in the two sequences are the same, then they are 80% homologous or have 80% sequence identity. Total identity is then determined as the average identity over all the windows that cover the complete query sequence.
[0021] The term “approximately” refers to a variation of 0 to 12 nucleotide(s) with respect to the specified position. For example, if the specified position is located within a restriction site, it is possible to modify such a restriction site by routine molecular biology techniques (e.g. digestion by nuclease activities to fill in overhang extremities) which results in a shift of the specified position to some nucleotides in the 5′ or 3′ direction.
[0022] The nucleic acid fragment is used in the context of the invention to improve the persistence of cells expressing the gene of interest, and more particularly for the preparation of transfected cells expressing the gene of interest. The improvement of persistence of transfected cells conferred by the nucleic acid fragment can be easily determined by routine experimentation. In particular, persistence of transfected cells is correlated to the persistence of expression of said gene of interest in the host cell or organism, whatever the nature of the gene product which can be produced intracellularly, anchored at the surface of the host cell or secreted outside the cell (e.g. in the body fluid). Such an improvement can be evaluated by cloning said nucleic acid fragment upstream of a gene of interest (e.g. a reporter gene encoding for example the bacterial enzyme chloramphenicol acetyltransferase (CAT), luciferase or eGFP) in the presence of appropriate transcriptional and/or translational control sequences (e.g. a promoter and, optionally, an enhancer), by introducing the resulting construct in an appropriate vector (e.g. a plasmid vector) and by evaluating how long is the gene product produced in the host cell, preferably in vivo (in transgenic animals or by direct administration to animal models). Examples of such gene expression analysis is provided in the Example section of the present specification, however other methods well known to those skilled in the art are also usable in the context of the invention, such as flow cytometry, ELISA, immunofluorescence, Western blotting, biological activity measurement and the like. Improvement of persistence of transfected cells is established when the product of the gene of interest is detected for a longer period of time than with a conventional construct devoid of the nucleic acid fragment under the same experimental conditions, and especially when production (whatever the level) of the gene product is stably retained for more than one month period.
[0023] Preferably, such a persistence of transfected cells expressing the gene of interest is obtained by minimising or reducing the induction and/or development of a host immune response against the product of said gene(s) of interest or the transfected cell expressing said gene(s) of interest. By way of illustration, a reduction of the host's immune response can be correlated to for example a reduction of the inflammation status in the host organism (which can be evaluated by observation of cell morphology especially a close proximity of the injected site) and/or a reduction of cell infiltration in the expressing tissues (especially CD4+ and CD8+ cells, i.e. by immunohistology) and/or a reduction of cytokine production following vector administration (such as TNF (Tumor Necrosis factor) alpha, IFN (interferon) gamma, IL (interleukin>6 and IL-12).
[0024] The term “associated” as used herein refers to a functional juxtaposition permitting the nucleic acid fragment to exert its optimal effect on the persistence of the expressing cells or persistence of the said gene-product(s) of interest in the transfected host cell or organism. For this purpose, the nucleic acid fragment can be inserted on either side of the gene(s) of interest, whatever its orientation (genomic or reverse genomic orientation with respect to the transcription direction). Preferably, the nucleic acid fragment is positioned upstream of the gene of interest in the genomic orientation. Moreover, there may be additional residues (e.g. one or more control sequence(s) such as a promoter and/or enhancer) between the nucleic acid fragment and the gene of interest so long as this functional relationship is preserved.
[0025] The sequence as set out in SEQ ID NO:1 within the specified nt positions originates from the upstream region of the human desmin gene, between positions approximately −18662 to approximately −1784 relative to the transcription initiation site (representing position +1). This particular sequence may include regulatory elements controlling in the natural context expression of that gene. In this respect, the nucleic acid fragment in use in the present invention can comprise one or more DnaseI hypersensitive sites. The DnaseI hypersensitive sites reflect the binding of proteins to the nucleic acid. These proteins may be transcription factors. According to this embodiment, 4 prominent muscle-specific DnaseI hypersensitive sites have been identified respectively at positions approximately −15.1 kb, −13.8 to −13.2 kb, −11.8 kb and −10.2 kb relative to the transcriptional start site of the human desmin gene. A weaker 5
[0026] Of course, the nucleic acid fragment in use in the present invention may include additional sequences, preferably sequences of the human desmin gene extending either in the 5′ or 3′ direction or both in the 5′ and 3′ directions with respect to the portion of the desmin gene specified in SEQ ID NO:1. According to this embodiment, the nucleic acid fragment used in the context of the present invention can extend in the 5′ direction advantageously up to approximately −50 kb, preferably up to approximately 40 kb, more preferably up to approximately −30 kb, and still more preferably up to approximately −20 kb upstream of the transcription initiation site of the desmin gene. Additional desmin sequences can extend in the 3′ direction up to approximately −1.4 kb, or preferably up to approximately −1 kb (e.g. up to position approximately −974 which corresponds to the 5′ end of the enhancer sequence) relative to the transcription initiation site of the desmin gene and can also encompasse the promoter or the enhancer or non-coding exonic sequence or any combination thereof.
[0027] As mentioned previously, the present invention also encompasses the use of a nucleic acid fragment homologous to the sequence set out in SEQ ID NO:1 or a portion thereof of at least 100 contigous nucleotides. According to this embodiment, a nucleic acid fragment having an “homologous” sequence exhibits one or more variation(s) compared to the corresponding portion of the sequence set out in SEQ ID NO:1 within the confines of appropriate levels of sequence homology. The term “variation” as used herein includes addition, deletion and/or substitution of one or more nucleotide(s). Such variation(s) can be obtained by recombinant techniques (e.g. site-directed mutagenesis). Alternatively, the nucleic acid fragment used in the present invention can be isolated or obtained (derived) from the usptream region of a desmin gene of a different species or subspecies which region may possess sequence polymorphisms that render its sequence substantially the same as, but not identical to, the sequence of the human desmin gene set forth herein. According to this advantageous embodiment, the nucleic acid fragment in use in the present invention can be cloned or obtained from the genomic portion of a non-human desmin gene, which genomic portion corresponds to positions approximately −18662 to approximately −1784 of the human desmin gene. Suitable desmin genes include those of any vertebrate, and preferably any mammalian species, including illustratively the following species, primates (e.g. simian), ruminants (e.g. bovine, ovine), fowls (e.g. chicken), avians, felines, horses, canines, swines (e.g. porcine), rodents (e.g. rat, mouse, hamster . . . ). For example a detailed comparative sequence analysis of the 5′ flanking sequences of the human and an animal (e.g. mouse) desmin genes can be conducted in an attempt to discover regions of high homology which may correspond to conserved transcriptional regulatory elements, such as those corresponding to the four major Dnase hypersensitive sites that have been indentified between −15.1 and −10.2 kb relative to the transcriptional start site of the human desmin gene. The initiation site of transcription of various desmin genes are known from the available literature but may otherwise be determined by standard techniques such as S
[0028] Preferably, the present invention is intended to encompass portion or homologous nucleic acid fragments (as defined above) which essentially preserve the overall function conferred by the “native” nucleic acid fragment (e.g. displaying the sequence as set out in SEQ ID NO:1), in terms of improving the persistence of transfected cells expressing an associated gene. One skilled in the art would be able to determine whether a particular portion or homologous nucleic acid fragment is active by linking it to a reporter gene placed under the control of appropriate control sequences (e.g. a promoter and, optionally an enhancer), to generate a construct, introducing the construct into a host cell or organism and measuring persistence of gene product relative to the result obtained when controlled by the non-modified nucleic acid fragment under the same experimental conditions. The functionality is preserved when the particular portion/homologue is capable of conferring persistence of gene product in a given host cell to the same extend (at least 80%) as the native sequence, whatever the level of gene product.
[0029] Advantageous nucleic acid fragments for use in the context of the present invention comprise a sequence the same as or homologous to all or part of the portion of the sequence as set out in SEQ ID NO:1:
[0030] starting at nucleotide approximately 1 and ending at nucleotide approximately 15465 or the same as, or homologous to all or part of the portion of the sequence complementary to the sequence set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 15465; or
[0031] starting at nucleotide approximately 7569 and ending at nucleotide approximately 10067 or the same as, or homologous to all or part of the portion of the sequence complementary to the sequence set out in SEQ ID NO:1 starting at nucleotide approximately 7569 and ending at nucleotide approximately 10067.
[0032] According to a preferred embodiment, the nucleic acid fragment in use in the present invention comprises a sequence the same as, or homologous to the portion of the sequence as set out in SEQ ID NO:1:
[0033] starting at nucleotide approximately 1 and ending at nucleotide approximately 16879,
[0034] starting at nucleotide approximately 2358 and ending at nucleotide approximately 10067, or
[0035] starting at nucleotide approximately 7569 and ending at nucleotide approximately 16879.
[0036] As an absolute preference, said nucleic acid fragment comprises a sequence the same as the sequence as set out in SEQ ID NO:1:
[0037] starting at nucleotide approximately 1 and ending at nucleotide approximately 16879,
[0038] starting at nucleotide approximately 2358 and ending at nucleotide approximately 10067, or
[0039] starting at nucleotide approximately 7569 and ending at nucleotide approximately 16879.
[0040] Other restriction fragments of the sequence as set out in SEQ ID NO:1 are also suitable in the context of the present inventon, e.g. the PstI fragment extending from nucleotide approximately 8221 to nucleotide approximately 14750, the BamHI fragment extending from nucleotide approximately 5128 to nucleotide approximately 16786, the EcoRI-XhoI fragment extending from nucleotide approximately 7569 to nucleotide approximately 16879, the BglII-XhoI fragment extending from nucleotide approximately 10383 to nucleotide approximately 16879 and the SacI fragment extending from nucleotide approximately 5366 to nucleotide approximately 11300.
[0041] According to another embodiment, the nucleic acid fragment in use in the present invention is operably linked to one or more control sequence(s) that permit expression of said gene of interest in a host cell or organism.
[0042] The term <<control sequence >> as used herein refers to any sequence that allows, contributes or modulates the functional regulation of a polynucleotide, including replication, duplication, transcription, splicing, translation, stability and/or transport of the polynucleotide or one of its derivative (i.e. mRNA) into the host cell or organism. The regulation may affect the frequency, speed and/or specificity of the process, and may be enhancing or inhibiting in nature. Apart the nucleic acid fragment as defined herein, control sequences are known in the art. Representative examples include, but are not limited to promoters, enhancers, transcriptional termination signals and elements that affect mRNA stability.
[0043] “Operably linked” refers to a juxtaposition of the control sequence(s) and the gene of interest, which are in relationship permitting them to operate in the expected manner. For instance, a promoter is operably linked to a gene of interest if the promoter allows transcription of the gene in the host cell or organism. There may be additional residues between the promoter and the gene of interest so long as this functional relationship is preserved. An enhancer is operably linked to a promoter if the enhancer enhances transcription, resulting in an enhancement of the expression of the associated gene in the host cell or organism. Preferably, the term “operably linked” as used herein also refers to the juxtaposition of a gene of interest with the control sequences controlling its transcription.
[0044] The nucleotide positions referenced in the present application for the cited promoters and enhancers are numbered relative to the presumed transcription initiation site (or cap site representing position +1) of the (native) gene concerned. By way of illustration, the first nucleotide directly upstream from the transcription initiation site is numbered −1 whereas the nucleotide following it is numbered +2. The initiation site of transcription can be determined by standard techniques such as S1 mapping or primer extension (Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor N.Y.).
[0045] A “host cell” is a cell where expression of the gene of interest is expected. The term <<host cell >> as used herein refers to a single entity, or can be part of a larger collection of cells. Such a larger collection of cells can comprise, for instance, a cell culture (either mixed or pure), a tissue (e.g., epithelial or other tissue), an organ (e.g., heart, lung, liver, urinary bladder, muscle or other organ), an organ system (e.g., circulatory system, respiratory system, gastrointestinal system, urinary system, nervous system, integumentary system or other organ system), or an organism (e.g., a mammal, particularly a human, or the like). In the context of the present invention, the host cell is preferably a muscle cell. “Muscle” refers to any types of muscles, including skeletal, cardiac and smooth muscles. “Smooth muscles” include visceral and vascular smooth muscles and more especially arterial smooth muscle cells (SMCs), with a special preference for neointimal and medial SMCs of aorta, coronary, mammary, femoral and carotid arteries as well as of saphenous vein. <<Skeletal muscle cells>> include myoblasts, myotubes, myofibers, myofibrills and satellite cells. <<Cardiac muscle cells>> include cardiomyocytes and satellite cells. Skeletal muscles are preferred in the context of the present invention.
[0046] Advantageously, said control sequence comprises a promoter which is, preferably, obtained from a mammalian nuclear gene or is a viral promoter, with a special preference for relatively weak mammalian promoters (e.g. to the same extend as the desmin promoter).
[0047] The term <<promoter>> as used herein refers to a DNA region capable of binding a RNA polymerase under certain conditions and initiating transcription of a gene positioned downstream from said promoter. Such a promoter contains at least the cis-acting sequences sufficient to initiate transcription of the gene of interest at the proper initiation site even at low levels in a host cell or organism. Preferably, it includes at least a TATA box (consensus sequence TATAAAA) or a TATA box-like element (an AT rich sequence having a TATA box function), usually located 25-35 bp of the transcriptional start site. The promoter used in the context of the present invention may further comprise one or more additional cis-acting sequences, e.g. to substantially increase gene transcription and/or render gene expression cell type specific, tissue-specific or inducible by external signal or agents, with a special preference for muscle-specific cis-acting sequences. Representative examples include without limitation CAAT box (consensus GGCCAATCT) bound by the NF-1 factor, GC box (consensus GGGCGG) bound by the SP1 factor, octamer ATTTGCAT bound by the Oct factor, icB (consensus GGGACTTTCC) bound by NFκB, ATF (consensus GTGACGT) bound by ATF factor Ap2, Sp1, Egr1, YY1, TGT3-3, E box (CANNTG), CarG box (CC(A/T)
[0048] The promoter which may be used in accordance with the present invention can be any promoter capable of functioning in the host cell or organism. Such a promoter lies within up to approximately 500 base pairs (bp), advantageously, within up to approximately 400 bp and more preferably up to approximately 200 bp immediately upstream of the transcription initiation site of a gene locus. Optionally, the retained promoter can be modified in order to improve its transcriptional activity, delete negative sequences, modify its regulation, introduce appropriate restriction sites etc.
[0049] For example, the minimal IE CMV promoter is suitable in the context of the invention, and especially, the 119 pb immediately upstream of the transcription initiation site in the CMV genome (Boshart et al, 1985, Cell 41, 521-530).
[0050] According to a preferred embodiment, the present invention employs a muscle-specific promoter. Examples of muscle-specific promoters include those isolated from myosin, myogenin (Edmondson et al., 1992, Mol. Cell. Biol. 12, 3665-3677), desmin (European application EP 0 999 278; Mericskay et al., 1999, Current Topics in Pathology Vol 93, pp 7-17; Eds Desmouliere and Tuchweber, Springer-Verlag Berlin Heidelberg), troponin, beta-enolase (Feo et al., 1995, Mol. Cell. Biol. 15, 5991-6002), creatine kinase (Stemberg et al., 1988, Mol. Cell. Biol. 8, 2896-2909; Trask et al., 1988, J. Biol. Chem. 263, 17142-17149), and skeletal alpha-actin (Taylor et al., 1988, Genomics 3, 323-336) genes.
[0051] A particularly preferred muscle-specific promoter is the promoter isolated or obtained (derived) from a desmin gene. Any desmin gene promoter of any species can be used in the context of the present invention, and in particular of human, mouse, rat, hamster, rabbit, pig. The promoter of various desmin genes can be obtained using conventional molecular biology techniques or by PCR from appropriate genomic libraries on the basis of the sequence disclosed in the literature or specialised data banks (such as Genebank under accession numbers X53154 for the human sequence, AJ250633 for the mouse sequence and X73524 for the rat sequence). The position of the key cis-acting sequences can be determined on the basis of the published data.
[0052] A promoter isolated or obtained from the human desmin gene is particularly preferred in the context of the present invention, and especially the portion comprised within a 537 bp fragment upstream of the transcription initiation site (i.e. in SEQ ID NO: 1 from positions approximately 18122 to approximately 18663) or a functional portion thereof.
[0053] Accordingly, the present invention uses preferably a promoter obtained from the human desmin gene comprising a portion of at least 100 contigous nucleotide bases which portion has a sequence the same as, or homologous to a portion of corresponding length of the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 18122 and ending at nucleotide approximately 18663 or the same as, or homologous to a portion of the corresponding length of the sequence complementary to the sequence set out in SEQ ID NO: 1 starting at nucleotide approximately 18122 and ending at nucleotide approximately 18663.
[0054] As an absolute preference, said promoter comprises a sequence the same as the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 18122 and ending at nucleotide approximately 18663.
[0055] Optionally, the promoter may be used in tandem with another promoter and/or with one or more enhancer(s).
[0056] According to another embodiment, the control sequence(s) used in the present invention comprise(s) an enhancer.
[0057] The term <<enhancer>> as used herein refers to a nucleotide sequence to which (a) factor(s) bind(s) directly or indirectly (i.e. through interaction with another cellular factor), thereby enhancing gene expression driven by the other control sequence(s) (e.g. the promoter) used in the context of the invention.
[0058] Such an enhancement can be determined for example by comparing the expression of a reporter gene under the control of a given promoter in the presence or in the absence of the enhancer, either in vitro (e.g. in cultured host cells) or in vivo (e.g. in transgenic animals or by direct administration to animals) and under the same experimental conditions. Gene expression can be determined by standard methods, as cited above.
[0059] A large number of enhancers from a variety of different sources are well known in the art and available as or within cloned polynucleotide sequences (e.g. from depositories such as ATCC or other commercial and individual sources). Advantageously, said enhancer is isolated or obtained from a mammalian nuclear gene or is a viral enhancer. For example, a suitable enhancer useful according to this aspect of the invention includes the human Cytomegalovirus (CMV) enhancer, and especially the portion thereof extending from positions −750 to −120 relative to the transcriptional start site (Genbank accession number X03922).
[0060] According to a preferred embodiment, the enhancer used in the context of the present invention is a muscle-specific enhancer. Representative examples of muscle-specific enhancers which may be utilized in the context of the present invention include, but are not limited to those isolated or obtained from the following mammalian genes encoding:
[0061] a) alpha-actin (Shimizu et al., 1995, J. Biol. Chem. 270, 7631-7643), especially the cardiac, the skeletal or the smooth muscle alpha-actin (Genbank accession number D00618);
[0062] b) troponin, especially troponin C (Genbank accession number M37984), 1 (Genbank accession number X90780) or T (Genbank accession number AJ011712);
[0063] c) myogenin (Genbank accession number X62155);
[0064] d) myosin. Several myosin enhancers have been identified to date from both myosin light chain and myosin heavy chain genes (for example Donoghue et al., 1988, Genes and Development 2, 1779-1790). Preferred is a myosin heavy chain enhancer, more preferred one of rabbit, with a special preference for the enhancer located between positions approximately −1332 and approximately −1225 upstream of the transcription initiation site of the rabbit myosin heavy chain encoding gene (Kallmeier et al., 1995, J. Biol. Chem. 270, 30949-30957);
[0065] e) creatine kinase, especially of human (Trask et al., 1988, J. Biol. Chem., 263, 17142-17149; Genbank accession number AH003460) or mouse (Jaynes et al., 1988, Mol. Cell. Biol. 8, 62-70). A preferred muscle-specific enhancer employs preferably the sequence located between positions approximately −919 and approximately −711 upstream of the transcription initiation site of the human creatine kinase gene;
[0066] f) smoothelin (Genbank accession number AH007691);
[0067] g) calponin, with a special preference for the sequence located between positions approximately +138 and approximately +1875 within the first intron of the murine calponin gene (Miano et al., 2000, J. Biol. Chem. 275, 9814-9822);
[0068] h) beta-enolase, especially the human ENO-3 gene with a special preference for the enhancer fragment comprising the sequence extending from positions approximately +504 to approximately +637 downstream of the transcription initiation site in the first intron of the human beta-enolase gene (Feo et al., 1995, Mol. Cell Biol. 15, 5991-6002);
[0069] i) desmin.
[0070] Preferred is a muscle-specific enhancer isolated or obtained from a desmin gene and especially from the human desmin gene, with a special preference for a portion thereof comprising at least nt −973 to nt −693 relative to the transcription initiation site of the human desmin gene (Li and Paulin, 1991, J. Biol. Chem. 266, 6562-6570). Alternatively, an enhancer isolated or obtained from the mouse desmin gene is also suitable, especially a portion comprising at least nt −578 to nt −976 relative to the transcription initiation site (Li and Capetanaki, 1993, Nucleic Acids Res. 21, 335-343; Li and Capetanaki, 1994, EMBO J. 13, 3580-3589).
[0071] According to a preferred embodiment, the enhancer in use in the present invention is obtained from the human desmin gene and comprises a portion of at least 100 contigous nucleotide bases which portion has a sequence the same as, or homologous to a portion of corresponding length of the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 16880 and ending at nucleotide approximately 18121 (i.e. corresponding to the XhoI-BamHI restriction fragment extending from positions approximately −1784 to approximately −537 relative to the transcription initiation site of the human desmin gene) or the same as, or homologous to a portion of the corresponding length of the sequence complementary to the sequence set out in SEQ ID NO:1 starting at nucleotide approximately 16880 and ending at nucleotide approximately 18121.
[0072] As previously mentioned, the enhancer is operably linked with the promoter if the enhancer increases gene expression driven by the promoter. An operably linked enhancer can be placed upstream or downstream of the promoter within the gene sequence or downstream of said gene sequence. Furthermore, the enhancer can be adjacent, at a close distance or over a distance of up to several kb to the promoter. Advantageously, the enhancer is positioned upstream of the promoter, advantageously with a distance separating the promoter and the enhancer by less than 500 bp, preferably less than 200 bp and, more preferably immediately adjacent to the promoter. Moreover, the orientation of the enhancer may be sense (genomic) or antisense (reverse genomic) relative to the transcriptional direction conferred by the promoter. The optimal location and orientation of each element relative to the others can be determined by routine experimentation for any particular construct. In the context of the present invention, the enhancer (e.g. the human desmin enhancer) is positioned in sense orientation upstream of the promoter (e.g. the human desmin promoter).
[0073] The optimal location and orientation of the promoter and/or enhancer relative to the nucleic acid fragment can also be determined by routine experimentation. By way of illustration, the nucleic acid fragment is positioned in genomic orientation and upstream of the enhancer at a distance of at least about 200 bp to about 1 kb, preferably about 400 bp to about 800 bp. Nevertheless, those skilled in the art would be aware that the distances of the nucleic acid fragment relative to the promoter and/or the enhancer and/or the gene of interest are provided for guidance and may depend upon the relative sizes of these different elements. Alternatively, the nucleic acid fragment may be placed distantly from the promoter and/or enhancer and even downstream the gene of interest. As an absolute preference, the nucleic acid fragment is positioned in sense (i.e. genomic) orientation upstream of the enhancer which is positioned upstream of the promoter which is positioned upstream of the gene(s) of interest.
[0074] The present invention also encompasses the use of more than one enhancer as defined hereinabove.
[0075] The operability of the juxtaposition may be easily determined by measuring its capability to drive gene expression (e.g. of a reporter gene) in the host cell or organism, either in vitro in appropriate cultured cells or in vivo (in transgenic animals or by direct administration to animal models). Gene expression can be determined by standard methods as indicated previously.
[0076] The promoter and the enhancer in use in the present invention can for example be isolated by cloning techniques from DNA libraries or by amplification methods (PCR) using appropriate probes or primers. Alternatively, they may be produced by chemical synthesis based upon sequence data available in the art. The promoter- and enhancer-bearing fragments can be associated by means of using restriction enzymes and ligases.
[0077] In the context of the present invention, the promoter or enhancer for use in the present invention or both can be modified by deletion, addition and/or substitution of one or several nucleotide(s), provided that their respective activity as defined above be substantially preserved (at least 80% of the activity of the native sequence). Such modifications can be aimed to remove (i) positive cis-acting sequences in order to improve tissue-specificity; or (ii) negative cis-acting sequences (<<silencers >>) which reduce expression levels. Site-directed mutagenesis can be used to modify the native sequence.
[0078] In a preferred embodiment, the present invention enables the juxtaposition of a nucleic acid fragment, an enhancer and a promoter isolated or obtained from the human desmin gene, for conferring improvement of the persistence of transfected cells expressing one or more gene(s) of interest in the host cell or organism, especially skeletal muscle cells. According to this embodiment, the present invention uses a sequence the same as or homologous to the sequence as set out in SEQ ID NO:1 starting at nt approximately 1 and ending at nt approximately 18663 (i.e. corresponding to the portion of the human desmin gene extending from approximately position-18662 to approximately position +1 relative to the transcription initiation site) or to the complementary sequence therof.
[0079] The term “gene of interest” refers to a nucleic acid which can be of any origin and isolated from a genomic DNA, a cDNA, or any DNA encoding a RNA, such as a genomic RNA, an mRNA, an antisense RNA, a ribosomal RNA, a ribozyme or a transfer RNA. The gene of interest can also be an oligonucleotide (i.e. a nucleic acid having a short size of less than 100 bp).
[0080] In a preferred embodiment, the gene of interest in use in the present invention, is a therapeutic gene, i.e. encodes a gene product of therapeutic interest, preferably other than desmin. A “gene product of therapeutic interest” is one which has a therapeutic or protective activity when administered appropriately to a patient, especially a patient suffering from a disease or illness condition or who should be protected against a disease or condition. Such a therapeutic or protective activity can be correlated to a beneficial effect on the course of a symptom of said disease or said condition. It is within the reach of the man skilled in the art to select a gene encoding an appropriate gene product of therapeutic interest, depending on the disease or condition to be treated. In a general manner, his choice may be based on the results previously obtained, so that he can reasonably expect, without undue experimentation, i.e. other than practicing the invention as claimed, to obtain such therapeutic properties.
[0081] In the context of the invention, the gene of interest can be homologous or heterologous with respect to to the host cell or organism into which it is introduced. Advantageously, it encodes a polypeptide, a ribozyme or an antisense RNA. The term <<polypeptide>> is to be understood as any translational product of a polynucleotide whatever its size is, and includes polypeptides having as few as 7 amino acid residues (peptides), but more typically proteins. In addition, it may be of any origin (prokaryotes, lower or higher eukaryotes, plant, virus etc). It may be a native polypeptide, a variant, a chimeric polypeptide having no counterpart in nature or fragments thereof. Advantageously, the gene of interest in use in the present invention encodes at least one polypeptide that can compensate for one or more defective or deficient cellular proteins in an animal or a human organism, or that acts through toxic effects to limit or remove harmful cells from the body. A suitable polypeptide may also be immunity conferring and acts as an antigen, e.g.to provoke a humoral response.
[0082] Representative examples of polypeptides encoded by the gene of interest include without limitation polypeptides selected from the group consisting of:
[0083] polypeptides involved in the cellular cycle, such as p21, p16, the expression product of the retinoblastoma (Rb) gene, kinase inhibitors (preferably of the cyclin-dependent type), GAX, GAS-1, GAS-3, GAS-6, Gadd45 and cyclin A, B and D;
[0084] angiogenic polypeptides, such as members of the family of vascular endothelial growth factors (VEGF; i.e. heparin-binding VEGF Genbank accession number M32977), transforming growth factor (TGF, and especially TGFalpha and beta), epithelial growth factors (EGF), fibroblast growth factor (FGF and especially FGF alpha and beta), tumor necrosis factors (TNF, especially TNF alpha and beta), CCN (including CTGF, Cyr61, Nov, Elm-1, Cop-1 and Wisp-3), scatter factor/hepatocyte growth factor (SH/HGF), angiogenin, angiopoyetin (especially 1 and 2), angiotensin-2, plasminogen activator (tPA) and urokinase (uPA);
[0085] cytokines, including interleukins (in particular IL-2, IL-6, IL-8, IL-12), colony stimulating factors (such as GM-CSF, G-CSF, M-CSF), interferons (such as IFN beta; Genbank accession number M25460; IFN gamma; Genbank accession number M29383) or IFN alpha);
[0086] chemokines, including RANTES, MIP alpha, MIP-1 beta, DCCKI, MDC, IL-10 (Genbank accession number U 16720) and MCP-1;
[0087] polypeptides capable of decreasing or inhibiting a cellular proliferation, including antibodies, toxins, immunotoxins, polypeptides inhibiting an oncogen expression products (e.g. ras, map kinase, tyrosine kinase receptors, growth factors), Fas ligand (Genbank accession number U08137), polypeptides activating the host immune system (MUC-1, early or late antigen(s) of a papilloma virus and the like);
[0088] polypeptides capable of inhibiting a bacterial, parasitic or viral infection or its development, such as antigenic determinants, transdominant variants inhibiting the action of a viral native protein by competition (EP 614980, WO95/16780), the extracellular domain of the HIV receptor CD4 (Traunecker et al., 1988, Nature 331, 84-86), immunoadhesin (Capon et al., 1989, Nature 337, 525-531; Byrn et al., 1990, Nature 344, 667-670), immunotoxins (Kurachi et al., 1985, Biochemistry 24, 5494-5499) and antibodies (Buchacher et al., 1992, Vaccines 92, 191-195);
[0089] enzymes, such as urease, renin, thrombin, metalloproteinase, nitric oxide synthases (eNOS (Genbank accession number M95296) and iNOS), SOD, catalase, heme oxygenase (Genbank accession number X06985), enase, the lipoprotein lipase family;
[0090] oxygen radical scavengers;
[0091] enzyme inhibitors, such as alpha1-antitrypsin, antithrombin III, plasminogen activator inhibitor PAI-1, tissue inhibitor of metalloproteinase 1-4
[0092] polypeptides capable of restoring at least partially a deficient cellular function responsible for a pathological condition, such as dystrophin or minidystrophin (in the context of myopathies; England et al., 1990, Nature 343, 180-182), insulin (in the context of diabetes), hemophilic factors (for the treatment of hemophilias and blood disorders such as Factor VIIa (U.S. Pat. No. 4,784,950), Factor VIII (U.S. Pat. No. 4,965,199) or derivative thereof (U.S. Pat. No. 4,868,112 having the B domain deleted) and Factor IX (U.S. Pat. No. 4,994,371)), CFTR (in the context of cystic fibrosis; Riordan et al., 1989, Science 245, 1066-1072), erythropoietin (anemia), lysosomal storage enzymes, including glucocerebrosidase (Gaucher's disease; U.S. Pat. No. 5,879,680 and U.S. Pat. No. 5,236,838), alpha-galactosidase (Fabry disease; U.S. Pat. No. 5,401,650), acid alpha-glucosidase (Pompe's disease; WO00/12740), alpha n-acetylgalactosaminidase (Schindler disease; U.S. Pat. No. 5,382,524), acid sphingomyelinase (Niemann-Pick disease; U.S. Pat. No. 5,686,240) and alpha-iduronidase (WO93/10244);
[0093] angiogenesis inhibitors, such as angiostatin, endostatin, platelet factor-4;
[0094] transcription factors, such as nuclear receptors comprising a DNA binding domain, a ligand binding domain and domain activating or inhibiting transcription (e.g. fusion products derived from oestrogen, steroid and progesterone receptors);
[0095] reporter genes (such as CAT, luciferase, eGFP . . . );
[0096] an antibody (whole immunoglobulins of any class, chimeric antibodies and hybrid antibodies with dual or multiple antigen or epitope specificities and fragments thereof such as F(ab)
[0097] any polypeptides that are recognized in the art as being useful for the treatment or prevention of a clinical condition.
[0098] It is within the scope of the present invention that the gene of interest may include addition(s), deletion(s) and/or modification(s) of one or more nucleotide(s) with respect to the native sequence.
[0099] As muscle cells are the preferred host cells in the context of the invention, the gene of interest preferably encodes a therapeutic polypeptide that is expressed by the muscle lo cells and eventually secreted into the blood stream to provide therapy to the host organism. One important aspect of the invention is a process for the treatment of muscular dystrophy wherein said gene of interest codes for dystrophin or minidistrophin.
[0100] As mentioned above, the gene of interest also includes genes encoding antisense sequences and ribozymes capable of binding and inactivating specific cellular RNA, preferably that of selected positively-acting growth regulatory genes, such as oncogenes and protooncogenes (c-myc, c-fos, c-jun, c-myb, c-ras, Kc and JE).
[0101] The present invention may use one or more gene(s) of interest. The different genes of interest may be controlled by common (polycistronic cassette) or independent control sequences that are positioned either in the same or in opposite directions. In this variation, the nucleic acid fragment in use in the present invention may regulate two genes of interest and placed between the promoter/enhancer controlling expression of each gene. Alternatively, it is possible to use more than one nucleic acid fragment, preferably positioned upstream of the promoter/enhancer elements used to drive each gene.
[0102] Those skilled in the art will appreciate that the present invention may further use additional control sequences for proper initiation, regulation and/or termination of transcription and translation of the gene(s) of interest into the host cell or organism. Such control sequences include but are not limited to non coding exons, introns, targeting sequences, transport sequences, secretion signal sequences, nuclear localisation signal sequences, IRES, polyA transcription termination sequences, tripartite leader sequences, sequences involved in replication or integration. Said control sequences have been reported in the literature and can be readily obtained by those skilled in the art.
[0103] In this respect, it is preferred that the control sequences comprise at least a non-coding exon, or a portion thereof, and optionally, one or more intron(s). Such exon and/or intron sequences may be advantageous for optimising expression and may for example be obtained from the gene from which the nucleic acid fragment and/or control sequence(s) originates (e.g. desmin) or from any other origin (e.g. eukaryotic, viral, synthetic). The large variety of exon/intron sequences described in the state of the art are suitable in the context of the present invention (see for example WO98/55639). They are preferably inserted after the transcription initiation site and before the initiation codon of the gene of interest. Referring to the preferred embodiment, an appropriate exon sequence comprises the portion of the first non-coding exon extending from position +2 to approximately position +60 relative to the transcriptional initiation site (i.e. the sequence shown in SEQ ID NO:1 from approximately nt 18664 to approximately nt 18722). With respect to intron(s), it (they) may be homologous (to the gene of interest) or heterologous (i.e., derived from any eukaryotic nuclear gene or of synthetic origin). Introns have been published in the literature and can be readily obtained by those skilled in the art. Illustrative examples include the introns isolated from the genes encoding alpha or beta globin (i.e. the second intron of the rabbit beta globin gene; Green et al., 1988, Nucleic Acids Res. 16, 369; Karasuyama et al., 1988, Eur. J. Immunol. 18, 97-104), ovalbumin, apolipoprotein, immunoglobulin, factor IX, factor VIII and CFTR and the synthetic introns such as the intron present in the pCI vector (Promega Corp, pCI mammalian expression vector E1731) made of the human beta globin donor fused to the mouse immunoglobin acceptor or the intron 16S/19S of SV40 (made of the association of the sequence that is spliced in the formation of SV40 16S and SV40 19S late mRNA Okayma and Berg, 1983, Mol. Cell. Biol. 3, 280-289).
[0104] The control sequence used in the context of the present invention may also contain a polyadenylation signal operably linked to the gene(s) of interest. A polyadenylation sequence is operably linked to the gene to be transcribed, when it allows termination of the transcription. It is preferably positioned downstream of the gene of interest.
[0105] According to another embodiment, the present invention also provides an expression cassette for the expression of one or more gene(s) of interest in a host cell or organism comprising at least said gene(s) of interest which expression is controlled by one or more control sequence(s) and a nucleic acid fragment, wherein said nucleic acid fragment comprises a portion of at least 100 contigous nucleotide bases having a sequence the same as, or homologous to a portion of corresponding length of the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 16879 or the same as, or homologous to a portion of the corresponding length of the sequence complementary to the sequence set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 16879.
[0106] Preferably, the nucleic acid fragment and/or the control sequence(s) (e.g. promoter and/or enhancer and/or exonic sequence and/or polyadenylation sequence . . . ) and/or the gene of interest have the characteristics as defined above. The term “and/or” whereever used herein includes the meaning of “and”, “or” and “all or any other combination of the elements connected by said term”.
[0107] In particular, the expression cassette of the present invention shows a propensity to direct gene expression in muscle cells, whereas in non-muscle cells, it is not at all or not very active (reduced activity by a factor of at least 5, preferably at least 10 and, more preferably, at least 100 as compared to the expression activity in muscle cells), whereby these host cells may be cultured cells or cells of a host organism. Thus, the expression cassette of the present invention results in maximal gene expression in muscle cells and thus provides the possibility of transcriptionally targeting expression of said gene of interest preferentially to muscle cells. Preferably, the muscle cell is a skeletal muscle cell.
[0108] According to a preferred embodiment, the expression cassette of the invention, comprises at least:
[0109] a sequence the same as, or homologous to the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 18663,
[0110] a sequence the same as, or homologous to the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 1 and ending at nucleotide approximately 18722,
[0111] a sequence the same as, or homologous to the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 2358 and ending at nucleotide approximately 10067 and starting at nucleotide approximately 16880 and ending at nucleotide approximately 18663,
[0112] a sequence the same as, or homologous to the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 2358 and ending at nucleotide approximately 10067 and starting at nucleotide approximately 16880 and ending at nucleotide approximately 18722,
[0113] a sequence the same as, or homologous to the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 7569 and ending at nucleotide approximately 18663, or
[0114] a sequence the same as, or homologous to the sequence as set out in SEQ ID NO:1 starting at nucleotide approximately 7569 and ending at nucleotide approximately 18722.
[0115] The expression cassette of the present invention may be constructed by standard molecular biology techniques well known in the art. The different control sequences and the nucleic acid frgament may be obtained from natural sources or by synthetic means. For example, they may be isolated by cloning techniques from DNA libraries or by amplification methods (PCR) using appropriate probes. Alternatively, they may be produced by chemical synthesis based upon sequence data available in the art. The promoter, enhancer and nucleic acid fragment may be associated by means of using restriction enzymes and ligases.
[0116] The present invention also provides a vector comprising an expression cassette according to the invention. The skilled person may choose the appropriate vector out of a wide range of vectors.
[0117] According to a preferred embodiment, the vector of the present invention is a naked DNA molecule, for instance a plasmid vector. The term “plasmid” encompasses both extrachromosomal circular DNA (i.e. episomal plasmid) and integrative plasmids which are capable of integration within the host's genome. Preferably, the plasmid is designed for amplification in bacteria and for expression in an eukaryotic host cell. It may also comprise a selection gene in order to select or to identify the transfected cells (e.g. by complementation of a cell auxotrophy or by antibiotic resistance), stabilising elements (e.g. cer sequence; Summers and Sherrat, 1984, Cell 36, 1097-1103), integrative elements (e.g. LTR viral sequences and transposons) as well as elements providing a self-replicating function and enabling the vector to be stably maintained in cells, independently of copy number of the vector in the cell. The self-replicating function is provided by using a viral origin of replication and providing one or more viral replication factors that are required for replication mediated by that particular viral origin (WO95/32299). Origins of replication and any replication factors may be obtained from a variety of viruses, including Epstein-Barr virus (EBV), human and bovine papilloma viruses and papovavirus BK. However, a preferred embodiment of the vector of the invention relates to a non self-replicating plasmid vector.
[0118] The range of suitable plasmids is very large and can be purchased from a variety of manufacturers. Suitable plasmids include but are not limited to those derived from pBR322 (Gibco BRL), pUC (Gibco BRL), pBluescript (Stratagene), pREP4, pCEP4 (Invitrogene), pCI (Promega) and p Poly (Lathe et al., Gene 57 (1987), 193-201). It can also be engineered by standard molecular biology techniques (Sambrook et al. 2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor N.Y.). Additionally, it is well known that the form of the vector can affect the expression efficiency, and it is preferable that a large fraction of the vector be in supercoiled form.
[0119] The present invention also encompasses viral vectors. Such viral vectors can be derived from any wild-type viruses, especially selected from the group consisting of herpes viruses, cytomegaloviruses, foamy viruses, lentiviruses, Semliki forrest viruses, AAV (adeno-associated viruses), poxviruses, adenoviruses and retroviruses. Such viral vectors are well known in the art. <<Derived>> means genetically engineered from the wild-type viral genome by introducing one or more modifications, such as deletion(s), addition(s) and/or substitution(s) of one or several nucleotide(s) present in a coding or a non-coding portion of the viral genome.
[0120] A viral vector can be an adenoviral vector. The adenoviral genome consists of a linear double-standed DNA molecule of approximately 36 kb carrying more than about thirty genes necessary to complete the viral cycle. The early genes are divided into 4 regions (E1 to E4) that are essential for viral replication (Pettersson and Roberts, 1986, In Cancer Cells (Vol 4): DNA Tumor Viruses, Botchan and Glodzicker Sharp Eds pp 37-47, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.; Halbert et al., 1985, J. Virol. 56, 250-257) with the exception of the E3 region, which is believed dispensable for viral replication based on the observation that naturally-occuring mutants or hybrid viruses deleted within the E3 region still replicate like wild-type viruses in cultured cells (Kelly and Lewis, 1973, J. Virol. 12, 643-652). The E1 gene products encode proteins responsible for the regulation of transcription of the viral genome. The E2 gene products are required for initiation and chain elongation in viral DNA synthesis. The proteins encoded by the E3 prevent cytolysis by cytoxic T cells and tumor necrosis factor (Wold and Gooding, 1991, Virology 184, 1-8). The proteins encoded by the E4 region are involved in DNA replication, late gene expression and splicing and host cell shut off (Halbert et al., 1985, J. Virol. 56, 250-257). The late genes (L1 to L5) encode in their majority the structural proteins constituting the viral capsid. They overlap at least in part with the early transcription units and are transcribed from a unique promoter (MLP for Major Late Promoter). In addition, the adenoviral genome carries at both extremities cis-acting 5′ and 3′ ITRs (Inverted Terminal Repeat) and packaging sequences essential for DNA replication. The ITRs harbor origins of DNA replication whereas the encapsidation region is required for the packaging of adenoviral DNA into infectious particles.
[0121] In one embodiment, the adenoviral vector of the present invention is engineered to be conditionally replicative (CRAd vectors) in order to replicate selectively in specific cells (e.g. proliferative cells) as described in Heise and Kirn (2000, J. Clin. Invest. 105, 847-851).
[0122] According to another and preferred embodiment, the adenoviral vector of the invention is replication-defective, at least for the E1 function by total or partial deletion and/or mutation of one or more genes constituting the E1 region. Advantageously, the E1 deletion covers nucleotides (nt) 458 to 3328 or 458 to 3510 by reference to the sequence of the human adenovirus type 5 disclosed in the Genebank data base under the accession number M 73260.
[0123] Furthermore, the adenoviral backbone of the vector may comprise additional modifications (deletions, insertions or mutations in one or more viral genes). An example of an E2 modification is illustrated by the thermosensible mutation localized on the DBP (DNA Binding Protein) encoding gene (Ensinger et al., 1972, J. Virol. 10, 328-339). The adenoviral sequence may also be deleted of all or part of the E4 region. A partial deletion retaining the ORFs 3 and 4 or ORFs 3 and 6/7 may be advantageous (see for example European application EP 974 668; Christ et al., 2000, Human Gene Ther. 11, 415-427; Lusky et al., 1999, J. Virol. 73, 8308-8319). Additional deletions within the non-essential E3 region may increase the cloning capacity (for a review see for example Yeh et al. FASEB Journal 11 (1997) 615-623), but it may be advantageous to retain all or part of the E3 sequences coding for the polypeptides (e.g. gp19k) allowing to escape the host immune system (Gooding et al., 1990, Critical Review of Immunology 10, 53-71) or inflammatory reactions (EP00440267.3). Second generation vectors retaining the ITRs and packaging sequences and containing substantial genetic modifications aimed to abolish the residual synthesis of the viral antigens may also be envisaged, in order to improve long-term expression of the expressed gene in the transduced cells (WO94/28152; Lusky et al., 1998, J. Virol 72, 2022-2032). Gutless adenoviral vectors retaining only the ITRs and packaging sequences and devoid of all viral genes can be envisaged in the context of the present invention.
[0124] The expression cassette of the present invention can be inserted in any location of the adenoviral genome, with the exception of the cis-acting sequences. Preferably, it is inserted in replacement of a deleted region (E1, E3 and/or E4), with a special preference for the deleted E1 region. In addition, the expression cassette may be positioned in sense or antisense orientation relative to the transcriptional direction of the region in question.
[0125] Adenoviruses adaptable for use in accordance with the present invention, can be derived from any human or animal source, in particular canine (e.g. CAV-1 or CAV-2; Genbank ref CAV1GENOM and CAV77082 respectively), avian (Genbank ref AAVEDSDNA), bovine (such as BAV3; Seshidhar Reddy et al., 1998, J. Virol. 72, 1394-1402), murine (Genbank ref ADRMUSMAV1), ovine, feline, porcine or simian adenovirus or alternatively from a hybrid thereof. Any serotype can be employed. However, the human adenoviruses of the C sub-group are preferred and especially adenoviruses 2 (Ad2) and 5 (Ad5). Generally speaking, the cited viruses are available in collections such as ATCC and have been the subject of numerous publications describing their sequence, organization and biology, allowing the artisan to apply them.
[0126] In addition, adenoviral particles or empty adenoviral capsids can also be used to transfer nucleic acids (e.g. a plasmidic vector) by a virus-mediated cointernalization process as described in U.S. Pat. No. 5,928,944. This process can be accomplished in the presence of (a) cationic agent(s) such as polycarbenes or lipid vesicles comprising one or more lipid layers.
[0127] A retroviral vector is also suitable in the context of the present invention. Retroviruses are a class of integrative viruses which replicate using a virus-encoded reverse transcriptase, to replicate the viral RNA genome into double stranded DNA which is integrated into chromosomal DNA of the infected cells. The numerous vectors described in the literature may be used within the framework of the present invention and especially those derived from murine leukemia viruses, especially Moloney (Gilboa et al., 1988, Adv. Exp.Med. Biol. 241, 29) or Friend's FB29 strains (WO95/01447). Generally, a retroviral vector is deleted of all or part of the viral genes gag, pol and env and retains 5′ and 3′ LTRs and an encapsidation sequence. These elements may be modified to increase expression level or stability of the retroviral vector. Such modifications include the replacement of the retroviral encapsidation sequence by one of a retrotransposon such as VL30 (U.S. Pat. No. 5,747,323). The expression cassette of the invention is inserted downstream of the encapsidation sequence, preferably in opposite direction relative to the retroviral genome.
[0128] A poxyiral vector is also suitable in the context of the present invention. Poxviruses are a group of complex enveloped viruses that distinguish from the above-mentioned viruses by their large DNA genome and their cytoplasmic site of replication. The genome of several members of poxyiridae has been mapped and sequenced. It is a double-stranded DNA of approximately 200 kb coding for about 200 proteins of which approximately 100 are involved in virus assembly. In the context of the present invention, a poxyiral vector may be obtained from any member of the poxyiridae, in particular canarypox, fowlpox and vaccinia virus, the latter being preferred. Suitable vaccinia viruses include without limitation the Copenhagen strain (Goebel et al., 1990, Virol. 179, 247-266 and 517-563; Johnson et al., 1993, Virol. 196, 381-401), the Wyeth strain and the modified Ankara (MVA) strain (Antoine et al., 1998, Virol. 244, 365-396). The general conditions for constructing a vaccinia virus comprising an expression cassette according to the present invention are well known in the art (see for example EP 83 286 and EP 206 920 for Copenhagen vaccinia viruses and Mayr et al., 1975, Infection 3, 6-14 and Sutter and Moss, 1992, Proc. Natl. Acad. Sci. USA 89, 10847-10851 for MVA viruses). The expression cassette of the present invention is preferably inserted within the poxyiral genome in a non-essential locus, such as non-coding intergenic regions or any gene for which inactivation or deletion does not significantly impair viral growth and replication. Thymidine kinase gene is particularly appropriate for insertion in Copenhagen vaccinia viruses (Hruby et al., 1983, Proc. Natl. Acad. Sci USA 80, 3411-3415; Weir et al., 1983, J. Virol. 46, 530-537). As far as MVA is concerned, insertion of the expression cassette can be performed in any of the excisions I to VII, and preferably in excision II or III (Meyer et al., 1991, J. Gen. Virol. 72, 1031-1038; Sutter et al., 1994, Vaccine 12, 1032-1040) or in D4R locus. For fowlpox virus, although insertion within thymidine kinase gene may be considered, the expression cassette is preferably introduced into a non-coding intergenic region (see for example EP 314 569 and U.S. Pat. No. 5,180,675). One may also envisage insertion in an essential viral locus provided that the defective function be supplied in trans, via a helper virus or by expression in the producer cell line.
[0129] The present invention also provides a viral particle comprising a vector according to the invention, preferably a viral vector. Such viral particle can be produced in appropriate cell lines according to standard techniques.
[0130] Adenoviral particles may be prepared and propagated according to any conventional technique in the field of the art (e.g. as described in Graham and Prevect, 1991, Methods in Molecular Biology, Vol 7, Gene Transfer and Expression Protocols; Ed E. J. Murray, The Human Press Inc, Clinton, N.J. or in WO96/17070) using a complementation cell line or a helper virus, which supplies in trans the viral genes for which the adenoviral vector of the invention is defective. The cell lines 293 (Graham et al., 1977, J. Gen. Virol. 36, 59-72) and PERC6 (Fallaux et al., 1998, Human Gene Therapy 9, 1909-1917) are commonly used to complement the E1 function. Other cell lines have been engineered to complement doubly defective vectors (Yeh et al., 1996, J. Virol. 70, 559-565; Krougliak and Graham, 1995, Human Gene Ther. 6, 1575-1586; Wang et al., 1995, Gene Ther. 2, 775-783; Lusky et al., 1998, J. Virol. 72, 2022-2033; EP919627 and WO97/04119). The adenoviral particles can be recovered from the culture supernatant but also from the cells after lysis and optionally further purified according to standard techniques (e.g. chromatography, ultracentrifugation, as described in WO96/27677, WO98/00524 WO98/26048 and WO00/50573).
[0131] Retroviral particles are prepared in the presence of a helper virus or in an appropriate complementation (packaging) cell line which contains integrated into its genome the retroviral genes for which the retroviral vector is defective (e.g. gag/pol and env). Such cell lines are described in the prior art (Miller and Rosman, 1989, BioTechniques 7, 980; Danos and Mulligan, 1988, Proc. Natl. Acad. Sci. USA 85, 6460; Markowitz et al., 1988, Virol. 167, 400). The product of the env gene is responsible for the binding of the viral particle to the viral receptors present on the surface of the target cell and, therefore determines the host range of the retroviral particle. In the context of the invention, it is advantageous to use a packaging cell line, such as the PA317 cells (ATCC CRL 9078) or 293E16 (WO97/35996) containing an amphotropic envelope protein, to allow infection of human and other species target cells. The retroviral particles are preferably recovered from the culture supernatant and may optionally be further purified according to standard techniques (e.g. chromatography, ultracentrifugation).
[0132] Poxyiral particles are prepared as described in numerous documents accessible to the artisan skilled in the art (Piccini et al., 1987, Methods of Enzymology 153, 545-563; U.S. Pat. No. 4,769,330; U.S. Pat. No. 4,772,848; U.S. Pat. No. 4,603,112; U.S. Pat. No. 5,100,587 and U.S. Pat. No. 5,179,993). The major techniques that have been developed utilize homologous recombination between a donor plasmid containing the expression cassette of the invention flanked on both sides by pox DNA sequences (encompassing the desired insertion site) and the wild type poxyiral genome. Generally, the donor plasmid is constructed, amplified by growth in
[0133] According to another advantageous alternative, the vector of the invention may be complexed with various compounds that can improve vector delivery efficiency. Such compounds include but are not limited to lipids, polymers, peptides, condensing agents (spermine, spermidine, histones, peptides) and their derivatives. These compounds are widely described in the scientific literature accessible to the man skilled in the art.
[0134] In this respect, preferred lipids are cationic lipids which have a high affinity for nucleic acids (e.g. the vector of the present invention) and which interact with cell membranes (Felgner et al., 1989, Nature 337, 387-388). As a result, they are capable of complexing the nucleic acid, thus generating a compact particle capable of entering the cells. Cationic lipids or mixtures of cationic lipids which may be used in the present invention include Lipofectin™, DOTMA: N-[1-(2,3-dioleyloxyl)propyl]-N,N,N-trimethylammonium (Felgner, 1987, Proc. Natl. Acad. Sci. USA 84, 7413-7417), DOGS: dioctadecylamidoglycylspermine or Transfectam™ (Behr, 1989, Proc. Natl. Acad. Sci. USA 86, 6982-6986), DMRIE: 1,2-dimiristyloxypropyl-3-dimethyl-hydroxyethylammonium and DORIE: 1,2-diooleyloxypropyl-3-dimethyl-hydroxyethylammnoium (Felgner, 1993, Methods 5, 67-75), DC-CHOL: 3 [N-(N′,N′-dimethylaminoethane)-carbamoyl]cholesterol (Gao, 1991, BBRC 179, 280-285), DOTAP (McLachlan, 1995, Gene Therapy 2, 674-622), Lipofectamine™, spermine- and spermidine-cholesterol, Lipofectace™ (for a review see for example Legendre, 1996, Medecine/Science 12, 1334-1341 or Gao, 1995, Gene Therapy 2, 710-722) and the cationic lipids disclosed in patent applications WO 98/34910, WO 98/14439, WO 97/19675, WO 97/37966 and their isomers. Nevertheless, this list is not exhaustive and other cationic lipids well known in the art can be used in connection with the present invention as well.
[0135] Cationic polymers or mixtures of cationic polymers which may be used in the present invention include chitosan (WO98/17693), poly(aminoacids) such as polylysine (U.S. Pat. No. 5,595,897 or FR 2 719 316); polyquatemary compounds; protamine; polyimines; polyethylene imine or polypropylene imine (WO96/02655); polyvinylamines; polycationic polymer derivatized with DEAE, such as DEAE dextran (Lopata et al., 1984, Nucleic Acid Res. 12, 5707-5717); polyvinylpyridine; polymethacrylates; polyacrylates; polyoxethanes; polythiodiethylaminomethylethylene (P(TDAE)); polyhistidine; polyomithine; poly-p-aminostyrene; polyoxethanes; co-polymethacrylates (eg copolymer of HPMA; N-(2hydroxypropyl/methacrylamide); the compound disclosed in U.S. Pat. No. 3,910,862, polyvinylpyrrolid complexes of DEAE with methacryl ate, dextran, acrylamide, polyimines, albumin, onedimethylaminomethylmethacrylates and polyvinylpyrrolidone methylacrylaminopropyltrimethyl ammonium chlorides; polyamidoamine (Haensler and Szoka, 1993, Bioconjugate Chem. 4, 372-379); telomeric compounds (patent application filing number EP 98401471.2); dendritic polymers (WO 95/24221). Nevertheless, this list is not exhaustive and other cationic polymers well known in the art can be used in the composition according to the invention as well.
[0136] Colipids may be optionally included in order to facilitate entry of the vector into the cell. Such colipids can be neutral or zwitterionic lipids. Representative examples include phosphatidylethanolamine (PE), phosphatidylcholine, phosphocholine, dioleylphosphatidylethanolamine (DOPE), sphingomyelin, ceramide or cerebroside and any of their derivatives.
[0137] The ratio of vector to the cationic compound (e.g. lipid or polymer) on a molar basis varies from between 0.1 and 20, preferably between 0.3 and 10, and most preferably between 0.5 and 5. The ratio is characterized by the theoretical charge ratio (+/−) obtained by dividing the number of positive charges provided by the cationic compound and the number of negative charges provided by the vector, assuming that all potentially cationic groups are in fact in the cationic state and all potentially anionic groups are in fact in the anionic state. In general, an excess of positive charges on the composition facilitates binding of the composition to the negatively-charged cell surface
[0138] The ratio of cationic lipids and/or cationic polymers to colipid(s) (on a weight to weight basis), when the co-lipid(s) is (are) co-existing in the complex, can range from 1:0 to 1:10. In preferred embodiments, this ratio ranges from 1:0.5 to 1:4.
[0139] The vector complexed to such compounds may be characterised by a <<small> average diameter (less than 2 μm). In a preferred embodiment, this average diameter is between about 20 and 800 nm, preferably between about 50 and 500 nm, more preferably between about 75 and 200 nm, and still more preferably about 100 nm. Measurements of the average diameter can be achieved by a number of techniques including, but not limited to, dynamic laser light scattering (photon correlation spectroscopy, PCS), as well as other techniques known to those skilled in the art (see, Washington, Particle Size Analysis in Pharmaceutics and other Industries, Ellis Horwood, N.Y., 1992, 135-169). Sizing procedure may also be applied on vector complexes in order to select a determined diameter. Methods which can be used in this sizing step include, but are not limited to, extrusion, sonication and microfluidization, size exclusion chromatography, field flow fractionation, electrophoresis and ultracentrifugation.
[0140] The complexation of the vector and one or more of the precited compound may e performed through reactive functional groups at the surface of the vector, optionally with the intermediary use of a cross linker or other activating agent (see for example Bioconjugate techniques 1996; ed G Henmanson; Academic Press), directly or via a spacer using heterobifunctional reactives such as SPDP or SMCC, or functionalized PEG which are well known by the person skilled in the art (Mattson et al., 1993, Mol. Biol. Reports, 17, 167-183).
[0141] The complexation of the vector of the invention with one or more of the above-cited compounds can be performed according to standard techniques. For example, the compound(s) (e.g. cationic lipids) is (are) dissolved in an appropriate organic solvent such as chloroform. The mixture is then dried under vaccum. The film obtained is further dissolved in an appropriate amount of solvent or mixture of solvents which are miscible in water, in particular ethanol, dimethylsulfoxide (DMSO), or preferably a 1:1 (v:v) ethanol DMSO mixture, so as to form lipid aggregates according to a known method (WO 96/03977), or alternatively, is suspended in an appropriate quantity of a solution of detergent such as an octylglucoside (e.g. n-octyl-beta-D-glucopyranoside or 6-O-(N-heptylcarbamoyl/methyl-alpha-D-glucopyranoside). The suspension may then be mixed with a solution comprising the desired amount of polynucleotide. Subsequent dialysis may be carried out in order to remove the detergent and to recover the composition of the invention. The principle of such a method is described by Hofland et al. (1996, Proc. Natl. Acad. Sci. USA 93, 7305-7309).
[0142] It is also feasible to suspend one or more compound(s) in a buffer and then the suspension is subjected to sonication until visual homogeneity is obtained. The suspension is then extruded through two microporous membranes under appropriate pressure. The suspension is then mixed with a solution of vector. This so-called sonication-extrusion technique is well known by those skilled in the art.
[0143] Other compounds may be included in the vector-synthetic vector complex, such as labelling molecules (see, for example, molecules disclosed in U.S. Pat. No. 4,711,955) enabling, for example, visualization of the distribution of the complex after in vitro or in vivo administration; targeting moieties (ligands), anchoring molecules; fusogenic peptides, nuclear localization peptides or elements facilitating penetration into the cell, lysis of endosomes (JTS1 peptides for example, Gottchalk et al., 1996, Gene Therapy, 3, 448-457), or even transport to the nucleus or a combination of said compounds. These compounds may be composed of all or part of sugars, glycol, peptides (e.g. GRP, Gastrin Releasing Peptide), oligonucleotides, lipids (especially those with C2-C22), hormones, vitamins, antigens, antibodies (or fragments thereof). A characteristic feature of a targeting moiety is its ability to recognize and bind a cellular and surface-exposed component. Such targeting moieties include without limitation chemical conjugates, hormones, sugars, polypeptides, oligonucleotides, vitamins, antigens, lectins, antibodies and fragments thereof. They are preferably capable of recognizing and binding to cell-specific markers, tissue-specific markers, cellular receptors, viral antigens, antigenic epitopes or tumor-associated markers. Representative examples include for example galactosyl residues to target the asialoglycoprotein receptor on the surface of hepatocytes. Fusogenic peptides include the INF-7 fusogenic peptide derived from the HA-2 subunit of the influenza virus hemagglutinin (Plank et al. 1994, J. Biol. Chem. 269, 12918-12924) for membrane fusion, A nuclear signal sequence can be derived from the T-antigen of the SV40 virus (Lanford and Butel, 1984, Cell 37, 801-813) or from the EBNA-1 protein of the Epstein Barr virus (Ambinder et al., 1991, J. Virol. 65, 1466-1478). Furthermore, the reactive groups can be substituted with alkyl C1-C6, leading for example to permethylated compositions. The reactive groups might also be substituted with amino groups. Such substituted polynucleotide or compound can be obtained easily using the techniques described in the literature, especially by chemical coupling, notably by using protective groups such as trifluoroacetyl, Fmoc (9-fluorenylmethoxycarbonyl) or BOC (tert-butyl oxycarbonyl) on the amine moiety. Selective removal of a protective group then allows coupling of the compound, and then complete deprotection (Greene and Wuts, 1991, Protective groups in organic synthesis. Ed. J. Wiley & Sons, Inc. New York).
[0144] Of course, before introducing the vector of the present invention in the host cell or organism, it is advantageous to purify said polynucleotide so that it is sufficiently free of undesirable contaminants, such as endotoxins and other pyrogens such that it does not cause ay untoward reactions in individual receving the vector construct. A preferred means of purifying the vector involves the use of buoyant density gradients (i.e. cesium chloride gradient centrifugation) or techniques as described in WO98/11208 and WO00/50573.
[0145] The present invention also provides a eukaryotic host cell comprising the expression cassette or the vector or the viral particle according to the invention.
[0146] For the purpose of the invention, the term “cells” should be understood broadly without any limitation concerning particular organization in tissue, organ, etc or isolated cells of a mammalian (preferably a human). Such cells may be unique type of cells or a group of different types of cells and encompass cultured cell lines, primary cells and proliferative cells from mammalian origin, with a special preference for human origin. Suitable host cells include but are not limited to hematopoietic cells (totipotent, stem cells, leukocytes, lymphocytes, monocytes, macrophages, APC, dendritic cells, non-human cells and the like), pulmonary cells, tracheal cells, hepatic cells, epithelial cells, endothelial cells or fibroblasts with a special preference for muscle cells (as defined above) and especially skeletal muscle cells. In the context of the present invention, the vector of the invention may be integrated in the genome of the host cell or maintained as an extrachromosomal element.
[0147] Moreover, according to a specific embodiment, the eukaryotic host cell can be further encapsulated. Cell encapsulation technology has been previously described (Tresco et al., 1992, ASAIO J. 38, 17-23; Aebischer et al., 1996, Human Gene Ther. 7, 851-860). According to said specific embodiment, transfected or infected host cells are encapsulated with compounds which form a microporous membrane and said encapsulated cells can further be implanted in vivo. Capsules containing the cells of interest may be prepared employing a hollow microporous membrane from poly-ether sulfone (PES) (Akzo Nobel Faser AG, Wuppertal, Gennany; Deglon et al. 1996, Human Gene Ther. 7, 2135-2146). This membrane has a molecular weight cutoff greater than 1 Mda which permits the free passage of proteins and nutrients between the capsule interior and exterior, while preventing the contact of transplanted cells with host cells.
[0148] The present invention also provides a non human transgenic animal comprising a cell having incorporated in its genome an expression cassette or a vector according to the invention. This transgenic animal may be a complete transgenic (all its cells possess the exogenous transgene) or alternatively a mosaic (the exogenous transgene occurs in a certain pourcentage of cells, but not in all. This transgenic animal is a non-human mammal, more especially a rodent and most preferably a mouse. Usually, a transgenic animal is obtained by the development of an egg which has been injected with the expression cassette or the vector (preferably linearized) of the invention. The techniques for generating transgenic animals are conventional in the domain of the art (see for example Jaenisch et al., 1988, Science 240, 1468-1474 and Capechi et al., 1989, Science 244, 1288-1292).
[0149] The present invention also provides a composition, and more especially a pharmaceutical composition, comprising the expression cassette, the vector the viral particle or the eukaryotic host cell according to the present invention and a pharmaceutically acceptable vehicle. In a special case, the composition may comprise two or more expression cassettes, vectors viral particles or eukaryotic host cells, which may differ by the nature (i) of the control sequence and/or (ii) of the gene of interest and/or (iii) of the vector backbone.
[0150] The composition according to the invention may be manufactured in a conventional manner for a variety of modes of administration including systemic, topical and localized administration (e.g. topical, aerosol, instillation, oral). For systemic administration, injection is preferred, e.g. subcutaneous, intradermal, intramuscular, intravenous, intraperitoneal, intrathecal, intracardiac (such as transendocardial and pericardial), intratumoral, intravaginal, intrapulmonary, intranasal, intratracheal, intravascular, intraarterial, intracoronary or intracerebroventricular. Intramuscular constitutes the preferred mode of administration. The administration may take place in a single dose or a dose repeated one or several times after a certain time interval. The appropriate administration route and dosage may vary in accordance with various parameters, as for example, the condition or disease to be treated, the stage to which it has progressed, the need for prevention or therapy and the therapeutic gene to be transferred. As an indication, a composition based on viral particles may be formulated in the form of doses of between 10
[0151] Moreover, the composition of the present invention can further comprise a pharmaceutically acceptable carrier for delivering said expression cassette, vector, viral particle or eukaryotic host cell into a human or animal body. The carrier is preferably a pharmaceutically suitable injectable carrier or diluent which is non-toxic to a human or animal organism at the dosage and concentration employed (for examples, see Remington's Pharmaceutical Sciences, 16
[0152] In addition, the composition according to the present invention may include one or more <<stabilizing>> additive(s), capable of preserving its degradation within the human or animal and/or of improving uptake into the host cell. Such additives may be used alone or in combination and include hyaluronidase (which is thought to destabilize the extra cellular matrix of the host cells as described in WO98/53853), chloroquine, protic compounds such as propylene glycol, polyethylene glycol, glycerol, ethanol, 1-methyl L-2-pyrrolidone or derivatives thereof, aprotic compounds such as dimethylsulfoxide (DMSO), diethylsulfoxide, di-n-propylsulfoxide, dimethylsulfone, sulfolane, dimethyl-formamide, dimethylacetamide, tetramethylurea, acetonitrile (see EP 890 362), cytokines, especially interleukin-10 (IL-10) (PCT/EP/99 03082), nuclease inhibitors such as actin G (WO99/56784) and cationic salts such as magnesium (Mg