Plaque It!
Sponsored by: Flash of Genius |
| 4520113 | Serological detection of antibodies to HTLV-III in sera of patients with AIDS and pre-AIDS conditions | May, 1985 | Gallo | 436/504 |
| 4708818 | Human immunodeficiency viruses associated with Acquired Immune Deficiency Syndrome (AIDS), a diagnostic method for AIDS and pre-AIDS, and a kit therefor | November, 1987 | Montagnier | 435/5 |
| 4716102 | Purified AIDS-associated virus ARV-2 | December, 1987 | Levy | 435/5 |
| CA443605 | December, 1983 | |||
| DE3587394 | June, 1993 | |||
| EP0020251 | December, 1980 | |||
| EP0060057 | September, 1982 | Expression of polypeptides in yeast. | ||
| EP0062574 | October, 1982 | Virus protein synthesis. | ||
| EP0073635 | March, 1983 | Expression vectors | ||
| EP0088632 | September, 1983 | Expression, processing and secretion of heterologous protein by yeast. | ||
| EP0116201 | August, 1984 | Secretory expression in eukaryotes | ||
| EP0136798 | April, 1985 | Titre plate and assay kit for detection of antibodies in human serum and their production and use. | ||
| EP0138667 | April, 1985 | Method for the diagnosis of lymphadenopathy and acquired immune depression syndrome. | ||
| EP0139216 | May, 1985 | Adult T cell leukemia virus antigen peptide. | ||
| EP0152030 | August, 1985 | Adult T cell leukemia virus antigen polypeptide. | ||
| EP0165120 | December, 1985 | Lymphadenopathy-associated retrovirus adapted to continuous lymphoblastoid-B cell lines capable of continuously producing said retrovirus, and process for preparing the cell lines. | ||
| EP0173529 | March, 1986 | Molecular clones of the genome of HTLV-III. | ||
| EP0178978 | April, 1986 | Cloned DNA sequences, hybridizable with genomic RNA of lymphadenopathy-associated virus (LAV) | ||
| EP0181150 | May, 1986 | Recombinant proteins of viruses associated with lymphadenopathy syndrome and/or acquired immune deficiency syndrome | ||
| EP0185444 | June, 1986 | Cloning and expression of HTLV-III DNA. | ||
| EP0187041 | July, 1986 | Fusions of AIDS-related polypeptides | ||
| EP0201540 | November, 1986 | ENVELOPE ANTIGENS OF LYMPHADENOPATHY ASSOCIATED VIRUS AND THEIR APPLICATIONS. | ||
| GB2104902 | March, 1983 | |||
| WO/1984/023659 | September, 1984 | |||
| WO/1984/016013 | October, 1984 | |||
| WO/1984/029099 | November, 1984 | |||
| WO/1985/001473 | January, 1985 | ELASTIC BAND SEALER APPARATUS AND METHOD FOR FORMING A CONTINUOUS NON-OVERLAPPING BAND | ||
| WO/1985/004897 | November, 1985 | METHOD AND CELL LINE FOR CONTINUOUS PRODUCTION OF RETROVIRUSES (HTLV-III) RELATED TO AIDS | ||
| WO/1985/004903 | November, 1985 | ISOLATION OF PROTEINS OF HTLV-III, SEROLOGICAL DETECTION OF ANTIBODIES TO HTLV-III IN SERA OF PATIENTS WITH AIDS AND PRE-AIDS CONDITIONS, AND DETECTION OF HTLV-III INFECTION BY IMMUNO-ASSAYS USING HTLV-III AND ITS PROTEINS | ||
| WO/1986/002383 | April, 1986 | ENVELOPE ANTIGENS OF LYMPHADENOPATHY ASSOCIATED VIRUS AND THEIR APPLICATIONS | ||
| WO/1986/006414 | November, 1986 | SYNTHETIC ANTIGENS FOR THE DETECTION OF AIDS-RELATED DISEASE | ||
| ZA847005 | September, 1984 |
This application is a divisional, of application Ser. No. 08/089,407, filed Jul. 8, 1993 now U.S. Pat. No. 7,273,695, which is a continuation of application Ser. No. 07/931,154, filed Aug. 17, 1992 now abandoned, which is a continuation of application Ser. No. 07/138,894, filed Dec. 24, 1987, now U.S. Pat. No. 5,156,949.
This application is a continuation-in-part of U.S. patent application Ser. No. 06/773,447, filed 6 Sep. 1985, which is a continuation-in-part of U.S. patent application Ser. No. 06/696,534, filed 30 Jan. 1985, now abandoned, which is a continuation-in-part of U.S. patent application Ser. No. 06/667,501, filed 31 Oct. 1984, now abandoned. The disclosures of the above application are incorporated herein by reference.
The present invention is directed to nucleotide sequences, such as DNA, encoding human immunodeficiency virus polypeptides, the use of such nucleotide sequences in diagnostic procedures and in the production of recombinant protein, as well as the use of such proteins in diagnostic, prophylactic, and therapeutic applications.
Acquired immune deficiency syndrome (AIDS) is now recognized as one of the greatest health threats facing modern medicine. There is, as yet, no cure for this almost invariably fatal disease. This state of affairs has made the prevention of the disease an extremely high priority in the medical community. An individual who is infected with human immunodeficiency virus (HIV), the etiologic agent of AIDS, can transmit the disease, and yet remain asymptomatic for many years. The ability to accurately screen large numbers of asymptomatic individuals (e.g., healthy appearing blood donors) for HIV infection is of great importance. Furthermore, the development of a vaccine would be particularly desirable, since it would afford some protection against transmission of AIDS by individuals who either are not detected by a diagnostic test, or evade such a test.
In 1983-1984, three groups independently identified the suspected etiological agent of AIDS. See, e.g., Barre-Sinoussi et al. (1983) Science 220:868-871; Montagnier et al., in Human T - Cell Leukemia Viruses (Gallo, Essex & Gross, eds., 1984); Vilmer et al. (1984) The Lancet 1:753; Popovic et al. (1984) Science 224:497-500; Levy et al. (1984) Science 225:840-842. These isolates were variously called lymphadenopathy-associated virus (LAV), human T-cell lymphotropic virus type III (HTLV-III), or AIDS-associated retrovirus (ARV). All of these isolates are strains of the same virus, and were later collectively named human immunodeficiency virus (HIV). With the isolation of a related AIDS-causing virus, the strains originally called HIV are now termed HIV-1 and the related virus is called HIV-2. See, e.g., Guyader et al. (1987) Nature 326:662-669; Brun-Vezinet et al. (1986) Science 233:343-346; Clavel et al. (1986) Nature 324:691-695.
Initially, HIV was propagated in culture in human mitogen-activated T cells. This method, however, could not produce the large quantities of virus required for serology assays on the scale required to protect public health and safety. It was not until immortalized cell lines capable of becoming chronically infected in vitro were discovered that HIV could be produced in any substantial quantities. See, e.g., Montagnier et al. (1984) Science 225:63-66; Levy et al., supra; Popovic et al., supra. The ability to grow the virus in culture led to the development of immunoassays for the detection of anti-HIV antibodies in the blood of patients suspected of having been infected, as well as for screening blood donors. See, e.g., Schupbach et al. (1984) Science 224:503-505; Sarngadharan et al. (1984) Science 224:506-508; Feorino et al. (1984) Science 225:69-72; Kalyanaraman et al. (1984) Science 225:321-323; Culliton et al. (1984) Science 226:1128-1131; Groopman et al. (1984) Science 226:447-449; Ho et al. (1984) Science 226:451-453; U.S. Pat. No. 4,520,113.
Due to the great hazard of cultivating HIV in vitro, the number of facilities and individuals capable of working with the virus is necessarily limited. Furthermore, while tissue culture may provide viral polypeptides suitable for use in diagnostic assays, it is highly undesirable to employ polypeptides produced by tissue culture in vaccine compositions due to the risk of infectivity posed by live, intact virus.
While production of viral polypeptides by recombinant means could be considered to be a solution to the problems described above, the production of recombinant proteins was not possible prior to the present invention. For example, HIV nucleotide sequences were not available and sequenced so as to enable the production of recombinant proteins. Even more importantly, it was unknown whether recombinantly produced viral protein would be sufficiently similar in antigenic properties to native HIV polypeptides so as to be generally useful in diagnostic assays or vaccine production. In addition, homology between the genome of HIV and human T-cell leukemia virus type I and type II (HTLV-I and -II) had been reported. See, e.g., Arya et al. (1984) Science 225:927-930. Thus, it was unclear that sufficiently unique epitopes of HIV could be produced by recombinant means to distinguish HIV from HTLV-I or HTLV-II. Furthermore, it was unclear prior to the present invention whether the various HIV isolates possessed sufficiently related epitopes so that a recombinant polypeptide based on one isolate could be useful in a general diagnostic assay or vaccine composition.
Prior to the present invention, therefore, recombinant HIV polypeptides could not be produced and it was not clear that such polypeptides would be generally useful in diagnostic, prophylactic, or therapeutic methods or products.
Nucleotide sequences and expression of nucleotide sequences are provided for detecting the presence of complementary sequences associated with a retroviral etiologic agent (HIV, e.g., HIV-1 or -2) for lymphadenopathy syndrome (LAS), acquired immune deficiency syndrome (AIDS) or AIDS-related complex (ARC), and for producing polypeptides. The single-stranded sequences are at least 20, more usually of at least about 50 nucleotides in length, and may find use as probes. The double-stranded sequences may find use as genes coding for expression of polypeptides, either fragments or complete polypeptides expressed by the virus or fused proteins, for use in diagnosis of HIV infection or evaluating stage of infection, the production of antibodies to HIV, and the production of vaccines. Based on the nucleotide sequences, synthetic peptides may also be prepared.
Specific aspects of the invention include:
1. A DNA construct comprising a replication system recognized by a unicellular microorganism and a DNA sequence coding for at least 20 bp of a human immunodeficiency virus (HIV) genome, said replication system being a non-HIV replication system;
2. A DNA construct comprising a replication system recognized by a unicellular microorganism and a DNA sequence of at least about 21 bp having an open reading frame and having a sequence substantially complementary to a sequence found in the gag, env, or pol region of an HIV, coding for a polypeptide which is immunologically non-cross-reactive with HTLV-I and HTVL-II, and reactive with an HIV;
3. A restriction endonuclease fragment of at least about 1.5 kbp derived from restriction enzyme digestion by at least one restriction endonuclease of a DNA sequence coding for an HIV of the class HIV-1;
4. A DNA sequence comprising a fragment of at least about 20 bp, wherein the strands are complementary to a restriction endonuclease fragment described in 3 above, said sequence duplexing with an HIV nucleic acid sequence and not duplexing with HTLV-I or HTLV-II under comparable selective hybridization conditions;
5. A method for detecting the presence of an HIV nucleic acid sequence present in a nucleic acid sample obtained from a physiological sample, which comprises:
(a) combining said nucleic acid sample with a single-stranded nucleic acid sequence of at least about 20 bases complementary to a sequence in said HIV and non-cross-reactive with HTLV-I and -II under conditions of predetermined stringency for hybridization; and
(b) detecting duplex formation between said DNA sequence and nucleic acid present in said sample;
6. A method for cloning DNA specific for an HIV, which comprises growing a unicellular microorganism containing the above-described DNA construct, whereby said DNA sequence is replicated;
7. A method for producing an expression product of HIV which comprises:
(a) transforming a unicellular microorganism host with a DNA construct having transcriptional and translational initiation and termination regulatory signals functional in said host and an HIV DNA sequence of at least 21 bp having an: open reading frame and under the regulatory control of said signals; and
(b) growing said host in a nutrient medium, whereby said expression product is produced;
8. A method for producing an expression product of HIV which comprises growing mammalian host cells having a DNA construct comprising transcriptional and translational initiation and termination regulatory signals functional in said host cells and a DNA sequence of at least 21 bp and less than the whole HIV genome, said sequence having an open reading frame and an initiation codon at its 5′-terminus and under the transcriptional and translational control of said regulatory signals, whereby a polypeptide encoded by said sequence is expressed;
9. A method of detecting antibodies to HIV in a sample suspected of containing said antibodies comprising:
10. Recombinant HIV polypeptides including, but not limited to:
11. An article of manufacture for use in an assay for anti-HIV antibodies comprising at least one of the above-described HIV polypeptides bound to a solid support.
12. A vaccine composition, and a method of producing antibodies in a mammal comprising administering to said mammal said vaccine composition wherein the vaccine composition comprises an antigenically effective amount of a recombinant HIV polypeptide.
Other embodiments will also be apparent from the description below.
FIG. 1 is a restriction map of proviral DNA from HIV strain ARV-2.
FIGS. 2 and 3 are restriction maps of recombinant λ phages containing ARV-2 sequences.
FIG. 4 is a complete nucleotide sequence of ARV-2, derived from partial sequences of several ARV clones. Corresponding amino acid sequences are indicated for the open reading frames of the individual genes.
FIG. 5 is the nucleotide sequence of ARV-2(9B). The amino acid sequences for the products of the gag, pol, and env genes are indicated. The U3, R, and U5 regions of the LTRs are also designated. The cap site is position +1. The nucleotides at the beginning of each line are numbered, and the amino acids at the end of each line are indicated. FIG. 5 herein shows the same sequence as that in FIG. 5 of both U.S. Ser. No. 06/773,447 (filed 6 Sep. 1985) and U.S. Ser. No. 06/696,534 (filed 30 Jan. 1985), the nucleotides in the figure of the earlier applications being numbered from the beginning of the integrated sequences.
FIG. 6 is a flow diagram showing the procedure for making the plasmid of pSV-7c/env, an expression vector for ARV-2-env gene.
FIG. 7 is a flow diagram showing the procedures for making the plasmids pGAG25-10 and pGAG41-10.
FIG. 8 is the nucleotide sequence of the p25gag gene cloned in plasmid pGAG25-10 and the amino acid sequence encoded by that gene.
FIG. 9 is the coding strand of the nucleotide sequence cloned in pGAG41-10 for producing the fusion protein p41gag and the corresponding amino acid.
FIG. 10 is a nucleotide sequence coding for p16gag protein that was cloned into plasmid ptac5 to make an expression plasmid for producing p16gag in bacteria.
FIG. 11 is a nucleotide sequence that encodes ARV-2 env protein that was used to prepare plasmid pDPC303.
FIG. 12 is a nucleotide sequence that encodes ARV-2 p31 protein and is contained in plasmid pTP31.
FIG. 13 is a map of the ARV env gene showing the regions env-1, env-2, env-3, env-4, and env-5.
FIG. 14 is a restriction map of plasmid pDM15, which was used to construct S. cerevisiae strain JSC302.
FIG. 15 is the synthetic nucleotide sequence env-5b, which encodes the amino acid sequence of the ARV env-5 region.
FIG. 16 is the results of an indirect ELISA in which an AIDS patient's serum (⋄) was titrated against microtiter plates coated with recombinant polypeptides from env regions. A pool of serum samples from random blood donors was used as a control (∘). Panel A shows the results for purified, recombinant env-2. Panel B shows the results with purified, recombinant env-5b. The insert in each panel shows a Coomassie-stained gel (lane 1) and an immunoblot with the AIDS patient's serum (lane 2) of the purified antigens used in these ELISAs.
FIG. 17 shows the results of an ELISA, employing recombinant env-2 (top panel) and env-5b (bottom panel) polypeptides, ruin on seronegative blood donors.
FIG. 18 shows the results of an ELISA, employing recombinant env-2 (top panel) and env-5b (bottom panel) polypeptides, run on HIV seropositive patients, including those diagnosed as having AIDS or AIDS-related complex (ARC), as well as those having contacts with AIDS patients.
FIG. 19 shows the results of ELISAs used to measure antibody titers in the AIDS seropositive patients of FIG. 18.
FIG. 20 is a flow diagram showing the procedure for making plasmid pAB24/RT4, an expression vector for HIV reverse transcriptase.
FIG. 21 is a flow diagram showing the construction of pCl/1-p25-ADH-GAP, a yeast expression vector for p25gag.
FIG. 22 shows the DNA and amino acid sequences of the p25gag structural region in pCl/1-p25-ADH-GAP.
FIG. 23 is a flow diagram showing the construction of pCl/1-pSP31-ADH-GAP (pCl/1-pSP31-GAP-ADH2), a yeast expression vector for a SOD/p31pol fusion protein.
FIG. 24 shows the DNA and amino acid sequences of the SOD/p31pol structural region in pCl/1-pSP31-ADH-GAP.
FIG. 25 is a flow diagram showing the construction of pSOD/env5b from pSODCF2 and a synthetic env-5b sequence.
FIG. 26 shows the nucleotide sequence and putative amino acid sequence of the SOD/env-4 fusion construct in pBS24/SOD-SFenv4.
FIG. 27 is a restriction map for yeast shuttle vector pAB24.
FIG. 28 is a restriction map for yeast expression vector pAB-GAP-env2.
FIG. 29 is a restriction map of pCMV6a.
FIG. 30 is an immunoblot performed with AIDS patient serum on env-1 (lanes A, B), env-2 (lanes C, D) and env-3 (lanes E, F). Lanes A, C and E are immunoblots with normal sera, while lanes B, D and F are immunoblots with serum from an AIDS patient.
FIG. 31 shows an ELISA survey for p31 antibodies. Panel (a) shows the results for random, normal blood donors. Panel (b) shows the results for virus-seropositive individuals. The shaded bars are for sera that scored negative in the virus immunoblot assays.
Nucleotide sequences are provided which are at least in part specific for sequences present in HIV retroviruses, which are the etiological agent of AIDS. HIV is an art-recognized family of viruses, e.g., HIV-1 and HIV-2. The original isolates of these viruses were variably referred to as lymphadenopathy virus (LAV) [Barre-Sinoussi et al. (1983) Science 220:868-871], human T-cell lymphotropic virus-III (HTLV-III) [Popovic et al. (1984) Science 224:497] and AIDS-associated retrovirus (ARV) [Levy et al. (1984) Science 225:840-842]. Applicants originally termed these isolates “human T-cell lymphotropic retrovirus (hTLR)”. Subsequently, the name HIV has been given to these retroviruses by an international committee. Thus, HIV (and particularly HIV-1) shall be used herein as an equivalent to hTLR. Examples of HIV-1 were previously called LAV, ARV and HTLV-III. Among the identifying characteristics of HIV retroviruses are (i) being an etiologic of AIDS, (ii) being cytopathic in vitro, (iii) having a tropism for CD4-bearing cells, and (iv) having elements trans-activating the expression of viral genes acting at the LTR level.
New HIVs may be shown to be of the same class by being similar in their morphology, serology, reverse transcriptase optima, cytopathology, amino acid sequence, and nucleotide sequence as known HIV strains. Coffin et al. (1986) Nature 321:10. Within different HIV-1 isolates, for example, the gag and pol proteins shows about 90-95% homology at the amino acid level, and the env precursor shows about 65-85% homology (most of the variations being confined to certain “hypervariable” regions), with all 23 env cysteines being conserved. Alizon et al. (1986) Cell 46:63-74. HIV-2, however, is a new class of the HIV family that is not a strain of HIV-1 according to the recommended criteria of the international taxonomy committee. See, e.g., Guyader et al. (1987) Nature 326:662-669. HIV-1 and HIV-2 show an overall approximate amino-acid homology of about 42%, with about 60% amino acid homology for the gag and pol proteins, and about 40% for the env precursor.
The nucleotide sequences of this invention may be the entire sequence of the retrovirus and/or the provirus or may be fragments thereof based on restriction enzyme digestion of HIV (provirus and/or other dsDNA homologous to retrovirus RNA), which fragments may be all or part of the LTR, gag, pol, env, and/or other open reading frames, such as Q (or sor), R, tat, and art (or trs) (sometimes referred to by the designation “orf” herein), untranslated regions intermediate coding regions, and fragments and combinations thereof. The minimum size single-stranded fragment will be at least 20 bases and usually at least 50 bases and may be 100 bases or more, where the entire HIV is about 9.5 kb. The sequence may be obtained as a fragment from the HIV or be synthesized.
The fragments can be used in a wide variety of ways, depending upon their size, their natural function, the use for which they are desired, and the degree to which they can be manipulated to modify their function. Thus, sequences of at least 20 bases, more usually at least 50 bases, and usually not exceeding about 1000 bases, more usually not exceeding about 500 bases, may serve as probes for detection of the presence of HIV in a host cell, including the genome, or in a physiological fluid, such as blood, lymph, saliva, spinal fluid, or the like. These sequences may include coding and/or non-coding sequences. The coding sequences may involve the gag, pol, env or other open reading frames, either in whole or in part. Where splicing occurs between, for example, a region in the LTR sequence and a coding sequence in another region, the joined DNA from the provirus, linked by in vitro manipulation, or from cDNA or cloned cDNA, may be employed.
It is found that HIV is highly polymorphic. Therefore, not only may DNA prepared from various isolates vary by one or more point mutations, but even the passage of a single isolate may result in variation in the progeny. Thus, where the nucleotide sequences are used for duplex formation, hybridization, or annealing, for example, for diagnosis or monitoring of the presence of the virus in vivo or in vitro , complete base pairing will not be required. One or more mismatches are permissible. To ensure that the presence of one or a few, usually not more than three, mismatches still allows for stable duplexes under the predetermined stringency of hybridizing or annealing conditions, probes will normally be greater than 20 bases, preferably at least about 50 bases or more.
The method of detection will involve duplex formation by annealing or hybridization of the oligonucleotide probe, either labeled or unlabeled, depending upon the nature of the detection system, with the DNA or RNA of a host suspected of harboring the provirus or virus. A physiological sample may include tissue, blood, saliva, serum, etc. Particularly, blood samples will be taken, more particularly blood samples containing peripheral mononuclear cells, which may be lysed and the DNA or RNA isolated in accordance with known techniques. Cells may be cultured to amplify virus in vitro, or treated to stimulate PBLs, thereby producing more virus. Conveniently, the cells are treated with a detergent, nucleic acids are extracted with organic solvents and precipitated in an appropriately buffered medium, and the DNA or RNA isolated. Depending upon the particular protocol, the DNA may be fragmented by mechanical shearing or restriction endonuclease digestion.
The sample polynucleotide mixture obtained from the human host can be bound to a support or may be used in solution depending upon the nature of the protocol. The well-established Southern technique [(1975) J. Mol. Biol. 98:503] may be employed with denatured DNA, by binding the single-stranded fragments to a nitrocellulose filter. Alternatively, RNA can be blotted on nitrocellulose following the procedure described by Thomas, (1980) Proc. Natl. Acad. Sci . ( USA ) 77:5201. Desirably, the fragments will be electrophoresed prior to binding to a support, so as to be able to select for various sized fractions. Other techniques may also be used such as described in Meinkoth & Wahl, (1984) Anal. Biochem. 138:267-284.
The oligonucleotide probe may be DNA or RNA, usually DNA. The oligonucleotide sequence may be prepared synthetically or in vivo by cloning, where the complementary sequence may then be excised from the cloning vehicle or retained with the cloning vehicle. Various cloning vehicles are available, such as pBR322, M13, Charon 4A, or the like, desirably a single-stranded vehicle, such as M13.
As indicated, the oligonucleotide probe may be labeled or unlabeled. A wide variety of techniques exist for labeling DNA and RNA. As illustrative of such techniques, is radiolabeling using nick translation, tailing with terminal deoxytransferase, or the like, where the bases which are employed carry radioactive 32 P. Alternatively, radioactive nucleotides can be employed where carbon, nitrogen or other radioactive atoms may be part of the nucleoside structure. Other labels which may be used include fluorophores, enzymes, enzyme substrates, enzyme cofactors, enzyme inhibitors, or the like. Alternatively, instead of having a label which provides for a detectable signal by itself or in conjunction with other reactive agents, ligands can be used to which receptors bind, where the receptors are labeled such as with the above-indicated labels, which labels provide detectable signals by themselves or in conjunction with other reagents. See, e.g., Leary et al. (1983) Proc. Natl. Acad. Sci . ( USA ) 80:4045-4049; Cosstick et al. (1984) Nucleic Acids Res. 12:1791-1810; PCT Pub. No. WO 83/02277.
The oligonucleotide probes are hybridized with the denatured human host nucleic acid, substantially intact or fragmented, or fractions thereof, under conditions of predetermined stringency. The stringency will depend upon the size and composition of the probe, the degree of mismatching, the desired cross reactivity with other strains of the subject HIV, and the like. Usually, an organic solvent such as formamide will be present in from about 30 to 60 vol percent, more usually from about 40 to 50 vol percent, with salt concentration from 0.5 to 1 M. Temperatures will generally range from about 30° C. to 65° C., more usually from about 35° C. to 50° C. The times for duplex formation may be varied widely, although minimum times will usually be at least about one hour and not more than about 72 hours, the time being selected in accordance with the amount of DNA or RNA available, the proportion of DNA or RNA as compared to total DNA or RNA, or the like. Stringency may also be modified by ionic strength and temperature. The hybridization and annealing can be carried out in two stages: a first stage in a hybridization medium; and, a second stage, involving washings at a higher stringency, by varying either or both temperature and ionic strength.
As understood in the art, the term “stringent hybridization conditions” as used herein refers to hybridization conditions which allow for closely related nucleic acid sequences to duplex (e.g., greater than about 90% homology), but not unrelated sequences. The appropriate conditions can be established by routine procedures, such as running Southern hybridization at increasing stringency until only related species are resolved and the background and/or control hybridization has disappeared (i.e., selective hybridization).
The oligonucleotide probe may be obtained in a variety of ways. Viral RNA from HIV may be isolated from the supernatant of cells infected (e.g., HIV-1 or HIV-2) in culture, and the high molecular weight materials precipitated and the DNA removed, for example, employing DNase. The residual RNA may then be divided into molecular weight fractions, where the fraction associated with the molecular weight of the retrovirus is isolated. This fraction will be from about 8 to 10 kb viral RNA. The viral RNA may be further purified by conventional techniques, such as electrophoresis, chromatography, or the like.
Nucleotide probes may be prepared employing reverse transcriptase using primers, e.g., random primers or specific primers. The cDNA may be prepared employing a radioactive label, e.g., 32 P, present with one or more of the dNTPs. Reverse transcription will provide various sized fragments depending on the primers, the efficiency of transcription, the integrity of the RNA, and the like. The resulting cDNA sequences may be cloned, separated and used for detection of the presence of a provirus in the human genome or for isolation of pure retroviral RNA.
Using specific primers of 10 to 20 bases, or more, HIV may be reverse transcribed and the resulting ss DNA used as a probe specific for the region which hybridized to the primer. By employing one or more radionucleotide-labeled bases, the probes will be radiolabeled to provide a detectable signal. Alternatively, modified bases may be employed which will be randomly incorporated into the probe and may be used to provide for a detectable signal. For example, biotin-modified bases may be employed. The resulting biotin-containing probe may then be used in conjunction with labeled avidin to provide for a detectable signal upon hybridization and duplex formation.
Of particular interest is employing the region containing the gag or env genes, where fragments may be employed to screen proviral DNA in infected cells, to determine the identity of retroviruses associated with AIDS or LAS obtained from different human hosts. Probes providing for the desired degree of cross-reactivity or absence of cross-reactivity may then be prepared in a form, either labeled or unlabeled, useful for diagnostic assays employing hybridization and annealing.
The double-stranded DNA sequences, either isolated and cloned from proviral DNA or cDNA or synthesized, may be used for expression of polypeptides which may be a precursor protein subject to further manipulation by cleavage, or a complete mature protein or fragment thereof. The smallest sequence of interest, so as to encode an amino acid sequence capable of specific binding, for example, to a receptor or an immunoglobulin, will be 21 bp, usually at least 45 bp, exclusive of the initiation codon. The sequence may code for any greater portion of or the complete polypeptide, or may include flanking regions of a precursor polypeptide, so as to include portions of sequences or entire sequences coding for two or more different mature polypeptides. The sequence will usually be less than about 5 kbp, more usually less than about 3 kbp.
The sequences having open reading frames as numbered in FIG. 4 are the genes beginning at nucleotide (nt) 838 to 2298 (gag); 2347 to 2825 (small polypeptide between gag and pol regions); 2965 to 5103 (pol); and 6236 to 8800 (env). It is to be understood that the above sequences may be spliced to other sequences present in the retrovirus, so that the 5′-end of the sequence may not code for the N-terminal amino acid of the expression product. The splice site may be at the 5′-terminus of the open reading frame or internal to the open reading frame. The initiation codon for the protein may not be the first codon for methionine, but may be the second or third methionine, so that employing the entire sequence indicated above may result in an extended protein. However, for the gag and env genes there will be proteolytic processing in mammalian cells, which processing may include the removal of extra amino acids.
In isolating the different domains the provirus may be digested with restriction endonucleases, the fragments electrophoresed and fragments having the proper size and duplexing with a probe, when available, are isolated, cloned in a cloning vector, and excised from the vector. The fragments may then be manipulated for expression. Superfluous nucleotides may be removed from one or both termini using Bal31 digestion. By restriction mapping, convenient restriction sites may be located external or internal to the coding region. Primer repair or in vitro mutagenesis may be employed for defining a terminus, for insertions, deletion, point or multiple mutations, or the like, where codons may be changed, either cryptic or changing the amino acid, restriction sites introduced or removed, or the like. Where the gene has been truncated, the lost nucleotides may be replaced using an adaptor. Adaptors are particularly useful for joining coding regions to ensure the proper reading frame.
The env domain of HIV can be obtained by digestion of the provirus with EcoRI and KpnI and purification of a 3300 base pair (bp) fragment, which fragment contains about 400 bp of 5′ non-coding and about 200 bp of 3′ non-coding region. Three different methionines coded for by the sequence in the 5′ end of the open reading frame may serve as translational initiation sites.
The open reading frame of the env gene of ARV-2 has a coding capacity of 863 amino acids. Portions of the env gene coding for the polypeptides shown in FIG. 5 were produced in S. cerevisiae using yeast expression vectors. See FIG. 13. Env-2, encompassing amino acid residues 26 to 510, corresponds to the major portion of the mature envelope glycoprotein, gp120, that is external to viral and infected cell membranes. Env-1 includes amino acid residues 26 to 276 and represents approximately the amino-terminal half of the gp120 polypeptide. Env-3, stretching between amino acid residues 529 to 855, corresponds to the portion of the env gene which encodes gp41, the viral glycoprotein that spans membranes and serves as an anchor for the envelope glycoprotein complex. Env-4, amino acid residues 272 to 509, correspond to the carboxyl terminal half of gp120. Env-5b, encompassing amino acid residues 557 to 677, corresponds to the region of gp41 stretching between the two hydrophobic domains. These various recombinant portions of the env domain are valuable in diagnostic assays for HIV infections, particularly env-2 and env-5b.
Digestion of proviral sequences with SacI and EcoRV provides a fragment of about 2300 bp which contains the gag domain and a second small open reading frame towards the 3′ end of the gag region. The gag domain is about 1500 bp and codes for a large precursor protein which is processed to yield proteins of about 25,000 (p25), 16,000 (p16) and 12,000 (p12) daltons. Digestion with SacI and BglII may also be used to obtain exclusively the gag domain with p12, p25 and partial p16 regions.
Digestion of the previous with KpnI and SstI provides a fragment containing the portion of the pol domain that encodes p31. Native HIV reverse transcriptase (RT) is purified from virions in p66 and p51 forms. Both of these forms have identical N-termini, apparently differing at the C-termini. RT is encoded within a domain of the viral pol gene. The mature enzyme is derived by proteolytic processing from a large precursor polypeptide whose cleavage is thought to be mediated by a viral protease. This protease, by analogy with other retroviruses, also cleaves the gag gene precursor. For direct expression of the RT domain in yeast, the N- and C-termini of the mature protein were estimated by drawing on homology comparisons with the amino acid sequences of pol gene products of other retroviruses. Precise amino acid choices for termini were based on the target specificities of retroviral proteases, including the AIDS virus protease, from known gag subunit sequences. Accordingly, the Phe-Pro at positions 155 and 156 of the ARV-2 pol open reading frame and the Val-Pro at positions 163 and 164 were selected as likely N-termini. A likely C terminal processing site was estimated at the Val-Pro of positions 691 and 692. See FIG. 5. Recombinant RT is valuable in diagnostic assays for HIV infections.
The polypeptides which are expressed by the above DNA sequences may find use in a variety of ways. The polypeptides or immunologically active fragments thereof, may find use as diagnostic reagents, being used in labeled or unlabeled form or immobilized (i.e., bound to a solid surface), as vaccines, in the production of monoclonal antibodies, e.g., inhibiting antibodies, or the like.
The DNA sequences may be joined with other sequences, such as viruses, e.g., vaccinia virus or adenovirus, to be used for vaccination. Particularly, the DNA sequence of the viral antigen may be inserted into the vaccinia virus at a site where it can be expressed, so as to provide an antigen of HIV recognized as an immunogen by the host. The gag, pol, or env genes or fragments thereof that encode immunogens could be used.
Another alternative is to join the gag, env, or pol regions or portions thereof to HBsAg gene or pre-S HBsAg gene or immunogenic portions thereof, which portion is capable of forming particles in a unicellular microorganism host, e.g., yeast or mammalian cells. Thus, particles are formed which will present the HIV immunogen to the host in immunogenic form, when the host is vaccinated with assembled particles.
As vaccines, the various forms of the immunogen can be administered in a variety of ways, orally, parenterally, intravenously, intra-arterially, subcutaneously, intramuscularly, or the like. Usually, these will be provided in a physiologically acceptable vehicle, generally distilled water, phosphate-buffered saline, physiological saline, buffers containing SDS or EDTA, and the like. Various adjuvants may be included, such as aluminum hydroxide, MTP in saline and Tween 80, and the dosages, number of times of administration and manner of administration determined empirically.
In order to obtain the HIV sequence (e.g., HIV-1 or HIV-2), virus can be pelleted from the supernatant of infected host cells. A 9 kb RNA species is purified by electrophoresis of the viral RNA in low-melting agarose gels, followed by phenol extraction. The purified RNA may then be used as a template with random primers in a reverse transcriptase reaction. The resulting cDNA is then screened for hybridization to polyA+ RNA from infected and uninfected cells, or to one of λ vectors containing HIV DNA disclosed herein. For the polyA+ RNA, hybridization occurring from infected, but not uninfected cells, is related to HIV.
Genomic DNA from infected cells can be digested with restriction enzymes and used to prepare a bacteriophage library. Based upon restriction analysis of the previously obtained fragments of the retrovirus, the viral genome can be partially digested with EcoRI and 9 kb-15 kb DNA fragments isolated and employed to prepare the library. The resulting recombinant phage. may be screened using a double-lift screening method employing the viral cDNA probe, followed by further purification, e.g., plaque-purification and propagation in large liquid cultures. From the library, the complete sequence of the virus can be obtained and detected with the previously described probe.
HIV DNA (either provirus or cDNA) may be cloned in any convenient vector. Constructs can be prepared, either circular or linear, where the HIV DNA, either the entire HIV or fragments thereof, may be ligated to a replication system functional in a microorganism host, either prokaryotic or eukaryotic cells (mammalian, yeast, arthropod, plant). Micro-organism hosts include E. coli, B. subtilis, P. aeruqenosa, S. cerevisiae, N. crassa , etc. Replication systems may be derived from ColE1, 2 mu plasmid, λ SV40, bovine papilloma virus, or the like, that is, both plasmids and viruses. Besides the replication system and the HIV DNA, the construct will usually also include one or more markers, which allow for selection of transformed or transfected hosts. Markers may include biocide resistance, e.g., resistance to antibiotics, heavy metals, etc., complementation in an auxotrophic host to provide prototrophy, and the like.
To produce recombinant polypeptides, expression vectors will be employed. For expression in microorganisms, the expression vector may differ from the cloning vector in having transcriptional and translational initiation and termination regulatory signal sequences and may or may not include a replication system which is functional in the expression host. The coding sequence is inserted between the initiation and termination regulatory signals so as to be under their regulatory control. Expression vectors may also include the use of regulable promoters, e.g., temperature-sensitive or inducible by chemicals, or genes which will allow for integration and amplification of the vector and HIV DNA such as tk, dhfr, metallothionein, or the like.
The expression vector is introduced into an appropriate host where the regulatory signals are functional in such host. The expression host is grown in an appropriate nutrient medium, whereby the desired polypeptide is produced and isolated from cells or from the medium when the polypeptide is secreted.
Where a host is employed in which the HIV transcriptional and translational regulatory signals are functional, then the HIV DNA sequence may be manipulated to provide for expression of the desired polypeptide in proper juxtaposition to the regulatory signals.
The polypeptide products can be obtained in substantially pure form, particularly free of debris from human cells, which debris may include such contaminants as proteins, polysaccharides, lipids, nucleic acids, viruses, bacteria, fungi, etc., and combinations thereof. Generally, the polypeptide products will have less than about 10-15 weight percent, preferably less than about 5 weight percent, of contaminating materials from the expression host. Depending upon whether the desired polypeptide is produced in the cytoplasm or secreted, the manner of isolation will vary. Where the product is in the cytoplasm, the cells are harvested, lysed, the product extracted and purified, using solvent extraction, chromatography, gel exclusion, electrophoresis, or the like. Where secreted, the desired product will be extracted from the nutrient medium and purified in accordance with the methods described above.
In many cases it will be desirable to express the recombinant HIV polypeptide as a fusion protein. This is particularly true with polypeptides such as p31pol and the transmembrane region of gp41env (env-5), to obtain improved levels of expression. The fusion proteins approach allows the addition of a signal sequence to the HIV polypeptide so that the product is secreted by the expression host. Generally, the DNA sequence for the HIV polypeptide is in the C-terminal portion of the fused gene, the heterologous sequence making up the N-terminal. The choice of the appropriate heterologous sequence for fusion to the HIV sequence is a matter of choice within the skill of the art. Preferred heterologous sequences include the N-termini of β-galactosidase and human superoxide dismutase. It is usually preferable that the heterologous sequence be non-immunogenic to humans. In one embodiment, however, two HIV sequences from different immunogenic domains of the virus, such as gag and env, are fused together. This produces a single fusion protein with the immunogenic potential of the two parent polypeptides.
The expression products of the env, gag, and pol genes and immunogenic fragments thereof having immunogenic sites may be used for screening antisera from patients' blood to determine whether antibodies are present which bind to HIV antigens. One or more of the antigens may be used in the assay. Preferred modes of the assay employ a combination of gag and env antigens or pol and env antigens. A combination of p25gag, p16gag, or p31pol and env antigens is particularly preferred. A wide variety of assay techniques can be employed, involving labeled or unlabeled antigens or immobilized antigens. The label may be fluorescers, radionuclides, enzymes, chemiluminescers, magnetic particles, enzyme substrates, cofactors or inhibitors, ligands, or the like.
A particularly convenient technique is to bind the antigen to a support that will bind proteins, such as the surface of an assay tube, a well of an assay plate, or a strip of material like nitrocellulose or nylon, and then contact the sample with the immobilized antigen. After washing the support to remove non-specifically bound antisera, labeled antibodies to human Ig are added and specifically bound label determined.
ELISA and “dot-blot” assays are particularly useful for screening blood or serum samples for anti-HIV antibodies. The ELISA assay uses microtiter trays having wells that have been coated with the antigenic HIV polypeptides(s). The wells are also typically post-coated with a nonantigenic protein to avoid nonspecific binding of antibodies in the sample to the well surface. The sample is deposited in the wells and incubated therein for a suitable period under conditions favorable to antigen-antibody binding.
Anti-HIV antibodies present in the sample will bind to the antigen(s) on the well wall. The sample is then removed and the wells are washed to remove any residual, unbound sample. A reagent containing enzyme-labeled antibodies to human immunoglobulin is then deposited in the wells and incubated therein to permit binding between the labeled anti-human Ig antibodies and HIV antigen-human antibody complexes bound to the well wall. Upon completion of the incubation, the reagent is removed and the wells washed to remove unbound labeled reagent. A substrate reagent is then added to the wells and incubated therein. Enzymatic activity on the substrate is determined visually or spectrophotometrically and is an indication of the presence and amount of anti-HIV antibody-containing immune complex bound to the well surface.
The “dot-blot” procedure involves using HIV antigen(s) immobilized on a piece or strip of bibulous support material, such as nitrocellulose filter paper or nylon membrane, rather than antigen-coated microtiter trays. The support will also be treated subsequently with a nonantigenic protein to eliminate nonspecific binding of antibody to the support. The antigen-carrying support is contacted with (e.g., dipped into) the sample and allowed to incubate therein. Again, any anti-HIV antibodies in the sample will bind to the antigen(s) immobilized on the support. After a suitable incubation period the support is withdrawn from the sample and washed in buffer to remove any unbound sample from the paper. The support is then incubated with the enzyme-labeled antibody to human Ig reagent for a suitable incubation period. Following treatment with the labeled reagent the support is washed in buffer, followed by incubation in the substrate solution. Enzymatic activity, indicating the presence of anti-HIV antibody-containing complexes on the support, causes color changes on the support which may be detected optically.
Either of these techniques may be modified to employ labels other than enzymes, or to detect non-human anti-HIV antibodies (e.g., primate). The reading or detection phases will be altered accordingly.
The antigenic HIV polypeptide may also be used as immunogens by themselves or joined to other antigens for the production of antisera or monoclonal antibodies which may be used for therapy or diagnosis. When used as immunogens, the HIV polypeptides can be prepared as vaccine compositions, as is known in the art. The immunoglobulins may be from any mammalian source, e.g., rodent, such as rat or mouse, primate, such as baboon, monkey or human, or the like. For diagnosis, the antibodies can be used in conventional ways to detect HIV in a clinical sample.
The following examples are offered for illustrative purposes only, and are not intended to limit the scope of the claims in any way. The examples are organized as follows:
The DNA and RNA sequences of HIV are provided, as well as fragments thereof, which find extensive use in the detection of the presence of HIV, for the expression of protein specific for HIV and the use of such proteins for the production of monoclonal antibodies for in vitro and in vivo use, in diagnostics, therapy, or the like. In addition, due to the observed polymorphism of HIV, probes are indicated which can be used to detect the presence of HIV or a particular polymorphism thereof. The probes are at least about 20 bases and will usually not be more than about 500 bases and may be in the gag, env or LTR region. Furthermore, a strategy is provided for analyzing the various polymorphisms, using restriction enzyme analysis, whereby different isolates can be related in accordance with different families.
1.1 Purification and Preparation of Viral RNA
HUT-78 cells infected with ARV-2 (ATCC Accession No. CRL 8597, deposited on Aug. 7, 1984) were obtained from Dr. Jay Levy, University of California, San Francisco. Cultures were grown for two weeks in RPMI medium with 10% fetal calf serum. Cells were removed by low-speed centrifugation (1,000×g for 10 min), and the resulting supernatants centrifuged at 2 Krpm for 1 h at 4° C. using a SW-28 rotor. The pellet, containing the virus, was resuspended in 10 mM Tris-HCl, pH 7.5 on ice. The resuspended pellet was treated with 10 μg of DNase (Boehringer-Mannhein) and was layered onto a linear sucrose gradient (15-50% in 10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 20 mM NaCl). The gradient was spun at 34 Krpm for 4 h at 4° C., in SW-41 rotor. Five 2.5 ml fractions were collected and an aliquot of each was electrophoresed in a 1% agarose, 5 mM methyl mercury hydroxide gel [Bailey et al. (1976) Anal. Biochem. 70:75-85] to determine which contained the 9 kb viral RNA. The fraction containing the viral RNA (identified by gel analysis) was diluted to 10 ml in 10 mM Tris-HCl, pH 7.5, 1 mM EDTA and was centrifuged at 34 Krpm for 2 h at 4° C. The pellet was resuspended in 20 mM Tris-HCl, pH 7.6, 10 mM EDTA, 0.1% SDS, and 200 μg/ml proteinase K. Incubation was carried out for 15 min at room temperature. The mixture was extracted with phenol and the aqueous phase was made 400 mM NaCl and precipitated with ethanol. The pellet was resuspended in water and stored at −70° C.
To purify the viral RNA from the nucleic acid pellet obtained as described above, a sample was electrophoresed in a low-melting 1% agarose gel containing 5 mM Methyl mercury hydroxide. After electrophoresis, the gel was stained with 0.1% ethidium bromide and nucleic acid bands were visualized under UV light. The region corresponding to 9 kb was cut from the gel and the agarose was melted at 70° C. for 2 to 3 min in three volumes of 0.3 M NaCl, 10 mM Tris, pH 7.5, 1 mM EDTA. The mixture was extracted with an equal volume of phenol. The aqueous phase was reextracted with phenol and was precipitated with ethanol. The pellet was washed with cold 95% ethanol, air dried, resuspended in water and stored at −70° C. until use. One hundred ml of culture medium yielded 0.5 to 1 pg of purified RNA.
1.2. Synthesis of Labeled Viral Probe
A 32 P-labeled cDNA was made to the gel purified viral RNA using random primers (calf thymus primers) prepared as described in Maniatis et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratories 1982). The reaction mixture contained 2 μl of 0.5 M MgCl 2 ; 5 μl of 0.1 M dithiothreitol; 2.5 μl each of 10 mM DATP, 10 mM dGTP and 10 mM dTTP; 2.5 μl calf thymus primer (100A 260 /ml); 0.5 μg viral RNA; 5 μl of actinomycin D (200 μg/ml); 10 μl of 32 P-dCTP (>3000 Ci/mmole, 1 mCi/ml) and 1 μl of AMV reverse transcriptase (17 units/μl) in a 50 μl reaction volume. The reaction was incubated for 1 h at 37° C. The probe was purified away from free nucleotides by gel filtration using a Sephadex G50 column. The void volume was pooled, NaCl was added to a final concentration of 400 mM and carrier single-stranded DNA to 100 μg/ml, and the cDNA was precipitated with ethanol. The pellet was resuspended in water and incorporated 32 P counts were determined.
1.3. Detection of HIV RNA and DNA in Mammalian Cells
PolyA+ RNA was prepared from HUT-78 cells infected with ARV-2, ARV-3 or ARV-4 (three different isolates from three different AIDS patients) and from uninfected HUT-78 cells. The polyA+ RNA was electrophoresed on 1% agarose gels containing 5 mM methyl mercury hydroxide (Bailey et al. supra), was transferred to nitrocellulose filters, and hybridized with the homologous probe prepared as described in Section 1.2. Hybridizations were carried out in 50% formamide, 3×SSC at 42° C. Washes were at 50° C. in 0.2×SSC. A 9 kbp band was present in all three samples of infected HUT-78 cells. This band was absent in polyA+ from uninfected cells.
High molecular weight DNA (chromosomal) was prepared from cultures of HUT-78 cells infected with ARV-2 and from non-infected HUT-78 cells following the procedure of Luciw et al. (1984) Molec . & Cell Biol. 4:1260-1269. The DNA was digested with restriction enzyme(s), electrophoresed in 1% agarose gels and blotted onto nitrocellulose following the procedure described by Southern, (1975), supra. Blots were hybridized with the 32 P-labeled probe (10 6 cpm/blot) in a mixture containing 50% formamide, 3×SSC, 10 mM Hepes, pH 7.0, 100 μg/ml denatured carrier DNA, 100 μg/ml yeast RNA and 1×Denhardt's for 36 h at 42° C. Filters were washed once at room temperature in 2×SSC and twice at 42° C. in 0.2×SSC, 0.1% SDS. Filters were air dried and exposed to X-Omat film using an intensifying screen.
The homologous 32 P-probe to ARV-2 hybridized specifically to two bands in the DNA from infected cells restricted with SacI. These bands were absent when DNA of non-infected cells was used, indicating that the probe is hybridizing specifically to infected cells presumably to the provirus integrated in the chromosomal DNA. The molecular weight of the bands is approximately 5 kb and 3 kb.
In order to determine if different enzymes would cut the proviral sequence, several other restriction digestions of the cell DNA were carried out using EcoRI, SphI or KpnI or double digestions using two of them. Southern results show specific bands hybridizing when DNA of infected cells is used. FIG. 1 shows a schematic map of the positions of restriction enzyme sites in the proviral sequence, and indicates fragment sites.
1.4. Cloning of Proviral HIV DNA
High molecular weight cell DNA from infected HUT-78 cells was prepared following the procedure of Luciw et al., supra. The DNA was digested with EcoRI, which cuts once in the provirus, centrifuged in a sucrose gradient and fractions corresponding to 8-15 kb were pooled, dialyzed and concentrated by ethanol precipitation. The bacteriophage λ derivative cloning vector, EMBL-4 [Karn et al. (1983) Methods Enzymol. 101:3-19] was digested to completion with a mixture of EcoRI, BamHI and SalI restriction enzymes and the DNA then deproteinized by phenol-chloroform extraction, precipitated with cold ethanol and resuspended in ligation buffer. The EMBL-4 phage DNA and EcoRI digest of cellular DNA were mixed and ligated and the resultant recombinant phage genomes packaged in vitro . After phage infection of λ-sensitive E. coli (DP50supF), about 500,000 phage plaques were transferred onto nitrocellulose filters, DNA was fixed and the filters were screened with a homologous 32 P-probe prepared as described in Section 1.2. Eleven recombinant phage out of 500,000 phage annealed in the initial-double-lift screening method (Maniatis et al., supra) to viral cDNA probe, and these were further plaque-purified and propagated in large liquid cultures for preparation of recombinant DNA. Plaque-purified phage containing ARV DNA were propagated in liquid culture in E. coli DP50supF; phage particles were harvested and banded in CsCl gradients and recombinant phage DNA was prepared by phenol extraction followed by ethanol precipitation (Maniatis et al., supra). One microgram of purified phage DNA was digested with restriction enzymes, electrophoresed on 1% agarose gels, and visualized with ethidium bromide under ultraviolet light. The DNA from these gels was transferred to nitrocellulose and annealed with viral cDNA probe.
Described below is the analysis of the 11 recombinant phage DNA molecules utilizing restriction enzymes and viral cDNA probe. Two examples were selected for detailed description: λ-9B contained an insertion of full-length proviral DNA along with flanking cell sequences, and λ-7A harbored a full-length viral genome permuted with respect to the EcoRI site. Digestion of λ-9B DNA with SacI yielded viral DNA fragments of 3.8 kb and 5.7 kb (FIG. 2). EcoRI digestion of λ-9B produced virus containing DNA species at 6.4 kb and 8.0 kb; a double digest of SacI and EcoRI gave viral DNA fragments at 3.8 kb and 5.4 kb (FIG. 2). This pattern is consistent with that of a provirus linked to cell DNA. The patterns of the digestion with KpnI and with a mixture of KpnI and EcoRI support this conclusion for λ-9B. In particular, digestion of λ-9B DNA with KpnI showed a 5.2 kb species that represents the internal fragment seen in proviral DNA (FIGS. 1 and 2). Bacteriophage λARV-2(9B) was deposited at the ATCC on 25 Jan. 1985 and given Accession No. 40158. A second recombinant phage, λ-10C, also contained a full-length proviral DNA insertion.
When the DNA of λ-7A phage was treated with SacI, two viral DNA fragments were observed at 3.8 kb and 6.6 kb (FIG. 3). EcoRI digestion of λ-7A DNA produced a 9.4 kb viral species, and a digestion with a mixture of EcoRI and SacI yielded two viral DNA fragments at 3.8 kb and 5.4 kb (FIG. 3). Thus, λ-7A should represent a recombinant phage clone containing a full-length linear HIV genome that is permuted with respect to the EcoRI site. Analyses with KpnI support this model. After digestion with KpnI a DNA fragment at 4.2 kb was observed as well as other DNA species. See FIG. 3. The data indicated that the 4.2 kb DNA fragment represents the circle junction and the others represent HIV DNA linked to cell DNA and bacteriophage vector DNA. The double digestion with KpnI and EcoRI left the 4.2 kb DNA intact and produced two fragments at 1.6 kb and 3.6 kb (FIG. 3); these three DNA species added up to 9.4 kb and constituted the HIV genome predicted from a permuted configuration.
In addition to the two types of recombinant phage containing HIV DNA described above, phage was obtained that (1) possessed the left half of the viral genome from the EcoRI site in viral DNA extending into flanking cell DNA [λARV-2(8A)] and (2) phage that had the right half of the viral genome [λARV-2(7D)] from the EcoRI site in viral DNA extending into flanking cell DNA. The four types of recombinant phage DNA structures are predicted from the analysis of viral DNA observed in infected cells. Maps of restriction enzyme sites are shown in FIGS. 1-3. The data for these maps were compiled from studies of proviral DNA in infected cells, from characterization of recombinant phage DNA, and from preliminary DNA sequence information. Bacteriophages λARV-2(7D) (right) and λARV-2(8A) (left) were deposited at the ATCC on Oct. 26, 1984 and given Accession Nos. 40143 and 40144, respectively.
To validate the cloned HIV DNA, a radioactive probe was prepared from two regions of the cloned HIV genome that represent about 70% of the genome and this probe was used to detect HIV DNA in restriction enzyme digests of DNA from infected cells. Whole-cell DNA was prepared from HUT-78 cell cultures infected with HIV-1 strains ARV-2, ARV-3 and ARV-4, and analyzed by digestion with restriction enzymes as described in Section 1.3. A probe to cloned ARV-2 DNA was prepared as follows: DNA from recombinant phage λ-7A (FIG. 3) was digested with SacI or with a mixture of SacI and KpnI and digestion products were electrophoresed in 1% agarose gels (low-melting agarose); the 3.8 kb DNA fragment from the digestion with SacI and the 3.1 kb DNA fragment from the digestion with SacI and KpnI (FIG. 3) were eluted, pooled, denatured by boiling in water for 2 min, and used as templates with random DNA primers (calf thymus) with reverse transcriptase.
DNA from HUT-78 cells infected with ARV-2, ARV-3 or ARV-4 were digested with SacI, PstI or HindIII. Digested DNA was electrophoresed on 1% agarose gels, blotted onto nitrocellulose filters and annealed with probe to cloned ARV-2 DNA or to homologous cDNA probe prepared as described in Section 1.2. Results show that SacI, PstI, and HindIII fragments detected by both probes are identical in all isolates.
1.5. Polymorphism of HIV
To measure the relatedness of independent HIV isolates, restriction enzyme digests of DNA from HUT-78 cells infected with ARV-3 and ARV-4 were analyzed with the probe made from cloned ARV-2 DNA. The SacI digest of ARV-3 DNA was similar to that of ARV-2 whereas the HindIII digests displayed different patterns. The SacI digest and the PstI digest of ARV-4 DNA differed from the corresponding digests of ARV-2 DNA. The intensity of the annealing signals obtained with ARV-3 and ARV-4 samples was much lower (about 10-fold less) than that for ARV-2 DNA, probably as a result of the fact that fewer cells were infected in the ARV-3 and ARV-4 cultures. The viral-specific DNA fragments produced by SacI treatment of ARV-3 and ARV-4 DNA totaled 9.0-9.5 kbp, a value similar to that of ARV-2 and in consonance with the RNA genome sizes.
1.6. Sequencing of Proviral DNA
Fragments or subfragments of ARV-2 DNA were prepared from the recombinant phages and cloned into M13 according to conventional procedures (Maniatis et al., supra). Sequencing was performed according to Sanger et al. [(1977) Proc. Natl. Acad. Sci. USA 74:5463], using the universal M13 primer or chemically synthesized primers complementary to ARV-2 sequence. The sequence is shown in FIG. 4. Also indicated in this figure are the restriction sites present in the DNA and the open reading frames encoded by the sequence.
The sequence of the HIV-1 DNA from λ phage 9B was sequenced in a similar manner. This sequence is shown in FIG. 5. There are several differences between the sequences shown in FIGS. 4 and 5 which reflect the polymorphism in HIV and the fact that the sequence of FIG. 4 was derived as a composite from sequence data on several HIV isolates, whereas the sequence of FIG. 5 is from a single isolate. The main difference affecting polypeptide sequence is that the small open reading frame between the gag and pol genes in FIG. 4 is not independent, but is part of the pol gene in FIG. 5. This merger of these reading frames was the result of three base changes. The region of fusion and sequence change occurs roughly between nucleotides 2853 and 2941 in FIG. 5.
Furthermore, open reading frames in FIG. 4 were translated into amino acids beginning with the first methionine in the open reading frame, whereas in FIG. 5 translation into amino acids was begun immediately with the codon following the stop codon.
1.7. Amino Acid Sequences of Native HIV Proteins
ARV-2 was prepared and purified as described in Section 1.1. The viral proteins were electrophoresed on an acrylamide gel, and the band corresponding to a 24,000 dalton or 16,000 dalton protein was excised from the gel and used for sequencing. Micro-sequence analysis was performed using Applied Biosystems model 470A protein sequencer similar to that described by Hewick et al. (1981) J. Biol. Chem. 256:7990-7997. Phenylthiohydantoin amino acids were identified by HPLC using a Beckman Ultrasphere ODS column and a trifluoroacetic acid-acetonitrile buffer system as reported by Hawke et al. (1982) Anal. Biochem. 120:308-311. Table 1 shows the first 20 amino acids from the amino terminus determined for p25-gag protein and Table 2 shows the first 30 amino acids for p16-gag protein.
| TABLE 1 | |
| Amino-terminal sequence of p25gag | |
| Position | Amino acid |
| 1 | Pro |
| 2 | Ile |
| 3 | Val |
| 4 | Gln |
| 5 | Asn |
| 6 | Leu |
| 7 | Gln |
| 8 | Gly |
| 9 | Gln |
| 10 | Met |
| 11 | Val |
| 12 | (His) |
| 13 | |