Plaque It!
Sponsored by: Flash of Genius |
| 5030559 | Methods and compositions for the identification of metastatic human tumors | July, 1991 | Nicolson et al. | |
| 5118611 | Adenocarcinoma antigen binding methods and reagents | June, 1992 | Smith et al. | |
| 5362718 | Rapamycin hydroxyesters | November, 1994 | Skotnicki et al. | |
| 5629007 | Method of preventing prostatic cancer development | May, 1997 | Audia et al. | |
| 5635197 | Treatment and prevention of prostatic cancer metastasis | June, 1997 | Audia et al. | |
| 5674682 | Nucleic acid primers for detecting micrometastasis of prostate cancer | October, 1997 | Croce et al. | |
| 5688658 | Detection of prostate-specific antigen in breast tumors | November, 1997 | Diamandis | |
| 5723302 | Detection of prostate-specific antigen in breast tumors | March, 1998 | Diamandis | |
| 5763202 | Methods of detecting prostate carcinoma using a monoclonal antibody to a new antigenic marker in epithelial prostatic cells alone or with a monoclonal antibody to an antigen of LNCaP cells | June, 1998 | Horoszewicz | |
| 5801197 | Rotamase enzyme activity inhibitors | September, 1998 | Steiner et al. | |
| 5843994 | Compositions and methods for treating and preventing pathologies including cancer | December, 1998 | Samid | |
| 5854206 | Compounds and methods for treatment and diagnosis of prostate cancer | December, 1998 | Twardzik et al. | |
| 5856112 | Method for selectively inducing biomarker expression in urologic tumor tissue for diagnosis and treatment thereof | January, 1999 | Marley et al. | |
| 5859031 | Small molecule inhibitors of rotamase enzyme activity | January, 1999 | Hamilton et al. | |
| 5861248 | Biomarkers for detection of prostate cancer | January, 1999 | Russell et al. | |
| 5882864 | Biomarkers and targets for diagnosis, prognosis and management of prostate disease | March, 1999 | An et al. | |
| 5912253 | Rapamycin derivatives | June, 1999 | Cottens et al. | |
| 5939258 | Methods of detecting micrometastasis of prostate cancer | August, 1999 | Croce et al. | |
| 5945522 | Prostate cancer gene | August, 1999 | Cohen et al. | |
| 5968802 | Nuclear cyclophilin | October, 1999 | Wang et al. | |
| 5972615 | Biomarkers and targets for diagnosis, prognosis and management of prostate disease | October, 1999 | An et al. | |
| 5976794 | Method for molecular staging of prostate cancer | November, 1999 | Katz et al. | |
| 5976838 | Secreted proteins and polynucleotides encoding them | November, 1999 | Jacobs et al. | |
| 5985890 | Rapamycin derivatives | November, 1999 | Cottens et al. | |
| 6011018 | Regulated transcription of targeted genes and other biological events | January, 2000 | Crabtree et al. | |
| 6025137 | Compositions and methods for the diagnosis, prevention and treatment of tumor progression | February, 2000 | Shyjan | |
| 6028174 | Method of diagnosing and treating gliomas | February, 2000 | Ullrich et al. | |
| 6034218 | Compounds and methods for immunotherapy and immunodiagnosis of prostate cancer | March, 2000 | Reed et al. | |
| 6051230 | Compositions for targeting the vasculature of solid tumors | April, 2000 | Thorpe et al. | |
| 6054265 | Screening assays for compounds that inhibit membrane fusion-associated events | April, 2000 | Barney et al. | |
| 6166011 | FKBP inhibitors | December, 2000 | Wythes et al. | |
| 6200985 | Rapamycin derivatives | March, 2001 | Cottens et al. | |
| 6251932 | Immunophilin ligands | June, 2001 | Reichelt et al. | |
| 6252058 | Sequences for targeting metastatic cells | June, 2001 | Thompson | |
| 6261535 | Diagnostic methods for targeting the vasculature of solid tumors | July, 2001 | Thorpe et al. | |
| 6261766 | Detection of prostate-specific antigen in breast tumors | July, 2001 | Diamandis | |
| 6313264 | Effector proteins of Rapamycin | November, 2001 | Caggiano et al. | |
| 6319891 | Method of diagnosing and treating gliomas | November, 2001 | Sontheimer et al. |
| EP0698215 | July, 1998 | DETECTION OF PROSTATE-SPECIFIC ANTIGEN IN BREAST TUMORS | ||
| EP0939084 | September, 1999 | An estrogen binding proteinaceous substance, its possible role in estrogen action, and potential use. | ||
| EP0974652 | January, 2000 | CANCEROUS METASTASIS-ASSOCIATED GENE | ||
| EP0648126 | February, 2000 | PHARMACEUTICAL PREPARATIONS FOR INHIBITING TUMOURS ASSOCIATED WITH PROSTATE ADENOCARCINOMA, STOMACH CANCER AND BREAST CANCER. | ||
| WO/1997/008318 | March, 1997 | COMPOUNDS AND METHODS FOR TREATMENT AND DIAGNOSIS OF PROSTATE CANCER | ||
| WO/1997/033909 | September, 1997 | COMPOUNDS AND METHODS FOR IMMUNOTHERAPY AND IMMUNODIAGNOSIS OF PROSTATE CANCER | ||
| WO/1998/037093 | August, 1998 | COMPOUNDS FOR IMMUNOTHERAPY OF PROSTATE CANCER AND METHODS FOR THEIR USE | ||
| WO/1998/037418 | August, 1998 | COMPOUNDS FOR IMMUNODIAGNOSIS OF PROSTATE CANCER AND METHODS FOR THEIR USE | ||
| WO/1999/018210 | April, 1999 | COMPOUNDS AND METHODS FOR IMMUNOTHERAPY AND IMMUNODIAGNOSIS OF PROSTATE CANCER | ||
| WO/1999/037811 | July, 1999 | BIOMARKERS AND TARGETS FOR DIAGNOSIS, PROGNOSIS AND MANAGEMENT OF PROSTATE DISEASE | ||
| WO/2000/004149 | January, 2000 | COMPOSITIONS AND METHODS FOR THERAPY AND DIAGNOSIS OF PROSTATE CANCER | ||
| WO/2000/014234 | March, 2000 | PROSTATE SPECIFIC PROMOTER AND REGULATION OF GENE EXPRESSION | ||
| WO/2000/018961 | April, 2000 | EXPRESSION ANALYSIS OF SPECIFIC NUCLEIC ACIDS AND POLYPEPTIDES USEFUL IN THE DIAGNOSIS AND TREATMENT OF PROSTATE CANCER | ||
| WO/2000/055174 | September, 2000 | HUMAN PROSTATE CANCER ASSOCIATED GENE SEQUENCES AND POLYPEPTIDES | ||
| WO/2000/055350 | September, 2000 | HUMAN CANCER ASSOCIATED GENE SEQUENCES AND POLYPEPTIDES | ||
| WO/2002/031209 | April, 2002 | GENES RELATED TO DEVELOPMENT OF REFRACTORY PROSTATE CANCER |
This application is a continuation of U.S. patent application Ser. No. 09/997,423, filed Nov. 28, 2001, which is now U.S. Pat. No. 6,821,731, which claims the benefit of U.S. Provisional Patent Application Ser. No. 60/253,539, filed Nov. 28, 2000, entitled “Expression Analysis of FKBP54 Nucleic Acids and Polypeptides Useful in the Diagnosis and Treatment of Prostate Cancer,” each of which are hereby incorporated by reference in their entirety.
Prostate cancer is the second most common cause of cancer related death and will kill an estimated 37,000 people this year alone. The prostate gland, which is found exclusively in male mammals, produces several regulatory peptides. The prostate gland comprises stroma and epithelium cells, the latter group consisting of columnar secretory cells and basal non-secretory cells. A proliferation of these basal cells, as well as stroma cells gives rise to benign prostatic hyperplasia (BPH) which is one common prostate disease. Another common prostate disease is prostatic adenocarcinoma (CaP), the most common of the fatal pathophysiological prostate cancers. Prostatic adenocarcinoma involves a malignant transformation of epithelial cells in the peripheral region of the prostate gland. Prostatic adenocarcinoma and benign prostatic hyperplasia are two common prostate diseases which have a high rate of incidence in the aging human male population. Approximately one out of every four males above the age of 55 suffers from a prostate disease of some form or another.
To date, various substances that are synthesized and secreted by normal, benign and cancerous prostates are used as tumor markers to gain an understanding of the pathogenesis of the various prostate diseases and in the diagnosis of prostate disease. The three predominant proteins or peptides secreted by a normal prostate gland are Prostatic Acid Phosphatase (PAP), Prostate Specific Antigen (PSA) and prostatic inhibin (PIP) also known as human seminal plasma inhibin (HSPI). Both PSA and PAP have been studied as tumour markers in the detection of prostate disease but since both exhibit elevated levels in prostates having benign prostatic hyperplasia (BPM) neither marker is specific and therefore are of limited use.
Despite the available knowledge, little is known about the genetic basis underlying the prostate cancer disease and the androgen-regulated genes that may be involved with its progression. Although androgens have been known to play a major role in the biology of prostate cancer. However, the full complexity of the hormonal regulation has not been completely covered and more androgen related processes are being elucidated. Many of these processes involve several molecules associated in prostate cancer that remain elusive. In addition, there may be several known molecules that have not yet been associated with the pathogenesis of the disease. Accordingly, a need exists for identifying unknown molecules that may be involved in prostate cancer and the genes encoding them. A need also exists for identifying known molecules that have not yet been implicated in the pathogenesis of prostate cancer, particularly those that can serve as targets for the diagnosis, prevention, and treatment of prostate cancer.
The invention is based, in part, on the discovery of a number of genes which are androgen-inducible in androgen-dependent prostate cancer cells (e.g., LNCaP cells). These genes serve as markers suitable for detection, diagnosis and prognosis of prostate disorders. This invention provides methods and screening assays for the detection and diagnosis of prostate cancer. The primary screening assays detect an alteration in the expression level of genes associated with prostate cancer. In particular, this invention provides for the use of immunophilins, such as FK-Binding Proteins (FKBPs), e.g., FKBP54, as genetic markers for the detection, diagnosis and prognosis of prostate disorders. Immunophilins are proteins that serve as receptors for the immunosuppressant drugs such as cyclosporin A (CsA), FK506, and rapamycin. Known classes of immunophilins include cyclophilins, and FK506 binding proteins, such as FKBPs. Cyclosporin A binds to cyclophilin while FK506 and rapamycin bind to FKBP. These immunophilin-drug complexes interface with a variety of intracellular signal transduction systems. Immunophilins are known to have peptidyl-prolyl isomerase (PPIase) or rotamase enzyme activity. It has been determined that rotamase activity has a role in the catalyzation of the interconversion of the cis and trans isomer of immunophilin proteins.
FKBP54 is a member of the immunophilin family and has been associated with the progesterone receptor complex as described by Smith et al. (1993) J. Biol. Chem. 268: 18365-18371. The invention provides for use of immunophilins, e.g., FKBP54, that are up-regulated (increased mRNA and protein expression/activated/agonized) or down-regulated (decreased mRNA and protein expression/suppressed/antagonized) in the presence of androgens.
Using gene cluster analysis, the expression pattern of FKBP54 was found to be similar to that of prostate specific antigen (PSA), which has been used to diagnose prostate cancer patient. The present study described herein demonstrates the up-regulation of FKBP54 in the presence of androgen and can be used as a marker for the detection, diagnosis and prognosis of prostate disorders. In addition, quantitative PCR was used to confirm gene expression of the target marker. The transcription level of FKBP54 was found to be regulated by androgen, demonstrating a time dependent increase in transcription. Western blot analysis of the expressed FKBP54 protein further confirmed the time dependent increase in expression levels in the presence of androgen. The presence of FKBP54 in solid tumors was also demonstrated. Furthermore, transient cotransfection studies in COS cells showed that androgen receptor activation was enhanced by FKBP54.
In one embodiment, the invention provides a method of assessing whether a subject is afflicted with prostate cancer, by comparing the level of expression of the FK-binding proteins, e.g., FKBP54 marker in a sample from a subject, to the normal level of expression of the marker in a control sample, where a significant difference between the level of expression of the marker in the sample from the subject and the normal level is an indication that the subject is afflicted with prostate cancer. In a preferred embodiment, the marker corresponds to a transcribed polynucleotide or portion thereof, where the polynucleotide includes the marker. In a particularly preferred embodiment, the level of expression of the marker in the sample differs from the normal level of expression of the marker in a subject not afflicted with prostate cancer by a factor of at least two, and in an even more preferred embodiment, the expression levels differ by a factor of at least three. In another preferred embodiment, the marker is not significantly expressed in non-prostate cancer cells.
In another preferred embodiment, the sample includes cells obtained from the subject. In another preferred embodiment, the level of expression of the marker in the sample is assessed by detecting the presence in the sample of a protein corresponding to the marker. In a particularly preferred embodiment, the presence of the protein is detected using a reagent which specifically binds with the protein. In an even more preferred embodiment, the reagent is selected from the group of reagents including an antibody, an antibody derivative, and an antibody fragment. In another preferred embodiment, the level of expression of the marker in the sample is assessed by detecting the presence in the sample of a transcribed polynucleotide or portion thereof, where the transcribed polynucleotide includes the marker. In a particularly preferred embodiment, the transcribed polynucleotide is an mRNA or a cDNA. In another particularly preferred embodiment, the step of detecting further comprises amplifying the transcribed polynucleotide.
In yet another preferred embodiment, the level of expression of the FKBP marker, e.g., FKBP54 marker in the sample is assessed by detecting the presence in the sample of a transcribed polynucleotide which anneals with the marker or anneals with a portion of a polynucleotide under stringent hybridization conditions, where the polynucleotide includes the marker. The level of expression of the marker is significantly altered, relative to the corresponding normal levels of expression the marker, is an indication that the subject is afflicted with prostate cancer.
In another embodiment, the invention provides a method for monitoring the progression of prostate cancer in a subject, including detecting in a subject sample at a first point in time the expression of the FKBP marker, e.g., FKBP54 marker, repeating this detection step at a subsequent point in time, and comparing the level of expression detected in the two detection steps, and monitoring the progression of prostate cancer in the subject using this information. In another preferred embodiment, the marker corresponds to a transcribed polynucleotide or portion thereof, where the polynucleotide includes the marker. In another preferred embodiment, the sample includes cells obtained from the subject. In a particularly preferred embodiment, the cells are collected from skin or blood tissue.
In another embodiment, the invention provides a method of assessing the efficacy of a test compound for inhibiting prostate cancer in a subject, including comparing expression of the FKBP54 marker in a first sample obtained from the subject which is exposed to or maintained in the presence of the test compound, to expression of the FKBP marker, e.g., FKBP54 marker in a second sample obtained from the subject, where the second sample is not exposed to the test compound, where a significantly lower level of expression of the marker in the first sample relative to that in the second sample is an indication that the test compound is efficacious for inhibiting prostate cancer in the subject. In a preferred embodiment, the first and second samples are portions of a single sample obtained from the subject. In another preferred embodiment, the first and second samples are portions of pooled samples obtained from the subject.
In another embodiment, the invention provides a method of assessing the efficacy of a therapy for inhibiting prostate cancer in a subject, the method including comparing expression of the FKBP marker, e.g., FKBP54 marker in the first sample obtained from the subject prior to providing at least a portion of the therapy to the subject, to expression of the marker in a second sample obtained form the subject following provision of the portion of the therapy, where a significantly lower level of expression of the marker in the second sample relative to the first sample is an indication that the therapy is efficacious for inhibiting prostate cancer in the subject.
In another embodiment, the invention provides a method of selecting a composition for inhibiting prostate cancer in a subject, the method including obtaining a sample including cells from a subject, separately maintaining aliquots of the sample in the presence of a plurality of test compositions, comparing expression of the FKBP marker, e.g., FKBP54 marker in each of the aliquots, and selecting one of the test compositions which induces a lower level of expression of the FKBP marker, e.g., FKBP54 marker in the aliquot containing that test composition, relative to other test compositions.
In another embodiment, the invention provides a method of inhibiting prostate cancer in a subject, including obtaining a sample including cells from a subject, separately maintaining aliquots of the sample in the presence of a plurality of test compositions, comparing expression of the FKBP marker, e.g., FKBP54 marker in each of the aliquots, and administering to the subject at least one of the test compositions which induces a lower level of expression of the FKBP marker, e.g., FKBP54 marker in the aliquot containing that test composition, relative to other test compositions.
In another embodiment, the invention provides a method of assessing the potential of a test compound to trigger prostate cancer in a cell, including maintaining separate aliquots of cells in the presence and absence of the test compound, and comparing expression of the FKBP marker, e.g., FKBP54 marker in each of the aliquots, where a significantly enhanced level of expression of the FKBP marker, e.g., FKBP54 marker in the aliquot maintained in the presence of the test compound, relative to the aliquot maintained in the absence of the test compound, is an indication that the test compound possesses the potential for triggering prostate cancer in a cell.
In another embodiment, the invention provides a method of treating a subject afflicted with prostate cancer, including providing to cells of the subject afflicted with prostate cancer a protein corresponding to the FKBP marker, e.g., FKBP54 marker. In a preferred embodiment, the protein is provided to the cells by providing a vector including a polynucleotide encoding the FKBP protein, e.g., FKBP54 protein to the cells.
In another embodiment, the invention provides a method of treating a subject afflicted with prostate cancer an antisense oligonucleotide complementary to a polynucleotide corresponding to the FKBP marker, e.g., FKBP54 marker.
In another embodiment, the invention provides a method of inhibiting prostate cancer in a subject at risk for developing prostate cancer, including inhibiting expression of a gene corresponding to the FKBP marker, e.g., FKBP54 marker.
Other features and advantages of the invention will be apparent from the following detailed description and claims.
FIG. 1A is a bar chart depicting the effect of dihydrotestosterone (DHT) on the growth and PSA production of LNCaP cells plated at 20,000 cells/well in a 24-well plate with 1 ml of medium. Cells were treated with DHT as shown, and cell growth was determined by 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) assay on day 3;
FIG. 1B is a graph depicting the effect of DHT on the growth and PSA production of LNCaP cells plated at 1×10 6 cells/well in a 175 cm 2 flask. Cells were treated with or without 10 nM DHT the next day, and were harvested for RNA preparation and PSA analysis.
FIG. 2 is a flowchart demonstrating the procedure for RNA sample preparation, Affymetrix GENECHIP® hybridizations and analysis;
FIG. 3A is a bar chart depicting the expression profile of PSA in response to androgen treatment. The mRNA frequencies are plotted on the Y-axis, and the DHT androgen treated and untreated cells for each time point plotted on the X-axis;
FIG. 3B is a bar chart depicting the expression profile of FKBP54 in response to androgen treatment. The mRNA frequencies are plotted on the Y-axis, and the DHT androgen treated and untreated cells for each time point plotted on the X-axis;
FIG. 4A is a bar chart demonstrating the quantitative RT-PCR analysis of PSA. Copy number is plotted on the Y-axis, and the DHT androgen treated and untreated cells for each time point plotted on the X-axis;
FIG. 4B is a bar chart demonstrating the quantitative RT-PCR analysis of FKBP54. Copy number is plotted on the Y-axis, and the DHT androgen treated and untreated cells for each time point plotted on the X-axis;
FIG. 5 is a bar chart demonstrating the transcriptional activation of androgen receptor (AR) by FKBP54 using COS-1 cells that were transiently transfected with GREe1bLuc reporter construct and 0.1 μg expression vector encoding FKBP54 (black bars) or empty vector (white bars). The COS-1 cells were treated with or without 10 −9 M of the synthetic androgen, R1881 for 20 hrs. Bars represent the mean of at least three independent experiments±SD.
The invention relates, in part, to newly discovered correlation between the expression of selected markers and the presence of prostate cancer in a subject, in particular, immunophilins such as the FK binding proteins (FKBPs), e.g., FKBP54. The term “FKBP54” is used herein synonymously with the term “FKBP51”. FKBP54 is a member of the immunophilin family. Other FK binding proteins include, but are not limited to, FKBP12 (Hidalgo et al. (2000) Oncogene 19:6680-6686, Genbank Accession No. AF 3222070), FKBP12.6 (Deivanayagam et al. (2000) Acta. Crystallogr. D. Biol. Crystallog 56: 266-271, Accession No. L37086), and FKBP52 (Yamamoto-Yamaguchi et al. (2001) Exp Hematol 29:582-588, Genbank Accession No. M88279).
The relative levels of expression of the FK binding protein marker, e.g., FKBP54 marker, has been found to be indicative of a predisposition in the subject to prostate cancer and/or diagnostic of the presence or potential presence of prostate cancer in a subject. The invention features the FKBP marker, e.g., FKBP54, methods for detecting the presence or absence of prostate cancer in a sample or subject, and methods of predicting the incidence of prostate cancer in a sample or subject using the FKBP marker, e.g., FKBP54. The invention also provides methods by which prostate cancer may be treated using the FKBP marker, e.g., FKBP54.
The present invention is based, at least in part, on the identification of the genetic marker, FKBP54, which is differentially expressed in samples from androgen dependent prostate cancer cells. A panel of 6800 known genes was screened for expression androgen dependent prostate cancer cells (see Example 1). Those genes with statistically significant (p<0.05) differences between the diseased and normal tissues were identified. This differential expression was observed either as a decrease in expression, or an increase in expression. The expression of these selected genes in androgen dependent prostate cancer cells was assessed by GENECHIP® analysis, as described in Example 1. FKBP54 was found to increase in expression in LNCaP prostate cancer cells. The growth of LNCaP cells and the production of PSA were responsive to a natural androgen receptor (AR) ligand, DHT, LNCaP and are suitable model for gene expression profiling. (See e.g., Kokontis et al. (1994) Cancer Res. 54: 1566-1573; Schuurmans et al. (1991) J Steroid Biochem. Mol. Biol. 40: 193-197; Swinnen et al. (1994) Molec. Cell. Endocrinol. 104: 153-162; Cleutjens et al. (1996) J. Biol. Chem. 271: 6379-6388; Henttu et al. (1992) Endocrinology 130: 766-772; Murtha et al. (1993) Biochem. 32: 6459-6464; Swinnen et al. (1996) Endocrinol. 137: 4468-4474).
As an internal control, the prostate specific antigen (PSA) gene, known in the art to be implicated in prostate cancer, was included to screen androgen dependent prostate cancer cells. PSA was found to be significantly increased in expression in androgen dependent prostate cancer cells.
Accordingly, the present invention pertains to the use of the FKBP genes (e.g., the DNA or cDNA of FKBP54), the corresponding mRNA transcripts, and the encoded polypeptides, as a marker for the presence or risk of development prostate cancer. The FKBP marker, e.g., FKBP54 is useful to correlate the extent and/or severity of disease. The FKBP marker, e.g., FKBP54 marker can be useful in the treatment of prostate cancer, or in assessing the efficacy of a treatment for cancer. In addition, the FKBP marker, e.g., FKBP54 marker can also be used in screening assays to identify compound or agents that modify the expression of the marker and the disease state.
In one aspect, the invention provides the FKBP marker, e.g., FKBP54 marker whose quantity or activity is correlated with the presence of prostate cancer. The FKBP marker, e.g., FKBP54 marker of the invention may be nucleic acid molecules (e.g., DNA, cDNA, or RNA) or polypeptides. The FKBP marker, e.g., FKBP54 marker can be either increased or decreased in quantity or activity in prostate cancer tissue as compared to non-prostate cancer tissue. For example, the gene designated “FKBP54” (accession number U42031) is increased in expression level in androgen dependent prostate cancer cell samples. Both the presence of increased or decreased mRNA for this gene, and also increased or decreased levels of the protein products of this gene serve as markers of prostate cancer. Preferably, increased and decreased levels of the FKBP marker, e.g., FKBP54 marker of the invention are increases and decreases of a magnitude that is statistically significant as compared to appropriate control samples (e.g., samples not affected with prostate cancer). In particularly preferred embodiments, the FKBP marker, e.g., FKBP54 marker is increased or decreased relative to control samples by at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, or 10-fold or more. Similarly, one skilled in the art will be cognizant of the fact that a preferred detection methodology is one in which the resulting detection values are above the minimum detection limit of the methodology.
Measurement of the relative amount of an RNA or protein marker of the invention may be by any method known in the art (see, e.g., Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2 nd, ed ., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989; and Current Protocols in Molecular Biology , eds. Ausubel et al. John Wiley & Sons: 1992). Typical methodologies for RNA detection include RNA extraction from a cell or tissue sample, followed by hybridization of a labeled probe (e.g., a complementary nucleic acid molecule) specific for the target RNA to the extracted RNA, and detection of the probe (e.g., Northern blotting). Typical methodologies for protein detection include protein extraction from a cell or tissue sample, followed by hybridization of a labeled probe (e.g., an antibody) specific for the target protein to the protein sample, and detection of the probe. The label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Detection of specific protein and nucleic acid molecules may also be assessed by gel electrophoresis, column chromatography, direct sequencing, or quantitative PCR (in the case of nucleic acid molecules) among many other techniques well known to those skilled in the art.
In certain embodiments, the FKBP gene itself (e.g., the FKBP54 DNA or cDNA), may serve as a marker for prostate cancer. For example, the absence of nucleic acids corresponding to the FKBP54 gene, such as by deletion of all or part of the gene, may be correlated with disease. Similarly, an increase of nucleic acid corresponding to the FKBP54 gene, such as by duplication of the gene, may also be correlated with disease.
Detection of the presence or number of copies of all or a part of a FKBP gene, e.g., FKBP54 gene of the invention may be performed using any method known in the art. Typically, it is convenient to assess the presence and/or quantity of a DNA or cDNA by Southern analysis, in which total DNA from a cell or tissue sample is extracted, is hybridized with a labeled probe (e.g., a complementary DNA molecule), and the probe is detected. The label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Other useful methods of DNA detection and/or quantification include direct sequencing, gel electrophoresis, column chromatography, and quantitative PCR, as is known by one skilled in the art.
The invention also encompasses nucleic acid and protein molecules which are structurally different from the molecules described above (e.g., which have a slightly altered nucleic acid or amino acid sequence), but which have the same properties as the molecules above (e.g., encoded amino acid sequence, or which are changed only in nonessential amino acid residues). Such molecules include allelic variants, and are described in greater detail in subsection I.
In another aspect, the invention provides the FKBP marker, e.g., FKBP54 marker whose quantity or activity is correlated with the severity of prostate cancer. This FKBP marker, e.g., FKBP54 marker is either increased or decreased in quantity or activity in prostate cancer tissue in a fashion that is either positively or negatively correlated with the degree of severity of prostate cancer. In yet another aspect, the invention provides the FKBP marker, e.g., FKBP54 marker whose quantity or activity is correlated with a risk in a subject for developing prostate cancer. The FKBP marker, e.g., FKBP54 marker is either increased or decreased in activity or quantity in direct correlation to the likelihood of the development of prostate cancer in a subject.
It will also be appreciated by one skilled in the art that the FKBP marker, e.g., FKBP54 of the invention may conveniently be provided on solid supports. For example, polynucleotides, such as mRNA, may be coupled to an array (e.g., a GENECHIP® array for hybridization analysis), to a resin (e.g., a resin which can be packed into a column for column chromatography), or a matrix (e.g., a nitrocellulose matrix for northern blot analysis). The immobilization of molecules complementary to the marker(s), either covalently or noncovalently, permits a discrete analysis of the presence or activity of each marker in a sample. In an array, for example, polynucleotides complementary to the full length or a portion of the FKBP marker, e.g., FKBP54 marker may individually be attached to different, known locations on the array. The array may be hybridized with, for example, polynucleotides extracted from a skin cell sample from a subject. The hybridization of polynucleotides from the sample with the array at any location on the array can be detected, and thus the presence or quantity of the marker in the sample can be ascertained. In a preferred embodiment, a GENECHIP® array is employed (Affymetrix). Similarly, Western analyses may be performed on immobilized antibodies specific for the FKBP polypeptide (e.g., FKBP54) marker hybridized to a protein sample from a subject. In addition, quantitative PCR was used to confirm gene expression of the target marker. The transcription level of FKBP54 was found to be regulated by androgen, demonstrating a time dependent increase in transcription. Western blot analysis of the expressed FKBP54 protein further confirmed the time dependent increase in expression levels in the presence of androgen. The presence of FKBP54 in solid tumors was also demonstrated. Furthermore, transient cotransfection studies in COS cells showed that androgen receptor activation was enhanced by FKBP54.
It will also be apparent to one skilled in the art that the entire FKBP marker, e.g., FKBP54 marker protein or nucleic acid molecule need not be conjugated to the support; a portion of the marker of sufficient length for detection purposes (e.g., for hybridization), for example, a portion of the marker which is 7, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 100 or more nucleotides or amino acids in length may be sufficient for detection purposes.
The FKBP, e.g., FKBP54 nucleic acid and protein marker of the invention may be isolated from any tissue or cell of a subject. In a preferred embodiment, the tissue is prostate cells or tissue. However, it will be apparent to one skilled in the art that other tissue samples, including bodily fluids (e.g., urine, bile, serum, lymph, saliva, mucus and pus) and other tissue samples may also serve as sources from which the markers of the invention may be isolated, or in which the presence, activity, and/or quantity of the markers of the invention may be assessed. The tissue samples containing one or more of the markers themselves may be useful in the methods of the invention, and one skilled in the art will be cognizant of the methods by which such samples may be conveniently obtained, stored, and/or preserved.
Several markers were known prior to the invention to be associated with prostate cancer, e.g., PSA. These markers are not included with the marker of the invention. However, these markers may be conveniently be used in combination with the marker of the invention in the methods, panels, and kits of the invention.
In another aspect, the invention provides methods of making an isolated hybridoma which produces an antibody useful for assessing whether a patient is afflicted with prostate cancer. In this method, a protein corresponding to the FKBP marker, e.g., FKBP54 marker is isolated (e.g., by purification from a cell in which it is expressed or by transcription and translation of a nucleic acid encoding the protein in vivo or in vitro using known methods. A vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is immunized using the isolated protein or protein fragment. The vertebrate may optionally (and preferably) be immunized at least one additional time with the isolated protein or protein fragment, so that the vertebrate exhibits a robust immune response to the protein or protein fragment. Splenocytes are isolated from the immunized vertebrate and fused with an immortalized cell line to form hybridomas, using any of a variety of methods well known in the art. Hybridomas formed in this manner are then screened using standard methods to identify one or more hybridomas which produce an antibody which specifically binds with the protein or protein fragment. The invention also includes hybridomas made by this method and antibodies made using such hybridomas.
The invention provides methods of identifying prostate cancer, or risk of developing prostate cancer in a subject. These methods involve isolating a sample from a subject (e.g., a sample containing prostate cancer cells or blood cells), detecting the presence, quantity, and/or activity of the FKBP marker, e.g., FKBP54 marker of the invention in the sample relative to a second sample from a subject known not to have prostate cancer. The levels of the FKBP marker, e.g., FKBP54 marker in the two samples are compared, and a significant increase in the marker in the test sample indicates the presence or risk of presence of prostate cancer in the subject.
The invention also provides methods of assessing the severity of prostate cancer in a subject. These methods involve isolating a sample from a subject (e.g., a sample containing prostate cancer cells or blood cells), detecting the presence, quantity, and/or activity of the FKBP marker, e.g., FKBP54 marker of the invention in the sample relative to a second sample from a subject known not to have prostate cancer. The level of the FKBP marker, e.g., FKBP54 marker in the two samples are compared, and a significant increase in the marker in the test sample is correlated with the degree of severity of prostate cancer in the subject.
The invention also provides methods of treating (e.g., inhibiting prostate cancer in a subject. These methods involve isolating a sample from a subject (e.g., a sample containing prostate cancer cells or blood cells), detecting the presence, quantity, and/or activity of FKBP marker, e.g., FKBP54 in the sample relative to a second sample from a subject known not to have prostate cancer. The levels of the FKBP marker, e.g., FKBP54 marker in the two samples are compared, and significant increases or decreases in one or more markers in the test sample relative to the control sample are observed. For markers that are significantly decreased in expression or activity, the subject may be administered that expressed marker protein, or may be treated by the introduction of mRNA or DNA corresponding to the decreased marker (e.g., by gene therapy), to thereby increase the levels of the marker protein in the subject. For markers that are significantly increased in expression or activity, the subject may be administered mRNA or DNA antisense to the increased marker (e.g., by gene therapy), or may be administered antibodies specific for the marker protein, to thereby decrease the levels of the marker protein in the subject. In this manner, the subject may be treated for prostate cancer.
The invention also provides methods of preventing the development prostate cancer in a subject. These methods involve, for markers that are significantly decreased in expression or activity, the administration of that marker protein, or the introduction of mRNA or DNA corresponding to the decreased marker (e.g., by gene therapy), to thereby increase the levels of the marker protein in the subject. For markers that are significantly increased in expression or activity, the subject may be administered mRNA or DNA antisense to the increased marker (e.g., by gene therapy), or may be administered antibodies specific for the marker protein, to thereby decrease the levels of the marker protein in the subject. In this manner, the development prostate cancer in a subject may be prevented.
The invention also provides methods of assessing a treatment or therapy for prostate cancer condition in a subject. These methods involve isolating a sample from a subject (e.g., a sample containing prostate cancer cells or blood cells) suffering from prostate cancer who is undergoing a treatment or therapy, detecting the presence, quantity, and/or activity of the FKBP marker, e.g., FKBP54 marker of the invention in the first sample relative to a second sample from a subject afflicted prostate cancer who is not undergoing any treatment or therapy for the condition, and also relative to a third sample from a subject unafflicted by prostate cancer. The levels of FKBP marker, e.g., FKBP54 marker in the three samples are compared, and significant increases or decreases the FKBP marker, e.g., FKBP54 marker in the first sample relative to the other samples are observed, and correlated with the presence, risk of presence, or severity prostate cancer. By assessing prostate cancer has been lessened or alleviated in the sample, the ability of the treatment or therapy to treat prostate cancer is also determined.
The invention also provides methods for diagnosing androgen-dependent prostate cancer in a subject. The method involves isolating a sample from a subject (e.g., a sample containing prostate cancer cells or blood cells) who is suffering from prostate cancer, measuring the level of expression of the FKBP marker, e.g., FKBP54 marker in the presence and absence of androgen and comparing the difference in expression of the FKBP marker, e.g., FKBP54 marker in the presence and absence of androgen. The prostate cancer cells are androgen dependent if the expression of the marker is increased in the presence of androgen compared to the absence of androgen.
The invention also provides methods for determining the efficacy of androgen withdrawal treatment in a subject afflicted with prostate cancer. The method involves detecting in a subject sample at a first point in time, the expression level of the FKBP marker, e.g., FKBP54 marker; and detecting the expression level of the FKBP marker, e.g., FKBP54 marker at a subsequent point in time occurring after the subject begins androgen withdrawal treatment. The level of expression of the FKBP marker, e.g., FKBP54 marker detected at the first and second time points is compared. A decrease in the level of expression indicates that the androgen withdrawal treatment has decreased efficacy.
The invention also provides pharmaceutical compositions for the treatment of prostate cancer. These compositions may include a marker protein and/or nucleic acid of the invention (e.g., for those markers which are decreased in quantity or activity in prostate cancer cell sample versus non-prostate cancer cell sample), and can be formulated as described herein. Alternately, these compositions may include an antibody which specifically binds to a marker protein of the invention and/or an antisense nucleic acid molecule which is complementary to a marker nucleic acid of the invention (e.g., for those markers which are increased in quantity or activity in a prostate cancer cell sample versus non-prostate cancer cell sample), and can be formulated as described herein.
The invention also provides kits for assessing the presence of prostate cancer in a sample (e.g., a sample from a subject at risk for prostate cancer), the kit comprising an antibody, wherein the antibody specifically binds with a protein corresponding to the FKBP marker, e.g., FKBP54 marker.
The invention further provides kits for assessing the presence of prostate cancer in a sample from a subject (e.g., a subject at risk for prostate cancer), the kit comprising a nucleic acid probe wherein the probe specifically binds with a transcribed polynucleotide corresponding to the FKBP marker, e.g., FKBP54 marker.
The invention further provides kits for assessing the suitability of each of a plurality of compounds for inhibiting prostate cancer in a subject. Such kits include a plurality of compounds to be tested, and a reagent for assessing expression of the FKBP marker, e.g., FKBP54 marker.
Modifications to the above-described compositions and methods of the invention, according to standard techniques, will be readily apparent to one skilled in the art and are meant to be encompassed by the invention.
To facilitate an understanding of the present invention, a number of terms and phrases are defined below:
As used herein, the terms “polynucleotide” and “oligonucleotide” are used interchangeably, and include polymeric forms of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three-dimensional structure, and may perform any function, known or unknown. The following are non-limiting examples of polynucleotides: a gene or gene fragment, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component. The term also includes both double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of this invention that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form.
A polynucleotide is composed of a specific sequence of four nucleotide bases: adenine (A); cytosine (C); guanine (G); thymine (T); and uracil (U) for guanine when the polynucleotide is RNA. This, the term “polynucleotide sequence” is the alphabetical representation of a polynucleotide molecule. This alphabetical representation can be inputted into databases in a computer having a central processing unit and used for bioinformatics applications such as functional genomics and homology searching.
A “gene” includes a polynucleotide containing at least one open reading frame that is capable of encoding a particular polypeptide or protein after being transcribed and translated. Any of the polynucleotide sequences described herein may be used to identify larger fragments or full-length coding sequences of the gene with which they are associated. Methods of isolating larger fragment sequences are known to those of skill in the art, some of which are described herein.
A “gene product” includes an amino acid (e.g., peptide or polypeptide) generated when a gene is transcribed and translated.
A “probe” when used in the context of polynucleotide manipulation includes an oligonucleotide that is provided as a reagent to detect a target present in a sample of interest by hybridizing with the target. Usually, a probe will comprise a label or a means by which a label can be attached, either before or subsequent to the hybridization reaction. Suitable labels include, but are not limited to radioisotopes, fluorochromes, chemiluminescent compounds, dyes, and proteins, including enzymes.
A “primer” includes a short polynucleotide, generally with a free 3′-OH group that binds to a target or “template” present in a sample of interest by hybridizing with the target, and thereafter promoting polymerization of a polynucleotide complementary to the target. A “polymerase chain reaction” (“PCR”) is a reaction in which replicate copies are made of a target polynucleotide using a “pair of primers” or “set of primers” consisting of “upstream” and a “downstream” primer, and a catalyst of polymerization, such as a DNA polymerase, and typically a thermally-stable polymerase enzyme. Methods for PCR are well known in the art, and are taught, for example, in MacPherson et al., IRL Press at Oxford University Press (1991)). All processes of producing replicate copies of a polynucleotide, such as PCR or gene cloning, are collectively referred to herein as “replication”. A primer can also be used as a probe in hybridization reactions, such as Southern or Northern blot analyses (see, e.g., Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2 nd, ed ., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989).
The term “cDNAs” includes complementary DNA, that is mRNA molecules present in a cell or organism made into cDNA with an enzyme such as reverse transcriptase. A “cDNA library” includes a collection of mRNA molecules present in a cell or organism, converted into cDNA molecules with the enzyme reverse transcriptase, then inserted into “vectors” (other DNA molecules that can continue to replicate after addition of foreign DNA). Exemplary vectors for libraries include bacteriophage, viruses that infect bacteria (e.g., lambda phage). The library can then be probed for the specific cDNA (and thus mRNA) of interest.
A “gene delivery vehicle” includes a molecule that is capable of inserting one or more polynucleotides into a host cell. Examples of gene delivery vehicles are liposomes, biocompatible polymers, including natural polymers and synthetic polymers; lipoproteins; polypeptides; polysaccharides; lipopolysaccharides; artificial viral envelopes; metal particles; and bacteria, viruses and viral vectors, such as baculovirus, adenovirus, and retrovirus, bacteriophage, cosmid, plasmid, fungal vector and other recombination vehicles typically used in the art which have been described for replication and/or expression in a variety of eukaryotic and prokaryotic hosts. The gene delivery vehicles may be used for replication of the inserted polynucleotide, gene therapy as well as for simply polypeptide and protein expression.
A “vector” includes a self-replicating nucleic acid molecule that transfers an inserted polynucleotide into and/or between host cells. The term is intended to include vectors that function primarily for insertion of a nucleic acid molecule into a cell, replication vectors that function primarily for the replication of nucleic acid and expression vectors that function for transcription and/or translation of the DNA or RNA. Also intended are vectors that provide more than one of the above function.
A “host cell” is intended to include any individual cell or cell culture which can be or has been a recipient for vectors or for the incorporation of exogenous nucleic acid molecules, polynucleotides and/or proteins. It also is intended to include progeny of a single cell. The progeny may not necessarily be completely identical (in morphology or in genomic or total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation. The cells may be prokaryotic or eukaryotic, and include but are not limited to bacterial cells, yeast cells, insect cells, animal cells, and mammalian cells, e.g., murine, rat, simian or human cells.
The term “genetically modified” includes a cell containing and/or expressing a foreign gene or nucleic acid sequence which in turn modifies the genotype or phenotype of the cell or its progeny. This term includes any addition, deletion, or disruption to a cell's endogenous nucleotides.
As used herein, “expression” includes the process by which polynucleotides are transcribed into mRNA and translated into peptides, polypeptides, or proteins. If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA, if an appropriate eukaryotic host is selected. Regulatory elements required for expression include promoter sequences to bind RNA polymerase and transcription initiation sequences for ribosome binding. For example, a bacterial expression vector includes a promoter such as the lac promoter and for transcription initiation the Shine-Dalgamo sequence and the start codon AUG (Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2 nd, ed ., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). Similarly, a eukaryotic expression vector includes a heterologous or homologous promoter for RNA polymerase II, a downstream polyadenylation signal, the start codon AUG, and a termination codon for detachment of the ribosome. Such vectors can be obtained commercially or assembled by the sequences described in methods well known in the art, for example, the methods described below for constructing vectors in general.
“Differentially expressed”, as applied to a gene, includes the differential production of mRNA transcribed from a gene or a protein product encoded by the gene. A differentially expressed gene may be overexpressed or underexpressed as compared to the expression level of a normal or control cell. In one aspect, it includes a differential that is 2.5 times, preferably 5 times or preferably 10 times higher or lower than the expression level detected in a control sample. The term “differentially expressed” also includes nucleotide sequences in a cell or tissue which are expressed where silent in a control cell or not expressed where expressed in a control cell.
The term “polypeptide” includes a compound of two or more subunit amino acids, amino acid analogs, or peptidomimetics. The subunits may be linked by peptide bonds. In another embodiment, the subunit may be linked by other bonds, e.g., ester, ether, etc. As used herein the term “amino acid” includes either natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics. A peptide of three or more amino acids is commonly referred to as an oligopeptide. Peptide chains of greater than three or more amino acids are referred to as a polypeptide or a protein.
“Hybridization” includes a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PCR reaction, or the enzymatic cleavage of a polynucleotide by a ribozyme.
Hybridization reactions can be performed under conditions of different “stringency”. The stringency of a hybridization reaction includes the difficulty with which any two nucleic acid molecules will hybridize to one another. Under stringent conditions, nucleic acid molecules at least 60%, 65%, 70%, 75% identical to each other remain hybridized to each other, whereas molecules with low percent identity cannot remain hybridized. A preferred, non-limiting example of highly stringent hybridization conditions are hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50° C., preferably at 55° C., more preferably at 60° C., and even more preferably at 65° C.
When hybridization occurs in an antiparallel configuration between two single-stranded polynucleotides, the reaction is called “annealing” and those polynucleotides are described as “complementary”. A double-stranded polynucleotide can be “complementary” or “homologous” to another polynucleotide, if hybridization can occur between one of the strands of the first polynucleotide and the second. “Complementarity” or “homology” (the degree that one polynucleotide is complementary with another) is quantifiable in terms of the proportion of bases in opposing strands that are expected to hydrogen bond with each other, according to generally accepted base-pairing rules.
An “antibody” includes an immunoglobulin molecule capable of binding an epitope present on an antigen. As used herein, the term encompasses not only intact immunoglobulin molecules such as monoclonal and polyclonal antibodies, but also anti-idotypic antibodies, mutants, fragments, fusion proteins, bi-specific antibodies, humanized proteins, and modifications of the immunoglobulin molecule that comprises an antigen recognition site of the required specificity.
As used herein, the term “prostate cancer” (CaP) refers to the art recognized use of the term which commonly appears in men. The term “prostate cancer” refers to both the appearance of a palpable tumor of the prostate, and also to microscopically detectable neoplastic or transformed cells in the prostate gland. In the latter case, the said cytologically-detectable prostate cancer may be asymptomatic, in that neither the patient nor the medical practitioner detects the presence of the cancer cells. Cancer cells are generally found in the prostates of men who live into their seventies or eighties, however not all of these men develop prostate cancer. In the event that prostate cancer metastasizes to additional sites distal to the prostate, the condition is described as metastatic cancer (MC), to distinguish this condition from organ-confined prostate cancer. CaP fatality results from metastatic dissemination of prostatic adenocarcinoma cells to distant sites, usually in the axial skeleton.
As used herein, the term “marker” includes a polynucleotide or polypeptide molecule which is present or absent, or increased or decreased in quantity or activity in subjects afflicted with prostate cancer, or in cells involved in prostate cancer. The relative change in quantity or activity of the marker is correlated with the incidence or risk of incidence of prostate cancer.
As used herein, the term “panel of markers” includes a group of markers, the quantity or activity of each member of which is correlated with the incidence or risk of incidence of prostate cancer. In certain embodiments, a panel of markers may include only those markers which are either increased or decreased in quantity or activity in subjects afflicted with or cells involved in prostate cancer. In other embodiments, a panel of markers may include only those markers present in a specific tissue type which are correlated with the incidence or risk of incidence of prostate cancer.
Various aspects of the invention are described in further detail in the following subsections:
One aspect of the invention pertains to isolated nucleic acid molecules that either themselves are the genetic markers (e.g., mRNA) of the invention, or which encode the polypeptide markers of the invention, or fragments thereof. Another aspect of the invention pertains to isolated nucleic acid fragments sufficient for use as hybridization probes to identify the nucleic acid molecules encoding the markers of the invention in a sample, as well as nucleotide fragments for use as PCR primers for the amplification or mutation of the nucleic acid molecules which encode the markers of the invention. As used herein, the term “nucleic acid molecule” is intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single-stranded or double-stranded, but preferably is double-stranded DNA.
The term “isolated nucleic acid molecule” includes nucleic acid molecules which are separated from other nucleic acid molecules which are present in the natural source of the nucleic acid. For example, with regards to genomic DNA, the term “isolated” includes nucleic acid molecules which are separated from the chromosome with which the genomic DNA is naturally associated. Preferably, an “isolated” nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5′ and 3′ ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated marker nucleic acid molecule of the invention, or nucleic acid molecule encoding a polypeptide marker of the invention, can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, an “isolated” nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized.
A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule having the nucleotide sequence of the FKBP gene, e.g., the FKBP54 gene or a portion thereof, can be isolated using standard molecular biology techniques and the sequence information provided herein. Using all or portion of the nucleic acid sequence of the FKBP gene, e.g., the FKBP54 as a hybridization probe, a marker gene of the invention or a nucleic acid molecule encoding a polypeptide marker of the invention can be isolated using standard hybridization and cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2 nd, ed ., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989).
A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to marker nucleotide sequences, or nucleotide sequences encoding a marker of the invention can be prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer.
In another preferred embodiment, an isolated nucleic acid molecule of the invention comprises a nucleic acid molecule which is a complement of the nucleotide sequence of a marker of the invention i.e., FKBP54, or a portion of any of these nucleotide sequences. A nucleic acid molecule which is complementary to such a nucleotide sequence is one which is sufficiently complementary to the nucleotide sequence such that it can hybridize to the nucleotide sequence, thereby forming a stable duplex.
The nucleic acid molecule of the invention, moreover, can comprise only a portion of the nucleic acid sequence of a marker nucleic acid of the invention, or a gene encoding a marker polypeptide of the invention, for example, a fragment which can be used as a probe or primer. The probe/primer typically comprises substantially purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes under stringent conditions to at least about 7 or 15, preferably about 20 or 25, more preferably about 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 400 or more consecutive nucleotides of a marker nucleic acid, or a nucleic acid encoding a marker polypeptide of the invention.
Probes based on the nucleotide sequence of a marker gene or of a nucleic acid molecule encoding a marker polypeptide of the invention can be used to detect transcripts or genomic sequences corresponding to the marker gene(s) and/or marker polypeptide(s) of the invention. In preferred embodiments, the probe comprises a label group attached thereto, e.g., the label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for identifying cells or tissue which misexpress (e.g., over- or under-express) a marker polypeptide of the invention, or which have greater or fewer copies of a marker gene of the invention. For example, a level of a marker polypeptide-encoding nucleic acid in a sample of cells from a subject may be detected, the amount of mRNA transcript of a gene encoding a marker polypeptide may be determined, or the presence of mutations or deletions of a marker gene of the invention may be assessed.
The invention further encompasses nucleic acid molecules that differ from the nucleic acid sequences of the FKBP gene, e.g., FKBP54 gene due to degeneracy of the genetic code and which thus encode the same proteins as those encoded by the FKBP gene, e.g., FKBP54 gene.
In addition to the nucleotide sequences of the FKBP gene, e.g., FKBP54 gene it will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequences of the proteins encoded by the FKBP gene, e.g., FKBP54 gene may exist within a population (e.g., the human population). Such genetic polymorphism in the FKBP gene, e.g., FKBP54 gene may exist among individuals within a population due to natural allelic variation. An allele is one of a group of genes which occur alternatively at a given genetic locus. In addition it will be appreciated that DNA polymorphisms that affect RNA expression levels can also exist that may affect the overall expression level of that gene (e.g., by affecting regulation or degradation). As used herein, the phrase “allelic variant” includes a nucleotide sequence which occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. As used herein, the terms “gene” and “recombinant gene” refer to nucleic acid molecules which include an open reading frame encoding a marker polypeptide of the invention.
Nucleic acid molecules corresponding to natural allelic variants and homologues of the FKBP gene, e.g., FKBP54 marker gene, or genes encoding the FKBP, e.g., FKBP54 marker protein of the invention can be isolated based on their homology to the FKBP genes, e.g., FKBP54 gene using the cDNAs disclosed herein, or a portion thereof, as a hybridization probe according to standard hybridization techniques under stringent hybridization conditions. Nucleic acid molecules corresponding to natural allelic variants and homologues of the marker genes of the invention can further be isolated by mapping to the same chromosome or locus as the marker genes or genes encoding the marker proteins of the invention.
In another embodiment, an isolated nucleic acid molecule of the invention is at least 15, 20, 25, 30, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000 or more nucleotides in length and hybridizes under stringent conditions to a nucleic acid molecule corresponding to a nucleotide sequence of a marker gene or gene encoding a marker protein of the invention. As used herein, the term “hybridizes under stringent conditions” is intended to describe conditions for hybridization and washing under which nucleotide sequences at least 60% homologous to each other typically remain hybridized to each other. Preferably, the conditions are such that sequences at least about 70%, more preferably at least about 80%, even more preferably at least about 85% or 90% homologous to each other typically remain hybridized to each other. Such stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology , John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions are hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2×SSC, 0.1% SDS at 50° C., preferably at 55° C., more preferably at 60° C., and even more preferably at 65° C. Preferably, an isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to the sequence of the FKBP gene, e.g., FKBP54 gene. As used herein, a “naturally-occurring” nucleic acid molecule includes an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein).
In addition to naturally-occurring allelic variants of the marker gene and gene encoding a marker protein of the invention sequences that may exist in the population, the skilled artisan will further appreciate that changes can be introduced by mutation into the nucleotide sequences of the marker genes or genes encoding the marker proteins of the invention, thereby leading to changes in the amino acid sequence of the encoded proteins, without altering the functional activity of these proteins. For example, nucleotide substitutions leading to amino acid substitutions at “non-essential” amino acid residues can be made. A “non-essential” amino acid residue is a residue that can be altered from the wild-type sequence of a protein without altering the biological activity, whereas an “essential” amino acid residue is required for biological activity. For example, amino acid residues that are conserved among allelic variants or homologs of a gene (e.g., among homologs of a gene from different species) are predicted to be particularly unamenable to alteration.
Accordingly, another aspect of the invention pertains to nucleic acid molecules encoding a marker protein of the invention that contain changes in amino acid residues that are not essential for activity. Such proteins differ in amino acid sequence from the marker proteins encoded by the FKBP genes, e.g., the FKBP54 gene, yet retain biological activity. In one embodiment, the protein comprises an amino acid sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more homologous to a marker protein of the invention.
An isolated nucleic acid molecule encoding a protein homologous to a marker protein of the invention can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of the gene encoding the marker protein, such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein. Mutations can be introduced into the FKBP gene, e.g., the FKBP54 gene of the invention by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A “conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced randomly along all or part of a coding sequence of a gene of the invention, such as by saturation mutagenesis, and the resultant mutants can be screened for biological activity to identify mutants that retain activity. Following mutagenesis, the encoded protein can be expressed recombinantly and the activity of the protein can be determined.
Another aspect of the invention pertains to isolated nucleic acid molecules which are antisense to the marker genes and genes encoding marker proteins of the invention. An “antisense” nucleic acid comprises a nucleotide sequence which is complementary to a “sense” nucleic acid encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary to an entire coding strand of the FKBP gene, e.g., the FKBP54 gene, or to only a portion thereof. In one embodiment, an antisense nucleic acid molecule is antisense to a “coding region” of the coding strand of a nucleotide sequence of the invention. The term “coding region” includes the region of the nucleotide sequence comprising codons which are translated into amino acid. In another embodiment, the antisense nucleic acid molecule is antisense to a “noncoding region” of the coding strand of a nucleotide sequence of the invention. The term “noncoding region” includes 5′ and 3′ sequences which flank the coding region that are not translated into amino acids (i.e., also referred to as 5′ and 3′ untranslated regions).
Antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick base pairing. The antisense nucleic acid molecule can be complementary to the entire coding region of an mRNA corresponding to a gene of the invention, but more preferably is an oligonucleotide which is antisense to only a portion of the coding or noncoding region. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed using chemical synthesis and enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used. Examples of modified nucleotides which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl)uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N-6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl)uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).
The antisense nucleic acid molecules of the invention are typically administered to a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a marker protein of the invention to thereby inhibit expression of the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule which binds to DNA duplexes, through specific interactions in the major groove of the double helix. An example of a route of administration of antisense nucleic acid molecules of the invention include direct injection at a tissue site (e.g., in skin). Alternatively, antisense nucleic acid molecules can be modified to target selected cells and then administered systemically. For example, for systemic administration, antisense molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using the vectors described herein. To achieve sufficient intracellular concentrations of the antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.
In yet another embodiment, the antisense nucleic acid molecule of the invention is an α-anomeric nucleic acid molecule. An α-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual β-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2′-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330).
In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave mRNA transcripts of the FKBP54 gene of the invention, to thereby inhibit translation of this mRNA. A ribozyme having specificity for a marker protein-encoding nucleic acid can be designed based upon the nucleotide sequence of a gene of the invention, disclosed herein. For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved in a marker protein-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071; and Cech et al. U.S. Pat. No. 5,116,742. Alternatively, mRNA transcribed from a gene of the invention can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel, D. and Szostak, J. W. (1993) Science 261:1411-1418.
Alternatively, expression of FKBP genes, e.g., the FKBP54 gene can be inhibited by targeting nucleotide sequences complementary to the regulatory region of these genes (e.g., the promoter and/or enhancers) to form triple helical structures that prevent transcription of the gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569-84; Helene, C. et al. (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher, L. J. (1992) Bioassays 14(12):807-15.
In yet another embodiment, the nucleic acid molecules of the present invention can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acid molecules can be modified to generate peptide nucleic acids (see Hyrup B. et al. (1996) Bioorganic & Medicinal Chemistry 4 (1): 5-23). As used herein, the terms “peptide nucleic acids” or “PNAs” refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in Hyrup B. et al. (1996) supra; Perry-O'Keefe et al. Proc. Natl. Acad. Sci. 93: 14670-675.
PNAs can be used in therapeutic and diagnostic applications. For example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, for example, inducing transcription or translation arrest or inhibiting replication. PNAs of the nucleic acid molecules of FKBPs, e.g., FKBP54 can also be used in the analysis of single base pair mutations in a gene, (e.g., by PNA-directed PCR clamping); as ‘artificial restriction enzymes’ when used in combination with other enzymes, (e.g., S1 nucleases (Hyrup B. (1996) supra)); or as probes or primers for DNA sequencing or hybridization (Hyrup B. et al. (1996) supra; Perry-O'Keefe supra).
In another embodiment, PNAs can be modified, (e.g., to enhance their stability or cellular uptake), by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in the art. For example, PNA-DNA chimeras of the nucleic acid molecules of the invention can be generated which may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, (e.g., RNAse H and DNA polymerases), to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup B. (1996) supra). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup B. (1996) supra and Finn P. J. et al. (1996) Nucleic Acids Res. 24 (17): 3357-63. For example, a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 5′-(4-methoxytrityl)amino-5′-deoxy-thymidine phosphoramidite, can be used as a between the PNA and the 5′ end of DNA (Mag, M. et al. (1989) Nucleic Acid Res. 17: 5973-88). PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5′ PNA segment and a 3′ DNA segment (Finn P. J. et al. (1996) supra). Alternatively, chimeric molecules can be synthesized with a 5′ DNA segment and a 3′ PNA segment (Peterser, K. H. et al. (1975) Bioorganic Med. Chem. Lett. 5: 1119-11124).
In other embodiments, the oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. USA 86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad. Sci. USA 84:648-652; PCT Publication No. WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. WO89/10134). In addition, oligonucleotides can be modified with hybridization-triggered cleavage agents (See, e.g., Krol et al. (1988) Bio - Techniques 6:958-976) or intercalating agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide may be conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). Finally, the oligonucleotide may be detectably labeled, either such that the label is detected by the addition of another reagent (e.g., a substrate for an enzymatic label), or is detectable immediately upon hybridization of the nucleotide (e.g., a radioactive label or a fluorescent label (e.g., a molecular beacon, as described in U.S. Pat. No. 5,876,930.
One aspect of the invention pertains to isolated marker proteins, and biologically active portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise anti-marker protein antibodies. In one embodiment, native marker proteins can be isolated from cells or tissue sources by an appropriate purification scheme using standard protein purification techniques. In another embodiment, marker proteins are produced by recombinant DNA techniques. Alternative to recombinant expression, a marker protein or polypeptide can be synthesized chemically using standard peptide synthesis techniques.
An “isolated” or “purified” protein or biologically active portion thereof is substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the marker protein is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized. The language “substantially free of cellular material” includes preparations of marker protein in which the protein is separated from cellular components of the cells from which it is isolated or recombinantly produced. In one embodiment, the language “substantially free of cellular material” includes preparations of marker protein having less than about 30% (by dry weight) of non-marker protein (also referred to herein as a “contaminating protein”), more preferably less than about 20% of non-marker protein, still more preferably less than about 10% of non-marker protein, and most preferably less than about 5% non-marker protein. When the marker protein or biologically active portion thereof is recombinantly produced, it is also preferably substantially free of culture medium, i.e., culture medium represents less than about 20%, more preferably less than about 10%, and most preferably less than about 5% of the volume of the protein preparation.
The language “substantially free of chemical precursors or other chemicals” includes preparations of marker protein in which the protein is separated from chemical precursors or other chemicals which are involved in the synthesis of the protein. In one embodiment, the language “substantially free of chemical precursors or other chemicals” includes preparations of protein having less than about 30% (by dry weight) of chemical precursors or non-protein chemicals, more preferably less than about 20% chemical precursors or non-protein chemicals, still more preferably less than about 10% chemical precursors or non-protein chemicals, and most preferably less than about 5% chemical precursors or non-protein chemicals.
As used herein, a “biologically active portion” of a marker protein includes a fragment of a marker protein comprising amino acid sequences sufficiently homologous to or derived from the amino acid sequence of the marker protein, which include fewer amino acids than the full length marker proteins, and exhibit at least one activity of a marker protein. Typically, biologically active portions comprise a domain or motif with at least one activity of the marker protein. A biologically active portion of a marker protein can be a polypeptide which is, for example, 10, 25, 50, 100, 200 or more amino acids in length. Biologically active portions of a marker protein can be used as targets for developing agents which modulate a marker protein-mediated activity.
In a preferred embodiment, marker protein is encoded by the FKBP genes, e.g., FKBP54 gene. In other embodiments, the marker protein is substantially homologous to a marker protein encoded by the FKBP genes, e.g., FKBP54 gene, and retains the functional activity of the marker protein, yet differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in detail in subsection I above. Accordingly, in another embodiment, the marker protein is a protein which comprises an amino acid sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or more homologous to the amino acid sequence encoded by the FKBP genes, e.g., FKBP54 gene.
To determine the percent identity of two amino acid sequences or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). In a preferred embodiment, the length of a reference sequence aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference sequence. The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid “identity” is equivalent to amino acid or nucleic acid “homology”). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. In a preferred embodiment, the percent identity between two amino acid sequences is determined using the Needleman and Wunsch ( J. Mol. Biol . (48):444-453 (1970)) algorithm which has been incorporated into the GAP program in the GCG software package, using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity between two nucleotide sequences is determined using the GAP program in the GCG software package using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two amino acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:11-17 (1989)) which has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
The nucleic acid and protein sequences of the present invention can further be used as a “query sequence” to perform a search against public databases to, for example, identify other family members or related sequences. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to marker protein molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used.
The invention also provides chimeric or fusion marker proteins. As used herein, a marker “chimeric protein” or “fusion protein” comprises a marker polypeptide operatively linked to a non-marker polypeptide. An “marker polypeptide” includes a polypeptide having an amino acid sequence encoded by the FKBP genes, e.g., FKBP54 gene, whereas a “non-marker polypeptide” includes a polypeptide having an amino acid sequence corresponding to a protein which is not substantially homologous to the marker protein, e.g., a protein which is different from marker protein and which is derived from the same or a different organism. Within a marker fusion protein the polypeptide can correspond to all or a portion of a marker protein. In a preferred embodiment, a marker fusion protein comprises at least one biologically active portion of a marker protein. Within the fusion protein, the term “operatively linked” is intended to indicate that the marker polypeptide and the non-marker polypeptide are fused in-frame to each other. The non-marker polypeptide can be fused to the N-terminus or C-terminus of the marker polypeptide.
For example, in one embodiment, the fusion protein is a GST-marker fusion protein in which the marker sequences are fused to the C-terminus of the GST sequences. Such fusion proteins can facilitate the purification of recombinant marker proteins.
In another embodiment, the fusion protein is a marker protein containing a heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host cells), expression and/or secretion of marker proteins can be increased through use of a heterologous signal sequence. Such signal sequences are well known in the art.
The marker fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject in vivo, as described herein. The marker fusion proteins can be used to affect the bioavailability of a marker protein substrate. Use of marker fusion proteins may be useful therapeutically for the treatment of disorders (e.g., prostate cancer) caused by, for example, (i) aberrant modification or mutation of a gene encoding a markerprotein; (ii) mis-regulation of the marker protein-encoding gene; and (iii) aberrant post-translational modification of a marker protein.
Moreover, the marker-fusion proteins of the invention can be used as immunogens to produce anti-marker protein antibodies in a subject, to purify marker protein ligands and in screening assays to identify molecules which inhibit the interaction of a marker protein with a marker protein substrate.
Preferably, a marker chimeric or fusion protein of the invention is produced by standard recombinant DNA techniques. For example, DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional techniques, for example by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Current Protocols in Molecular Biology , eds. Ausubel et al. John Wiley & Sons: 1992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide). A marker protein-encoding nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the marker protein.
A signal sequence can be used to facilitate secretion and isolation of the secreted protein or other proteins of interest. Signal sequences are typically characterized by a core of hydrophobic amino acids which are generally cleaved from the mature protein during secretion in one or more cleavage events. Such signal peptides contain processing sites that allow cleavage of the signal sequence from the mature proteins as they pass through the secretory pathway. Thus, the invention pertains to the described polypeptides having a signal sequence, as well as to polypeptides from which the signal sequence has been proteolytically cleaved (i.e., the cleavage products). In one embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked in an expression vector to a protein of interest, such as a protein which is ordinarily not secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the protein, such as from a eukaryotic host into which the expression vector is transformed, and the signal sequence is subsequently or concurrently cleaved. The protein can then be readily purified from the extracellular medium by art recognized methods. Alternatively, the signal sequence can be linked to the protein of interest using a sequence which facilitates purification, such as with a GST domain.
The present invention also pertains to variants of the marker proteins of the invention which function as either agonists (mimetics) or as antagonists to the marker proteins. Variants of the marker proteins can be generated by mutagenesis, e.g., discrete point mutation or truncation of a marker protein. An agonist of the marker proteins can retain substantially the same, or a subset, of the biological activities of the naturally occurring form of a marker protein. An antagonist of a marker protein can inhibit one or more of the activities of the naturally occurring form of the marker protein by, for example, competitively modulating an activity of a marker protein. Thus, specific biological effects can be elicited by treatment with a variant of limited function. In one embodiment, treatment of a subject with a variant having a subset of the biological activities of the naturally occurring form of the protein has fewer side effects in a subject relative to treatment with the naturally occurring form of the marker protein.
Variants of a marker protein which function as either marker protein agonists (mimetics) or as marker protein antagonists can be identified by screening combinatorial libraries of mutants, e.g., truncation mutants, of a marker protein for marker protein agonist or antagonist activity. In one embodiment, a variegated library of marker protein variants is generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated library of marker protein variants can be produced by, for example, enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of potential marker protein sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set of marker protein sequences therein. There are a variety of methods which can be used to produce libraries of potential marker protein variants from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one mixture, of all of the sequences encoding the desired set of potential marker protein sequences. Methods for synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, S. A. (1983) Tetrahedron 39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477).
In addition, libraries of fragments of a protein coding sequence corresponding to a marker protein of the invention can be used to generate a variegated population of marker protein fragments for screening and subsequent selection of variants of a marker protein. In one embodiment, a library of coding sequence fragments can be generated by treating a double stranded PCR fragment of a marker protein coding sequence with a nuclease under conditions wherein nicking occurs only about once per molecule, denaturing the double stranded DNA, renaturing the DNA to form double stranded DNA which can include sense/antisense pairs from different nicked products, removing single stranded p