Plaque It!
Sponsored by: Flash of Genius |
| 4496778 | Process for the hydroxylation of olefins using molecular oxygen, an osmium containing catalyst, a copper Co-catalyst, and an aromatic amine based promoter | January, 1985 | Myers et al. | 568/860 |
| 4650764 | Helper cell | March, 1987 | Temin et al. | 435/240 |
| 4657760 | Methods and compositions using monoclonal antibody to human T cells | April, 1987 | Kung et al. | 424/85 |
| 4683195 | Process for amplifying, detecting, and/or-cloning nucleic acid sequences | July, 1987 | Mullis et al. | 435/6 |
| 4683202 | Process for amplifying nucleic acid sequences | July, 1987 | Mullis | 435/91 |
| 4797368 | Adeno-associated virus as eukaryotic expression vector | January, 1989 | Carter et al. | 435/320 |
| 4861719 | DNA constructs for retrovirus packaging cell lines | August, 1989 | Miller | 435/236 |
| 4873191 | Genetic transformation of zygotes | October, 1989 | Wagner et al. | 435/172.3 |
| 4940838 | Process for the incorporation of foreign dna into the genome of dicotyledonous plants | July, 1990 | Schilperoort et al. | 800/205 |
| 4946787 | N-(ω,(ω-1)-dialkyloxy)- and N-(ω,(ω-1)-dialkenyloxy)-alk-1-yl-N,N,N-tetrasubstituted ammonium lipids and uses therefor | August, 1990 | Eppstein et al. | 435/240.2 |
| 4965188 | Process for amplifying, detecting, and/or cloning nucleic acid sequences using a thermostable enzyme | October, 1990 | Mullis et al. | 435/6 |
| 4980289 | Promoter deficient retroviral vector | December, 1990 | Temin et al. | 435/235 |
| 5096815 | Generation and selection of novel DNA-binding proteins and polypeptides | March, 1992 | Ladner et al. | 435/69.1 |
| 5124263 | Recombination resistant retroviral helper cell and products produced thereby | June, 1992 | Temin et al. | 435/240.2 |
| 5139941 | AAV transduction vectors | August, 1992 | Myzyczka et al. | 435/172.3 |
| 5173410 | Transfer vector | December, 1992 | Ahlquist | 435/91 |
| 5198346 | Generation and selection of novel DNA-binding proteins and polypeptides | March, 1993 | Ladner et al. | 435/69.1 |
| 5206344 | Interleukin-2 muteins and polymer conjugation thereof | April, 1993 | Katre et al. | 530/351 |
| 5223409 | Directed evolution of novel binding proteins | June, 1993 | Ladner et al. | 435/69.7 |
| 5225212 | Microreservoir liposome composition and method | July, 1993 | Martin et al. | 424/450 |
| 5399346 | Gene therapy | March, 1995 | Anderson et al. | 424/93.21 |
| 5459127 | Cationic lipids for intracellular delivery of biologically active molecules | October, 1995 | Felgner et al. | 514/7 |
| 5474796 | Method and apparatus for conducting an array of chemical reactions on a support surface | December, 1995 | Brennan | 427/2.13 |
| 5500360 | RNA transformation vector | March, 1996 | Ahlquist et al. | 435/172.3 |
| 5501967 | Process for the site-directed integration of DNA into the genome of plants | March, 1996 | Offringa et al. | 435/172.3 |
| 5538848 | Method for detecting nucleic acid amplification using self-quenching fluorescence probe | July, 1996 | Livak et al. | 435/5 |
| 5580859 | Delivery of exogenous DNA sequences in a mammal | December, 1996 | Felgner et al. | 514/44 |
| 5584807 | Gas driven gene delivery instrument | December, 1996 | McCabe | 604/71 |
| 5589466 | Induction of a protective immune response in a mammal by injecting a DNA sequence | December, 1996 | Felgner et al. | 514/44 |
| 5589616 | Monocot seed storage proteins in dicots | December, 1996 | Hoffman | 800/205 |
| 5605798 | DNA diagnostic based on mass spectrometry | February, 1997 | Koster | 435/6 |
| 5608153 | Semiconductor acceleration sensor and testing method thereof | March, 1997 | Ueyanagi | 73/1 |
| 5614396 | Methods for the genetic modification of endogenous genes in animal cells by homologous recombination | March, 1997 | Bradley et al. | 435/172.3 |
| 5618682 | Bioluminescence measurement system | April, 1997 | Scheirer | 435/8 |
| 5674713 | DNA sequences encoding coleoptera luciferase activity | October, 1997 | McElroy et al. | 435/69.7 |
| 5733731 | Peptide library and screening method | March, 1998 | Schatz et al. | 436/6 |
| 5777324 | Method and apparatus for maldi analysis | July, 1998 | Hillenkamp | 250/288 |
| 5811238 | Methods for generating polynucleotides having desired characteristics by iterative selection and recombination | September, 1998 | Stemmer et al. | 435/6 |
| 5824877 | Method for soybean transformation and regeneration | October, 1998 | Hinchee et al. | 800/205 |
| 5830721 | DNA mutagenesis by random fragmentation and reassembly | November, 1998 | Stemmer et al. | 435/172.1 |
| 5837458 | Methods and compositions for cellular and metabolic engineering | November, 1998 | Minshull et al. | 435/6 |
| 5843654 | Rapid detection of mutations in the p53 gene | December, 1998 | Heisler et al. | 435/6 |
| 5843669 | Cleavage of nucleic acid acid using thermostable methoanococcus jannaschii FEN-1 endonucleases | December, 1998 | Kaiser et al. | 435/6 |
| 5846717 | Detection of nucleic acid sequences by invader-directed cleavage | December, 1998 | Brow et al. | 435/6 |
| 5846795 | RNA transformation vector | December, 1998 | Ahlquist et al. | 435/172.3 |
| 5858659 | Polymorphism detection | January, 1999 | Sapolsky et al. | 435/6 |
| 5866785 | Recombinant plant viral nucleic acids | February, 1999 | Donson et al. | 800/205 |
| 5888780 | Rapid detection and identification of nucleic acid variants | March, 1999 | Dahlberg et al. | 435/91.53 |
| 5891840 | Stabilization of somatotropins by modification of cysteine residues utilizing site directed mutagenesis | April, 1999 | Cady et al. | 514/2 |
| 5919626 | Attachment of unmodified nucleic acids to silanized solid phase surfaces | July, 1999 | Shi et al. | 435/6 |
| 5925525 | Method of identifying nucleotide differences | July, 1999 | Fodor et al. | 435/6 |
| 5952174 | Ligase/polymerase-mediated genetic bit analysis of single nucleotide polymorphisms and its use in genetic analysis | September, 1999 | Nikiforov et al. | 435/6 |
| 5965794 | Viral amplification of recombinant messenger RNA in transgenic plants | October, 1999 | Turpen | 800/288 |
| 5976796 | Construction and expression of renilla luciferase and green fluorescent protein fusion genes | November, 1999 | Szalay et al. | 435/6 |
| 5977438 | Production of peptides in plants as viral coat protein fusions | November, 1999 | Turpen et al. | 800/288 |
| 5981839 | Methods and compositions for regulated transcription and expression of heterologous genes | November, 1999 | Knauf et al. | 800/287 |
| 5985551 | Method and apparatus for conducting an array of chemical reactions on a support surface | November, 1999 | Brennan | 435/6 |
| 5985557 | Invasive cleavage of nucleic acids | November, 1999 | Prudent et al. | 435/6 |
| 5994069 | Detection of nucleic acids by multiple sequential invasive cleavages | November, 1999 | Hall et al. | 435/6 |
| 6001311 | Apparatus for diverse chemical synthesis using two-dimensional array | December, 1999 | Brennan | 422/131 |
| 6001567 | Detection of nucleic acid sequences by invader-directed cleavage | December, 1999 | Brow et al. | 435/6 |
| 6017696 | Methods for electronic stringency control for molecular biological analysis and diagnostics | January, 2000 | Heller | 435/6 |
| 6043031 | DNA diagnostics based on mass spectrometry | March, 2000 | Koster et al. | 435/6 |
| 6045996 | Hybridization assays on oligonucleotide arrays | April, 2000 | Cronin et al. | 435/6 |
| 6051380 | Methods and procedures for molecular biological analysis and diagnostics | April, 2000 | Sosnowski et al. | 435/6 |
| 6051757 | Regeneration of plants containing genetically engineered T-DNA | April, 2000 | Barton et al. | 800/294 |
| 6063947 | Canola oil having increased oleic acid and decreased linolenic acid content | May, 2000 | DeBonte et al. | 554/223 |
| 6068818 | Multicomponent devices for molecular biological analysis and diagnostics | May, 2000 | Ackley et al. | 422/50 |
| 6074859 | Mutant-type bioluminescent protein, and process for producing the mutant-type bioluminescent protein | June, 2000 | Hirokawa et al. | 435/189 |
| 6080912 | Methods for creating transgenic animals | June, 2000 | Bremel et al. | 800/23 |
| 6090543 | Cleavage of nucleic acids | July, 2000 | Prudent et al. | 435/6 |
| EP0025949 | September, 1980 | Acylated triazolyl-gamma-fluoropinacolyl derivatives, process for their preparation and their use as fungicides. | ||
| EP0078220 | October, 1985 | Polycrystalline silicon interconnections for bipolar transistor flip-flop. | ||
| EP0185573 | November, 1985 | Expression and excretion of polypeptides in eucaryotes under the control of an adenovirus promoter | ||
| EP0453242 | April, 1991 | Transfer and expression of gene sequences into central nervous system cells using herpes simplex virus mutants with deletions in genes for viral replication | ||
| EP0488528 | October, 1991 | Recombinant adeno-associated virus vectors | ||
| WO/1993/003367 | February, 1983 | DIFFERENTIAL BINDING AFFINITIES AND DISSOCIATION ASSAYS BASED THEREON | ||
| WO/1984/003564 | September, 1984 | METHOD OF DETERMINING ANTIGENICALLY ACTIVE AMINO ACID SEQUENCES | ||
| WO/1990/008832 | August, 1990 | RETROVIRAL VECTOR AND ITS USE IN THE PRODUCTION OF TRANSGENIC ANIMALS | ||
| WO/1991/018088 | November, 1991 | ADENO-ASSOCIATED VIRUS (AAV)-BASED EUCARYOTIC VECTORS | ||
| WO/1992/005263 | April, 1992 | VIRAL DEFECTIVE VACCINE PRODUCED BY TRANSCOMPLEMENTING CELL LINE | ||
| WO/1994/012649 | June, 1994 | GENE THERAPY FOR CYSTIC FIBROSIS | ||
| WO/1994/021807 | September, 1994 | DEFECTIVE MUTANT NON-RETROVIRAL VIRUS (E.G. HSV) AS VACCINE | ||
| WO/1994/026914 | November, 1994 | ADENOVIRAL VECTORS OF ANIMAL ORIGIN AND USE THEREOF IN GENE THERAPY | ||
| WO/1994/028152 | December, 1994 | DEFECTIVE ADENOVIRUSES AND CORRESPONDING COMPLEMENTATION LINES | ||
| WO/1994/028938 | December, 1994 | ADENOVIRUS VECTORS FOR GENE THERAPY SPONSORSHIP | ||
| WO/1995/002697 | January, 1995 | DEFECTIVE ADENOVIRUS VECTORS AND USE THEREOF IN GENE THERAPY | ||
| WO/1995/021931 | August, 1995 | NUCLEIC ACID-CONTAINING COMPOSITION, ITS PREPARATION AND USE | ||
| WO/1996/025508 | August, 1996 | NUCLEIC ACID-CONTAINING COMPOSITION, PREPARATION AND USE THEREOF | ||
| WO/1999/067641 | December, 1999 | DECODING OF ARRAY SENSORS WITH MICROSPHERES | ||
| WO/2000/039587 | July, 2000 | COMPOSITE ARRAYS UTILIZING MICROSPHERES |
This patent application was supported in part by grant CA-64556 from the National Institutes of Health. The government has certain rights in the invention.
The present invention relates to intracellular signaling molecules, in particular the Nod2 protein and nucleic acids encoding the Nod2 protein. The present invention provides methods of identifying modulators of Nod2 signaling. In particular, the present invention additionally provides methods of screening immune modulators such as adjuvants using Nod2. The present invention further provides methods of altering Nod2 signaling.
Inflammatory bowel diseases (IBD) are defined by chronic, relapsing intestinal inflammation of obscure origin. IBD refers to two distinct disorders, Crohn's disease and ulcerative colitis (UC). Both diseases appear to involve either a dysregulated immune response to GI tract antigens, a mucosal barrier breach, and/or an adverse inflammatory reaction to a persistent intestinal infection. The GI tract luminal contents and bacteria constantly stimulate the mucosal immune system, and a delicate balance of proinflammatory and anti-inflammatory cells and molecules maintains the integrity of the GI tract, without eliciting severe and damaging inflammation. It is unknown how the IBD inflammatory cascade begins, but constant GI antigen-dependent stimulation of the mucosal and systemic immune systems perpetuates the inflammatory cascade and drives lesion formation.
There is no known cure for IBD, which afflicts 2 million Americans. Current are herein incorporated by reference; Mann et al., Cell 33:153 [1983]; Markowitz et al., J. Virol., 62:1120 [1988]; PCT/US95/14575; EP 453242; EP178220; Bernstein et al. Genet. Eng., 7:235 [1985]; McCormick, BioTechnol., 3:689 [1985]; WO 95/07358; and Kuo et al., Blood 82:845 [1993]). The retroviruses are integrating viruses that infect dividing cells. The retrovirus genome includes two LTRs, an encapsidation sequence and three coding regions (gag, pol and env). In recombinant retroviral vectors, the gag, pol and env genes are generally deleted, in whole or in part, and replaced with a heterologous nucleic acid sequence of interest. These vectors can be constructed from different types of retrovirus, such as, HIV, MoMuLV (“murine Moloney leukaemia virus” MSV (“murine Moloney sarcoma virus”), HaSV (“Harvey sarcoma virus”); SNV (“spleen necrosis virus”); RSV (“Rous sarcoma virus”) and Friend virus. Defective retroviral vectors are also disclosed in WO95/02697.
In general, in order to construct recombinant retroviruses containing a nucleic acid sequence, a plasmid is constructed that contains the LTRs, the encapsidation sequence and the coding sequence. This construct is used to transfect a packaging cell line, which cell line is able to supply in trans the retroviral functions that are deficient in the plasmid. In general, the packaging cell lines are thus able to express the gag, pol and env genes. Such packaging cell lines have been described in the prior art, in particular the cell line PA317 (U.S. Pat. No. 4,861,719, herein incorporated by reference), the PsiCRIP cell line (See, WO90/02806), and the GP+envAm-12 cell line (See, WO89/07150). In addition, the recombinant retroviral vectors can contain modifications within the LTRs for suppressing transcriptional activity as well as extensive encapsidation sequences that may include a part of the gag gene (Bender et al., J. Virol., 61:1639 [1987]). Recombinant retroviral vectors are purified by standard techniques known to those having ordinary skill in the art.
Alternatively, the vector can be introduced in vivo by lipofection. For the past decade, there has been increasing use of liposomes for encapsulation and transfection of nucleic acids in vitro. Synthetic cationic lipids designed to limit the difficulties and dangers encountered with liposome mediated transfection can be used to prepare liposomes for in vivo transfection of a gene encoding a marker (Felgner et. al., Proc. Natl. methods of managing IBD symptoms cost an estimated $1.2 billion annually in the United States alone.
In patients with IBD, ulcers and inflammation of the inner lining of the intestines lead to symptoms of abdominal pain, diarrhea, and rectal bleeding. Ulcerative colitis occurs in the large intestine, while in Crohn's, the disease can involve the entire GI tract as well as the small and large intestines. For most patients, IBD is a chronic condition with symptoms lasting for months to years. It is most common in young adults, but can occur at any age. It is found worldwide, but is most common in industrialized countries such as the United States, England, and northern Europe. It is especially common in people of Jewish descent and has racial differences in incidence as well. The clinical symptoms of IBD are intermittent rectal bleeding, crampy abdominal pain, weight loss and diarrhea. Diagnosis of IBD is based on the clinical symptoms, the use of a barium enema, but direct visualization (sigmoidoscopy or colonoscopy) is the most accurate test. Protracted IBD is a risk factor for colon cancer. The risk for cancer begins to rise significantly after eight to ten years of IBD.
Some patients with UC only have disease in the rectum (proctitis). Others with UC have disease limited to the rectum and the adjacent left colon (proctosigmoiditis). Yet others have UC of the entire colon (universal IBD). Symptoms of UC are generally more severe with more extensive disease (larger portion of the colon involved with disease).
The prognosis for patients with disease limited to the rectum (proctitis) or UC limited to the end of the left colon (proctosigmoiditis) is better then that of full colon UC. Brief periodic treatments using oral medications or enemas may be sufficient. In those with more extensive disease, blood loss from the inflamed intestines can lead to anemia, and may require treatment with iron supplements or even blood transfusions. Rarely, the colon can acutely dilate to a large size when the inflammation becomes very severe. This condition is called toxic megacolon. Patients with toxic megacolon are extremely ill with fever, abdominal pain and distention, dehydration, and malnutrition. Unless the patient improves rapidly with medication, surgery is usually necessary to prevent colon rupture.
Crohn's disease can occur in all regions of the gastrointestinal tract. With this disease intestinal obstruction due to inflammation and fibrosis occurs in a large number of patients. Granulomas and fistula formation are frequent complications of Crohn's disease. Disease progression consequences include intravenous feeding, surgery and colostomy.
The most commonly used medications to treat IBD are anti-inflammatory drugs such as the salicylates. The salicylate preparations have been effective in treating mild to moderate disease. They can also decrease the frequency of disease flares when the medications are taken on a prolonged basis. Examples of salicylates include sulfasalazine, olsalazine, and mesalamine. All of these medications are given orally in high doses for maximal therapeutic benefit. These medicines are not without side effects. Azulfidine can cause upset stomach when taken in high doses, and rare cases of mild kidney inflammation have been reported with some salicylate preparations.
Corticosteroids are more potent and faster-acting than salicylates in the treatment of IBD, but potentially serious side effects limit the use of corticosteroids to patients with more severe disease. Side effects of corticosteroids usually occur with long term use. They include thinning of the bone and skin, infections, diabetes, muscle wasting, rounding of faces, psychiatric disturbances, and, on rare occasions, destruction of hip joints.
In IBD patients that do not respond to salicylates or corticosteroids, medications that suppress the immune system are used. Examples of immunosuppressants include azathioprine and 6-mercaptopurine. Immunosuppressants used in this situation help to control IBD and allow gradual reduction or elimination of corticosteroids. However, immunosuppressants cause increased risk of infection, renal insufficiency, and the need for hospitalization.
Clearly there is a great need for identification of the molecular basis of IBD, or its associated disorders Crohn's disease and ulcerative colitis.
The present invention relates to intracellular signaling molecules, in particular the Nod2 protein and nucleic acids encoding the Nod2 protein. The present invention provides methods of identifying modulators of Nod2 signaling. In particular, the present invention additionally provides methods of screening immune modulators such as adjuvants using Nod2. The present invention further provides methods of altering Nod2 signaling.
Accordingly, in some embodiments the present invention provides a method of screening compounds, comprising providing a cell expressing a Nod2 variant, wherein the Nod2 variant is defective in responding to bacterial muropeptides; and a plurality of test compounds; and contacting the cell with the plurality of test compounds; and determining the presence or absence of a response to a bacterial muropeptide. In some embodiments, the response to a bacterial muropeptide comprises the activation of NF-κB. In some embodiments, the activation of NF-κB is detected using a reporter gene assay. In some embodiments, the Nod2 variant is a Crohn's disease associated variant (e.g., including, but not limited to, SEQ ID NOs: 55, 57, 59, and 61). In some embodiments, the cell expressing a Nod2 variant comprises peripheral blood mononuclear cells isolated from an individual diagnosed with Crohn's disease. In some embodiments, the response to a bacterial muropeptide comprises the activation of NF-κB. In some embodiments, the activation of NF-κB is detected using an electrophoretic mobility shift assay. In some embodiments, the test compound is a drug. In some embodiments, the test compound is a muramyl dipeptides (e.g., including, but not limited to, MurNAc-L-Ala-D-isoGln, MuraNac-L-Ala-D-isoGln, MuraNac-L-Ala-D-Glu, analogs of MurNAc-L-Ala-D-isoGln, analogs of MuraNac-L-Ala-D-isoGln, and analogs of MuraNac-L-Ala-D-Glu). In some embodiments, the present invention provides a drug identified by the method.
The present invention also provides a method of screening compounds, comprising providing a cell expressing Nod2; and a plurality of test compounds; and contacting the cell with the plurality of test compounds; and determining the level of Nod2 activity in the cell in response to the test compound. In some embodiments, the Nod2 activity comprises activation of NF-κB. In some embodiments, the activation of NF-κB is detected using a reporter gene assay. In some embodiments, the test compound is a muramyl dipeptides (e.g., including, but not limited to, MurNAc-L-Ala-D-isoGln, MuraNac-L-Ala-D-isoGln, MuraNac-L-Ala-D-Glu, analogs of MurNAc-L-Ala-D-isoGln, analogs of MuraNac-L-Ala-D-isoGln, and analogs of MuraNac-L-Ala-D-Glu). In some embodiments, the test compound is a vaccine adjuvant. In other embodiments, the test compound in an anti-adjuvant. In some embodiments, the Nod2 activity is increased in response to the test compound. In other embodiments, the Nod2 activity is decreased in response to the test compound.
The present invention further provides a method of modulating Nod2 signaling in a subject, comprising providing a compound that is capable of altering a subject's Nod2 activity in response to bacterial muropeptides; and administering the compound to a subject under conditions such that the subject's Nod2 activity is response to bacterial muropeptides is altered. In some embodiments, the subject is diagnosed with Crohn's disease and the compound increases the subjects Nod2 activity is response to bacterial muropeptides. In other embodiments, the compound is a vaccine adjuvant and the compound increases the subjects Nod2 activity is response to bacterial muropeptides. In still further embodiments, the compound is an anti-adjuvant and the compound decreases the subjects Nod2 activity is response to bacterial muropeptides. In some embodiments, the compound is a muramyl dipeptides (e.g., including, but not limited to, MurNAc-L-Ala-D-isoGln, MuraNac-L-Ala-D-isoGln, MuraNac-L-Ala-D-Glu, analogs of MurNAc-L-Ala-D-isoGin, analogs of MuraNac-L-Ala-D-isoGln, and analogs of MuraNac-L-Ala-D-Glu). In some embodiments, the method further comprises the step of determining the presence or absence of a variant Nod2 allele (e.g., including, but not limited to, SEQ ID NOs: 33, 54, 56, 58, and 60) in the subject diagnosed with Crohn's disease.
FIG. 1 shows the deduced Amino Acid Sequence and Domain Structure of Human Nod2. FIG. 1A shows the amino acid sequence of Nod2 (SEQ ID NO:4). Caspase recruitment domains (CARD 1 and 2; SEQ ID NOs: 5 and 6), nucleotide binding domain (NBD; SEQ ID NO:7) and leucine-rich repeats (LRRS; SEQ ID NOs:8-17) are indicated by reverse highlight, underline and arrows, respectively. The consensus sequence of the P-loop (Walker A box; SEQ ID NO: 18) and the Mg 2+ binding site (Walker B box; SEQ ID NO: 19) are indicated by boxes. FIG. 1B shows the domain structure of Nod2. Numbers corresponds to amino acid residues shown in panel A. The region homologous to the CARDS, NBD and LRRs are indicated by black closed, dark closed, and hatched boxes, respectively.
FIG. 2 shows an alignment of Human Nod2 and Related Proteins. FIG. 1A shows an alignment of CARDs of Nod2 (SEQ ID NOs:5 and 6), Nod1 (GeneBank accession number AF113925; SEQ ID NO:20), RICK (AF027706; SEQ ID NO:21), ARC (AF043244; SEQ ID NO:22), RAIDD (U79115; SEQ ID NO:23), Caspase-2 (U13021; SEQ ID NO:24), Ced-3 (L29052; SEQ ID NO:25), Ced-4 (X69016; SEQ ID NO:26), Caspase-9 (U56390; SEQ ID NO:27), Apaf-I (AF013263; SEQ ID NO:28) and c-IAP-1 (L49431; SEQ ID NO:29). Hydrophobic residues are shown in reverse highlighting. Negatively and positively charged residues are highlighted in light and dark gray, respectively. Proline and glycine residues ((αβ breaker) are bolded. The putative (αhelices, H1 to H5, are shown according to the three dimensional structure of the CARD of RAIDD (Chou et al., Cell, 94:171 [1998]). FIG. 2B shows an alignment of NBDs of Nod2 (SEQ ID NO:7), Nod1 (SEQ ID NO:30), Apaf-I (SEQ ID NO:31) and Ced-4 (SEQ ID NO:32). The residues identical and similar to those of Nod2 are shown by reverse and dark highlighting, respectively. The consensus sequence of the P-loop (Walker A box) and the Mg 2+ binding site (Walker B box) are indicated by boxes. The residues identical and similar to those of Nod2 are shown by reverse and dark highlighting, respectively. FIG. 2C shows an alignment of LRRs of Nod2 (SEQ ID NOs: 8-17). The conserved positions with leucine and other hydrophobic residues are indicated by dark and light gray highlighting, respectively. The putative (α helix and β sheet are shown according to the three dimensional structure of the ribonuclease inhibitor (Kobe and Deisenhofer, Curr. Opin. Struct Biol., 5:409-416 [1995]).
FIG. 3 shows an expression Analysis of Nod2. FIG. 3A shows a northern blot analysis of nod2 expression in human tissues; PBL (peripheral blood leukocytes). FIG. 3B shows RT-PCR analysis of nod2 expression in granulocyte, monocyte and lymphocyte enriched populations. Two sets of Nod2 oligonucleotide primers (P1-P2 and P3-P4) were used to amplify the nucleotide sequences of the CARDs and LRRs, respectively. FIG. 3C shows the nucleotide sequence of the 5′ region of Nod2. Two potential in-frame translation initiation sites separated by 81 nucleotides are indicated by arrows. FIG. 3D shows immunoblotting of nod2 gene products expressed in HEK293T cells. Cells were transfected with control plasmid (lane 1), or constructs containing both potential translation initiator sites of Nod2 (lane 2), or as a control the second translation initiation site corresponding to that of Nod2b (lane 3) or the most NH2-terminal translation initiation site (lane 4) in the context of a canonical translation initiation motif. In all cases, a Nod2 protein lacking residues 302-1040 and HA tagged at its COOH terminus was expressed to facilitate detection of nod2 gene products. Nod2 proteins were detected by immunoblotting with anti-HA antibody and indicated by a and b.
FIG. 4 shows mutational Analysis of Nod2. FIG. 4A shows wt and mutant Nod2 proteins. CARDs, NBD and LRRs are indicated by black closed, dark closed, and hatched boxes, respectively. Numbers represent amino acid residue in Nod2 protein. FIG. 4B shows expression analysis of wt and mutant Nod2 proteins. HEK293T cells were transfected with control plasmid (−) or 5 μg of plasmids producing the indicated HA-tagged Nod2 proteins. Extracts from equal number of cells were immunoprecipitated with rabbit anti-HA antibody and immunoblotted with mouse monoclonal anti-HA antibody. The expected size of CARDs, CARD1 and LRRs mutant proteins are indicated by black arrowheads. FIG. 4C shows NF-κB activation by Nod2 proteins. Induction of NF-κB activation was determined from triplicate culture of HEK293T cells co-transfected with the indicated amount of wt or mutant Nod2 expression plasmids in the presence of pBVIx-Luc and pEF-BOS-β-gal as described below. Values represent mean ±SD of triplicate cultures.
FIG. 5 shows that Nod2 Acts through the IKK complex to activate NF-KB. FIG. 5A shows inhibition of Nod2 and TNFα-induced NF-KB activation by dominant negative mutant proteins of the NF-κB pathway. Induction of NF-κB activation was determined in triplicate cultures of HEK293T cells transfected with 30 ng of Nod2 plasmid (open bars) or stimulated with 10 ng/ml of TNFα for 4 h (closed bars) and 70 ng of I-κBα S32A/S36A, IKKα K44A, IKKβ K44A, RICK (406-540) or RIP (558-671) expression plasmid in the presence of pBVIx-Luc and pEF-BOS-β-gal. Results are presented as a percent of values obtained with Nod2 and control plasmid. In the experiment shown, Nod2 and TNFα induced 58±8-fold and 14±1-fold activation of NF-κB, respectively. Values represent mean ±SD of triplicate cultures. FIG. 5B shows induction of NF-κB in parental Rat-1 and derivative 5R cells. Induction of NF-κB activation was determined from triplicate cultures of I×10 5 HEK293T cells co-transfected with the indicated plasmids and pBVIx-Luc in the presence of control plasmid pEF-BOS-β-gal. Values represent mean ±SD of triplicate cultures.
FIG. 6 shows the interaction of Nod2 with RICK. FIGS. 6A and B show the interaction between wt and mutant Nod2 with RICK. HEK293T cells were co-transfected with wt or mutant Nod2 and RICK expression plasmid. The co-immunoprecipitated RICK was detected by immunoblotting with anti-Flag antibody (upper panel). Nod2 immunoprecipitates are shown in lower panel. Total lysates were blotted with anti-Flag antibody (middle panel). FIG. 6C shows the interaction between Nod2 and wt and mutant RICK. HEK293T cells were co-transfected with wt Nod2 and wt or mutant RICK-ΔCARD (residues 1-374) or RICK-CARD (residues 374540) expression plasmid. The co-immunoprecipitated Nod2 was detected by immunoblotting with anti-HA antibody (upper panel). Total lysates were blotted with anti-Flag (middle panel) or anti-HA (lower panel) antibody. A background band is shown by asterisk.
FIG. 7 shows that enforced oligomerization of Nod2 induces NF-κB activation. FIG. 7A shows an expression analysis of wt and mutant Fpk3-Nod2 chimeric proteins. HEK293T cells were transfected with of control plasmid (−) or plasmids producing the indicated Myc-tagged Fpk3Nod2 proteins. Extracts from equal number of cells was immunoprecipitated and immunoblotted with rabbit anti-Myc antibody. FIG. 7B shows that enforced oligomerization of Nod2 induces NF-κB activation. 2×10 5 HEK293T cells were transfected with 1 ng of the indicated plasmids in the presence of pBVIx-luc and pEF-BOS-β-gal. 8 hr post-transfection, cells were treated with 500 nM AP1510 (black bars) or left untreated (white bars). 24 hr post-transfection, the κB-dependent transcription was determined as described below. Values represent mean ±SD of triplicate cultures.
FIG. 8 shows the response of HEK293T cells expressing Nod1 to bacterial and fungal pathogen components. FIG. 8A shows data from 1×10 5 HEK293T cells that were transfected with 0.3 ng of pcDNA3-Flag (white bars) or pcDNA3Nod1-Flag (black bars) in the presence of 600 ng of pcDNA3, 73 ng pEFIBOS-βgal and 7.3 ng pBXIV-Iuc. 8 hr post-transfection, cells were treated with 10 μg/ml of each pathogen product, lipoteichoic acid (LTA) or peptidoglycan (PGN) from Staphylococcus aureus , lipopolysaccharide (LPS) from Escherichia coli 055:B5, mannan from Candida albicans 20A, synthetic soluble bacterial lipoprotein (SBLP) or left untreated (Control). 24 hr post-transfection, κB-dependent transcription was determined by luciferase activity relative and values normalized to β-galactosidase in triplicate cultures. As control, the inset showed Nod1 proteins immunodetected with anti-FLAG Ab in lysates from cells transfected with 10 ng pcDNA3-Nod1 in presence (right) and absence (left) of 10 μg/ml LPS. FIG. 8B shows data from 1×10 5 HEK293T cells that were transfected with 0.3 ng of pcDNA3-Flag (−), pcDNA3-Nod1-Flag (Nod1) or pcDNA3-Nod1(I-648)-Flag (Nod1ΔLRR), 300 ng pcDNA3-FLAG-TLR4, 3 ng pCMVIL1R1 plus 100 ng pcDNA3-IL1β-HA (IL1) or I ng pcDNA3-RIP-Flag (RIP) in the presence of 600 ng of pcDNA3, 73 ng pEF 1BOS-βgal and 7.3 ng pBXIV-luc. Eight hr post-transfection, cells were treated with 10 μg/n-A LPS (black bars) or left untreated (white bars). Twenty-four hr post-transfection, κB-dependent transcription was determined as described above.
FIG. 9 shows differential responsiveness of Nod1 and Nod2 to LPS from various sources. 1×10 5 HEK293T cells were transfected with 0.3 ng of pcDNA3-Flag (−), pcDNA3-Nod1-Flag (Nod1) or pcDNA3-Nod1(1-648)-Flag (Nod1ΔLRR), 0.03 ng of pcDNA3-Nod2 or pcDNA3-Nod2(1-744)-Flag (Nod2ΔLRR) in the presence of 600 ng of pcDNA3, 73 ng pEF1BOS-βgal and 7.3 ng pBXIV-luc. 8 hr post-transfection, cells were treated with 10 μg/ml each pathogen, LTA from S. aureus or S. sanguis , PGN from S. aureus , LPS from Pseudomonas aeruginosa, Shigella flexneri 1A, Sarratia marcescens, Salmonella typhimurium, Klebsiella pneumoniae or E. coli 055:B5, or left alone without treatment. For TNFα stimulation, 22 hrs after transfection, cells were incubated with 10 ng/ml of TNFα for 2 hr.
FIG. 10 shows the physical Interaction between Nod1 and LPS. 1×10 8 HEK293T cells were transfected with 30 μg of pcDNA3-Flag-Nod1, pRK7-FLAG-IKKβ, pcDNA3-FLAG-IKKi, pcDNA3-FLAG-IKKγ or pcDNA3-CIPER-FLAG (Takeuchi et al., Immunity, 4:443 [1999]). 24 hr post-transfection, S100 fractions were prepared from transfected cells as described below. The radioactivity of [ 3 H] LPS co-immunoprecipitated with anti-FLAG Ab was determined as described below. FIG. 10A shows S100 lysate from transfected cells was incubated with [ 3 H] LPS, anti-FLAG M2 Ab, Protein A-Sepharose and Protein G-Sepharose. FIG. 10B shows data for proteins that were immunopurified first from 20 mg of S100 lysate and incubated with [ 3 H] LPS in the presence of 10 mg BSA. The co-imunoprecipitated radioactivity was determined as described in detail below. Expression of each protein in 50 μg of S100 lysate was immunodetected with anti-FLAG Ab.
FIG. 11 shows the nucleic acid sequence of SEQ ID NO:33.
FIG. 12 shows the nucleic acid sequence of SEQ ID NO:1.
FIG. 13 shows the polypeptide sequence of SEQ ID NO:2.
FIG. 14 shows the polypeptide sequence of SEQ ID NO:3.
FIG. 15 shows the polypeptide sequence of SEQ ID NO:34.
FIG. 16 shows the nucleic acid (SEQ ID NOs: 35 (wild type) and 36 (mutant)) and polypeptide (SEQ ID NO:51 (wild type) and SEQ ID NO:52 (mutant)) of Nod2 Exon 11.
FIG. 17 shows recognition of peptidoglycans by NOD2. FIG. 17A shows that NOD2 and TLR4 recognize different bacterial components. FIG. 17B shows activation of NF-κB by cells stimulated with gel-filtration fractions of PGN from B. subtilis digested or undigested with Mutanolysin.
FIG. 18 shows stimulation of NOD2 by synthetic muropeptides. FIG. 18A shows a schematic representation of synthetic muropeptide structures used in the study. FIG. 18B shows the ability of HEK293T cells transfected with 66 ng pMX-puro-NOD2 (NOD2) or vector control (−) and reporter pB×IV-luc and pEF-BOS-β-gal plasmids to activate NF-κB. FIG. 18C shows the ability of MDP and its analogs MurNAc-L-Ala-L-isoGln (LL) and MurNAc-D-Ala-D-isoGln (DD) at 100 ng/ml to stimulate NF-κB activation in the presence of NOD2 or control plasmid (−). FIG. 18D shows the ability of MDP (100 ng/ml) and sBLP (1000 ng/ml) to activate NF-κB.
FIG. 19 shows activation of normal and Crohn's disease-associated NOD2 proteins by MDP.
FIG. 20 shows stimulation of PBMNC from individuals carrying normal, heterozygous or homozygous L1007fs alleles with LPS and MDP. FIG. 20A shows representative DNA analysis from normal donors (WT) and Crohn's disease patient carrying the L1007fs mutation in one (heterozygous, HT) or both (homozygous, HM) alleles. FIG. 20B shows PBMNC stimulated with 1 μg/ml of LPS from Salmonella typhimurium (L), 10 ng/ml of MDP (M) for 1 hour or left untreated. FIG. 20C shows an analysis of nuclear extracts for the presence of NF-κB binding activity by EMSA. FIGS. 20C and D show PBMNC analyzed for expression of IL-1β (C) and A1 (D) transcripts by quantitative RT-PCR.
FIG. 21 shows the nucleic acid sequence of SEQ ID NO: 54.
FIG. 22 shows the amino acid sequence of SEQ ID NO: 55.
FIG. 23 shows the nucleic acid sequence of SEQ ID NO: 56.
FIG. 24 shows the amino acid sequence of SEQ ID NO: 57.
FIG. 25 shows the nucleic acid sequence of SEQ ID NO: 58.
FIG. 26 shows the amino acid sequence of SEQ ID NO: 59.
FIG. 27 shows the nucleic acid sequence of SEQ ID NO: 60.
FIG. 28 shows the amino acid sequence of SEQ ID NO: 61.
The present invention relates to intracellular signaling molecules, in particular the Nod2 protein and nucleic acids encoding the Nod2 protein. The Nod2 protein was found to have structural homology to the Nod1 protein. Apaf-1 and Nod1 (also called CARD4) are members of a family of intracellular proteins that are composed of an NH2-terminal caspase-recruitment domain (CARD), a centrally located nucleotide-binding domain (NBD) and a COOH-terminal regulatory domain (Bertin et al., J. Biol. Chem. 274: 12955-12958 [1999], Inohara et al., J. Biol. Chem. 274: 14560-14568 [1999]). While Apaf-1 possesses WD40 repeats, Nod1 contains leucine-rich repeats (LRRs) in its C-terminus. The structural and functional similarities between Apaf-1 and Nod1 suggest that these proteins share a common molecular mechanism for activation and effector function. In the case of Apaf-1, the WD-40 repeats act as a recognition domain for mitochondrial damage through binding to cytochrome c, allowing Apaf-1 to oligomerize and interact with procaspase-9 through a CARD-CARD homophilic interaction (Li et al., Cell 91: 479-489 [1997], Zou et al., J. Bio. Chem. 274: 11549-11556 [1999]). Apaf-1 oligomerization is mediated by the NBD and is thought to induce the proximity and proteolytic activation of procaspase-9 molecules in the apoptosome complex (Srinivasula et al., Mol. Cell 1: 949-957 [1998], Hu et al., J. Bio. Chem. 273: 33489-34494 [1998]).
Previous studies showed that Nod1 promotes apoptosis when overexpressed in cells, but unlike Apaf-1, it induces NF-κB activation (Bertin et al., supra, Inohara et al., supra). NF-κB activation induced by Nod1 is mediated by the association of the CARD of Nod1 with the corresponding CARD of RICK (also called RIP2 and CARDIAK), a protein kinase that activates NF-κB (Bertin et al., supra, Inohara et al., supra, Inohara et al., J. Biol. Chem. 273: 12296-12300 [1998], McCarthy et al., J. Bio. Chem. 273, 16968-16975 [1998], Thome et al., Curr. Biol. 8: 885-888 [1998]). Analyses with wild-type (wt) and mutant forms of both Nod1 and RICK have suggested that Nod1 and RICK act in the same pathway of NF-κB activation, where RICK functions as a downstream mediator of Nod1 signaling (Bertin et al., supra, Inohara et al., [1999] supra, Inohara et al., J. Biol. Chem. 275: 27823-27831 [2000]). Nod1 self-associates through its NBD and Nod1 oligomerization promotes proximity of RICK molecules and NF-κB activation (Inohara et al., [2000], supra). Nod1 also displays striking similarity to a class of disease resistance (R) proteins found in plants (Pamiske et al., Cell 91: 821-832 [1997], Dixon et al., Proc. Natl. Acad. Sci. U. S. A. 97: 8807-8814 [2000]). Like Nod1, these intracellular R proteins contain N-terminal effector domains linked to a NBD and share with Nod1 the presence of multiple LRRs located C-terminally of the NBD (Bertin et al., supra, Dixon et al., supra). After specific recognition of pathogen products, these R proteins mediate a defense response associated with metabolic alterations and localized cell death at the site of pathogen invasion (Dixon et al., supra). The LRRs of R proteins are highly diverse and appear to be involved in the recognition of a wide array of pathogen components (Parniske et al., supra, Dixon et al., supra). The binding partner of the LRRs of Nod1 remains unknown. The structural homology of Nod1 with plant R proteins suggest that other LRR-containing Nod1-like molecules may exist in the human genome to allow activation of these molecules by different sets of intracellular stimuli.
The identification and characterization of Nod2, a LRR-containing protein with structural and functional similarity to Nod1 is disclosed herein. These studies indicate that Nod2 activates NF-κB, but unlike Nod1, this new homologue is primarily expressed in monocytes. The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism of action is not necessary to practice the present invention. Nevertheless, Nod2 is a member of the Nod1/Apaf-I family that activates NF-κB through interactions with its NH 2 -terminal CARDS, as these domains were necessary and sufficient for NF-κB activation. Nod2 associated with RICK via a homophilic CARD-CARD interaction. The NF-κB-inducing activity of Nod2 correlated with its ability to associate with RICK and was inhibited by a RICK mutant, suggesting that RICK is a direct downstream target of Nod2. Thus, the signaling pathways of both Nod1 and Nod2 appear to utilize RICK as a downstream mediator of NF-κB activation. In contrast to Nod1, two tandem CARDs are present in the NH 2 -terminus of Nod2 and both were required for association with RICK and NF-κB activation.
Nod2 is the first molecule known to contain two CARDS. The molecular basis underlying the requirement of both CARDs of Nod2 for RICK binding remains unclear. The present invention is not limited to any particular mechanism of action. Indeed, an understanding of the mechanism of action is not necessary to practice the present invention. Nevertheless, it is contemplated that the presence of both CARDs may enhance the affinity for the CARD of RICK. Another possibility is that upon an initial interaction involving a CARD of Nod2 and the CARD of RICK, Nod2 may undergo a conformational change that allows the second CARD to associate with high affinity to RICK. The intermediate region of RICK associates with IKKγ (Inohara et al., [2000], supra), providing a direct link between Nod1/Nod2 and the IKK complex. Consistent with this model, NF-κB activation induced by Nod2 as well as that induced by Nod I required IKKγ and was inhibited by dominant negative forms of IKKγ, IKKα and IKKβ. The functional role for the LRRs of Nod1 and Nod2 remains unclear. The LRR is a repeated protein-protein interaction module that is presumably involved in the activation of Nod1 and Nod2 by upstream signals. In the case of plant NBD/LRR-containing R proteins, their LRRs appear to be important for the recognition of pathogen components and their N-terminal domains appear to mediate a signaling cascade that regulates gene expression (Parniske et al., supra, Dixon et al., supra). Because both Nod1 and Nod2 activate NF-κB, their LRRs may act to recognize a different set of intracellular stimuli that mediate Nod1 and Nod2 oligomerization and association with RICK. Nod2 is expressed in monocytes, dendritic cells, and paneth cells in the gut (Gutierrez et al., J Biol Chem 277(44):41701-5 [2002]). Because Nod2 is expressed in monocytes, Nod2 might serve as an intracellular receptor that transduces signals in the monocyte/macrophage that lead to activation of NF-κB and transcription of regulatory genes.
The Nod2 proteins of the present invention are also involved in the recognition of microbial pathogens. The innate immune system regulates the immediate response to microbial pathogens in multiple organisms including humans. The innate immune response is initiated by recognition of specific pathogen components by host immune cells. Mammalian cells have cell surface receptors and intracellular mechanisms that initiate the defense response against microbial pathogens (Aderem and Ulevitch, Nature, 406:785-787 [2000]; Philpott et al., J. Immunol., 165:903-914 [2000]). Toll like receptors (TLRs) comprise a family of cell surface receptors that are related to the Drosophila Toll protein, a molecule involved in defense against fungal infection in the fly (Aderem and Ulevitch, Supra). Ten mammalian TLRs have been identified (Aderem and Ulevitch, Supra). Two members of the family, TLR2 and TLR4, have been better characterized and shown to mediate the response to multiple bacterial cell-wall components including lipopolysaccharide (LPS), lipopeptides, peptidoglycans (PGN) and lipoteichoic acid (LTA) (Yang et al., Nature, 395:284-288 [1998]; Poltorak et al., Science, 282:2085-2088 [1998]; Aliprantis et al., Science, 285:736-739 [1999]; Chow et al., J. Biol. Chem., 274:10689-10692 [2000]; and Schwandner et al., J. Biol. Chem., 274: 17406-17409 [2000]). Mammalian TLRs have multiple leucine-rich repeats in the ectodomain and an intracellular Toll-IL1 receptor (TIR) domain that mediates a signaling cascade to the nucleus (Aderem and Ulevitch, Supra). Stimulation of TLR2 and TLR4 leads to the recruitment of the adaptor molecule MyD88 and the serine kinase IL-1R-associated kinase (IRAK), two signaling components that together with TRAF-6 mediate activation of NF-κB (Aderem and Ulevitch, Supra).
Plants have several classes of genes that regulate the defense against invading pathogens. An important class of these molecules is termed disease resistance (R) proteins, and members include both membrane-bound and cytosolic proteins. These are essential for the defense against multiple pathogens including bacteria, fungi and viruses (Dixon et al., PNAS, 97:8807-8814 [2000]). The cytosolic type of R proteins which include the Tobacco N gene product and up to 200 gene products in Arabinopsis thaliana are comprised of an N-terminal TIR or zinc finger effector domain, a centrally located nucleotide-binding domain (NBD) and C-terminal leucine-rich repeats (LRRs) (Dixon et al., Supra). The LRRs of cytosolic R proteins are highly diverse and appear to be involved in the recognition of a wide array of microbial components (Dixon et al., Supra). This class of disease resistant proteins mediates the hypersensitive (HS) response in plants that includes metabolic alterations and localized cell death at the site of pathogen invasion (Dixon et al., Supra). The cytosolic R proteins of plants have remarkable structural homology to Nod1/CARD4, a recently described protein related to the apoptosis regulator Apaf-1 (Zou et al., Cell, 90:405-413 [1997]; Bertin et al., J. Biol. Chem., 274:12955-12958; and Inohara et al., J. Biol. Chem., 274:14560-14568 [1999]). Like plant R proteins, Nod1 is comprised of an N-terminal effector domain, a centrally located NBD and multiple LRRs at the C-terminus (Bertin et al., Supra; Inohara et al., Supra). Nod1 induces NF-κB activation which is mediated through the association of its N-terminal caspase-recruitment domain (CARD) with that of RICK, a protein kinase that also activates NF-κB (Bertin et al., Supra; Inohara et al., Supra; Inohara et al., J. Biol. Chem., 273:12296-12300 [1998]; McCarthyetal., J. Biol. Chem., 273:16968-16975; Thome et al., Curr. Biol., 8:885-888 [1998]; Inohara et al., J. biol. Chem., 275:27823-27831 [2000]). However, the trigger molecule(s) which activates Nod1 to mediate NF-κB activation remains unknown.
The present invention also demonstrates that lipopolysaccharide (LPS) induces NF-κB activation in HEK293T cell expressing Nod1, whereas parental HEK293Tcells are insensitive to LPS. The present invention is not limited to a particular mechanism of action. Indeed, an understanding of the mechanism of action is not necessary to practice the present invention. Nevertheless, in the human system, the TLR4/MD2/CD14 complex has been demonstrated to serve as a surface receptor for LPS (Aderem and Ulevitch, Supra). In addition to the cell surface TLR4 complex, there is mounting evidence that mammalian cells have an intracellular receptor that detects LPS in the cytoplasm of bacteria infected cells (Philpott et al., Supra). For example, epithelial cells are unresponsive to extracellular LPS either purified or presented in the context of non-invasive Gram negative bacterial strains (Philpott et al., Supra). Yet, LPS introduced inside of the epithelial cells activates NF-κB (Philpott et al., Supra). However, to date, the identification of an intracellular recognition system for LPS and/or other microbial products remains elusive. Nod1 is involved in the recognition of peptidoglycan fragments. Nod1 function might be important in the intracellular response of epithelial cells against invading bacteria, as Nod1 is expressed in intestinal, lung and nasal epithelial surfaces in the late mouse embryo (Inohara et al., Supra). The presence of an intracellular detection system for bacterial LPS would be expected in epithelial surfaces such as those of the gut that are highly exposed to bacteria and bacterial products. In such organs, triggering of an inflammatory response to bacterial products through surface receptors such as TLR4 would be detrimental to the organism. HEK293T cells expressing Nod2, another member of Nod family, respond to LPS but Nod1 and Nod2 appear to have different preferences for LPS preparations from different bacteria. These observations suggest that in addition to TLRs, Nod family members may represent another innate immune system for the recognition of a wide array of pathogen products. For example, the genome of the plant Arabidopsis thaliana contains approximately 200 disease resistance genes encoding intracellular NBD-LRR proteins related to Nod1 and Nod2 (Dixon et al., Supra).
Three genetic variants within the coding region of NOD2, L1007fsinsC, G908R, and R702W, have been genetically associated with susceptibility to Crohn's disease in European and American populations (Hugot et al., Nature 411:599 (2001); Ogura et al., Nature 411:603 (2001); Hampe et al., Lancet 357:1925 (2001); Ahmad et al., Gastroenterology 122:854 (2002)). NOD2 has been shown to recognize preparations of lipopolysaccharides (LPS) and peptidoglycan (PGN) through its COOH terminal LRRs (Inohara et al., J. Biol. Chem. 276:2551 [2001]), and this activity is deficient in the disease-associated variants (Ogura et al., [2001], supra). However, the precise bacterial structure recognized by NOD2 remains unknown. Experiments conducted during the course of development of the present invention (See e.g., Example 9) identified muramyl dipeptide derived from peptidoglycan as the essential structure in bacteria recognized by NOD2. Peripheral blood mononuclear cells from individuals homozygous for the major disease-associated L1007fsinsC NOD2 mutation responded to lipopolysaccharide but not to synthetic muramyl dipeptide.
The present invention is not limited to a particular mechanism. Indeed, an understanding of the mechanism of the present invention is not necessary to practice the present invention. Nonetheless, it is contemplated that the lack of response to MDP in apparently normal individuals suggest that the presence of certain bacteria in the intestine and/or additional genetic factors may be required for clinical disease. MDP is the minimal essential structure of bacterial peptidoglycan required for biological effects including activity in Freund's complete adjuvant (H. Takada, S. Kotani, In The Theory and Practical Application of Adjuvants , D. E. S. Stewart, Ed., (Wiley, Chichester, England, 1995), pp. 171-202). MDP has been shown to signal through TLR2 and TLR4-independent mechanisms (Yang et al., Infect. Immun. 69:2045 [2001]; Wolfert et al., J. Biol. Chem. 277:39179 [2002]), but the host recognition system for MDP has not been identified. The present invention is not limited to a particular mechanism. Indeed, an understanding of the mechanism is not necessesary to understand the present invention. Nonetheless, it is contemplated that experiments described herein support the conclusion that Nod2 mediates the recognition of MDP in mammalian cells. Macrophages contain intracellular hydrolases, which digest bacterial PGN and release PGN fragments including GlcNAc-MurNAcdipeptide (Johnnsen et al., Am. J. Physiol. 260:R126 [1991]; Vermeulen et al., Infect. Immun. 46:476 [1984]). These muropeptides derived from intracellular and/or phagocytosed bacteria as well as PGN fragments released during bacterial growth are available for recognition by Nod2. The synthetic MDP and GlcNAc-MurNAc-dipeptides mimicking the natural muropeptides induced Nod2-dependent activation of NF-κB. Thus, it is contemplated that Nod2 is activated by muropeptides derived from bacteria in vivo. This observation provides a role for Nod2 in both inflammatory responses in Crohn's disease and vaccine and adjuvant function (e.g., by stimulating immune system function in response to adjuvants).
Crohn's disease associated Nod2 variants and PBMNC from individuals homozygous for L1007fsinsC are defective in their response to muramyl dipeptide. This result is consistent with the observation that homozygocity for L1007fsinsC is important for susceptibility to Crohn's disease. Because activation of NF- κ B in response to bacterial components mediates protection of the host against pathogens, it is contemplated that Nod2-mediated susceptibility to disease is caused by a failure to trigger a protective NF- κ B pathway in response to muropeptides.
Such a defective response against certain bacterial products may result secondarily in the diffuse activation of NF- κ B found in intestinal tissue by Nod2-independent mechanisms. Accordingly, in some embodiments, the present invention provides methods of screening for compounds that restore the activity against bacterial muropeptides. The present invention further provides pharmaceuticals identified by such screening methods. It is contemplated that therapeutics that restore activity against bacterial muropeptides are beneficial to Crohn's disease patients harboring Nod2 mutations.
Definitions
To facilitate understanding of the invention, a number of terms are defined below.
As used herein, the term “Nod2” when used in reference to a protein or nucleic acid refers to a protein or nucleic acid encoding a protein that, in its wild type form, activates NF- κ B and contains two CARDs (caspase recruitment domains). The term Nod2 encompasses both proteins that are identical to wild-type Nod2 and those that are derived from wild type Nod2 (e.g., variants of Nod2 or chimeric genes constructed with portions of Nod2 coding regions).
As used herein, the term “activates NF-κB,” when used in reference to any molecule that activates NF-κB, refers to a molecule (e.g., a protein) that induces the activity of the NF-κB transcription factor through a cell signaling pathway. Assays for determining if a molecule activates NF-κB utilize, for example, NF-κB responsive reporter gene constructs. Suitable assays include, but are not limited to, those described in Examples 4 and 5.
As used herein, the term “activity of Nod2” refers to any activity of wild type Nod2. The term is intended to encompass all activities of Nod2 (e.g., including, but not limited to, activating NF-κB, binding to RICK, and enhancing apoptosis).
The term “apoptosis” means non-necrotic cell death that takes place in metazoan animal cells following activation of an intrinsic cell suicide program. Apoptosis is a normal process in the development and homeostasis of metazoan animals. Apoptosis involves characteristic morphological and biochemical changes, including cell shrinkage, zeiosis, or blebbing, of the plasma membrane, and nuclear collapse and fragmentation of the nuclear chromatin, at intranucleosomal sites, due to activation of an endogenous nuclease.
As used herein, the term “symptoms of Crohn's disease” refers to symptoms associated with Crohn's disease, including, but not limited to abdominal pain, diarrhea, rectal bleeding, weight loss, fever, loss of appetite, and other more serious complications, such as dehydration, anemia and malnutrition. A number of such symptoms are subject to quantitative analysis (e.g., weight loss, fever, anemia, etc.). Some symptoms are readily determined from a blood test (e.g., anemia) or a test that detects the presence of blood (e.g., rectal bleeding).
The phrase “under conditions such that symptoms of Crohn's disease are reduced” refers to a qualitative or quantitative reduction in detectable symptoms (e.g., “symptoms of Crohn's disease”), including but not limited to a detectable impact on the rate of recovery from disease (e.g., rate of weight gain).
As used herein, the term “adjuvants” refers to a compound used to increase the immunological response (e.g., to a vaccine or for the generation of antibodies in non-human animals). Adjuvants include, but are not limited to, depending on the host species, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, BCG (Bacille Calmette-Guerin) and Corynebacterium parvum ).
As used herein, the term “anti-adjuvant” refers to a compound that inhibit adjuvant activity. In some embodiments, anti-adjuvants inhibit Nod2 signaling that has been enhanced by an adjuvant compound such as MDP or MDP analogs. In some embodiments, anti-adjuvants are used to reduce side effects of an adjuvant such as fever or inflammation.
The term “gene” refers to a nucleic acid (e.g., DNA) sequence that comprises coding sequences necessary for the production of a polypeptide or precursor (e.g., Nod2). The polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence so long as the desired activity or functional properties (e.g., enzymatic activity, ligand binding, signal transduction, etc.) of the full-length or fragment are retained. The term also encompasses the coding region of a structural gene and the including sequences located adjacent to the coding region on both the 5′ and 3′ ends for a distance of about 1 kb on either end such that the gene corresponds to the length of the full-length mRNA. The sequences that are located 5′ of the coding region and which are present on the mRNA are referred to as 5′ untranslated sequences. The sequences that are located 3′ or downstream of the coding region and that are present on the mRNA are referred to as 3′ untranslated sequences. The term “gene” encompasses both cDNA and genomic forms of a gene. A genomic form or clone of a gene contains the coding region interrupted with non-coding sequences termed “introns” or “intervening regions” or “intervening sequences.” Introns are segments of a gene that are transcribed into nuclear RNA (hnRNA); introns may contain regulatory elements such as enhancers. Introns are removed or “spliced out” from the nuclear or primary transcript; introns therefore are absent in the messenger RNA (mRNA) transcript. The mRNA functions during translation to specify the sequence or order of amino acids in a nascent polypeptide.
In particular, the term “Nod2 gene” refers to the full-length Nod2 nucleotide sequence (e.g., contained in SEQ ID NO: 1). However, it is also intended that the term encompass fragments of the Nod2 sequence, as well as other domains within the full-length Nod2 nucleotide sequence. Furthermore, the terms “Nod2 nucleotide sequence” or “Nod2 polynucleotide sequence” encompasses DNA, cDNA, and RNA (e.g., mRNA) sequences.
Where “amino acid sequence” is recited herein to refer to an amino acid sequence of a naturally occurring protein molecule, “amino acid sequence” and like terms, such as “polypeptide” or “protein” are not meant to limit the amino acid sequence to the complete, native amino acid sequence associated with the recited protein molecule.
In addition to containing introns, genomic forms of a gene may also include sequences located on both the 5′ and 3′ end of the sequences that are present on the RNA transcript. These sequences are referred to as “flanking” sequences or regions (these flanking sequences are located 5′ or 3′ to the non-translated sequences present on the mRNA transcript). The 5′ flanking region may contain regulatory sequences such as promoters and enhancers that control or influence the transcription of the gene. The 3′ flanking region may contain sequences that direct the termination of transcription, post-transcriptional cleavage and polyadenylation.
The-term “wild-type” refers to a gene or gene product that has the characteristics of that gene or gene product when isolated from a naturally occurring source. A wild-type gene is that which is most frequently observed in a population and is thus arbitrarily designed the “normal” or “wild-type” form of the gene. In contrast, the terms “modified”, “mutant”, and “variant” refer to a gene or gene product that displays modifications in sequence and or functional properties (i.e., altered characteristics) when compared to the wild-type gene or gene product. It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that they have altered characteristics when compared to the wild-type gene or gene product.
As used herein, the terms “nucleic acid molecule encoding,” “DNA sequence encoding,” and “DNA encoding” refer to the order or sequence of deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes for the amino acid sequence.
DNA molecules are said to have “5′ ends” and “3′ ends” because mononucleotides are reacted to make oligonucleotides or polynucleotides in a manner such that the 5′ phosphate of one mononucleotide pentose ring is attached to the 3′ oxygen of its neighbor in one direction via a phosphodiester linkage. Therefore, an end of an oligonucleotides or polynucleotide, referred to as the “5′ end” if its 5′ phosphate is not linked to the 3′ oxygen of a mononucleotide pentose ring and as the “3′ end” if its 3′ oxygen is not linked to a 5′ phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid sequence, even if internal to a larger oligonucleotide or polynucleotide, also may be said to have 5′ and 3′ ends. In either a linear or circular DNA molecule, discrete elements are referred to as being “upstream” or 5′ of the “downstream” or 3′ elements. This terminology reflects the fact that transcription proceeds in a 5′ to 3′ fashion along the DNA strand. The promoter and enhancer elements that direct transcription of a linked gene are generally located 5′ or upstream of the coding region. However, enhancer elements can exert their effect even when located 3′ of the promoter element and the coding region. Transcription termination and polyadenylation signals are located 3′ or downstream of the coding region.
As used herein, the terms “an oligonucleotide having a nucleotide sequence encoding a gene” and “polynucleotide having a nucleotide sequence encoding a gene,” means a nucleic acid sequence comprising the coding region of a gene or, in other words, the nucleic acid sequence that encodes a gene product. The coding region may be present in either a cDNA, genomic DNA, or RNA form. When present in a DNA form, the oligonucleotide or polynucleotide may be single-stranded (i.e., the sense strand) or double-stranded. Suitable control elements such as enhancers/promoters, splice junctions, polyadenylation signals, etc. may be placed in close proximity to the coding region of the gene if needed to permit proper initiation of transcription and/or correct processing of the primary RNA transcript. Alternatively, the coding region utilized in the expression vectors of the present invention may contain endogenous enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, etc. or a combination of both endogenous and exogenous control elements.
As used herein, the term “regulatory element” refers to a genetic element that controls some aspect of the expression of nucleic acid sequences. For example, a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region. Other regulatory elements include splicing signals, polyadenylation signals, termination signals, etc.
As used herein, the terms “complementary” or “complementarity” are used in reference to polynucleotides (i.e., a sequence of nucleotides) related by the base-pairing rules. For example, for the sequence “A-G-T,” is complementary to the sequence “T-C-A.” Complementarity may be “partial,” in which only some of the nucleic acids′ bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods that depend upon binding between nucleic acids.
The term “homology” refers to a degree of complementarity. There may be partial homology or complete homology (i.e., identity). A partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid and is referred to using the functional term “substantially homologous.” The term “inhibition of binding,” when used in reference to nucleic acid binding, refers to inhibition of binding caused by competition of homologous sequences for binding to a target sequence. The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target that lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
The art knows well that numerous equivalent conditions may be employed to comprise low stringency conditions; factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of low stringency hybridization different from, but equivalent to, the above listed conditions. In addition, the art knows conditions that promote hybridization under conditions of high stringency (e.g., increasing the temperature of the hybridization and/or wash steps, the use of formamide in the hybridization solution, etc.).
When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term “substantially homologous” refers to any probe that can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described above.
A gene may produce multiple RNA species that are generated by differential splicing of the primary RNA transcript. cDNAs that are splice variants of the same gene will contain regions of sequence identity or complete homology (representing the presence of the same exon or portion of the same exon on both cDNAs) and regions of complete non-identity (for example, representing the presence of exon “A” on cDNA 1 wherein cDNA 2 contains exon “B” instead). Because the two cDNAs contain regions of sequence identity they will both hybridize to a probe derived from the entire gene or portions of the gene containing sequences found on both cDNAs; the two splice variants are therefore substantially homologous to such a probe and to each other.
When used in reference to a single-stranded nucleic acid sequence, the term “substantially homologous” refers to any probe that can hybridize (i.e., it is the complement of) the single-stranded nucleic acid sequence under conditions of low stringency as described above.
As used herein, the term “competes for binding” is used in reference to a first polypeptide with an activity which binds to the same substrate as does a second polypeptide with an activity, where the second polypeptide is a variant of the first polypeptide or a related or dissimilar polypeptide. The efficiency (e.g., kinetics or thermodynamics) of binding by the first polypeptide may be the same as or greater than or less than the efficiency substrate binding by the second polypeptide. For example, the equilibrium binding constant (K D ) for binding to the substrate may be different for the two polypeptides. The term “K m ” as used herein refers to the Michaelis-Menton constant for an enzyme and is defined as the concentration of the specific substrate at which a given enzyme yields one-half its maximum velocity in an enzyme catalyzed reaction.
As used herein, the term “hybridization” is used in reference to the pairing of complementary nucleic acids. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementary between the nucleic acids, stringency of the conditions involved, the T m of the formed hybrid, and the G:C ratio within the nucleic acids.
As used herein, the term “T m ” is used in reference to the “melting temperature.” The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands. The equation for calculating the T m of nucleic acids is well known in the art. As indicated by standard references, a simple estimate of the T m value may be calculated by the equation: T m =81.5+0.41(% G+C), when a nucleic acid is in aqueous solution at 1 M NaCl (See e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization [ 1985]). Other references include more sophisticated computations that take structural as well as sequence characteristics into account for the calculation of T m .
As used herein the term “stringency” is used in reference to the conditions of temperature, ionic strength, and the presence of other compounds such as organic solvents, under which nucleic acid hybridizations are conducted. Those skilled in the art will recognize that “stringency” conditions may be altered by varying the parameters just described either individually or in concert. With “high stringency” conditions, nucleic acid base pairing will occur only between nucleic acid fragments that have a high frequency of complementary base sequences (e.g., hybridization under “high stringency” conditions may occur between homologs with about 85-100% identity, preferably about 70-100% identity). With medium stringency conditions, nucleic acid base pairing will occur between nucleic acids with an intermediate frequency of complementary base sequences (e.g., hybridization under “medium stringency” conditions may occur between homologs with about 50-70% identity). Thus, conditions of “weak” or “low” stringency are often required with nucleic acids that are derived from organisms that are genetically diverse, as the frequency of complementary sequences is usually less.
“High stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5× SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5× Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1× SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
“Medium stringency conditions” when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5× SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5× Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 1.0× SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
“Low stringency conditions” comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5× SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/l EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5× Denhardt's reagent [50× Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 5× SSPE, 0.1% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
The following terms are used to describe the sequence relationships between two or more polynucleotides: “reference, sequence”, “sequence identity”, “percentage of sequence identity”, and “substantial identity”. A “reference sequence” is a defined sequence used as a basis for a sequence comparison; a reference sequence may be a subset of a larger sequence, for example, as a segment of a full-length cDNA sequence given in a sequence listing or may comprise a complete gene sequence. Generally, a reference sequence is at least 20 nucleotides in length, frequently at least 25 nucleotides in length, and often at least 50 nucleotides in length. Since two polynucleotides may each (1) comprise a sequence (i.e., a portion of the complete polynucleotide sequence) that is similar between the two polynucleotides, and (2) may further comprise a sequence that is divergent between the two polynucleotides, sequence comparisons between two (or more) polynucleotides are typically performed by comparing sequences of the two polynucleotides over a “comparison window” to identify and compare local regions of sequence similarity. A “comparison window”, as used herein, refers to a conceptual segment of at least 20 contiguous nucleotide positions wherein a polynucleotide sequence may be compared to a reference sequence of at least 20 contiguous nucleotides and wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. Optimal alignment of sequences for aligning a comparison window may be conducted by the local homology algorithm of Smith and Waterman [Smith and Waterman, Adv. Appl. Math. 2: 482 (1981)] by the homology alignment algorithm of Needleman and Wunsch [Needleman and Wunsch, J. Mol. Biol. 48:443 (1970)], by the search for similarity method of Pearson and Lipman [Pearson and Lipman, Proc. Natl. Acad. Sci . ( U.S.A ) 85:2444 (1988)], by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection, and the best alignment (i.e., resulting in the highest percentage of homology over the comparison window) generated by the various methods is selected. The term “sequence identity” means that two polynucleotide sequences are identical (i.e., on a nucleotide-by-nucleotide basis) over the window of comparison. The term “percentage of sequence identity” is calculated by comparing two optimally aligned sequences over the window of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. The terms “substantial identity” as used herein denotes a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 85 percent sequence identity, preferably at least 90 to 95 percent sequence identity, more usually at least 99 percent sequence identity as compared to a reference sequence over a comparison window of at least 20 nucleotide positions, frequently over a window of at least 25-50 nucleotides, wherein the percentage of sequence identity is calculated by comparing the reference sequence to the polynucleotide sequence which may include deletions or additions which total 20 percent or less of the reference sequence over the window of comparison. The reference sequence may be a subset of a larger sequence, for example, as a segment of the full-length sequences of the compositions claimed in the present invention (e.g., Nod2)
As applied to polypeptides, the term “substantial identity” means that two peptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap weights, share at least 80 percent sequence identity, preferably at least 90 percent sequence identity, more preferably at least 95 percent sequence identity or more (e.g., 99 percent sequence identity). Preferably, residue positions which are not identical differ by conservative amino acid substitutions. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having-aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. Preferred conservative amino acids substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine.
The term “fragment” as used herein refers to a polypeptide that has an amino-terminal and/or carboxy-terminal deletion as compared to the native protein, but where the remaining amino acid sequence is identical to the corresponding positions in the amino acid sequence deduced from a full-length cDNA sequence. Fragments typically are at least 4 amino acids long, preferably at least 20 amino acids long, usually at least 50 amino acids long or longer, and span the portion of the polypeptide required for intermolecular binding of the compositions(claimed in the present invention) with its various ligands and/or substrates.
The term “polymorphic locus” is a locus present in a population which shows variation between members of the population (i.e., the most common allele has a frequency of less than 0.95). In contrast, a “monomorphic locus” is a genetic locus at little or no variations seen between members of the population (generally taken to be a locus at which the most common allele exceeds a frequency of 0.95 in the gene pool of the population).
The term “naturally-occurring” as used herein as applied to an object refers to the fact that an object can be found in nature. For example, a polypeptide or polynucleotide sequence that is present in an organism (including viruses) that can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory is naturally-occurring.
“Amplification” is a special case of nucleic acid replication involving template specificity. It is to be contrasted with non-specific template replication (i.e., replication that is template-dependent but not dependent on a specific template). Template specificity is here distinguished from fidelity of replication (i.e., synthesis of the proper polynucleotide sequence) and nucleotide (ribo- or deoxyribo-) specificity. Template specificity is frequently described in terms of “target” specificity. Target sequences are “targets” in the sense that they are sought to be sorted out from other nucleic acid. Amplification techniques have been designed primarily for this sorting out.
Template specificity is achieved in most amplification techniques by the choice of enzyme. Amplification enzymes are enzymes that, under conditions they are used, will process only specific sequences of nucleic acid in a heterogeneous mixture of nucleic acid. For example, in the case of Qβ replicase, MDV-1 RNA is the specific template for the replicase (D. L. Kacian et al., Proc. Natl. Acad. Sci. USA 69:3038 [1972]). Other nucleic acid will not be replicated by this amplification enzyme. Similarly, in the case of T7 RNA polymerase, this amplification enzyme has a stringent specificity for its own promoters (M. Chamberlin et al., Nature 228:227 [1970]). In the case of T4 DNA ligase, the enzyme will not ligate the two oligonucleotides or polynucleotides, where there is a mismatch between the oligonucleotide or polynucleotide substrate and the template at the ligation junction (D. Y. Wu and R. B. Wallace, Genomics 4:560 [1989]). Finally, Taq and Pfu polymerases, by virtue of their ability to function at high temperature, are found to display high specificity for the sequences bounded and thus defined by the primers; the high temperature results in thermodynamic conditions that favor primer hybridization with the target sequences and not hybridization with non-target sequences (H. A. Erlich (ed.), PCR Technology , Stockton Press [1989]).
As used herein, the term “amplifiable nucleic acid” is used in reference to nucleic acids that may be amplified by any amplification method. It is contemplated that “amplifiable nucleic acid” will usually comprise “sample template.”
As used herein, the term “sample template” refers to nucleic acid originating from a sample that is analyzed for the presence of “target” (defined below). In contrast, “background template” is used in reference to nucleic acid other than sample template that may or may not be present in a sample. Background template is most often inadvertent. It may be the result of carryover, or it may be due to the presence of nucleic acid contaminants sought to be purified away from the sample. For example, nucleic acids from organisms other than those to be detected may be present as background in a test sample.
As used herein, the term “primer” refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, (i.e., in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH). The primer is preferably single stranded for maximum efficiency in amplification, but may alternatively be double stranded. If double stranded, the primer is first treated to separate its strands before being used to prepare extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths of the primers will depend on many factors, including temperature, source of primer and the use of the method.
As used herein, the term “probe” refers to an oligonucleotide (i.e., a sequence of nucleotides), whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification, that is capable of hybridizing to another oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are useful in the detection, identification and isolation of particular gene sequences. It is contemplated that any probe used in the present invention will be labelled with any “reporter molecule,” so that is detectable in any detection system, including, but not limited to enzyme (e.g., ELISA, as well as enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label.
As used herein, the term “target,” when used in reference to the polymerase chain reaction, refers to the region of nucleic acid bounded by the primers used for polymerase chain reaction. Thus, the “target” is sought to be sorted out from other nucleic acid sequences. A “segment” is defined as a region of nucleic acid within the target sequence.
As used herein, the term “polymerase chain reaction” (“PCR”) refers to the method of K. B. Mullis U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,965,188, hereby incorporated by reference, that describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic DNA without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands of the double stranded target sequence. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule. Following annealing, the primers are extended with a polymerase so as to form a new pair of complementary strands. The steps of denaturation, primer annealing, and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired target sequence. The length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect of the process, the method is referred to as the “polymerase chain reaction” (hereinafter “PCR”). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified.”
With PCR, it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (e.g., hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; incorporation of 32 P-labeled deoxynucleotide triphosphates, such as dCTP or dATP, into the amplified segment). In addition to genomic DNA, any oligonucleotide or polynucleotide sequence can be amplified with the appropriate set of primer molecules. In particular, the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications.
As used herein, the terms “PCR product,” “PCR fragment,” and “amplification product” refer to the resultant mixture of compounds after two or more cycles of the PCR steps of denaturation, annealing and extension are complete. These terms encompass the case where there has been amplification of one or more segments of one or more target sequences.
As used herein, the term “amplification reagents” refers to those reagents (deoxyribonucleotide triphosphates, buffer, etc.), needed for amplification except for primers, nucleic acid template, and the amplification enzyme. Typically, amplification reagents along with other reaction components are placed and contained in a reaction vessel (test tube, microwell, etc.).
As used herein, the terms “restriction endonucleases” and “restriction enzymes” refer to bacterial enzymes, each of which cut double-stranded DNA at or near a specific nucleotide sequence.
As used herein, the term “recombinant DNA molecule” as used herein refers to a DNA molecule that is comprised of segments of DNA joined together by means of molecular biological techniques.
As used herein, the term “antisense” is used in reference to RNA sequences that are complementary to a specific RNA sequence (e.g., mRNA). Included within this definition are antisense RNA (“asRNA”) molecules involved in gene regulation by bacteria. Antisense RNA may be produced by any method, including synthesis by splicing the gene(s) of interest in a reverse orientation to a viral promoter that permits the synthesis of a coding strand. Once introduced into an embryo, this transcribed strand combines with natural mRNA produced by the embryo to form duplexes. These duplexes then block either the further transcription of the mRNA or its translation. In this manner, mutant phenotypes may be generated. The term “antisense strand” is used in reference to a nucleic acid strand that is complementary to the “sense” strand. The designation (−) (i.e., “negative”) is sometimes used in reference to the antisense strand, with the designation (+) sometimes used in reference to the sense (i.e., “positive”) strand.
The term “isolated” when used in relation to a nucleic acid, as in “an isolated oligonucleotide” or “isolated polynucleotide” refers to a nucleic acid sequence that is identified and separated from at least one contaminant nucleic acid with which it is ordinarily associated in its natural source. Isolated nucleic acid is present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA found in the state they exist in nature. For example, a given DNA sequence (e.g., a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs that encode a multitude of proteins. However, isolated nucleic acid encoding Nod2 includes, by way of example, such nucleic acid in cells ordinarily expressing Nod2 where the nucleic acid is in a chromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature. The isolated nucleic acid, oligonucleotide, or polynucleotide may be present in single-stranded or double-stranded form. When an isolated nucleic acid, oligonucleotide or polynucleotide is to be utilized to express a protein, the oligonucleotide or polynucleotide will contain at a minimum the sense or coding strand (i.e., the oligonucleotide or polynucleotide may single-stranded), but may contain both the sense and anti-sense strands (i.e., the oligonucleotide or polynucleotide may be double-stranded).
As used herein, a “portion of a chromosome” refers to a discrete section of the chromosome. Chromosomes are divided into sites or sections by cytogeneticists as follows: the short (relative to the centromere) arm of a chromosome is termed the “p” arm; the long arm is termed the “q” arm. Each arm is then divided into 2 regions termed region 1 and region 2 (region 1 is closest to the centromere). Each region is further divided into bands. The bands may be further divided into sub-bands. For example, the 11p15.5 portion of human chromosome 11 is the portion located on chromosome 11 (11) on the short arm (p) in the first region (1) in the 5th band (5) in sub-band 5 (0.5). A portion of a chromosome may be “altered;” for instance the entire portion may be absent due to a deletion or may be rearranged (e.g., inversions, translocations, expanded or contracted due to changes in repeat regions). In the case of a deletion, an attempt to hybridize (i.e., specifically bind) a probe homologous to a particular portion of a chromosome could result in a negative result (i.e., the probe could not bind to the sample containing genetic material suspected of containing the missing portion of the chromosome). Thus, hybridization of a probe homologous to a particular portion of a chromosome may be used to detect alterations in a portion of a chromosome.
The term “sequences associated with a chromosome” means preparations of chromosomes (e.g., spreads of metaphase chromosomes), nucleic acid extracted from a sample containing chromosomal DNA (e.g., preparations of genomic DNA); the RNA that is produced by transcription of genes located on a chromosome (e.g., hnRNA and mRNA), and cDNA copies of the RNA transcribed from the DNA located on a chromosome. Sequences associated with a chromosome may be detected by numerous techniques including probing of Southern and Northern blots and in situ hybridization to RNA, DNA, or metaphase chromosomes with probes containing sequences homologous to the nucleic acids in the above listed preparations.
As used herein the term “portion” when in reference to a nucleotide sequence (as in “a portion of a given nucleotide sequence”) refers to fragments of that sequence. The fragments may range in size from four nucleotides to the entire nucleotide sequence minus one nucleotide (10 nucleotides, 20, 30, 40, 50, 100, 200, etc.).
As used herein the term “coding region” when used in reference to structural gene refers to the nucleotide sequences that encode the amino acids found in the nascent polypeptide as a result of translation of a mRNA molecule. The coding region is bounded, in eukaryotes, on the 5′ side by the nucleotide triplet “ATG” that encodes the initiator methionine and on the 3′ side by one of the three triplets which specify stop codons (i.e., TAA, TAG, TGA).
As used herein, the term “purified” or “to purify” refers to the removal of contaminants from a sample. For example, Nod2 antibodies are purified by removal of contaminating non-immunoglobulin proteins; they are also purified by the removal of 15 immunoglobulin that does not bind Nod2. The removal of non-immunoglobulin proteins and/or the removal of immunoglobulins that do not bind Nod2 results in an increase in the percent of Nod2-reactive immunoglobulins in the sample. In another example, recombinant Nod2 polypeptides are expressed in bacterial host cells and the polypeptides are purified by the removal of host cell proteins; the percent of recombinant Nod2 polypeptides is thereby increased in the sample.
The term “recombinant DNA molecule”