Title:
Nucleic acids encoding B7-Like molecules and uses thereof
Document Type and Number:
United States Patent 7414122

Abstract:
The present invention provides B7-Like (B7-L) polypeptides and nucleic acid molecules encoding the same. The invention also provides selective binding agents, vectors, host cells, and methods for producing B7-L polypeptides. The invention further provides pharmaceutical compositions and methods for the diagnosis, treatment, amelioration, and/or prevention of diseases, disorders, and conditions associated with B7-L polypeptides.

Inventors:
Fox, Gary M. (Newbury Park, CA, US)
Sullivan, John K. (Newbury Park, CA, US)
Holst, Paige (Camarillo, CA, US)
Yoshinaga, Steven Kiyoshi (Thousand Oaks, CA, US)
      Plaque It!

Sponsored by:
Flash of Genius
Application Number:
10/998526
Publication Date:
08/19/2008
Filing Date:
11/29/2004
View Patent Images:
Images are available in PDF form when logged in. To view PDFs, Login  or  Create Account (Free!)
Assignee:
Amgen Inc. (Thousand Oaks, CA, US)
Primary Class:
Other Classes:
435/455, 514/44, 435/69.1, 435/320.1, 435/252.3
International Classes:
C07H21/04; C12N15/00; C12N1/21; C12N5/16; A61K31/713
US Patent References:
6635750B7-H2 nucleic acids, members of the B7 familyOctober, 2003Coyle et al.536/23.5
6965018Antibodies directed to B7-related polypeptide, BSL-2November, 2005Mikesell et al.530/388.1
20020110836PD-L2 molecules: novel PD-1 ligands and uses thereforAugust, 2002Freeman et al.435/7.2
20030208058B7-like polypeptides and polynucleotidesNovember, 2003Fiscella et al.536/23.5
Foreign References:
EP1074617February, 2001Primers for synthesising full-length cDNA and their use
WO/2000/036107June, 2000COMPOSITIONS AND METHODS FOR THERAPY AND DIAGNOSIS OF OVARIAN CANCER
WO/2000/055375September, 2000SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM
WO/2000/061612October, 2000COMPOUNDS AND METHODS FOR THERAPY AND DIAGNOSIS OF LUNG CANCER
WO/2001/034659May, 2001MODIFIED POLYMERS PREPARED WITH LANTHANIDE-BASED CATALYSTS
Other References:
Attwood T., Science 2000; 290:471-473.
Skolnick et al., Trends in Biotech. 2000; 18(1)34-39.
Metzler et al., Nature Structural Biol. 1997; 4:527-531.
Burgess et al, J Cell Biol. 111:2129-2138, 1990.
Lazar et al., Mol Cell Biol. 8:1247-1252, 1988.
Bowie et al. Science, 247:1306-1310, 1990.
Henry et al., “Structure and evolution of the extended B7 family” Immunology Today, vol. 20, No. 6, Jun. 1999, pp. 285-288.
Database EMBL ′Online! Feb. 22, 2000, Isogai T et al.: Homo sapiens cDNA FLJ11010 fis, clone PLACE1003145 Database accession No. AK001872, (2000).
Database EMBL ′Online! Jun. 1, 1999, Tseng et al.: “Mus musculus butyrophilin-like protein (btdc) mRNA, complete cds” retrieved from EBI Database accession No. AF142780, (1999).
Database EMBL ′Online Mar. 9, 2001, Latchman et al.: “Homo sapiens PD-1-ligand 2 protein (PDL2) m RNA, complete cds”, Database accession No. AF344424, (2001).
Database EMBL, Online “Homo sapiens butyrophilin precursor B7-DC mRNA, complete cds”, Database Accession No. AF329193.1, (2000).
Yoshinaga et al., T-cell co-stimulation through B7RP-1 and ICOS, Nature, 402:827-32 (1999).
Ishida et al.,“Induced expression of PD-1, a novel member of the immunoglobulin gene superfamily, upon programmed cell death”, EMBO J. 11:3887-95 (1992).
Dong et al., B7-H1, a third member of the B7 family, co-stimulates T-cell proliferation and interleukin-10 secretion, Nat. Med. 5:1365-69 (1999).
Latchman et al., “PD-L2 is a second ligand for PD-1 and inhibits T cell activation”, 2001 Nature Immunology 2:261-68 (2001).
Nishimura et al., “Development of Lupus-like Autoimmune Diseases by Disruption of the PD-1 Gene Encoding an ITIM Motif-Carrying Immunoreceptor”, Immunity, 11:141-51 (1999).
Tseng et al., B7-DC, a New Dendritic Cell Molecule with Potent Costimulatory Properties for T Cells, J. Exp. Med. 193:839-45(2001).
Primary Examiner:
Ouspenski, Ilia
Attorney, Agent or Firm:
McDonnell Boehnen Hulbert & Berghoff LLP
Parent Case Data:
This application is a continuation of U.S. application Ser. No. 09/955,866, filed on Sep. 19, 2001, which claims the benefit of priority of U.S. Provisional Application No. 60/233,867, filed on Sep. 20, 2000, the disclosure of each of which is explicitly incorporated by reference herein.
Claims:
What claimed is:

1. An isolated nucleic acid molecule comprising the nucleotide sequence as set forth in SEQ ID NO: 1.

2. A vector comprising the nucleic acid molecule of claim 1.

3. An isolated host cell comprising the vector of claim 2.

4. The host cell of claim 3 that is a eukaryotic cell.

5. The host cell of claim 3 that is a prokaryotic cell.

6. A process of producing a B7-L polypeptide comprising culturing the host cell of claim 3 under suitable conditions to express the polypeptide, and optionally isolating the polypeptide from the culture.

7. The process of claim 6, wherein the nucleic acid molecule comprises promoter DNA other than the promoter DNA for the native B7-L polypeptide operatively linked to the DNA encoding the B7-L polypeptide.

8. A composition comprising a nucleic acid molecule of claim 1 and a pharmaceutically acceptable formulation agent.

9. The composition of claim 8, wherein said nucleic acid molecule is contained in a viral vector.

10. A viral vector comprising a nucleic acid molecule of any of claim 1.

11. A nucleic acid molecule of claim 1, attached to a solid support.

12. An array of nucleic acid molecules comprising at least one nucleic acid molecule of claim 1.

Description:

FIELD OF THE INVENTION

The present invention relates to B7-Like (B7-L) polypeptides and nucleic acid molecules encoding the same. The invention also relates to selective binding agents, vectors, host cells, and methods for producing B7-L polypeptides. The invention further relates to pharmaceutical compositions and methods for the diagnosis, treatment, amelioration, and/or prevention of diseases, disorders, and conditions associated with B7-L polypeptides.

BACKGROUND OF THE INVENTION

Technical advances in the identification, cloning, expression, and manipulation of nucleic acid molecules and the deciphering of the human genome have greatly accelerated the discovery of novel therapeutics. Rapid nucleic acid sequencing techniques can now generate sequence information at unprecedented rates and, coupled with computational analyses, allow the assembly of overlapping sequences into partial and entire genomes and the identification of polypeptide-encoding regions. A comparison of a predicted amino acid sequence against a database compilation of known amino acid sequences allows one to determine the extent of homology to previously identified sequences and/or structural landmarks. The cloning and expression of a polypeptide-encoding region of a nucleic acid molecule provides a polypeptide product for structural and functional analyses. The manipulation of nucleic acid molecules and encoded polypeptides may confer advantageous properties on a product for use as a therapeutic.

In spite of the significant technical advances in genome research over the past decade, the potential for the development of novel therapeutics based on the human genome is still largely unrealized. Many genes encoding potentially beneficial polypeptide therapeutics or those encoding polypeptides, which may act as “targets” for therapeutic molecules, have still not been identified. Accordingly, it is an object of the invention to identify novel polypeptides, and nucleic acid molecules encoding the same, which have diagnostic or therapeutic benefit.

SUMMARY OF THE INVENTION

A novel polypeptide, which is related to proteins in the T-cell costimulatory pathway, has been identified. This polypeptide, termed B7-like polypeptide (B7-L), represents a B7-related polypeptide having a structure similar to and sharing homology with CD80, CD86, B7RP-1, and B7-H1. Human B7-L polypeptide shares a greater amino acid identity with human B7-H1 (35%) than with other members of the B7 family (B7rp-1, 20%; B7-1, 12%). In addition, the mouse and human orthologs of B7-L polypeptide and B7-H1 share a greater degree of amino acid similarity (approximately 70%) than the mouse and human orthologs of B7-1, B7-2, or B7rp-1 (approximately 40%). Furthermore, both B7-H1 and B7-L bind to the same receptor, PD-1.

The present invention relates to novel B7-L nucleic acid molecules and encoded polypeptides.

The invention provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:

(a) the nucleotide sequence as set forth in SEQ ID NO: 1;

(b) the nucleotide sequence of the DNA insert in ATCC Deposit No. PTA 2481;

(c) a nucleotide sequence encoding the polypeptide as set forth in SEQ ID NO: 2;

(d) a nucleotide sequence that hybridizes under at least moderately stringent conditions to the complement of the nucleotide sequence of any of (a)-(c); and

(e) a nucleotide sequence complementary to the nucleotide sequence of any of (a)-(c).

The invention also provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:

(a) a nucleotide sequence encoding a polypeptide that is at least about 70 percent identical to the polypeptide as set forth in SEQ ID NO: 2 or the nucleotide sequence of the DNA insert in ATCC Deposit No. PTA 2481, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(b) a nucleotide sequence encoding an allelic variant or splice variant of the nucleotide sequence as set forth in SEQ ID NO: 1, the nucleotide sequence of the DNA insert in ATCC Deposit No. PTA 2481, or the nucleotide sequence of (a);

(c) a region of the nucleotide sequence of SEQ ID NO: 1, the nucleotide sequence of the DNA insert in ATCC Deposit No. PTA 2481, or the nucleotide sequence of (a) or (b) encoding a polypeptide fragment of at least about 25 amino acid residues, wherein the polypeptide fragment has an activity of the polypeptide set forth in SEQ ID NO: 2, or is antigenic;

(d) a region of the nucleotide sequence of SEQ ID NO: 1, the nucleotide sequence of the DNA insert in ATCC Deposit No. PTA 2481, or the nucleotide sequence of either (c) or (d) comprising a fragment of at least about 16 nucleotides;

(e) a nucleotide sequence that hybridizes under at least moderately stringent conditions to the complement of the nucleotide sequence of any of (a)-(d); and

(f) a nucleotide sequence complementary to the nucleotide sequence of any of (a)-(d).

The invention further provides for an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:

(a) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 2 with at least one conservative amino acid substitution, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(b) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 2 with at least one amino acid insertion, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(c) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 2 with at least one amino acid deletion, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(d) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 2 that has a C- and/or N-terminal truncation, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(e) a nucleotide sequence encoding a polypeptide as set forth in SEQ ID NO: 2 with at least one modification selected from the group consisting of amino acid substitutions, amino acid insertions, amino acid deletions, C-terminal truncation, and N-terminal truncation, wherein the encoded polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(f) a nucleotide sequence of any of (a)-(e) comprising a fragment of at least about 16 nucleotides;

(g) a nucleotide sequence that hybridizes under at least moderately stringent conditions to the complement of the nucleotide sequence of any of (a)-(f); and

(h) a nucleotide sequence complementary to the nucleotide sequence of any of (a)-(e).

The present invention provides for an isolated polypeptide comprising an amino acid sequence selected from the group consisting of:

(a) the amino acid sequence as set forth in SEQ ID NO: 2; and

(b) the amino acid sequence encoded by the DNA insert of ATCC Deposit No. PTA 2481.

The invention also provides for an isolated polypeptide comprising an amino acid sequence selected from the group consisting of:

(a) the amino acid sequence as set forth in SEQ ID NO: 3, optionally further comprising an amino-terminal methionine;

(b) an amino acid sequence for an ortholog of SEQ ID NO: 2;

(c) an amino acid sequence that is at least about 70 percent identical to the amino acid sequence of SEQ ID NO: 2 or the amino acid sequence encoded by the DNA insert of ATCC Deposit No. PTA 2481, wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(d) a fragment of the amino acid sequence set forth in SEQ ID NO: 2 or the amino acid sequence encoded by the DNA insert of ATCC Deposit No. PTA 2481 comprising at least about 25 amino acid residues, wherein the fragment has an activity of the polypeptide set forth in SEQ ID NO: 2, or is antigenic; and

(e) an amino acid sequence for an allelic variant or splice variant of the amino acid sequence as set forth in SEQ ID NO: 2, the amino acid sequence encoded by the DNA insert of ATCC Deposit No. PTA 2481, or the amino acid sequence of any of (a)-(c).

The invention further provides for an isolated polypeptide comprising an amino acid sequence selected from the group consisting of:

(a) the amino acid sequence as set forth in SEQ ID NO: 2 with at least one conservative amino acid substitution, wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(b) the amino acid sequence as set forth in SEQ ID NO: 2 with at least one amino acid insertion, wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(c) the amino acid sequence as set forth in SEQ ID NO: 2 with at least one amino acid deletion, wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2;

(d) the amino acid sequence as set forth in SEQ ID NO: 2 that has a C- and/or N-terminal truncation, wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2; and

(e) the amino acid sequence as set forth in SEQ ID NO: 2 with at least one modification selected from the group consisting of amino acid substitutions, amino acid insertions, amino acid deletions, C-terminal truncation, and N-terminal truncation, wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2.

The invention still further provides for an isolated polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 2 with at least one conservative amino acid substitution selected from the group consisting of: valine at position 4; isoleucine or valine at position 6; leucine or valine at position 7; methionine or valine at position 8; isoleucine at position 10; leucine or valine at position 17; glycine at position 19; serine at position 22; leucine at position 23; aspartic acid at position 28; leucine or valine at position 31; valine at position 32; isoleucine at position 40; leucine at position 50; valine at position 52; valine, leucine, or methionine at position 55; arginine at position 61; methionine at position 62; lysine at position 70; serine at position 74; isoleucine or methionine at position 75; valine at position 76; aspartic acid at position 78; methionine or isoleucine at position 80; arginine at position 84; leucine at position 89; asparagine at position 91; isoleucine or leucine at position 92; isoleucine or leucine at position 94; glutamic acid at position 96; aspartic acid at position 97; phenylalanine at position 100; valine at position 104; leucine at position 105; arginine at position 107; tyrosine at position 110; glutamic acid at position 111; valine or isoleucine at position 115; serine at position 116; valine at position 117; glycine at position 121; valine or methionine at position 126; valine or isoleucine at position 131; isoleucine or leucine at position 139; isoleucine at position 141; serine at position 146; phenylalanine at position 148; isoleucine, methionine, or leucine at position 153; isoleucine at position 160; threonine at position 165; aspartic acid at position 171; phenylalanine at position 174; serine at position 177; threonine at position 178; valine at position 180; methionine, valine, or isoleucine at position 182; arginine at position 183; lysine at position 188; isoleucine or leucine at position 193; lysine at position 200; valine at position 202; isoleucine at position 204; valine or methionine at position 209; isoleucine at position 213; isoleucine or valine at position 222; valine at position 223; leucine at position 225; valine or leucine at position 227; leucine or valine at position 231; leucine at position 232; valine or leucine at position 235; methionine or isoleucine at position 240; arginine at position 250; arginine at position 255; glutamic acid at position 256; serine at position 264; arginine at position 266; and leucine at position 268; wherein the polypeptide has an activity of the polypeptide as set forth in SEQ ID NO: 2.

Also provided are fusion polypeptides comprising B7-L amino acid sequences.

The present invention also provides for an expression vector comprising the isolated nucleic acid molecules as set forth herein, recombinant host cells comprising the recombinant nucleic acid molecules as set forth herein, and a method of producing a B7-L polypeptide comprising culturing the host cells and optionally isolating the polypeptide so produced.

A transgenic non-human animal comprising a nucleic acid molecule encoding a B7-L polypeptide is also encompassed by the invention. The B7-L nucleic acid molecules are introduced into the animal in a manner that allows expression and increased levels of a B7-L polypeptide, which may include increased circulating levels. Alternatively, the B7-L nucleic acid molecules are introduced into the animal in a manner that prevents expression of endogenous B7-L polypeptide (i.e., generates a transgenic animal possessing a B7-L polypeptide gene knockout). The transgenic non-human animal is preferably a mammal, and more preferably a rodent, such as a rat or a mouse.

Also provided are derivatives of the B7-L polypeptides of the present invention.

Additionally provided are selective binding agents such as antibodies and peptides capable of specifically binding the B7-L polypeptides of the invention. Such antibodies and peptides may be agonistic or antagonistic.

Pharmaceutical compositions comprising the nucleotides, polypeptides, or selective binding agents of the invention and one or more pharmaceutically acceptable formulation agents are also encompassed by the invention. The pharmaceutical compositions are used to provide therapeutically effective amounts of the nucleotides or polypeptides of the present invention. The invention is also directed to methods of using the polypeptides, nucleic acid molecules, and selective binding agents.

The B7-L polypeptides and nucleic acid molecules of the present invention may be used to treat, prevent, ameliorate, and/or detect diseases and disorders, including those recited herein.

The present invention also provides a method of assaying test molecules to identify a test molecule that binds to a B7-L polypeptide. The method comprises contacting a B7-L polypeptide with a test molecule to determine the extent of binding of the test molecule to the polypeptide. The method further comprises determining whether such test molecules are agonists or antagonists of a B7-L polypeptide. The present invention further provides a method of testing the impact of molecules on the expression of B7-L polypeptide or on the activity of B7-L polypeptide.

Methods of regulating expression and modulating (i.e., increasing or decreasing) levels of a B7-L polypeptide are also encompassed by the invention. One method comprises administering to an animal a nucleic acid molecule encoding a B7-L polypeptide. In another method, a nucleic acid molecule comprising elements that regulate or modulate the expression of a B7-L polypeptide may be administered. Examples of these methods include gene therapy, cell therapy, and anti-sense therapy as further described herein.

In another aspect of the present invention, the B7-L polypeptides may be used for identifying receptors thereof (“B7-L polypeptide receptors”). Various forms of “expression cloning” have been extensively used to clone receptors for protein ligands. See, e.g., Simonsen and Lodish, 1994 , Trends Pharmacol. Sci. 15:437-41 and Tartaglia et al., 1995 , Cell 83:1263-71. The isolation of a B7-L polypeptide receptor is useful for identifying or developing novel agonists and antagonists of the B7-L polypeptide signaling pathway. Such agonists and antagonists include soluble B7-L polypeptide receptors, anti-B7-L polypeptide receptor-selective binding agents (such as antibodies and derivatives thereof), small molecules, and antisense oligonucleotides, any of which can be used for treating one or more disease or disorder, including those disclosed herein.

BRIEF DESCRIPTION OF THE FIGURES

FIGS. 1A-1B illustrate the nucleotide sequence of the human B7-L gene (SEQ ID NO: 1) and the deduced amino acid sequence of human B7-L polypeptide (SEQ ID NO: 2). The predicted signal peptide ( underline ) and transmembrane domain ( double underline ) are indicated;

FIGS. 2A-2C illustrate an amino acid sequence alignment of human B7-L polypeptide (GA16817596; SEQ ID NO: 2), human CD80 or B7-1 (Cd80_Human; SEQ ID NO: 4; GenBank accession no. P33681), human CD86 or B7-2 (Cd86_Human; SEQ ID NO: 5; GenBank accession no. U04343), human B7-H1 (B7-H1_Human; SEQ ID NO: 6; GenBank accession no. AF177937), human B7rp-1 (B7rp-1_Human; SEQ ID NO: 7; GenBank accession no. AF199028), human PRO352 (Pro352_Human; SEQ ID NO: 8; GenBank accession no. Y41705), human butyrophilin BTF1 (Btf1_Human; SEQ ID NO: 9; GenBank accession no. U90543), human butyrophilin BTF2 (Btsf2a2_Hu; SEQ ID NO: 10; GenBank accession no. U90550), human butyrophilin BTF4 (Btf4_Human; SEQ ID NO: 11; GenBank accession no. U90546), human butyrophilin BTF3 (Btn3a3_Human; SEQ ID NO: 12; GenBank accession no. U90548), and butyrophilin (Btn_Human; SEQ ID NO: 13; GenBank accession no. U39576);

FIGS. 3A-3E illustrate a portion of the genomic nucleotide sequence for human B7-L polypeptide (SEQ ID NO: 14). The location of the deduced amino acid sequence of exon 1 (SEQ ID NO: 19) is indicated.

FIG. 4 illustrates a portion of the genomic nucleotide sequence for human B7-L polypeptide (SEQ ID NO: 15).

FIGS. 5A-5F illustrate a portion of the genomic nucleotide sequence for human B7-L polypeptide (SEQ ID NO: 16). The location of the deduced amino acid sequence of exon 2 (SEQ ID NO: 20) is indicated.

FIGS. 6A-6B illustrate a portion of the genomic nucleotide sequence for human B7-L polypeptide (SEQ ID NO: 17). The location of the deduced amino acid sequence of exon 3 (SEQ ID NO: 21) is indicated.

FIGS. 7A-7M illustrate a portion of the genomic nucleotide sequence for human B7-L polypeptide (SEQ ID NO: 18). The locations of the deduced amino acid sequence of exons 4 (SEQ ID NO: 22), 5 (SEQ ID NO: 23), and 6 are indicated.

FIG. 8 shows the results for Northern blot analysis of human B7-L mRNA expression.

FIG. 9 shows the results for fluorescence-activated cell sorter (FACS) analysis of CHO D-cells transfected with vector alone (A), or with vectors encoding PD-1 (B), CRP-1/ICOS(C) or CD28 (D), and incubated with either FITC alone, or with the following fusion proteins: CRP-1/Fc, B7-L polypeptide/Fc, B7rp-1/Fc, B7-H1/Fc, or B7-2/Fc.

FIG. 10 shows the results obtained in anti-CD3 mediated T-cell proliferation assays using B7-L polypeptide/Fc, B7RP-1/Fc, or B7-1/Fc.

FIG. 11 shows the results of FACS analysis of human peripheral blood cells using FITC alone (A), Fc control (B), B7rp-1/Fc (C), B7-L polypeptide/Fc (D), B7-L polypeptide/Fc and anti-CD3 (E), or B7-L polypeptide/Fc and anti-CD19 (F).

DETAILED DESCRIPTION OF THE INVENTION

The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described. All references cited in this application are expressly incorporated by reference herein.

Definitions

The terms “B7-L gene” or “B7-L nucleic acid molecule” or “B7-L polynucleotide” refer to a nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in SEQ ID NO: 1, a nucleotide sequence encoding the polypeptide as set forth in SEQ ID NO: 2, a nucleotide sequence of the DNA insert in ATCC Deposit No. PTA 2481, and nucleic acid molecules as defined herein.

The term “B7-L polypeptide allelic variant” refers to one of several possible naturally occurring alternate forms of a gene occupying a given locus on a chromosome of an organism or a population of organisms.

The term “B7-L polypeptide splice variant” refers to a nucleic acid molecule, usually RNA, which is generated by alternative processing of intron sequences in an RNA transcript of B7-L polypeptide amino acid sequence as set forth in SEQ ID NO: 2.

The term “isolated nucleic acid molecule” refers to a nucleic acid molecule of the invention that (1) has been separated from at least about 50 percent of proteins, lipids, carbohydrates, or other materials with which it is naturally found when total nucleic acid is isolated from the source cells, (2) is not linked to all or a portion of a polynucleotide to which the “isolated nucleic acid molecule” is linked in nature, (3) is operably linked to a polynucleotide which it is not linked to in nature, or (4) does not occur in nature as part of a larger polynucleotide sequence. Preferably, the isolated nucleic acid molecule of the present invention is substantially free from any other contaminating nucleic acid molecule(s) or other contaminants that are found in its natural environment that would interfere with its use in polypeptide production or its therapeutic, diagnostic, prophylactic or research use.

The term “nucleic acid sequence” or “nucleic acid molecule” refers to a DNA or RNA sequence. The term encompasses molecules formed from any of the known base analogs of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy-N-6-methyladenosine, aziridinyl-cytosine, pseudoisocytosine, S-(carboxyhydroxylmethyl) uracil, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxy-methylaminomethyluracil, dihydrouracil, inosine, N6-iso-pentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethyl-guanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyamino-methyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarbonyl-methyluracil, 5-methoxyuracil, 2-methylthio-N-6-isopentenyladenine, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, N-uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 2,6-diaminopurine.

The term “vector” is used to refer to any molecule (e.g. nucleic acid, plasmid, or virus) used to transfer coding information to a host cell.

The term “expression vector” refers to a vector that is suitable for transformation of a host cell and contains nucleic acid sequences that direct and/or control the expression of inserted heterologous nucleic acid sequences. Expression includes, but is nbt limited to, processes such as transcription, translation, and RNA splicing, if introns are present.

The term “operably linked” is used herein to refer to an arrangement of flanking sequences wherein the flanking sequences so described are configured or assembled so as to perform their usual function. Thus, a flanking sequence operably linked to a coding sequence may be capable of effecting the replication, transcription and/or translation of the coding sequence. For example, a coding sequence is operably linked to a promoter when the promoter is capable of directing transcription of that coding sequence. A flanking sequence need not be contiguous with the coding sequence, so long as it functions correctly. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the coding sequence and the promoter sequence can still be considered “operably linked” to the coding sequence.

The term “host cell” is used to refer to a cell which has been transformed, or is capable of being transformed with a nucleic acid sequence and then of expressing a selected gene of interest. The term includes the progeny of the parent cell, whether or not the progeny is identical in morphology or in genetic make-up to the original parent, so long as the selected gene is present.

The term “B7-L polypeptide” refers to a polypeptide comprising the amino acid sequence of SEQ ID NO: 2 and related polypeptides. Related polypeptides include B7-L polypeptide fragments, B7-L polypeptide orthologs, B7-L polypeptide variants, and B7-L polypeptide derivatives, which possess at least one activity of the polypeptide as set forth in SEQ ID NO: 2. B7-L polypeptides may be mature polypeptides, as defined herein, and may or may not have an amino-terminal methionine residue, depending on the method by which they are prepared.

The term “B7-L polypeptide fragment” refers to a polypeptide that comprises a truncation at the amino-terminus (with or without a leader sequence) and/or a truncation at the carboxyl-terminus of the polypeptide as set forth in SEQ ID NO: 2. The term “B7-L polypeptide fragment” also refers to amino-terminal and/or carboxyl-terminal truncations of B7-L polypeptide orthologs, B7-L polypeptide derivatives, or B7-L polypeptide variants, or to amino-terminal and/or carboxyl-terminal truncations of the polypeptides encoded by B7-L polypeptide allelic variants or B7-L polypeptide splice variants. B7-L polypeptide fragments may result from alternative RNA splicing or from in vivo protease activity. Membrane-bound forms of a B7-L polypeptide are also contemplated by the present invention. In preferred embodiments, truncations and/or deletions comprise about 10 amino acids, or about 20 amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or more than about 100 amino acids. The polypeptide fragments so produced will comprise about 25 contiguous amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or about 150 amino acids, or about 200 amino acids. Such B7-L polypeptide fragments may optionally comprise an amino-terminal methionine residue. It will be appreciated that such fragments can be used, for example, to generate antibodies to B7-L polypeptides.

The term “B7-L polypeptide ortholog” refers to a polypeptide from another species that corresponds to B7-L polypeptide amino acid sequence as set forth in SEQ ID NO: 2. For example, mouse and human B7-L polypeptides are considered orthologs of each other.

The term “B7-L polypeptide variants” refers to B7-L polypeptides comprising amino acid sequences having one or more amino acid sequence substitutions, deletions (such as internal deletions and/or B7-L polypeptide fragments), and/or additions (such as internal additions and/or B7-L fusion polypeptides) as compared to the B7-L polypeptide amino acid sequence set forth in SEQ ID NO: 2 (with or without a leader sequence). Variants may be naturally occurring (e.g., B7-L polypeptide allelic variants, B7-L polypeptide orthologs, and B7-L polypeptide splice variants) or artificially constructed. Such B7-L polypeptide variants may be prepared from the corresponding nucleic acid molecules having a DNA sequence that varies accordingly from the DNA sequence as set forth in SEQ ID NO: 1. In preferred embodiments, the variants have from 1 to 3, or from 1 to 5, or from 1 to 10, or from 1 to 15, or from 1 to 20, or from 1 to 25, or from 1 to 50, or from 1 to 75, or from 1 to 100, or more than 100 amino acid substitutions, insertions, additions and/or deletions, wherein the substitutions may be conservative, or non-conservative, or any combination thereof.

The term “B7-L polypeptide derivatives” refers to the polypeptide as set forth in SEQ ID NO: 2, B7-L polypeptide fragments, B7-L polypeptide orthologs, or B7-L polypeptide variants, as defined herein, that have been chemically modified. The term “B7-L polypeptide derivatives” also refers to the polypeptides encoded by B7-L polypeptide allelic variants or B7-L polypeptide splice variants, as defined herein, that have been chemically modified.

The term “mature B7-L polypeptide” refers to a B7-L polypeptide lacking a leader sequence. A mature B7-L polypeptide may also include other modifications such as proteolytic processing of the amino-terminus (with or without a leader sequence) and/or the carboxyl-terminus, cleavage of a smaller polypeptide from a larger precursor, N-linked and/or O-linked glycosylation, and the like. An exemplary mature B7-L polypeptide is depicted by the amino acid sequence of SEQ ID NO: 3.

The term “B7-L fusion polypeptide” refers to a fusion of one or more amino acids (such as a heterologous protein or peptide) at the amino- or carboxyl-terminus of the polypeptide as set forth in SEQ ID NO: 2, B7-L polypeptide fragments, B7-L polypeptide orthologs, B7-L polypeptide variants, or B7-L derivatives, as defined herein. The term “B7-L fusion polypeptide” also refers to a fusion of one or more amino acids at the amino- or carboxyl-terminus of the polypeptide encoded by B7-L polypeptide allelic variants or B7-L polypeptide splice variants, as defined herein.

The term “biologically active B7-L polypeptides” refers to B7-L polypeptides having at least one activity characteristic of the polypeptide comprising the amino acid sequence of SEQ ID NO: 2. In addition, a B7-L polypeptide may be active as an immunogen; that is, the B7-L polypeptide contains at least one epitope to which antibodies may be raised.

The term “isolated polypeptide” refers to a polypeptide of the present invention that (1) has been separated from at least about 50 percent of polynucleotides, lipids, carbohydrates, or other materials with which it is naturally found when isolated from the source cell, (2) is not linked (by covalent or noncovalent interaction) to all or a portion of a polypeptide to which the “isolated polypeptide” is linked in nature, (3) is operably linked (by covalent or noncovalent interaction) to a polypeptide with which it is not linked in nature, or (4) does not occur in nature. Preferably, the isolated polypeptide is substantially free from any other contaminating polypeptides or other contaminants that are found in its natural environment that would interfere with its therapeutic, diagnostic, prophylactic or research use.

The term “identity,” as known in the art, refers to a relationship between the sequences of two or more polypeptide molecules or two or more nucleic acid molecules, as determined by comparing the sequences. In the art, “identity” also means the degree of sequence relatedness between nucleic acid molecules or polypeptides, as the case may be, as determined by the match between strings of two or more nucleotide or two or more amino acid sequences. “Identity” measures the percent of identical matches between the smaller of two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (i.e., “algorithms”).

The term “similarity” is a related concept, but in contrast to “identity,” “similarity” refers to a measure of relatedness that includes both identical matches and conservative substitution matches. If two polypeptide sequences have, for example, 10/20 identical amino acids, and the remainder are all non-conservative substitutions, then the percent identity and similarity would both be 50%. If in the same example, there are five more positions where there are conservative substitutions, then the percent identity remains 50%, but the percent similarity would be 75% ( 15/20). Therefore, in cases where there are conservative substitutions, the percent similarity between two polypeptides will be higher than the percent identity between those two polypeptides.

The term “naturally occurring” or “native” when used in connection with biological materials such as nucleic acid molecules, polypeptides, host cells, and the like, refers to materials which are found in nature and are not manipulated by man. Similarly, “non-naturally occurring” or “non-native” as used herein refers to a material that is not found in nature or that has been structurally modified or synthesized by man.

The terms “effective amount” and “therapeutically effective amount” each refer to the amount of a B7-L polypeptide or B7-L nucleic acid molecule used to support an observable level of one or more biological activities of the B7-L polypeptides as set forth herein.

The term “pharmaceutically acceptable carrier” or “physiologically acceptable carrier” as used herein refers to one or more formulation materials suitable for accomplishing or enhancing the delivery of the B7-L polypeptide, B7-L nucleic acid molecule, or B7-L selective binding agent as a pharmaceutical composition.

The term “antigen” refers to a molecule or a portion of a molecule capable of being bound by a selective binding agent, such as an antibody, and additionally capable of being used in an animal to produce antibodies capable of binding to an epitope of that antigen. An antigen may have one or more epitopes.

The term “selective binding agent” refers to a molecule or molecules having specificity for a B7-L polypeptide. As used herein, the terms, “specific” and “specificity” refer to the ability of the selective binding agents to bind to human B7-L polypeptides and not to bind to human non-B7-L polypeptides. It will be appreciated, however, that the selective binding agents may also bind orthologs of the polypeptide as set forth in SEQ ID NO: 2, that is, interspecies versions thereof, such as mouse and rat B7-L polypeptides.

The term “transduction” is used to refer to the transfer of genes from one bacterium to another, usually by a phage. “Transduction” also refers to the acquisition and transfer of eukaryotic cellular sequences by retroviruses.

The term “transfection” is used to refer to the uptake of foreign or exogenous DNA by a cell, and a cell has been “transfected” when the exogenous DNA has been introduced inside the cell membrane. A number of transfection techniques are well known in the art and are disclosed herein. See, e.g., Graham et al., 1973 , Virology 52:456; Sambrook et al., Molecular Cloning, A Laboratory Manual (Cold Spring Harbor Laboratories, 1989); Davis et al., Basic Methods in Molecular Biology (Elsevier, 1986); and Chu et al., 1981 , Gene 13:197. Such techniques can be used to introduce one or more exogenous DNA moieties into suitable host cells.

The term “tansformation” as used herein refers to a change in a cell's genetic characteristics, and a cell has been transformed when it has been modified to contain a new DNA. For example, a cell is transformed where it is genetically modified from its native state. Following transfection or transduction, the transforming DNA may recombine with that of the cell by physically integrating into a chromosome of the cell, may be maintained transiently as an episomal element without being replicated, or may replicate independently as a plasmid. A cell is considered to have been stably transformed when the DNA is replicated with the division of the cell.

Relatedness of Nucleic Acid Molecules and/or Polypeptides

It is understood that related nucleic acid molecules include allelic or splice variants of the nucleic acid molecule of SEQ ID NO: 1, and include sequences which are complementary to any of the above nucleotide sequences. Related nucleic acid molecules also include a nucleotide sequence encoding a polypeptide comprising or consisting essentially of a substitution, modification, addition and/or deletion of one or more amino acid residues compared to the polypeptide as set forth in SEQ ID NO: 2. Such related B7-L polypeptides may comprise, for example, an addition and/or a deletion of one or more N-linked or O-linked glycosylation sites or an addition and/or a deletion of one or more cysteine residues.

Related nucleic acid molecules also include fragments of B7-L nucleic acid molecules which encode a polypeptide of at least about 25 contiguous amino acids, or about 50 amino acids, or about 75 amino acids, or about 100 amino acids, or about 150 amino acids, or about 200 amino acids, or more than 200 amino acid residues of the B7-L polypeptide of SEQ ID NO: 2.

In addition, related B7-L nucleic acid molecules also include those molecules which comprise nucleotide sequences which hybridize under moderately or highly stringent conditions as defined herein with the fully complementary sequence of the B7-L nucleic acid molecule of SEQ ID NO: 1, or of a molecule encoding a polypeptide, which polypeptide comprises the amino acid sequence as shown in SEQ ID NO: 2, or of a nucleic acid fragment as defined herein, or of a nucleic acid fragment encoding a polypeptide as defined herein. Hybridization probes may be prepared using the B7-L sequences provided herein to screen cDNA, genomic or synthetic DNA libraries for related sequences. Regions of the DNA and/or amino acid sequence of B7-L polypeptide that exhibit significant identity to known sequences are readily determined using sequence alignment algorithms as described herein and those regions may be used to design probes for screening.

The term “highly stringent conditions” refers to those conditions that are designed to permit hybridization of DNA strands whose sequences are highly complementary, and to exclude hybridization of significantly mismatched DNAs. Hybridization stringency is principally determined by temperature, ionic strength, and the concentration of denaturing agents such as formamide. Examples of “highly stringent conditions” for hybridization and washing are 0.015 M sodium chloride, 0.0015 M sodium citrate at 65-68° C. or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 50% formamide at 42° C. See Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual (2nd ed., Cold Spring Harbor Laboratory, 1989); Anderson et al., Nucleic Acid Hybridisation: A Practical Approach Ch. 4 (IRL Press Limited).

More stringent conditions (such as higher temperature, lower ionic strength, higher formamide, or other denaturing agent) may also be used—however, the rate of hybridization will be affected. Other agents may be included in the hybridization and washing buffers for the purpose of reducing non-specific and/or background hybridization. Examples are 0.1% bovine serum albumin, 0.1% polyvinyl-pyrrolidone, 0.1% sodium pyrophosphate, 0.1% sodium dodecylsulfate, NaDodSO 4 , (SDS), ficoll, Denhardt's solution, sonicated salmon sperm DNA (or another non-complementary DNA), and dextran sulfate, although other suitable agents can also be used. The concentration and types of these additives can be changed without substantially affecting the stringency of the hybridization conditions. Hybridization experiments are usually carried out at pH 6.8-7.4; however, at typical ionic strength conditions, the rate of hybridization is nearly independent of pH. See Anderson et al., Nucleic Acid Hybridisation: A Practical Approach Ch. 4 (IRL Press Limited).

Factors affecting the stability of DNA duplex include base composition, length, and degree of base pair mismatch. Hybridization conditions can be adjusted by one skilled in the art in order to accommodate these variables and allow DNAs of different sequence relatedness to form hybrids. The melting temperature of a perfectly matched DNA duplex can be estimated by the following equation:
T m (° C.)=81.5+16.6(log[Na+])+0.41(% G+C )−600 /N− 0.72(% formamide)
where N is the length of the duplex formed, [Na+] is the molar concentration of the sodium ion in the hybridization or washing solution, % G+C is the percentage of (guanine+cytosine) bases in the hybrid. For imperfectly matched hybrids, the melting temperature is reduced by approximately 1° C. for each 1% mismatch.

The term “moderately stringent conditions” refers to conditions under which a DNA duplex with a greater degree of base pair mismatching than could occur under “highly stringent conditions” is able to form. Examples of typical “moderately stringent conditions” are 0.015 M sodium chloride, 0.0015 M sodium citrate at 50-65° C. or 0.015 M sodium chloride, 0.0015 M sodium citrate, and 20% formamide at 37-50° C. By way of example, “moderately stringent conditions” of 50° C. in 0.015 M sodium ion will allow about a 21% mismatch.

It will be appreciated by those skilled in the art that there is no absolute distinction between “highly stringent conditions” and “moderately stringent conditions.” For example, at 0.015 M sodium ion (no formamide), the melting temperature of perfectly matched long DNA is about 71° C. With a wash at 65° C. (at the same ionic strength), this would allow for approximately a 6% mismatch. To capture more distantly related sequences, one skilled in the art can simply lower the temperature or raise the ionic strength.

A good estimate of the melting temperature in 1M NaCl* for oligonucleotide probes up to about 20 nt is given by:
Tm =2° C. per A - T base pair+4° C. per G - C base pair
*The sodium ion concentration in 6× salt sodium citrate (SSC) is 1M. See Suggs et al., Developmental Biology Using Purified Genes 683 (Brown and Fox, eds., 1981).

High stringency washing conditions for oligonucleotides are usually at a temperature of 0-5° C. below the Tm of the oligonucleotide in 6× SSC, 0.1% SDS.

In another embodiment, related nucleic acid molecules comprise or consist of a nucleotide sequence that is at least about 70 percent identical to the nucleotide sequence as shown in SEQ ID NO: 1. In preferred embodiments, the nucleotide sequences are about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical to the nucleotide sequence as shown in SEQ ID NO: 1. Related nucleic acid molecules encode polypeptides possessing at least one activity of the polypeptide set forth in SEQ ID NO: 2.

Differences in the nucleic acid sequence may result in conservative and/or non-conservative modifications of the amino acid sequence relative to the amino acid sequence of SEQ ID NO: 2.

Conservative modifications to the amino acid sequence of SEQ ID NO: 2 (and the corresponding modifications to the encoding nucleotides) will produce a polypeptide having functional and chemical characteristics similar to those of B7-L polypeptides. In contrast, substantial modifications in the functional and/or chemical characteristics of B7-L polypeptides may be accomplished by selecting substitutions in the amino acid sequence of SEQ ID NO: 2 that differ significantly in their effect on maintaining (a) the structure of the molecular backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain.

For example, a “conservative amino acid substitution” may involve a substitution of a native amino acid residue with a normative residue such that there is little or no effect on the polarity or charge of the amino acid residue at that position. Furthermore, any native residue in the polypeptide may also be substituted with alanine, as has been previously described for “alanine scanning mutagenesis.”

Conservative amino acid substitutions also encompass non-naturally occurring amino acid residues that are typically incorporated by chemical peptide synthesis rather than by synthesis in biological systems. These include peptidomimetics, and other reversed or inverted forms of amino acid moieties.

Naturally occurring residues may be divided into classes based on common side chain properties:

    • 1) hydrophobic: norleucine, Met, Ala, Val, Leu, Ile;
    • 2) neutral hydrophilic: Cys, Ser, Thr;
    • 3) acidic: Asp, Glu;
    • 4) basic: Asn, Gln, His, Lys, Arg;
    • 5) residues that influence chain orientation: Gly, Pro; and
    • 6) aromatic: Trp, Tyr, Phe.

For example, non-conservative substitutions may involve the exchange of a member of one of these classes for a member from another class. Such substituted residues may be introduced into regions of the human B7-L polypeptide that are homologous with non-human B7-L polypeptides, or into the non-homologous regions of the molecule.

In making such changes, the hydropathic index of amino acids may be considered. Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics. The hydropathic indices are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (−0.4); threonine (−0.7); serine (−0.8); tryptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2); glutamate (−3.5); glutamine (−3.5); aspartate (−3.5); asparagine (−3.5); lysine (−3.9); and arginine (−4.5).

The importance of the hydropathic amino acid index in conferring interactive biological function on a protein is generally understood in the art (Kyte et al., 1982 , J. Mol. Biol. 157:105-31). It is known that certain amino acids may be substituted for other amino acids having a simiar hydropathic index or score and still retain a similar biological activity. In making changes based upon the hydropathic index, the substitution of amino acids whose hydropathic indices are within ±2 is preferred, those that are within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred.

It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity, particularly where the biologically functionally equivalent protein or peptide thereby created is intended for use in immunological embodiments, as in the present case. The greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e., with a biological property of the protein.

The following hydrophilicity values have been assigned to these amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0±1); glutamate (+3.0±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (−0.4); proline (−0.5±1); alanine (−0.5); histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5); leucine (−1.8); isoleucine (−1.8); tyrosine (−2.3); phenylalanine (−2.5); and tryptophan (−3.4). In making changes based upon similar hydrophilicity values, the substitution of amino acids whose hydrophilicity values are within ±2 is preferred, those that are within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred. One may also identify epitopes from primary amino acid sequences on the basis of hydrophilicity. These regions are also referred to as “epitopic core regions.”

Desired amino acid substitutions (whether conservative or non-conservative) can be determined by those skilled in the art at the time such substitutions are desired. For example, amino acid substitutions can be used to identify important residues of the B7-L polypeptide, or to increase or decrease the affinity of the B7-L polypeptides described herein. Exemplary amino acid substitutions are set forth in Table I.

TABLE I
Amino Acid Substitutions
Original Exemplary Preferred
Residues Substitutions Substitutions
Ala Val, Leu, Ile Val
Arg Lys, Gln, Asn Lys
Asn Gln Gln
Asp Glu Glu
Cys Ser, Ala Ser
Gln Asn Asn
Glu Asp Asp
Gly Pro, Ala Ala
His Asn, Gln, Lys, Arg Arg
Ile Leu, Val, Met, Ala, Leu
Phe, Norleucine
Leu Norleucine, Ile, Ile
Val, Met, Ala, Phe
Lys Arg, 1,4 Diamino-butyric Arg
Acid, Gln, Asn
Met Leu, Phe, Ile Leu
Phe Leu, Val, Ile, Ala, Leu
Tyr
Pro Ala Gly
Ser Thr, Ala, Cys Thr
Thr Ser Ser
Trp Tyr, Phe Tyr
Tyr Trp, Phe, Thr, Ser Phe
Val Ile, Met, Leu, Phe, Leu
Ala, Norleucine

A skilled artisan will be able to determine suitable variants of the polypeptide as set forth in SEQ ID NO: 2 using well-known techniques. For identifying suitable areas of the molecule that may be changed without destroying biological activity, one skilled in the art may target areas not believed to be important for activity. For example, when similar polypeptides with similar activities from the same species or from other species are known, one skilled in the art may compare the amino acid sequence of a B7-L polypeptide to such similar polypeptides. With such a comparison, one can identify residues and portions of the molecules that are conserved among similar polypeptides. It will be appreciated that changes in areas of the B7-L molecule that are not conserved relative to such similar polypeptides would be less likely to adversely affect the biological activity and/or structure of a B7-L polypeptide. One skilled in the art would also know that, even in relatively conserved regions, one may substitute chemically similar amino acids for the naturally occurring residues while retaining activity (conservative amino acid residue substitutions). Therefore, even areas that may be important for biological activity or for structure may be subject to conservative amino acid substitutions without destroying the biological activity or without adversely affecting the polypeptide structure.

Additionally, one skilled in the art can review structure-function studies identifying residues in similar polypeptides that are important for activity or structure. In view of such a comparison, one can predict the importance of amino acid residues in a B7-L polypeptide that correspond to amino acid residues that are important for activity or structure in similar polypeptides. One skilled in the art may opt for chemically similar amino acid substitutions for such predicted important amino acid residues of B7-L polypeptides.

One skilled in the art can also analyze the three-dimensional structure and amino acid sequence in relation to that structure in similar polypeptides. In view of such information, one skilled in the art may predict the alignment of amino acid residues of B7-L polypeptide with respect to its three dimensional structure. One skilled in the art may choose not to make radical changes to amino acid residues predicted to be on the surface of the protein, since such residues may be involved in important interactions with other molecules. Moreover, one skilled in the art may generate test variants containing a single amino acid substitution at each amino acid residue. The variants could be screened using activity assays known to those with skill in the art. Such variants could be used to gather information about suitable variants. For example, if one discovered that a change to a particular amino acid residue resulted in destroyed, undesirably reduced, or unsuitable activity, variants with such a change would be avoided. In other words, based on information gathered from such routine experiments, one skilled in the art can readily determine the amino acids where further substitutions should be avoided either alone or in combination with other mutations.

A number of scientific publications have been devoted to the prediction of secondary structure. See Moult, 1996 , Curr. Opin. Biotechnol. 7:422-27; Chou et al., 1974 , Biochemistry 13:222-45; Chou et al., 1974 , Biochemistry 113:211-22; Chou et al., 1978 , Adv. Enzymol Relat. Areas Mol. Biol. 47:45-48; Chou et al., 1978 , Ann. Rev. Biochem. 47:251-276; and Chou et al., 1979 , Biophys. J. 26:367-84. Moreover, computer programs are currently available to assist with predicting secondary structure. One method of predicting secondary structure is based upon homology modeling. For example, two polypeptides or proteins that have a sequence identity of greater than 30%, or similarity greater than 40%, often have similar structural topologies. The recent growth of the protein structural database (PDB) has provided enhanced predictability of secondary structure, including the potential number of folds within the structure of a polypeptide or protein. See Holm et al., 1999 , Nucleic Acids Res. 27:244-47. It has been suggested that there are a limited number of folds in a given polypeptide or protein and that once a critical number of structures have been resolved, structural prediction will become dramatically more accurate (Brenner et al., 1997 , Curr. Opin. Struct. Biol. 7:369-76).

Additional methods of predicting secondary structure include “threading” (Jones, 1997 , Curr. Opin. Struct. Biol. 7:377-87; Sippl et al., 1996 , Structure 4:15-19), “profile analysis” (Bowie et al., 1991 , Science, 253:164-70; Gribskov et al., 1990 , Methods Enzymol. 183:146-59; Gribskov et al., 1987 , Proc. Nat. Acad. Sci. U.S.A. 84:4355-58), and “evolutionary linkage” (See Holm et al., supra, and Brenner et al., supra).

Preferred B7-L polypeptide variants include glycosylation variants wherein the number and/or type of glycosylation sites have been altered compared to the amino acid sequence set forth in SEQ ID NO: 2. In one embodiment, B7-L polypeptide variants comprise a greater or a lesser number of N-linked glycosylation sites than the amino acid sequence set forth in SEQ ID NO: 2. An N-linked glycosylation site is characterized by the sequence: Asn-X-Ser or Asn-X-Thr, wherein the amino acid residue designated as X may be any amino acid residue except proline. The substitution of amino acid residues to create this sequence provides a potential new site for the addition of an N-linked carbohydrate chain. Alternatively, substitutions that eliminate this sequence will remove an existing N-linked carbohydrate chain. Also provided is a rearrangement of N-linked carbohydrate chains wherein one or more N-linked glycosylation sites (typically those that are naturally occurring) are eliminated and one or more new N-linked sites are created. Additional preferred B7-L variants include cysteine variants, wherein one or more cysteine residues are deleted or substituted with another amino acid (e.g., serine) as compared to the amino acid sequence set forth in SEQ ID NO: 2. Cysteine variants are useful when B7-L polypeptides must be refolded into a biologically active conformation such as after the isolation of insoluble inclusion bodies. Cysteine variants generally have fewer cysteine residues than the native protein, and typically have an even number to minimize interactions resulting from unpaired cysteines.

In other embodiments, B7-L polypeptide variants comprise an amino acid sequence as set forth in SEQ ID NO: 2 with at least one amino acid insertion and wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2, or an amino acid sequence as set forth in SEQ ID NO: 2 with at least one amino acid deletion and wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2. B7-L polypeptide variants also comprise an amino acid sequence as set forth in SEQ ID NO: 2 wherein the polypeptide has a carboxyl- and/or amino-terminal truncation and further wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2. B7-L polypeptide variants further comprise an amino acid sequence as set forth in SEQ ID NO: 2 with at least one modification selected from the group consisting of amino acid substitutions, amino acid insertions, amino acid deletions, carboxyl-terminal truncations, and amino-terminal truncations and wherein the polypeptide has an activity of the polypeptide set forth in SEQ ID NO: 2.

In further embodiments, B7-L polypeptide variants comprise an amino acid sequence that is at least about 70 percent identical to the amino acid sequence as set forth in SEQ ID NO: 2. In preferred embodiments, B7-L polypeptide variants comprise an amino acid sequence that is about 75 percent, or about 80 percent, or about 85 percent, or about 90 percent, or about 95, 96, 97, 98, or 99 percent identical to the amino acid sequence as set forth in SEQ ID NO: 2. B7-L polypeptide variants possess at least one activity of the polypeptide set forth in SEQ ID NO: 2.

In addition, the polypeptide comprising the amino acid sequence of SEQ ID NO: 2, or other B7-L polypeptide, may be fused to a homologous polypeptide to form a homodimer or to a heterologous polypeptide to form a heterodimer. Heterologous peptides and polypeptides include, but are not limited to: an epitope to allow for the detection and/or isolation of a B7-L fusion polypeptide; a transmembrane receptor protein or a portion thereof, such as an extracellular domain or a transmembrane and intracellular domain; a ligand or a portion thereof which binds to a transmembrane receptor protein; an enzyme or portion thereof which is catalytically active; a polypeptide or peptide which promotes oligomerization, such as a leucine zipper domain; a polypeptide or peptide which increases stability, such as an immunoglobulin constant region; and a polypeptide which has a therapeutic activity different from the polypeptide comprising the amino acid sequence as set forth in SEQ ID NO: 2, or other B7-L polypeptide.

Fusions can be made either at the amino-terminus or at the carboxyl-terminus of the polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 2, or other B7-L polypeptide. Fusions may be direct with no linker or adapter molecule or may be through a linker or adapter molecule. A linker or adapter molecule may be one or more amino acid residues, typically from about 20 to about 50 amino acid residues. A linker or adapter molecule may also be designed with a cleavage site for a DNA restriction endonuclease or for a protease to allow for the separation of the fused moieties. It will be appreciated that once constructed, the fusion polypeptides can be derivatized according to the methods described herein.

In a further embodiment of the invention, the polypeptide comprising the amino acid sequence of SEQ ID NO: 2, or other B7-L polypeptide, is fused to one or more domains of an Fc region of human IgG. Antibodies comprise two functionally independent parts, a variable domain known as “Fab,” that binds an antigen, and a constant domain known as “Fc,” that is involved in effector functions such as complement activation and attack by phagocytic cells. An Fc has a long serum half-life, whereas an Fab is short-lived. Capon et al., 1989 , Nature 337:525-31. When constructed together with a therapeutic protein, an Fc domain can provide longer half-life or incorporate such functions as Fc receptor binding, protein A binding, complement fixation, and perhaps even placental transfer. Id. Table II summarizes the use of certain Fc fusions known in the art.

TABLE II
Fc Fusion with Therapeutic Proteins
Form of Fc Fusion partner Therapeutic implications Reference
IgG1 N-terminus of Hodgkin's disease; U.S. Pat. No.
CD30-L anaplastic lymphoma; T- 5,480,981
cell leukemia
Murine Fcγ2a IL-10 anti-inflammatory; Zheng et al., 1995, J.
transplant rejection Immunol. 154: 5590-600
IgG1 TNF receptor septic shock Fisher et al., 1996, N.
Engl. J. Med. 334: 1697-1702;
Van Zee et al.,
1996, J. Immunol.
156: 2221-30
IgG, IgA, IgM, TNF receptor inflammation, U.S. Pat. No.
or IgE autoimmune disorders 5,808,029
(excluding the
first domain)
IgG1 CD4 receptor AIDS Capon et al., 1989,
Nature 337: 525-31
IgG1, N-terminus anti-cancer, antiviral Harvill et al., 1995,
IgG3 of IL-2 Immunotech. 1: 95-105
IgG1 C-terminus of osteoarthritis; WO 97/23614
OPG bone density
IgG1 N-terminus of anti-obesity PCT/US 97/23183, filed
leptin December 11, 1997
Human Ig Cγ1 CTLA-4 autoimmune disorders Linsley, 1991, J. Exp.
Med., 174: 561-69

In one example, a human IgG hinge, CH2, and CH3 region may be fused at either the amino-terminus or carboxyl-terminus of the B7-L polypeptides using methods known to the skilled artisan. In another example, a human IgG hinge, CH2, and CH3 region may be fused at either the amino-terminus or carboxyl-terminus of a B7-L polypeptide fragment (e.g., the predicted extracellular portion of B7-L polypeptide).

The resulting B7-L fusion polypeptide may be purified by use of a Protein A affinity column. Peptides and proteins fused to an Fc region have been found to exhibit a substantially greater half-life in vivo than the unfused counterpart. Also, a fusion to an Fc region allows for dimerization/multimerization of the fusion polypeptide. The Fc region may be a naturally occurring Fc region, or may be altered to improve certain qualities, such as therapeutic qualities, circulation time, or reduced aggregation.

Identity and similarity of related nucleic acid molecules and polypeptides are readily calculated by known methods. Such methods include, but are not limited to those described in Computational Molecular Biology (A. M. Lesk, ed., Oxford University Press 1988); Biocomputing: Informatics and Genome Projects (D. W. Smith, ed., Academic Press 1993); Computer Analysis of Sequence Data (Part 1, A. M. Griffin and H. G. Griffin, eds., Humana Press 1994); G. von Heinle, Sequence Analysis in Molecular Biology (Academic Press 1987); Sequence Analysis Primer (M. Gribskov and J. Devereux, eds., M. Stockton Press 1991); and Carillo et al., 1988 , SIAM Applied Math., 48:1073.

Preferred methods to determine identity and/or similarity are designed to give the largest match between the sequences tested. Methods to determine identity and similarity are described in publicly available computer programs. Preferred computer program methods to determine identity and similarity between two sequences include, but are not limited to, the GCG program package, including GAP (Devereux et al., 1984 , Nucleic Acids Res. 12:387; Genetics Computer Group, University of Wisconsin, Madison, Wis.), BLASTP, BLASTN, and FASTA (Altschul et al., 1990 , J. Mol. Biol. 215:403-10). The BLASTX program is publicly available from the National Center for Biotechnology Information (NCBI) and other sources (Altschul et al., BLAST Manual (NCB NLM NIH, Bethesda, Md.); Altschul et al., 1990, supra). The well-known Smith Waterman algorithm may also be used to determine identity.

Certain alignment schemes for aligning two amino acid sequences may result in the matching of only a short region of the two sequences, and this small aligned region may have very high sequence identity even though there is no significant relationship between the two full-length sequences. Accordingly, in a preferred embodiment, the selected alignment method (GAP program) will result in an alignment that spans at least 50 contiguous amino acids of the claimed polypeptide.

For example, using the computer algorithm GAP (Genetics Computer Group, University of Wisconsin, Madison, Wis.), two polypeptides for which the percent sequence identity is to be determined are aligned for optimal matching of their respective amino acids (the “matched span,” as determined by the algorithm). A gap opening penalty (which is calculated as 3× the average diagonal; the “average diagonal” is the average of the diagonal of the comparison matrix being used; the “diagonal” is the score or number assigned to each perfect amino acid match by the particular comparison matrix) and a gap extension penalty (which is usually 0.1× the gap opening penalty), as well as a comparison matrix such as PAM 250 or BLOSUM 62 are used in conjunction with the algorithm. A standard comparison matrix is also used by the algorithm (see Dayhoff et al., 5 Atlas of Protein Sequence and Structure (Supp. 3 1978)(PAM250 comparison matrix); Henikoff et al., 1992 , Proc. Natl. Acad. Sci USA 89:10915-19 (BLOSUM 62 comparison matrix)).

Preferred parameters for polypeptide sequence comparison include the following:

    • Algorithm: Needleman and Wunsch, 1970 , J. Mol. Biol. 48:443-53;
    • Comparison matrix: BLOSUM 62 (Henikoff et al., supra);
    • Gap Penalty: 12
    • Gap Length Penalty: 4
    • Threshold of Similarity: 0
      The GAP program is useful with the above parameters. The aforementioned parameters are the default parameters for polypeptide comparisons (along with no penalty for end gaps) using the GAP algorithm.

Preferred parameters for nucleic acid molecule sequence comparison include the following:

    • Algorithm: Needleman and Wunsch, supra;
    • Comparison matrix: matches =+10, mismatch=0
    • Gap Penalty: 50
    • Gap Length Penalty: 3
      The GAP program is also useful with the above parameters. The aforementioned parameters are the default parameters for nucleic acid molecule comparisons.

Other exemplary algorithms, gap opening penalties, gap extension penalties, comparison matrices, and thresholds of similarity may be used, including those set forth in the Program Manual, Wisconsin Package, Version 9, September, 1997. The particular choices to be made will be apparent to those of skill in the art and will depend on the specific comparison to be made, such as DNA-to-DNA, protein-to-protein, protein-to-DNA; and additionally, whether the comparison is between given pairs of sequences (in which case GAP or BestFit are generally preferred) or between one sequence and a large database of sequences (in which case FASTA or BLASTA are preferred).

Nucleic Acid Molecules

The nucleic acid molecules encoding a polypeptide comprising the amino acid sequence of a B7-L polypeptide can readily be obtained in a variety of ways including, without limitation, chemical synthesis, cDNA or genomic library screening, expression library screening, and/or PCR amplification of cDNA.

Recombinant DNA methods used herein are generally those set forth in Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1989) and/or Current Protocols in Molecular Biology (Ausubel et al., eds., Green Publishers Inc. and Wiley and Sons 1994). The invention provides for nucleic acid molecules as described herein and methods for obtaining such molecules.

Where a gene encoding the amino acid sequence of a B7-L polypeptide has been identified from one species, all or a portion of that gene may be used as a probe to identify orthologs or related genes from the same species. The probes or primers may be used to screen cDNA libraries from various tissue sources believed to express the B7-L polypeptide. In addition, part or all of a nucleic acid molecule having the sequence as set forth in SEQ ID NO: 1 may be used to screen a genomic library to identify and isolate a gene encoding the amino acid sequence of a B7-L polypeptide. Typically, conditions of moderate or high stringency will be employed for screening to minimize the number of false positives obtained from the screening.

Nucleic acid molecules encoding the amino acid sequence of B7-L polypeptides may also be identified by expression cloning which employs the detection of positive clones based upon a property of the expressed protein. Typically, nucleic acid libraries are screened by the binding an antibody or other binding partner (e.g., receptor or ligand) to cloned proteins that are expressed and displayed on a host cell surface. The antibody or binding partner is modified with a detectable label to identify those cells expressing the desired clone.

Recombinant expression techniques conducted in accordance with the descriptions set forth below may be followed to produce these polynucleotides and to express the encoded polypeptides. For example, by inserting a nucleic acid sequence that encodes the amino acid sequence of a B7-L polypeptide into an appropriate vector, one skilled in the art can readily produce large quantities of the desired nucleotide sequence. The sequences can then be used to generate detection probes or amplification primers. Alternatively, a polynucleotide encoding the amino acid sequence of a B7-L polypeptide can be inserted into an expression vector. By introducing the expression vector into an appropriate host, the encoded B7-L polypeptide may be produced in large amounts.

Another method for obtaining a suitable nucleic acid sequence is the polymerase chain reaction (PCR). In this method, cDNA is prepared from poly(A)+ RNA or total RNA using the enzyme reverse transcriptase. Two primers, typically complementary to two separate regions of cDNA encoding the amino acid sequence of a B7-L polypeptide, are then added to the cDNA along with a polymerase such as Taq polymerase, and the polymerase amplifies the cDNA region between the two primers.

Another means of preparing a nucleic acid molecule encoding the amino acid sequence of a B7-L polypeptide is chemical synthesis using methods well known to the skilled artisan such as those described by Engels et al., 1989 , Angew. Chem. Intl. Ed. 28:716-34. These methods include, inter alia, the phosphotriester, phosphoramidite, and H-phosphonate methods for nucleic acid synthesis. A preferred method for such chemical synthesis is polymer-supported synthesis using standard phosphoramidite chemistry. Typically, the DNA encoding the amino acid sequence of a B7-L polypeptide will be several hundred nucleotides in length. Nucleic acids larger than about 100 nucleotides can be synthesized as several fragments using these methods. The fragments can then be ligated together to form the full-length nucleotide sequence of a B7-L gene. Usually, the DNA fragment encoding the amino-terminus of the polypeptide will have an ATG, which encodes a methionine residue. This methionine may or may not be present on the mature form of the B7-L polypeptide, depending on whether the polypeptide produced in the host cell is designed to be secreted from that cell. Other methods known to the skilled artisan may be used as well.

In certain embodiments, nucleic acid variants contain codons which have been altered for optimal expression of a B7-L polypeptide in a given host cell. Particular codon alterations will depend upon the B7-L polypeptide and host cell selected for expression. Such “codon optimization” can be carried out by a variety of methods, for example, by selecting codons which are preferred for use in highly expressed genes in a given host cell. Computer algorithms which incorporate codon frequency tables such as “Eco_high.Cod” for codon preference of highly expressed bacterial genes may be used and are provided by the University of Wisconsin Package Version 9.0 (Genetics Computer Group, Madison, Wis.). Other useful codon frequency tables include “Celegans_high.cod,” “Celegans_low.cod,” “ Drosophila _high.cod,” “Human_high.cod,” “Maize_high.cod,” and “Yeast_high.cod.”

In some cases, it may be desirable to prepare nucleic acid molecules encoding B7-L polypeptide variants. Nucleic acid molecules encoding variants may be produced using site directed mutagenesis, PCR amplification, or other appropriate methods, where the primer(s) have the desired point mutations (see Sambrook et al., supra, and Ausubel et al., supra, for descriptions of mutagenesis techniques). Chemical synthesis using methods described by Engels et al., supra, may also be used to prepare such variants. Other methods known to the skilled artisan may be used as well.

Vectors and Host Cells

A nucleic acid molecule encoding the amino acid sequence of a B7-L polypeptide is inserted into an appropriate expression vector using standard ligation techniques. The vector is typically selected to be functional in the particular host cell employed (i.e., the vector is compatible with the host cell machinery such that amplification of the gene and/or expression of the gene can occur). A nucleic acid molecule encoding the amino acid sequence of a B7-L polypeptide may be amplified/expressed in prokaryotic, yeast, insect (baculovirus systems) and/or eukaryotic host cells. Selection of the host cell will depend in part on whether a B7-L polypeptide is to be post-translationally modified (e.g., glycosylated and/or phosphorylated). If so, yeast, insect, or mammalian host cells are preferable. For a review of expression vectors, see Meth. Enz., vol. 185 (D. V. Goeddel, ed., Academic Press 1990).

Typically, expression vectors used in any of the host cells will contain sequences for plasmid maintenance and for cloning and expression of exogenous nucleotide sequences. Such sequences, collectively referred to as “flanking sequences” in certain embodiments will typically include one or more of the following nucleotide sequences: a promoter, one or more enhancer sequences, an origin of replication, a transcriptional termination sequence, a complete intron sequence containing a donor and acceptor splice site, a sequence encoding a leader sequence for polypeptide secretion, a ribosome binding site, a polyadenylation sequence, a polylinker region for inserting the nucleic acid encoding the polypeptide to be expressed, and a selectable marker element. Each of these sequences is discussed below.

Optionally, the vector may contain a “tag”-encoding sequence, i.e., an oligonucleotide molecule located at the 5′ or 3′ end of the B7-L polypeptide coding sequence; the oligonucleotide sequence encodes polyHis (such as hexaHis), or another “tag” such as FLAG, HA (hemaglutinin influenza virus), or myc for which commercially available antibodies exist. This tag is typically fused to the polypeptide upon expression of the polypeptide, and can serve as a means for affinity purification of the B7-L polypeptide from the host cell. Affinity purification can be accomplished, for example, by column chromatography using antibodies against the tag as an affinity matrix. Optionally, the tag can subsequently be removed from the purified B7-L polypeptide by various means such as using certain peptidases for cleavage.

Flanking sequences may be homologous (i.e., from the same species and/or strain as the host cell), heterologous (i.e., from a species other than the host cell species or strain), hybrid (i.e., a combination of flanking sequences from more than one source), or synthetic, or the flanking sequences may be native sequences that normally function to regulate B7-L polypeptide expression. As such, the source of a flanking sequence may be any prokaryotic or eukaryotic organism, any vertebrate or invertebrate organism, or any plant, provided that the flanking sequence is functional in, and can be activated by, the host cell machinery.

Flanking sequences useful in the vectors of this invention may be obtained by any of several methods well known in the art. Typically, flanking sequences useful herein—other than the B7-L gene flanking sequences—will have been previously identified by mapping and/or by restriction endonuclease digestion and can thus be isolated from the proper tissue source using the appropriate restriction endonucleases. In some cases, the full nucleotide sequence of a flanking sequence may be known. Here, the flanking sequence may be synthesized using the methods described herein for nucleic acid synthesis or cloning.

Where all or only a portion of the flanking sequence is known, it may be obtained using PCR and/or by screening a genomic library with a suitable oligonucleotide and/or flanking sequence fragment from the same or another species. Where the flanking sequence is not known, a fragment of DNA containing a flanking sequence may be isolated from a larger piece of DNA that may contain, for example, a coding sequence or even another gene or genes. Isolation may be accomplished by restriction endonuclease digestion to produce the proper DNA fragment followed by isolation using agarose gel purification, Qiagen® column chromatography (Chatsworth, Calif.), or other methods known to the skilled artisan. The selection of suitable enzymes to accomplish this purpose will be readily apparent to one of ordinary skill in the art.

An origin of replication is typically a part of those prokaryotic expression vectors purchased commercially, and the origin aids in the amplification of the vector in a host cell. Amplification of the vector to a certain copy number can, in some cases, be important for the optimal expression of a B7-L polypeptide. If the vector of choice does not contain an origin of replication site, one may be chemically synthesized based on a known sequence, and ligated into the vector. For example, the origin of replication from the plasmid pBR322 (New England Biolabs, Beverly, Mass.) is suitable for most gram-negative bacteria and various origins (e.g., SV40, polyoma, adenovirus, vesicular stomatitus virus (VSV), or papillomaviruses such as HPV or BPV) are useful for cloning vectors in mammalian cells. Generally, the origin of replication component is not needed for mammalian expression vectors (for example, the SV40 origin is often used only because it contains the early promoter).

A transcription termination sequence is typically located 3′ of the end of a polypeptide coding region and serves to terminate transcription. Usually, a transcription termination sequence in prokaryotic cells is a G-C rich fragment followed by a poly-T sequence. While the sequence is easily cloned from a library or even purchased commercially as part of a vector, it can also be readily synthesized using methods for nucleic acid synthesis such as those described herein.

A selectable marker gene element encodes a protein necessary for the survival and growth of a host cell grown in a selective culture medium. Typical selection marker genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, tetracycline, or kanamycin for prokaryotic host cells; (b) complement auxotrophic deficiencies of the cell; or (c) supply critical nutrients not available from complex media. Preferred selectable markers are the kanamycin resistance gene, the ampicillin resistance gene, and the tetracycline resistance gene. A neomycin resistance gene may also be used for selection in prokaryotic and eukaryotic host cells.

Other selection genes may be used to amplify the gene that will be expressed. Amplification is the process wherein genes that are in greater demand for the production of a protein critical for growth are reiterated in tandem within the chromosomes of successive generations of recombinant cells. Examples of suitable selectable markers for mammalian cells include dihydrofolate reductase (DHFR) and thymidine kinase. The mammalian cell transformants are placed under selection pressure wherein only the transformants are uniquely adapted to survive by virtue of the selection gene present in the vector. Selection pressure is imposed by culturing the transformed cells under conditions in which the concentration of selection agent in the medium is successively changed, thereby leading to the amplification of both the selection gene and the DNA that encodes a B7-L polypeptide. As a result, increased quantities of B7-L polypeptide are synthesized from the amplified DNA.

A ribosome binding site is usually necessary for translation initiation of mRNA and is characterized by a Shine-Dalgamo sequence (prokaryotes) or a Kozak sequence (eukaryotes). The element is typically located 3′ to the promoter and 5′ to the coding sequence of a B7-L polypeptide to be expressed. The Shine-Dalgarno sequence is varied but is typically a polypurine (i.e., having a high A-G content). Many Shine-Dalgamo sequences have been identified, each of which can be readily synthesized using methods set forth herein and used in a prokaryotic vector.

A leader, or signal, sequence may be used to direct a B7-L polypeptide out of the host cell. Typically, a nucleotide sequence encoding the signal sequence is positioned in the coding region of a B7-L nucleic acid molecule, or directly at the 5′ end of a B7-L polypeptide coding region. Many signal sequences have been identified, and any of those that are functional in the selected host cell may be used in conjunction with a B7-L nucleic acid molecule. Therefore, a signal sequence may be homologous (naturally occurring) or heterologous to the B7-L nucleic acid molecule. Additionally, a signal sequence may be chemically synthesized using methods described herein. In most cases, the secretion of a B7-L polypeptide from the host cell via the presence of a signal peptide will result in the removal of the signal peptide from the secreted B7-L polypeptide. The signal sequence may be a component of the vector, or it may be a part of a B7-L nucleic acid molecule that is inserted into the vector.

Included within the scope of this invention is the use of either a nucleotide sequence encoding a native B7-L polypeptide signal sequence joined to a B7-L polypeptide coding region or a nucleotide sequence encoding a heterologous signal sequence joined to a B7-L polypeptide coding region. The heterologous signal sequence selected should be one that is recognized and processed, i.e., cleaved by a signal peptidase, by the host cell. For prokaryotic host cells that do not recognize and process the native B7-L polypeptide signal sequence, the signal sequence is substituted by a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, or heat-stable enterotoxin II leaders. For yeast secretion, the native B7-L polypeptide signal sequence may be substituted by the yeast invertase, alpha factor, or acid phosphatase leaders. In mammalian cell expression the native signal sequence is satisfactory, although other mammalian signal sequences may be suitable.

In some cases, such as where glycosylation is desired in a eukaryotic host cell expression system, one may manipulate the various presequences to improve glycosylation or yield. For example, one may alter the peptidase cleavage site of a particular signal peptide, or add pro-sequences, which also may affect glycosylation. The final protein product may have, in the −1 position (relative to the first amino acid of the mature protein) one or more additional amino acids incident to expression, which may not have been totally removed. For example, the final protein product may have one or two amino acid residues found in the peptidase cleavage site, attached to the amino-terminus. Alternatively, use of some enzyme cleavage sites may result in a slightly truncated form of the desired B7-L polypeptide, if the enzyme cuts at such area within the mature polypeptide.

In many cases, transcription of a nucleic acid molecule is increased by the presence of one or more introns in the vector; this is particularly true where a polypeptide is produced in eukaryotic host cells, especially mammalian host cells. The introns used may be naturally occurring within the B7-L gene especially where the gene used is a full-length genomic sequence or a fragment thereof. Where the intron is not naturally occurring within the gene (as for most cDNAs), the intron may be obtained from another source. The position of the intron with respect to flanking sequences and the B7-L gene is generally important, as the intron must be transcribed to be effective. Thus, when a B7-L cDNA molecule is being transcribed, the preferred position for the intron is 3′ to the transcription start site and 5′ to the poly-A transcription termination sequence. Preferably, the intron or introns will be located on one side or the other (i.e., 5′ or 3′) of the cDNA such that it does not interrupt the coding sequence. Any intron from any source, including viral, prokaryotic and eukaryotic (plant or animal) organisms, may be used to practice this invention, provided that it is compatible with the host cell into which it is inserted. Also included herein are synthetic introns. Optionally, more than one intron may be used in the vector.

The expression and cloning vectors of the present invention will typically contain a promoter that is recognized by the host organism and operably linked to the molecule encoding the B7-L polypeptide. Promoters are untranscribed sequences located upstream (i.e., 5′) to the start codon of a structural gene (generally within about 100 to 1000 bp) that control the transcription of the structural gene. Promoters are conventionally grouped into one of two classes: inducible promoters and constitutive promoters. Inducible promoters initiate increased levels of transcription from DNA under their control in response to some change in culture conditions, such as the presence or absence of a nutrient or a change in temperature. Constitutive promoters, on the other hand, initiate continual gene product production; that is, there is little or no control over gene expression. A large number of promoters, recognized by a variety of potential host cells, are well known. A suitable promoter is operably linked to the DNA encoding B7-L polypeptide by removing the promoter from the source DNA by restriction enzyme digestion and inserting the desired promoter sequence into the vector. The native B7-L promoter sequence may be used to direct amplification and/or expression of a B7-L nucleic acid molecule. A heterologous promoter is preferred, however, if it permits greater transcription and higher yields of the expressed protein as compared to the native promoter, and if it is compatible with the host cell system that has been selected for use.

Promoters suitable for use with prokaryotic hosts include the beta-lactamase and lactose promoter systems; alkaline phosphatase; a tryptophan (trp) promoter system; and hybrid promoters such as the tac promoter. Other known bacterial promoters are also suitable. Their sequences have been published, thereby enabling one skilled in the art to ligate them to the desired DNA sequence, using linkers or adapters as needed to supply any useful restriction sites.

Suitable promoters for use with yeast hosts are also well known in the art. Yeast enhancers are advantageously used with yeast promoters. Suitable promoters for use with mammalian host cells are well known and include, but are not limited to, those obtained from the genomes of viruses such as polyoma virus, fowlpox virus, adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, retroviruses, hepatitis-B virus and most preferably Simian Virus 40 (SV40). Other suitable mammalian promoters include heterologous mammalian promoters, for example, heat-shock promoters and the actin promoter.

Additional promoters which may be of interest in controlling B7-L gene expression include, but are not limited to: the SV40 early promoter region (Bernoist and Chambon, 1981 , Nature 290:304-10); the CMV promoter; the promoter contained in the 3′ long terminal repeat of Rous sarcoma virus (Yamamoto, et al., 1980 , Cell 22:787-97); the herpes thymidine kinase promoter (Wagner et al., 1981 , Proc. Natl. Acad. Sci. U.S.A. 78:1444-45); the regulatory sequences of the metallothionine gene (Brinster et al., 1982 , Nature 296:39-42); prokaryotic expression vectors such as the beta-lactamase promoter (Villa-Kamaroff et al., 1978 , Proc. Natl. Acad. Sci. U.S.A., 75:3727-31); or the tac promoter (DeBoer et al., 1983 , Proc. Natl. Acad. Sci. U.S.A., 80:21-25). Also of interest are the following animal transcriptional control regions, which exhibit tissue specificity and have been utilized in transgenic animals: the elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984 , Cell 38:639-46; Ornitz et al., 1986 , Cold Spring Harbor Symp. Quant. Biol. 50:399-409 (1986); MacDonald, 1987 , Hepatology 7:425-515); the insulin gene control region which is active in pancreatic beta cells (Hanahan, 1985 , Nature 315:115-22); the immunoglobulin gene control region which is active in lymphoid cells (Grosschedl et al., 1984 , Cell 38:647-58; Adames et al., 1985 , Nature 318:533-38; Alexander et al., 1987 , Mol. Cell. Biol., 7:1436-44); the mouse mammary tumor virus control region which is active in testicular, breast, lymphoid and mast cells (Leder et al., 1986 , Cell 45:485-95); the albumin gene control region which is active in liver (Pinkert et al., 1987 , Genes and Devel. 1:268-76); the alpha-feto-protein gene control region which is active in liver (Krumlauf et al., 1985 , Mol. Cell. Biol., 5:1639-48; Hammer et al., 1987 , Science 235:53-58); the alpha 1-antitrypsin gene control region which is active in the liver (Kelsey et al., 1987 , Genes and Devel. 1:161-71); the beta-globin gene control region which is active in myeloid cells (Mogram et al., 1985 , Nature 315:338-40; Kollias et al., 1986 , Cell 46:89-94); the myelin basic protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et al., 1987 , Cell 48:703-12); the myosin light chain-2 gene control region which is active in skeletal muscle (Sani, 1985 , Nature 314:283-86); and the gonadotropic releasing hormone gene control region which is active in the hypothalamus (Mason et al., 1986 , Science 234:1372-78).

An enhancer sequence may be inserted into the vector to increase the transcription of a DNA encoding a B7-L polypeptide of the present invention by higher eukaryotes. Enhancers are cis-acting elements of DNA, usually about 10-300 bp in length, that act on the promoter to increase transcription. Enhancers are relatively orientation and position independent. They have been found 5′ and 3′ to the transcription unit. Several enhancer sequences available from mammalian genes are known (e.g., globin, elastase, albumin, alpha-feto-protein and insulin). Typically, however, an enhancer from a virus will be used. The SV40 enhancer, the cytomegalovirus early promoter enhancer, the polyoma enhancer, and adenovirus enhancers are exemplary enhancing elements for the activation of eukaryotic promoters. While an enhancer may be spliced into the vector at a position 5′ or 3′ to a B7-L nucleic acid molecule, it is typically located at a site 5′ from the promoter.

Expression vectors of the invention may be constructed from a starting vector such as a commercially available vector. Such vectors may or may not contain all of the desired flanking sequences. Where one or more of the flanking sequences described herein are not already present in the vector, they may be individually obtained and ligated into the vector. Methods used for obtaining each of the flanking sequences are well known to one skilled in the art.

Preferred vectors for practicing this invention are those that are compatible with bacterial, insect, and mammalian host cells. Such vectors include, inter alia, pCRII, pCR3, and pcDNA3.1 (Invitrogen, San Diego, Calif.), pBSII (Stratagene, La Jolla, Calif.), pET15 (Novagen, Madison, Wis.), pGEX (Pharmacia Biotech, Piscataway, N.J.), pEGFP-N2 (Clontech, Palo Alto, Calif.), pETL (BlueBacII, Invitrogen), pDSR-alpha (International Pub. No. WO 90/14363) and pFastBacDual (Gibco-BRL, Grand Island, N.Y.).

Additional suitable vectors include, but are not limited to, cosmids, plasmids, or modified viruses, but it will be appreciated that the vector system must be compatible with the selected host cell. Such vectors include, but are not limited to plasmids such as Bluescript® plasmid derivatives (a high copy number ColE1-based phagemid; Stratagene Cloning Systems, La Jolla Calif.), PCR cloning plasmids designed for cloning Taq-amplified PCR products (e.g., TOPO™ TA Cloning® Kit and PCR2.1® plasmid derivatives; Invitrogen), and mammalian, yeast or virus vectors such as a baculovirus expression system (pBacPAK plasmid derivatives; Clontech).

After the vector has been constructed and a nucleic acid molecule encoding a B7-L polypeptide has been inserted into the proper site of the vector, the completed vector may be inserted into a suitable host cell for amplification and/or polypeptide expression. The transformation of an expression vector for a B7-L polypeptide into a selected host cell may be accomplished by well known methods including methods such as transfection, infection, calcium chloride, electroporation, microinjection, lipofection, DEAE-dextran method, or other known techniques. The method selected will in part be a function of the type of host cell to be used. These methods and other suitable methods are well known to the skilled artisan, and are set forth, for example, in Sambrook et al., supra.

Host cells may be prokaryotic host cells (such as E. coli ) or eukaryotic host cells (such as a yeast, insect, or vertebrate cell). The host cell, when cultured under appropriate conditions, synthesizes a B7-L polypeptide that can subsequently be collected from the culture medium (if the host cell secretes it into the medium) or directly from the host cell producing it (if it is not secreted). The selection of an appropriate host cell will depend upon various factors, such as desired expression levels, polypeptide modifications that are desirable or necessary for activity (such as glycosylation or phosphorylation) and ease of folding into a biologically active molecule.

A number of suitable host cells are known in the art and many are available from the American Type Culture Collection (ATCC), Manassas, Va. Examples include, but are not limited to, mammalian cells, such as Chinese hamster ovary cells (CHO), CHO DHFR(−) cells (Urlaub et al., 1980 , Proc. Natl. Acad. Sci. U.S.A. 97:4216-20), human embryonic kidney (HEK) 293 or 293T cells, or 3T3 cells. The selection of suitable mammalian host cells and methods for transformation, culture, amplification, screening, product production, and purification are known in the art. Other suitable mammalian cell lines, are the monkey COS-1 and COS-7 cell lines, and the CV-1 cell line. Further exemplary mammalian host cells include primate cell lines and rodent cell lines, including transformed cell lines. Normal diploid cells, cell strains derived from in vitro culture of primary tissue, as well as primary, explants, are also suitable. Candidate cells may be genotypically deficient in the selection gene, or may contain a dominantly acting selection gene. Other suitable mammalian cell lines include but are not limited to, mouse neuroblastoma N2A cells, HeLa, mouse L-929 cells, 3T3 lines derived from Swiss, Balb-c or NIH mice, BHK or HaK hamster cell lines. Each of these cell lines is known by and available to those skilled in the art of protein expression.

Similarly useful as host cells suitable for the present invention are bacterial cells. For example, the various strains of E. coli (e.g., HB101, DH5α, DH10, and MC1061) are well-known as host cells in the field of biotechnology. Various strains of B. subtilis, Pseudomonas spp., other Bacillus spp., Streptomyces spp., and the like may also be employed in this method.

Many strains of yeast cells known to those skilled in the art are also available as host cells for the expression of the polypeptides of the present invention. Preferred yeast cells include, for example, Saccharomyces cerivisae and Pichia pastoris.

Additionally, where desired, insect cell systems may be utilized in the methods of the present invention. Such systems are described, for example, in Kitts et al., 1993 , Biotechniques, 14:810-17; Lucklow, 1993 , Curr. Opin. Biotechnol. 4:564-72; and Lucklow et al., 1993 , J. Virol., 67:4566-79. Preferred insect cells are Sf-9 and Hi5 (Invitrogen).

One may also use transgenic animals to express glycosylated B7-L polypeptides. For example, one may use a transgenic milk-producing animal (a cow or goat, for example) and obtain the present glycosylated polypeptide in the animal milk. One may also use plants to produce B7-L polypeptides, however, in general, the glycosylation occurring in plants is different from that produced in mammalian cells, and may result in a glycosylated product which is not suitable for human therapeutic use.

Polypeptide Production

Host cells comprising a B7-L polypeptide expression vector may be cultured using standard media well known to the skilled artisan. The media will usually contain all nutrients necessary for the growth and survival of the cells. Suitable media for culturing E. coli cells include, for example, Luria Broth (LB) and/or Terrific Broth (TB). Suitable media for culturing eukaryotic cells include Roswell Park Memorial Institute medium 1640 (RPMI 1640), Minimal Essential Medium (MEM) and/or Dulbecco's Modified Eagle Medium (DMEM), all of which may be supplemented with serum and/or growth factors as necessary for the particular cell line being cultured. A suitable medium for insect cultures is Grace's medium supplemented with yeastolate, lactalbumin hydrolysate, and/or fetal calf serum as necessary.

Typically, an antibiotic or other compound useful for selective growth of transfected or transformed cells is added as a supplement to the media. The compound to be used will be dictated by the selectable marker element present on the plasmid with which the host cell was transformed. For example, where the selectable marker element is kanamycin resistance, the compound added to the culture medium will be kanamycin. Other compounds for selective growth include ampicillin, tetracycline, and neomycin.

The amount of a B7-L polypeptide produced by a host cell can be evaluated using standard methods known in the art. Such methods include, without limitation, Western blot analysis, SDS-polyacrylamide gel electrophoresis, non-denaturing gel electrophoresis, High Performance Liquid Chromatography (HPLC) separation, immunoprecipitation, and/or activity assays such as DNA binding gel shift assays.

If a B7-L polypeptide has been designed to be secreted from the host cells, the majority of polypeptide may be found in the cell culture medium. If however, the B7-L polypeptide is not secreted from the host cells, it will be present in the cytoplasm and/or the nucleus (for eukaryotic host cells) or in the cytosol (for gram-negative bacteria host cells).

For a B7-L polypeptide situated in the host cell cytoplasm and/or nucleus (for eukaryotic host cells) or in the cytosol (for bacterial host cells), the intracellular material (including inclusion bodies for gram-negative bacteria) can be extracted from the host cell using any standard technique known to the skilled artisan. For example, the host cells can be lysed to release the contents of the periplasm/cytoplasm by French press, homogenization, and/or sonication followed by centrifugation.

If a B7-L polypeptide has formed inclusion bodies in the cytosol, the inclusion bodies can often bind to the inner and/or outer cellular membranes and thus will be found primarily in the pellet material after centrifugation. The pellet material can then be treated at pH extremes or with a chaotropic agent such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives in the presence of a reducing agent such as dithiothreitol at alkaline pH or tris carboxyethyl phosphine at acid pH to release, break apart, and solubilize the inclusion bodies. The solubilized B7-L polypeptide can then be analyzed using gel electrophoresis, immunoprecipitation, or the like. If it is desired to isolate the B7-L polypeptide, isolation may be accomplished using standard methods such as those described herein and in Marston et al., 1990 , Meth. Enz., 182:264-75.

In some cases, a B7-L polypeptide may not be biologically active upon isolation. Various methods for “refolding” or converting the polypeptide to its tertiary structure and generating disulfide linkages can be used to restore biological activity. Such methods include exposing the solubilized polypeptide to a pH usually above 7 and in the presence of a particular concentration of a chaotrope. The selection of chaotrope is very similar to the choices used for inclusion body solubilization, but usually the chaotrope is used at a lower concentration and is not necessarily the same as chaotropes used for the solubilization. In most cases the refolding/oxidation solution will also contain a reducing agent or the reducing agent plus its oxidized form in a specific ratio to generate a particular redox potential allowing for disulfide shuffling to occur in the formation of the protein's cysteine bridges. Some of the commonly used redox couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, dithiothreitol(DTT)/dithiane DTT, and 2-2-mercaptoethanol(bME)/dithio-b(ME). In many instances, a cosolvent may be used or may be needed to increase the efficiency of the refolding, and the more common reagents used for this purpose include glycerol, polyethylene glycol of various molecular weights, arginine and the like.

If inclusion bodies are not formed to a significant degree upon expression of a B7-L polypeptide, then the polypeptide will be found primarily in the supernatant after centrifugation of the cell homogenate. The polypeptide may be further isolated from the supernatant using methods such as those described herein.

The purification of a B7-L polypeptide from solution can be accomplished using a variety of techniques. If the polypeptide has been synthesized such that it contains a tag such as Hexahistidine (B7-L polypeptide/hexaHis) or other small peptide such as FLAG (Eastman Kodak Co., New Haven, Conn.) or myc (Invitrogen) at either its carboxyl- or amino-terminus, it may be purified in a one-step process by passing the solution through an affinity column where the column matrix has a high affinity for the tag.

For example, polyhistidine binds with great affinity and specificity to nickel. Thus, an affinity column of nickel (such as the Qiagen® nickel columns) can be used for purification of B7-L polypeptide/polyHis. See, e.g., Current Protocols in Molecular Biology § 10.11.8 (Ausubel et al., eds., Green Publishers Inc. and Wiley and Sons 1993).

Additionally, B7-L polypeptides may be purified through the use of a monoclonal antibody that is capable of specifically recognizing and binding to a B7-L polypeptide.

Other suitable procedures for purification include, without limitation, affinity chromatography, immunoaffinity chromatography, ion exchange chromatography, molecular sieve chromatography, HPLC, electrophoresis (including native gel electrophoresis) followed by gel elution, and preparative isoelectric focusing (“Isoprime” machine/technique, Hoefer Scientific, San F