An efficient method for native protein purification in the selected range from prostate cancer tissue digests
Introduction
Other than skin cancer, prostate cancer (CP) is the most common cancer in American men. According to the American Cancer Society’s estimate, 180,890 new cases of CP will be diagnosed and 26,120 men will die of the disease in 2016. About 1 in 7 men will be diagnosed with CP during his lifetime (1). Nearly two-thirds of the cancer is diagnosed in men aged 65 or older. CP is the second leading cause of cancer death in American men, but most diagnosed men do not die from it. In fact, more than 2.9 million men in the United States who have been diagnosed with CP at some point are alive today. The current biomarker of elevated serum prostate-specific antigen (PSA) is a flawed test for early detection as many men with abnormal serum PSA turn out not to have cancer upon biopsy.
Researchers are working on strategies to improve molecular diagnosis. One approach is to assay for specific characteristics of PSA, or to detect PSA variants more specific to cancer (2). Another approach is to develop new tests based on other tumor markers. We used comparative analysis between the transcriptomes of isolated CD26+ cancer cells and CD26+ normal luminal cells to identify genes up-regulated in cancer (3,4). Genes that were found to be overexpressed by ≥8-fold, and to encode secreted or extracellular proteins were selected as biomarker candidates for assay development. Similarly, comparative analysis of isolated CD90+ cancer-associated stromal cells and CD49a+ benign tissue stromal cells has identified additional secreted protein candidates (5,6).
Anterior gradient 2 (AGR2; 19 kDa), breast cancer membrane protein 11/AGR3 (BCMP11; 19 kDa), cysteine-rich secretory protein-3 (CRISP3; 28 kDa), thymocyte differentiation antigen 1/CD90 (THY1; ~27 kDa) are examples of proteins secreted or released from prostate tumors. As such, these proteins can all likely be detected and measured in body fluids. Furthermore, array signals indicate that AGR2 is a moderately abundant protein while the others are less (3,7). In order to obtain antibodies that can recognize AGR2 and others, one needs to isolate these antigens preferably in their native form for mouse immunization.
The present study concerns with the development of a procedure to isolate cancer secreted/extracellular proteins such as AGR2 from media in which CP tissue was digested by collagenase (8). This media contained proteins synthesized by cell types of the tumors. Since a number of biomarker candidates are calculated to be in the 10–30 kDa size range, protein separation based on size fractionation was employed. The rationale behind the choice of 30 kDa as the upper bound was to ensure that monoclonal antibodies would not be raised against the abundant PSA (34 kDa) and other prostatic proteins such as zinc α2-glycoprotein (AZGP1, 43 kDa) and prostatic acid phosphatase (ACPP, 52 kDa), and to remove collagenase type I (68 kDa) from being present in the protein preparation. Another abundant species, β-microseminoprotein (MSMB/prostate secretory protein of 94 amino acids; 13 kDa), shows much reduced expression in tumors (3). The choice of 10 kDa as the lower bound was to remove impurities and small molecules such as peptide fragments.
The tissue digestion media used in this study was media supernatant pooled from ten tumor tissue specimens of Gleason 3+3, 3+4, and 4+3 scores (and one of 4+5), which would contain proteins synthesized by both pattern 3 and pattern 4 tumors. Proteins in the desired molecular weight range were purified by a scheme involving sonication, dialysis and ultrafiltration. Western blotting analysis was done to ascertain whether the resultant purified product was, for example, positive for AGR2 and negative for PSA. Mass spectrometry (MS)-based proteomics was also used to profile the protein composition of the purified fraction to gauge proteomic complexity.
Methods
Ethical statement
Human prostate tissue specimens were obtained from ten radical prostatectomy patients operated by surgeons in the Department of Urology with informed and written consent from all participants as per the guidelines approved by the University of Washington Institutional Review Board. Frozen sections were histologically assessed to confirm that the specimens were enriched for cancer. The pathology characteristics of the collected tumors were as follows: 05-081CP G3+3; 05-020CP G3+4; 05-036CP G3+4; 05-125CP G3+4; 05-056CP G3+4; 05-131CP G3+4; 05-081CP G3+3; 03-169CP G3+3; 05-154CP G4+5; 04-176CP G4+3 (G = Gleason score).
Tissue specimens and tissue digestion
Each CP tissue sample weighing at least 0.1 g, was rinsed with Hanks balanced salt solution (HBSS) and minced for enzymatic digestion overnight at room temperature with 0.2% collagenase type I (Invitrogen, Carlsbad, CA, USA) in serum-free RPMI1640 media on a magnetic stirrer. The resultant cell suspension was filtered with a 70-µm Falcon cell strainer, diluted with an equal volume of HBSS, and aspirated with an 18-gauge needle. The suspension was centrifuged at 1,000 rpm for 15 min to collect the supernatant. These media preparations were screened by Western blotting analysis to show low levels of TIMP1 and high levels of CD90 compared with normal tissue (NP) or benign prostatic hyperplasia (BPH) samples digested by collagenase (5). Tumor loses TIMP1 expression (8).
Purification of proteins in the 10–30 kDa range
To 22.5 mL of pooled CP samples, the media supernatant was made to contain 1 mM phenylmethylsulfonyl fluoride (PMSF), 1 mM benzamidine, 10 µM leupeptin, 1 µM pepstatin A, 1 mM EDTA, 1% Triton-X-100 and 15 mM β-mercaptoethanol. The sample was stirred for 1 h at 4 °C. The pH was adjusted to 5.5, and the solution was stirred for another 1 h. The sample was then sonicated three times for 2 min each, and centrifuged at 40,000 rpm for 30 min. The supernatant was then passed through a 0.22-micron filter, and fractionated using a 30-kDa MWCO Millipore filter (Millipore, Billerica, MA, USA) at 1,000 rpm for 30 min. The flow-through was then fractionated using a 10-kDa MWCO Millipore filter at 4,000 rpm for 30 min. The resultant product after these filtrations was concentrated to 1 mL. One mL MES buffer (25 mM MES, pH 5.5, 150 mM NaCl, 1 mM EDTA, 50 mM β-mercaptoethanol) was then added. The retentate was centrifuged at 4,000 rpm to about 1 mL volume. This preparation was desalted by placing in 3-kDa dialysis tubing overnight against 1 mM MES, pH 5.5.
SDS-PAGE and Western blot analysis
Protein concentration of the purified CP sample was measured by Bradford Assay (BioRad, Hercules, CA, USA). For gel analysis, loading buffer containing 0.1 M DTT was added to the amount of purified CP sample containing ~60 µg protein, heated to 70 °C for 10 min, electrophoresed on 4–20% gradient SDS-polyacrylamide gel (BioRad), and electrotransferred to PVDF membrane (Hybond-P, Amersham). Blotting was ascertained by visualization of protein bands on the membrane by Ponçeau S (Sigma, St. Louis, MO, USA). Afterwards, the stain was completely removed by repeated washing with triple distilled water. The membrane was immersed in 5% nonfat dry milk-PBS-Tween for 30 min, and probed with AGR2 antibody (1:2,000; clone 1C3, Abnova, Taiwan), or PSA antibody (1:1,000; clone A67-B/E3, Santa Cruz Biotechnology, Dallas, TX, USA) for 60 min, followed by horseradish peroxidase conjugated anti-mouse IgG (1:5,000; Vector, Burlingame, CA, USA). After washing, the membrane was incubated with Luminol (Santa Cruz Biotechnology), and the immunoreactive bands were visualized using Biomax MR light film (Kodak, Rochester, NY, USA).
Proteomic analysis using liquid chromatography-tandem mass spectrometry (LC-MS/MS)
The purified protein sample was further concentrated using Amicon Ultra-15 centrifugal filter MWCO3000 (Millipore). Protein concentration was determined by BCA Protein Assay (Thermo Scientific, Rockford, IL, USA). An aliquot of the concentrated preparation containing 10 µg protein was denatured in 50% trifluoroethanol at 60 °C for 2 h with gentle shaking (9), and then reduced with 2 mM DTT at 37 °C for 1 h. The sample was diluted 5-fold with 50 mM NH4HCO3 before digestion with trypsin (trypsin:protein 1:50 w/w) at 37 °C for 3 h. After speed-vac, the resultant tryptic peptides were reconstituted in 50 mM NH4HCO3 for LC-MS analysis.
LC-MS/MS was performed as previously described (10). Briefly, ~1 µg peptides were loaded onto a 65-cm-long, 75-µm-i.d. reversed-phase capillary column packed in house with 3 μm Jupiter C18 particles (Phenomenex, Torrance, CA, USA). The mobile phases consisted of solution A (0.1% formic acid in water) and solution B (0.1% formic acid in acetonitrile). An exponential gradient starting with 100% of mobile phase A to 60% of mobile phase B over the course of 100 min was employed with a flow-rate of ~500 NL/min. Eluted peptides were ionized via a nanoelectrospray ionization interface manufactured in house and analyzed on an LTQ-Orbitrap mass spectrometer (Thermo Fisher Scientific, San Jose, CA, USA). The instrument was operated in data-dependent mode with m/z ranging from 400–2,000, in which a full MS scan was followed by ten MS/MS scans.
MS/MS data was analyzed by SEQUEST-based database searching against the Human International Protein Index (IPI) database (version 3.54). Searching parameters were: 3 Da tolerance for precursor ion masses and 1 Da for fragment ion masses with no enzyme restraint and a maximum of three missed tryptic cleavages. Filtering criteria (11) were applied for peptide identifications to achieve a <5% false discovery rate at the unique peptide level based on a reversed-database searching methodology (12).
Results
Purification of the 10–30 kDa fraction of CP tissue digests
Protein concentration of the purified CP sample was found to be at 3.2 mg/mL. Figure 1 shows the SDS-PAGE/Coomassie profile of the purified 10–30 kDa fraction. There was a good removal of proteins having molecular weights above 30 kDa by the purification scheme. At the same time, there were little 10–30 kDa proteins in the excluded fraction. The banding pattern also indicated minimal protein degradation.
Detection of AGR2 in the 10–30 kDa fraction
Figure 2 shows the presence of AGR2 (19 kDa) in the purified 10–30 kDa CP preparation and not in the excluded fraction. The Western blot also shows that this protein was not expressed by non-cancer (e.g., samples 05-091NP and 03-169BPH). There was an estimated loss of about 50% in the amount of AGR2 after purification (compare lanes labeled unfractionated CP and 10–30 kDa fraction, Figure 2). A fainter band lower than AGR2 may represent AGR3 (166 aa), which has 65% sequence homology with AGR2 (175 aa).
Absence of PSA in the 10–30 kDa fraction
The purified 10–30 kDa fraction was found to be negative for PSA as the unfractionated preparation was positive (Figure 3). On the blot, PSA was detected in matched 05-091CP and 05-091NP, as well as in 03-169BPH. Thus, the purification scheme was effective in eliminating an abundant protein species just 4 kDa larger than the upper size cutoff point.
MS analysis of purified CP fraction
The purified 10–30 kDa was analyzed using LC-MS/MS, and the proteins and peptides identified are included in Table S1. AGR2 was confidently identified by seven different peptides (Table 1). Previously, using unfractionated CP tissue digest media LC-MS/MS was unable to confidently detect AGR2 (unpublished) most likely due to its relatively low abundance or physical properties. On the other hand, the exogenously added (abundant) collagenase was not seen among the 252 proteins identified with two or more unique peptides. Although PSA was undetected by Western blotting, the protein was still identified by MS with seven spectral counts, but as evidence of a relatively low-abundance protein in the fraction. Table 2 lists the top dozen abundant proteins identified by LC-MS/MS using spectral counts as a measure of abundance (13). Among them are several large cytoskeletal components FLNA, COL6A3, FLNB, MYH11, VCL, etc. In fact, only 51% (290/574) of the proteins had calculated molecular weights in the 10–30 kDa range (and 1.6% below 10 kDa). Many of the proteins belonged to the cytoplasmic proteome and were not necessarily secreted.
Full table
Full table
Full table
Discussion
With protein biomarker discovery, reagents such as high quality antibodies are needed for clinical assays to measure these markers. Purified antigens are the optimal immunogens for monoclonal antibody production. However, the purification of multiple proteins from biosamples is extremely challenging and not cost-effective. The purification of 10–30 kDa protein species from pooled CP samples represents our attempt to obtain suitable materials for subsequent antibody production. Simple modifications can be done to the protocol to obtain any size fractions containing other informative biomarkers. Our results showed that this procedure is effective with exclusion of proteins outside the size range, as probed by Western blotting, while retaining the targeted proteins with acceptable losses. Importantly, the proteins are present in their native configuration with proper post-translational modifications intact. Antibodies resulting after mouse immunization would likely recognize specifically these protein analytes in human biospecimens. The laborious part would involve extensive screening of the hybridomas for the desired clones against particular antigens as the number of protein species in the 10–30 kDa CP is unknown (at least 574 based on the MS data). Removal of the abundant prostatic proteins ensures against generation of many unwanted antibodies. One drawback of this approach is that proteins in lower abundance would have fewer resultant clones, and screening for them would be hard. The first screen would compare immunoreactivity on frozen tissue sections of CP vs. NP. Those that show no reactivity on NP would be used for immunoprecipitation of CP digestion media or media of CP cell lines (7), and the protein(s) captured would be identified by MS. Most likely, these antibodies will not work well on Western blotting.
Using the more sensitive MS analysis, the 10–30 kDa fraction was shown to still contain nearly 50% protein species with molecular weights outside the selected range. These could result from degradation during the overnight tissue digestion. Frequently, some tumor tissue samples may contain necrotic areas not visible grossly. The cytoplasmic proteins then, most likely, resulted from breakdown of cells. This creates a potentially large obstacle for our approach as cytoskeletal proteins are some of the most immunogenic molecules.
An alternative to using CP tissue media is media supernatant of cultured cancer cell lines (and cancer-associated stromal cells) as well as that of xenograft tissue digest. Perhaps then, the problem of cell breakdown could be minimized. While no single cell line produces all the proteins found in CP (3), a panel of available cell lines and xenografts could produce most of these proteins. For example, AGR2 is made by CL1, CRISP3 by LuCaP 35, CEACAM5, BCMP11 by LuCaP 49 [results from dataset queries (4)]. In this way, we are not limited by the availability of ample material for immunization.
Our protein purification scheme has been optimized by including relatively fewer and simpler steps to obtain a good yield. It is convenient and efficient. The more common approach of using recombinant proteins as immunogens suffers from the lack of proper configuration when made in a bacterial or yeast host system, or low yield and high cost when made in mammalian cultured cells. The recombinantly produced proteins would still require purification. Nevertheless, useful monoclonal antibodies against AGR2 have been obtained by us using bacterially made recombinant AGR2. The selected antibody clones were able to recognize the recombinant protein and native AGR2 in CP (and not NP) digestion media, media supernatant of cultured CL1 cells, and in urine of patients (7). This could be due to the relatively small size of AGR2 and its lack of extensive glycosylation (small molecular weight difference between calculated size from amino acid composition and size estimated in SDS-PAGE).
The presence of stromal cell-derived proteins in the CP fraction shows that these could also be used for tumor detection. CD90 was previously detected by global proteomics in unfractionated CP tissue media (6,8). CD90 expression is increased in the tumor stroma compared with NP stroma, and can be measured in urine of patients (6). In addition, a number of other secreted CP stromal proteins have been identified. Thus, a CP tissue sample would contain proteins derived from both the cancer epithelial component and the tumor stromal component.
Conclusions
This work reports a method to selectively isolate native proteins in a particular molecular weight range using tissue digestion media. This process essentially lowers the proteomic complexity of the biosample that could be used to identify biomarker candidates, and for mouse immunization.
Acknowledgements
The authors thank Charles Luo and Michael Regnier in the Department of Bioengineering, Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, USA, for their expertise in developing the purification protocol. Proteomics experiments were performed in the Environmental Molecular Sciences Laboratory, a national scientific user facility sponsored by the DoE and located at Pacific Northwest National Laboratory, which is operated by Battelle Memorial Institute for the DoE under Contract DE-AC05-76RL0 1830.
Funding: This work was supported by NCI-EDRN grant CA111244 and NIH P41GM103493.
Footnote
Conflicts of Interest: The authors have no conflicts of interest to declare.
Ethical Statement: The study was approved by the University of Washington Institutional Review Board and written informed consent was obtained from all patients.
References
- American Cancer Society. Prostate Cancer. Available online: http://www.cancer.org/cancer/prostatecancer/detailedguide/index
- Sokoll LJ, Sanda MG, Feng Z, et al. A prospective, multicenter, National Cancer Institute Early Detection Research Network study of [-2]proPSA: improving prostate cancer detection and correlating with cancer aggressiveness. Cancer Epidemiol Biomarkers Prev 2010;19:1193-200. [Crossref] [PubMed]
- Pascal LE, Vêncio RZ, Page LS, et al. Gene expression relationship between prostate cancer cells of Gleason 3, 4 and normal epithelial cells as revealed by cell type-specific transcriptomes. BMC Cancer 2009;9:452. [Crossref] [PubMed]
- Pascal LE, Deutsch EW, Campbell DS, et al. The urologic epithelial stem cell database (UESC) - a web tool for cell type-specific gene expression and immunohistochemistry images of the prostate and bladder. BMC Urol 2007;7:19. [Crossref] [PubMed]
- Pascal LE, Goo YA, Vêncio RZ, et al. Gene expression down-regulation in CD90+ prostate tumor-associated stromal cells involves potential organ-specific genes. BMC Cancer 2009;9:317. [Crossref] [PubMed]
- True LD, Zhang H, Ye M, et al. CD90/THY1 is overexpressed in prostate cancer-associated fibroblasts and could serve as a cancer biomarker. Mod Pathol 2010;23:1346-56. [Crossref] [PubMed]
- Wayner EA, Quek SI, Ahmad R, et al. Development of an ELISA to detect the secreted prostate cancer biomarker AGR2 in voided urine. Prostate 2012;72:1023-34. [Crossref] [PubMed]
- Liu AY, Zhang H, Sorensen CM, et al. Analysis of prostate cancer by proteomics using tissue specimens. J Urol 2005;173:73-8. [Crossref] [PubMed]
- Wang H, Qian WJ, Mottaz HM, et al. Development and evaluation of a micro- and nanoscale proteomic sample preparation method. J Proteome Res 2005;4:2397-403. [Crossref] [PubMed]
- Wang L, Aryal UK, Dai Z, et al. Mapping N-linked glycosylation sites in the secretome and whole cells of Aspergillus niger using hydrazide chemistry and mass spectrometry. J Proteome Res 2012;11:143-56. [Crossref] [PubMed]
- Qian WJ, Kaleta DT, Petritis BO, et al. Enhanced detection of low abundance human plasma proteins using a tandem IgY12-SuperMix immunoaffinity separation strategy. Mol Cell Proteomics 2008;7:1963-73. [Crossref] [PubMed]
- Qian WJ, Liu T, Monroe ME, et al. Probability-based evaluation of peptide and protein identifications from tandem mass spectrometry and SEQUEST analysis: the human proteome. J Proteome Res 2005;4:53-62. [Crossref] [PubMed]
- Liu H, Sadygov RG, Yates JR 3rd. A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal Chem 2004;76:4193-201. [Crossref] [PubMed]