Background: Our group previously reported that tumour-specific expression of the rate-limiting enzyme in the mevalonate pathway, 3-hydroxy-3-methylglutharyl-coenzyme A reductase (HMG-CoAR) is associated with more favourable tumour parameters and a good prognosis in breast cancer. In the present study, the prognostic value of HMG-CoAR expression was examined in tumours from a cohort of patients with primary epithelial ovarian cancer. Methods: HMG-CoAR expression was assessed using immunohistochemistry (IHC) on tissue microarrays (TMA) consisting of 76 ovarian cancer cases, analysed using automated algorithms to develop a quantitative scoring model. Kaplan Meier analysis and Cox proportional hazards modelling were used to estimate the risk of recurrence free survival (RFS). Results: Seventy-two tumours were suitable for analysis. Cytoplasmic HMG-CoAR expression was present in 65% (n = 46) of tumours. No relationship was seen between HMG-CoAR and age, histological subtype, grade, disease stage, estrogen receptor or Ki-67 status. Patients with tumours expressing HMG-CoAR had a significantly prolonged RFS (p = 0.012). Multivariate Cox regression analysis revealed that HMG-CoAR expression was an independent predictor of improved RFS (RR = 0.49, 95% CI (0.25-0.93); p = 0.03) when adjusted for established prognostic factors such as residual disease, tumour stage and grade. Conclusion: HMG-CoAR expression is an independent predictor of prolonged RFS in primary ovarian cancer. As HMG-CoAR inhibitors, also known as statins, have demonstrated anti-neoplastic effects in vitro, further studies are required to evaluate HMG-CoAR expression as a surrogate marker of response to statin treatment, especially in conjunction with current chemotherapeutic regimens.
Introduction: We previously reported an association between tumor-specific 3-hydroxy-3-methylglutharyl-coenzyme A reductase (HMG-CoAR) expression and a good prognosis in breast cancer. Here, the predictive value of HMG-CoAR expression in relation to tamoxifen response was examined. Methods: HMG-CoAR protein and RNA expression was analyzed in a cell line model of tamoxifen resistance using western blotting and PCR. HMG-CoAR mRNA expression was examined in 155 tamoxifen-treated breast tumors obtained from a previously published gene expression study (Cohort I). HMG-CoAR protein expression was examined in 422 stage II premenopausal breast cancer patients, who had previously participated in a randomized control trial comparing 2 years of tamoxifen with no systemic adjuvant treatment (Cohort II). Kaplan-Meier analysis and Cox proportional hazards modeling were used to estimate the risk of recurrence-free survival (RFS) and the effect of HMG-CoAR expression on tamoxifen response. Results: HMG-CoAR protein and RNA expression were decreased in tamoxifen-resistant MCF7-LCC9 cells compared with their tamoxifen-sensitive parental cell line. HMG-CoAR mRNA expression was decreased in tumors that recurred following tamoxifen treatment (P < 0.001) and was an independent predictor of RFS in Cohort I (hazard ratio = 0.63, P = 0.009). In Cohort II, adjuvant tamoxifen increased RFS in HMG-CoAR-positive tumors (P = 0.008). Multivariate Cox regression analysis demonstrated that HMG-CoAR was an independent predictor of improved RFS in Cohort II (hazard ratio = 0.67, P = 0.010), and subset analysis revealed that this was maintained in estrogen receptor (ER)-positive patients (hazard ratio = 0.65, P = 0.029). Multivariate interaction analysis demonstrated a difference in tamoxifen efficacy relative to HMG-CoAR expression (P = 0.05). Analysis of tamoxifen response revealed that patients with ER-positive/HMG-CoAR tumors had a significant response to tamoxifen (P = 0.010) as well as patients with ER-positive or HMG-CoAR-positive tumors (P = 0.035). Stratification according to ER and HMG-CoAR status demonstrated that ER-positive/HMG-CoAR-positive tumors had an improved RFS compared with ER-positive/HMG-CoAR-negative tumors in the treatment arm (P = 0.033); this effect was lost in the control arm (P = 0.138), however, suggesting that HMG-CoAR predicts tamoxifen response. Conclusions: HMG CoAR expression is a predictor of response to tamoxifen in both ER-positive and ER-negative disease. Premenopausal patients with tumors that express ER or HMG-CoAR respond to adjuvant tamoxifen.
We evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple variants regulate a particular molecular phenotype, and pleiotropy, where a single variant associates with multiple molecular phenotypes over multiple genomic regions. The highest proportion of share genetic regulation is detected between gene expression and proteins (66.6%), with a further median shared genetic associations across 49 different tissues of 78.3% and 62.4% between plasma proteins and gene expression. We represent the genetic and molecular associations in networks including 2828 known GWAS variants, showing that GWAS variants are more often connected to gene expression in trans than other molecular phenotypes in the network. Our work provides a roadmap to understanding molecular networks and deriving the underlying mechanism of action of GWAS variants using different molecular phenotypes in an accessible tissue.
There is a clear clinical need for high-specificity plasma biomarkers for predicting risk of venous thromboembolism (VTE), but thus far, such markers have remained elusive. Utilizing affinity reagents from the Human Protein Atlas project and multiplexed immuoassays, we extensively analyzed plasma samples from 2 individual studies to identify candidate protein markers associated with VTE risk. We screened plasma samples from 88 VTE cases and 85 matched controls, collected as part of the Swedish ¡°Venous Thromboembolism Biomarker Study,¡± using suspension bead arrays composed of 755 antibodies targeting 408 candidate proteins. We identified significant associations between VTE occurrence and plasma levels of human immunodeficiency virus type I enhancer binding protein 1 (HIVEP1), von Willebrand factor (VWF), glutathione peroxidase 3 (GPX3), and platelet-derived growth factor β (PDGFB). For replication, we profiled plasma samples of 580 cases and 589 controls from the French FARIVE study. These results confirmed the association of VWF and PDGFB with VTE after correction for multiple testing, whereas only weak trends were observed for HIVEP1 and GPX3. Although plasma levels of VWF and PDGFB correlated modestly (p ~ 0.30) with each other, they were independently associated with VTE risk in a joint model in FARIVE (VWF P < .001; PDGFB P 5 .002). PDGF was verified as the target of the capture antibody by immunocapture mass spectrometry and sandwich enzyme-linked immunosorbent assay. In conclusion, we demonstrate that high-throughput affinity plasma proteomic profiling is a valuable research strategy to identify potential candidate biomarkers for thrombosis-related disorders, and our study suggests a novel association of PDGFB plasma levels with VTE.
Endothelial cells line blood vessels and regulate hemostasis, inflammation, and blood pressure. Proteins critical for these specialized functions tend to be predominantly expressed in endothelial cells across vascular beds. Here, we present a systems approach to identify a panel of human endothelial-enriched genes using global, body-wide transcriptomics data from 124 tissue samples from 32 organs. We identified known and unknown endothelial-enriched gene transcripts and used antibody-based profiling to confirm expression across vascular beds. The majority of identified transcripts could be detected in cultured endothelial cells from various vascular beds, and we observed maintenance of relative expression in early passage cells. In summary, we describe a widely applicable method to determine cell-type-specific transcriptome profiles in a whole-organism context, based on differential abundance across tissues. We identify potential vascular drug targets or endothelial biomarkers and highlight candidates for functional studies to increase understanding of the endothelium in health and disease.
Antibodies empower numerous important scientific, clinical, diagnostic, and industrial applications. Ideally, the epitope(s) targeted by an antibody should be identified and characterized, thereby establishing antibody reactivity, highlighting possible cross-reactivities, and perhaps even warning against unwanted (e.g. autoimmune) reactivities. Antibodies target proteins as either conformational or linear epitopes. The latter are typically probed with peptides, but the cost of peptide screening programs tends to prohibit comprehensive specificity analysis. To perform high-throughput, high-resolution mapping of linear antibody epitopes, we have used ultrahigh-density peptide microarrays generating several hundred thousand different peptides per array. Using exhaustive length and substitution analysis, we have successfully examined the specificity of a panel of polyclonal antibodies raised against linear epitopes of the human proteome and obtained very detailed descriptions of the involved specificities. The epitopes identified ranged from 4 to 12 amino acids in size. In general, the antibodies were of exquisite specificity, frequently disallowing even single conservative substitutions. In several cases, multiple distinct epitopes could be identified for the same target protein, suggesting an efficient approach to the generation of paired antibodies. Two alternative epitope mapping approaches identified similar, although not necessarily identical, epitopes. These results show that ultrahigh-density peptide microarrays can be used for linear epitope mapping. With an upper theoretical limit of 2,000,000 individual peptides per array, these peptide microarrays may even be used for a systematic validation of antibodies at the proteomic level.
The brain is a vital organ and because it is well shielded from the outside environment, possibilities for noninvasive analysis are often limited. Instead, fluids taken from the spinal cord or circulatory system are preferred sources for the discovery of candidate markers within neurological diseases. In the context of multiple sclerosis (MS), we applied an affinity proteomic strategy and screened 22 plasma samples with 4595 antibodies (3450 genes) on bead arrays, then defined 375 antibodies (334 genes) for targeted analysis in a set of 172 samples and finally used 101 antibodies (43 genes) on 443 plasma as well as 573 cerebrospinal spinal fluid (CSF) samples. This revealed alteration of protein profiles in relation to MS subtypes for IRF8, IL7, METTL14, SLC30A7, and GAP43. Respective antibodies were subsequently used for immunofluorescence on human post-mortem brain tissue with MS pathology for expression and association analysis. There, antibodies for IRF8, IL7, and METTL14 stained neurons in proximity of lesions, which highlighted these candidate protein targets for further studies within MS and brain tissue. The affinity proteomic translation of profiles discovered by profiling human body fluids and tissue provides a powerful strategy to suggest additional candidates to studies of neurological disorders.
BACKGROUND: Blood-based proteomic profiling may aid and expand our understanding of diseases and their different phenotypes. The aim of the presented study was to profile serum samples from patients with malignant melanoma using affinity proteomic assays to describe proteins in the blood stream that are associated to stage or recurrence of melanoma. MATERIAL AND METHODS: Multiplexed protein analysis was conducted using antibody suspension bead arrays. A total of 232 antibodies against 132 proteins were selected from (i) a screening with 4595 antibodies and 32 serum samples from melanoma patients and controls, (ii) antibodies used for immunohistochemistry, (iii) protein targets previously related with melanoma. The analysis was performed with 149 serum samples from patients with malignant melanoma. Antibody selectivity was then assessed by Western blot, immunocapture mass spectrometry, and epitope mapping. Lastly, indicative antibodies were applied for IHC analysis of melanoma tissues. RESULTS: Serum levels of regucalcin (RGN) and syntaxin 7 (STX7) were found to be lower in patients with both recurring tumors and a high Breslow's thickness (T-stage 3/4) compared to low thickness (T-stage 1/2) without disease recurrence. Serum levels of methylenetetrahydrofolate dehydrogenase 1-like (MTHFD1L) were instead elevated in sera of T3/4 patients with recurrence. The analysis of tissue sections with S100A6 and MTHFD1L showed positive staining in a majority of patients with melanoma, and S100A6 was significantly associated to T-stage. CONCLUSIONS: Our findings provide a starting point to further study RGN, STX7, MTHFD1L and S100A6 in serum to elucidate their involvement in melanoma progression and to assess a possible contribution to support clinical indications.
This month: relating single cells to populations (Cao/Packer, Wu/Altschuler, O'Brien, Friedman), an excess of ribosomes (Barkai), human pathology atlas (Uhlen), signatures of feedback (Rahi), and major genome redesign (Baumgart).
Functional surface display of small affinity proteins, namely, affibodies (6.5 kDa), was evaluated for the model cyanobacterium Synechocystis sp. strain PCC 6803 through anchoring to native surface structures. These structures included confirmed or putative subunits of the type IV pili, the S-layer protein, and the heterologous Escherichia coli autotransporter antigen 43 system. The most stable display system was determined to be through C-terminal fusion to PilA1, the major type IV pilus subunit in Synechocystis, in a strain unable to retract these pili (Delta pilT1). Type IV pilus synthesis was upheld, albeit reduced, when fusion proteins were incorporated. However, pilus-mediated functions, such as motility and transformational competency, were negatively affected. Display of affibodies on Synechocystis and the complementary anti-idiotypic affibodies on E. coli or Staphylococcus carnosus was able to mediate interspecies cell-cell binding by affibody complex formation. The same strategy, however, was not able to drive cell-cell binding and aggregation of Synechocystis-only mixtures. Successful affibody tagging of the putative minor pilin PilA4 showed that it locates to the type IV pili in Synechocystis and that its extracellular availability depends on PilA1. In addition, affibody tagging of the S-layer protein indicated that the domains responsible for the anchoring and secretion of this protein are located at the N and C termini, respectively. This study can serve as a basis for future surface display of proteins on Synechocystis for biotechnological applications. IMPORTANCE Cyanobacteria are gaining interest for their potential as autotrophic cell factories. Development of efficient surface display strategies could improve their suitability for large-scale applications by providing options for designed microbial consortia, cell immobilization, and biomass harvesting. Here, surface display of small affinity proteins was realized by fusing them to the major subunit of the native type IV pili in Synechocystis sp. strain PCC 6803. The display of complementary affinity proteins allowed specific cell-cell binding between Synechocystis and Escherichia coli or Staphylococcus carnosus. Additionally, successful tagging of the putative pilin PilA4 helped determine its localization to the type IV pili. Analogous tagging of the S-layer protein shed light on the regions involved in its secretion and surface anchoring.
SCF (Skp1/Cul1/F-box) ubiquitin ligases act as master regulators of cellular homeostasis by targeting key proteins for ubiquitylation. Here, we identified a hitherto uncharacterized F-box protein, FBXO28 that controls MYC-dependent transcription by non-proteolytic ubiquitylation. SCFFBXO28 activity and stability are regulated during the cell cycle by CDK1/2-mediated phosphorylation of FBXO28, which is required for its efficient ubiquitylation of MYC and downsteam enhancement of the MYC pathway. Depletion of FBXO28 or overexpression of an F-box mutant unable to support MYC ubiquitylation results in an impairment of MYC-driven transcription, transformation and tumourigenesis. Finally, in human breast cancer, high FBXO28 expression and phosphorylation are strong and independent predictors of poor outcome. In conclusion, our data suggest that SCFFBXO28 plays an important role in transmitting CDK activity to MYC function during the cell cycle, emphasizing the CDK-FBXO28-MYC axis as a potential molecular drug target in MYC-driven cancers, including breast cancer. FBXO28 is identified as part of a SCF complex acting as a regulator of tumor cell proliferation and an important modifier of MYC function. FBXO28 may be a new prognostic factor in breast cancer and a new potential drug target in MYC- driven tumors.
Spatially resolved transcriptomic technologies are promising tools to study complex biological processes such as mammalian embryogenesis. However, the imbalance between resolution, gene capture, and field of view of current methodologies precludes their systematic application to analyze relatively large and three-dimensional mid-and late-gestation embryos. Here, we combined DNA nanoball (DNB)-patterned arrays and in situ RNA capture to create spatial enhanced resolution omics-sequencing (Stereo-seq). We applied Stereo-seq to generate the mouse organogenesis spatiotemporal transcriptomic atlas (MOSTA), which maps with single cell resolution and high sensitivity the kinetics and directionality of transcriptional variation during mouse organogenesis. We used this information to gain insight into the molecular basis of spatial cell heterogeneity and cell fate specification in developing tissues such as the dorsal midbrain. Our panoramic atlas will facilitate in-depth investigation of longstanding questions concerning normal and abnormal mammalian development.
Elucidating the cellular organization of the cerebral cortex is critical for understanding brain structure and function. Using large-scale single-nucleus RNA sequencing and spatial transcriptomic analysis of 143 macaque cortical regions, we obtained a comprehensive atlas of 264 transcriptome-defined cortical cell types and mapped their spatial distribution across the entire cortex. We characterized the cortical layer and region preferences of glutamatergic, GABAergic, and non-neuronal cell types, as well as regional differences in cell-type composition and neighborhood complexity. Notably, we discovered a relationship between the regional distribution of various cell types and the region's hierarchical level in the visual and somatosensory systems. Cross-species comparison of transcriptomic data from human, macaque, and mouse cortices further revealed primate-specific cell types that are enriched in layer 4, with their marker genes expressed in a region-dependent manner. Our data provide a cellular and molecular basis for understanding the evolution, development, aging, and pathogenesis of the primate brain.
Despite the wealth of commercially available antibodies to human proteins, research is often hindered by their inconsistent validation, their poor performance and the inadequate coverage of the proteome. These issues could be addressed by systematic, genome-wide efforts to generate and validate renewable protein binders. We report a multicenter study to assess the potential of hybridoma and phage-display technologies in a coordinated large-scale antibody generation and validation effort. We produced over 1,000 antibodies targeting 20 SH2 domain proteins and evaluated them for potency and specificity by enzyme-linked immunosorbent assay (ELISA), protein microarray and surface plasmon resonance (SPR). We also tested selected antibodies in immunoprecipitation, immunoblotting and immunofluorescence assays. Our results show that high-affinity, high-specificity renewable antibodies generated by different technologies can be produced quickly and efficiently. We believe that this work serves as a foundation and template for future larger-scale studies to create renewable protein binders.
G protein-coupled receptors (GPCRs) control critical cellular signaling pathways. Therapeutic agents including anti-GPCR antibodies (Abs) are being developed to modulate GPCR function. However, validating the selectivity of anti-GPCR Abs is challenging because of sequence similarities among individual receptors within GPCR sub-families. To address this challenge, we developed a multiplexed immunoassay to test >400 anti-GPCR Abs from the Human Protein Atlas targeting a customized library of 215 expressed and solubilized GPCRs representing all GPCR subfamilies. We found that-61% of Abs tested were selective for their intended target,-11% bound off -target, and-28% did not bind to any GPCR. Antigens of on-target Abs were, on average, significantly longer, more disordered, and less likely to be buried in the interior of the GPCR protein than the other Abs. These results provide important insights into the immunogenicity of GPCR epitopes and form a basis for designing therapeu-tic Abs and for detecting pathological auto-Abs against GPCRs.
The pancreas is composed of both exocrine glands and intermingled endocrine cells to execute its diverse functions, including enzyme production for digestion of nutrients and hormone secretion for regulation of blood glucose levels. To define the molecular constituents with elevated expression in the human pancreas, we employed a genome-wide RNA sequencing analysis of the human transcriptome to identify genes with elevated expression in the human pancreas. This quantitative transcriptomics data was combined with immunohistochemistry-based protein profiling to allow mapping of the corresponding proteins to different compartments and specific cell types within the pancreas down to the single cell level. Analysis of whole pancreas identified 146 genes with elevated expression levels, of which 47 revealed a particular higher expression as compared to the other analyzed tissue types, thus termed pancreas enriched. Extended analysis of in vitro isolated endocrine islets identified an additional set of 42 genes with elevated expression in these specialized cells. Although only 0.7% of all genes showed an elevated expression level in the pancreas, this fraction of transcripts, in most cases encoding secreted proteins, constituted 68% of the total mRNA in pancreas. This demonstrates the extreme specialization of the pancreas for production of secreted proteins. Among the elevated expression profiles, several previously not described proteins were identified, both in endocrine cells (CFC1, FAM159B, RBPJL and RGS9) and exocrine glandular cells (AQP12A, DPEP1, GATM and ERP27). In summary, we provide a global analysis of the pancreas transcriptome and proteome with a comprehensive list of genes and proteins with elevated expression in pancreas. This list represents an important starting point for further studies of the molecular repertoire of pancreatic cells and their relation to disease states or treatment effects.
In tumor tissues, hypoxia is a commonly observed feature resulting from rapidly proliferating cancer cells outgrowing their surrounding vasculature network. Transformed cancer cells are known to exhibit phenotypic alterations, enabling continuous proliferation despite a limited oxygen supply. The four-step isogenic BJ cell model enables studies of defined steps of tumorigenesis: the normal, immortalized, transformed, and metastasizing stages. By transcriptome profiling under atmospheric and moderate hypoxic (3% O2) conditions, we observed that despite being highly similar, the four cell lines of the BJ model responded strikingly different to hypoxia. Besides corroborating many of the known responses to hypoxia, we demonstrate that the transcriptome adaptation to moderate hypoxia resembles the process of malignant transformation. The transformed cells displayed a distinct capability of metabolic switching, reflected in reversed gene expression patterns for several genes involved in oxidative phosphorylation and glycolytic pathways. By profiling the stage-specific responses to hypoxia, we identified ASS1 as a potential prognostic marker in hypoxic tumors. This study demonstrates the usefulness of the BJ cell model for highlighting the interconnection of pathways involved in malignant transformation and hypoxic response.
After a century of research, the human centrosome continues to fascinate. Based on immunofluorescence and confocal microscopy, an extensive inventory of the protein components of the human centrosome, and the centriolar satellites, with the important contribution of over 300 novel proteins localizing to these compartments is presented. A network of candidate centrosome proteins involved in ubiquitination, including six interaction partners of the Kelch-like protein 21, and an additional network of protein phosphatases, together supporting the suggested role of the centrosome as an interactive hub for cell signaling, is identified. Analysis of multi-localization across cellular organelles analyzed within the Human Protein Atlas (HPA) project shows how multi-localizing proteins are particularly overrepresented in centriolar satellites, supporting the dynamic nature and wide range of functions for this compartment. In summary, the spatial dissection of the human centrosome and centriolar satellites described here provides a comprehensive knowledgebase for further exploration of their proteomes.
The transformation of normal cells to malignant, metastatic tumor cells is a multistep process caused by the sequential acquirement of genetic changes. To identify these changes, we compared the transcriptomes and levels and distribution of proteins in a four-stage cell model of isogenically matched normal, immortalized, transformed, and metastatic human cells, using deep transcriptome sequencing and immunofluorescence microscopy. The data show that similar to 6% (n = 1,357) of the human protein-coding genes are differentially expressed across the stages in the model. Interestingly, the majority of these genes are down-regulated, linking malignant transformation to dedifferentiation. The up-regulated genes are mainly components that control cellular proliferation, whereas the down-regulated genes consist of proteins exposed on or secreted from the cell surface. As many of the identified gene products control basic cellular functions that are defective in cancers, the data provide candidates for follow-up studies to investigate their functional roles in tumor formation. When we further compared the expression levels of four of the identified proteins in clinical cancer cohorts, similar differences were observed between benign and cancer cells, as in the cell model. This shows that this comprehensive demonstration of the molecular changes underlying malignant transformation is a relevant model to study the process of tumor formation.
One of the major challenges of a chromosome-centric proteome project is to explore in a systematic manner the potential proteins identified from the chromosomal genome sequence, but not yet characterized on a protein level. Here, we describe the use of RNA deep sequencing to screen human cell lines for RNA profiles and to use this information to select cell lines suitable for characterization of the corresponding gene product. In this manner, the subcellular localization of proteins can be analyzed systematically using antibody-based confocal microscopy. We demonstrate the usefulness of selecting cell lines with high expression levels of RNA transcripts to increase the likelihood of high quality immunofluorescence staining and subsequent successful subcellular localization of the corresponding protein. The results show a path to combine transcriptomics with affinity proteomics to characterize the proteins in a gene- or chromosome-centric manner.
Malignant gliomas are among the most severe types of cancer, and the most common primary brain tumors. Treatment options are limited and the prognosis is poor. WNT-5A, a member of the WNT family of lipoglycoproteins, plays a role in oncogenesis and tumor progression in various cancers, whereas the role of WNT-5A in glioma remains obscure. Based on the role of WNT-5A as an oncogene, its potential to regulate microglia cells and the glioma-promoting capacities of microglia cells, we hypothesize that WNT-5A has a role in regulation of immune functions in glioma. We investigated WNT-5A expression by in silico analysis of the cancer genome atlas (TCGA) transcript profiling of human glioblastoma samples and immunohistochemistry experiments of human glioma tissue microarrays (TMA). Our results reveal higher WNT-5A protein levels and mRNA expression in a subgroup of gliomas (WNT-5A(high)) compared to non-malignant control brain tissue. Furthermore, we show a significant correlation between WNT-5A in the tumor and presence of major histocompatibility complex Class II-positive microglia/monocytes. Our data pinpoint a positive correlation between WNT-5A and a proinflammatory signature in glioma. We identify increased presence of microglia/monocytes as an important aspect in the inflammatory transformation suggesting a novel role for WNT-5A in human glioma.
The testis' function is to produce haploid germ cells necessary for reproduction. Here we have combined a genome-wide transcriptomics analysis with immunohistochemistry-based protein profiling to characterize the molecular components of the testis. Deep sequencing (RNA-Seq) of normal human testicular tissue from seven individuals was performed and compared with 26 other normal human tissue types. All 20 050 putative human genes were classified into categories based on expression patterns. The analysis shows that testis is the tissue with the most tissue-specific genes by far. More than 1000 genes show a testis-enriched expression pattern in testis when compared with all other analyzed tissues. Highly testis enriched genes were further characterized with respect to protein localization within the testis, such as spermatogonia, spermatocytes, spermatids, sperm, Sertoli cells and Leydig cells. Here we present an immunohistochemistry-based analysis, showing the localization of corresponding proteins in different cell types and various stages of spermatogenesis, for 62 genes expressed at > 50-fold higher levels in testis when compared with other tissues. A large fraction of these genes were unexpectedly expressed in early stages of spermatogenesis. In conclusion, we have applied a genome-wide analysis to identify the human testis-specific proteome using transcriptomics and antibody-based protein profiling, providing lists of genes expressed in a tissue-enriched manner in the testis. The majority of these genes and proteins were previously poorly characterised in terms of localization and function, and our list provides an important starting point to increase our molecular understanding of human reproductive biology and disease.
Cancer testis antigens (CTAs) are of clinical interest as biomarkers and present valuable targets for immunotherapy. To comprehensively characterize the CTA landscape of non-small-cell lung cancer (NSCLC), we compared RNAseq data from 199 NSCLC tissues to the normal transcriptome of 142 samples from 32 different normal organs. Of 232 CTAs currently annotated in the Caner Testis Database (CTdatabase), 96 were confirmed in NSCLC. To obtain an unbiased CTA profile of NSCLC, we applied stringent criteria on our RNAseq data set and defined 90 genes as CTAs, of which 55 genes were not annotated in the CTdatabase, thus representing potential new CTAs. Cluster analysis revealed that CTA expression is histology dependent and concurrent expression is common. IHC confirmed tissue-specific protein expression of selected new CTAs (TKTL1, TGIF2LX, VCX, and CXORF67). Furthermore, methylation was identified as a regulatory mechanism of CTA expression based on independent data from The Cancer Genome Atlas. The proposed prognostic impact of CTAs in lung cancer was not confirmed, neither in our RNAseq cohort nor in an independent meta-analysis of 1,117 NSCLC cases. In summary, we defined a set of 90 reliable CTAs, including information on protein expression, methylation, and survival association. The detailed RNAseq catalog can guide biomarker studies and efforts to identify targets for immunotherapeutic strategies.
Background: Precision medicine approaches aim to tackle diseases on an individual level through molecular profiling. Despite the growing knowledge about diseases and the reported diversity of molecular phenotypes, the descriptions of human health on an individual level have been far less elaborate. Methods: To provide insights into the longitudinal protein signatures of well-being, we profiled blood plasma collected over one year from 101 clinically healthy individuals using multiplexed antibody assays. After applying an antibody validation scheme, we utilized > 700 protein profiles for in-depth analyses of the individuals’ short-term health trajectories. Findings: We found signatures of circulating proteomes to be highly individual-specific. Considering technical and longitudinal variability, we observed that 49% of the protein profiles were stable over one year. We also identified eight networks of proteins in which 11–242 proteins covaried over time. For each participant, there were unique protein profiles of which some could be explained by associations to genetic variants. Interpretation: This observational and non-interventional study identifyed noticeable diversity among clinically healthy subjects, and facets of individual-specific signatures emerged by monitoring the variability of the circulating proteomes over time. To enable more personal hence precise assessments of health states, longitudinal profiling of circulating proteomes can provide a valuable component for precision medicine approaches. Funding: This work was supported by the Erling Persson Foundation, the Swedish Heart and Lung Foundation, the Knut and Alice Wallenberg Foundation, Science for Life Laboratory, and the Swedish Research Council.
The development and progression of cardiovascular disease (CVD) can mainly be attributed to the narrowing of blood vessels caused by atherosclerosis and thrombosis, which induces organ damage that will result in end-organ dysfunction characterized by events such as myocardial infarction or stroke. It is also essential to consider other contributory factors to CVD, including cardiac remodelling caused by cardiomyopathies and co-morbidities with other diseases such as chronic kidney disease. Besides, there is a growing amount of evidence linking the gut microbiota to CVD through several metabolic pathways. Hence, it is of utmost importance to decipher the underlying molecular mechanisms associated with these disease states to elucidate the development and progression of CVD. A wide array of systems biology approaches incorporating multi-omics data have emerged as an invaluable tool in establishing alterations in specific cell types and identifying modifications in signalling events that promote disease development. Here, we review recent studies that apply multi-omics approaches to further understand the underlying causes of CVD and provide possible treatment strategies by identifying novel drug targets and biomarkers. We also discuss very recent advances in gut microbiota research with an emphasis on how diet and microbial composition can impact the development of CVD. Finally, we present various biological network analyses and other independent studies that have been employed for providing mechanistic explanation and developing treatment strategies for end-stage CVD, namely myocardial infarction and stroke.
Objectives: Immunohistochemistry is an important extension to clinical information and morphology, and prevails as an invaluable tool for establishing a correct cancer diagnosis in clinical diagnostic pathology. The applicability of immunohistochemistry is limited by the availability of validated cell- and cancer-type specific antibodies, rendering an unmet need to discover, test, and validate novel markers. The SATB2 protein is selectively expressed in glandular cells from the lower gastrointestinal tract and expression is retained in a large majority of primary and metastatic colorectal cancers. Methods: We analyzed the expression of SATB2 in all clinical cases (n = 840), in which immunohistochemistry for detection of CK20 was deemed necessary for a final diagnosis. Results: SATB2 showed a high sensitivity (93%) and specificity (77%) to determine a cancer of colorectal origin and in combination with CK7 and CK20, the specificity increased to 100%. Conclusions: We conclude that SATB2 provides a new and advantageous supplement for clinical differential diagnostics.
Chromosome XV was one of the last two chromosomes of Saccharomyces cerevisiae to be discovered(1). It is the third-largest yeast chromosome after chromosomes XII and IV, and is very similar in size to chromosome VII. It alone represents 9% of the yeast genome (8% if ribosomal DNA is included). When systematic sequencing of chromosome XV was started, 93 genes or markers were identified, and most of them were mapped(2). However, very little else was known about chromosome XV which, in contrast to shorter chromosomes, had not been the object of comprehensive genetic or molecular analysis. It was therefore decided to start sequencing chromosome XV only in the third phase of the European Yeast Genome Sequencing Programme, after experience was gained on chromosomes III, XI and II (refs 3-5). The sequence of chromosome XV has been determined from a set of partly overlapping cosmid clones derived from a unique yeast strain, and physically mapped at 3.3-kilobase resolution before sequencing. As well as numerous new open reading frames (ORFs) and genes encoding tRNA or small RNA molecules, the sequence of 1,091,283 base pairs confirms the high proportion of orphan genes and reveals a number of ancestral and successive duplications with other yeast chromosomes.
The intermediate filament protein nestin is expressed during embryonic development, but considered largely restricted to areas of regeneration in the adult. Here, we perform a body-wide transcriptome and protein-profiling analysis to reveal that nestin is constitutively, and highly-selectively, expressed in adult human endothelial cells (EC), independent of proliferative status. Correspondingly, we demonstrate that it is not a marker for tumour EC in multiple malignancy types. Imaging of EC from different vascular beds reveals nestin subcellular distribution is shear-modulated. siRNA inhibition of nestin increases EC proliferation, and nestin expression is reduced in atherosclerotic plaque neovessels. eQTL analysis reveals an association between SNPs linked to cardiovascular disease and reduced aortic EC nestin mRNA expression. Our study challenges the dogma that nestin is a marker of proliferation, and provides insight into its regulation and function in EC. Furthermore, our systems-based approach can be applied to investigate body-wide expression profiles of any candidate protein.
Changes in the endothelium of the cerebral vasculature can contribute to inflammatory, thrombotic, and malignant disorders. The importance of defining cell-type-specific genes and their modification in disease is increasingly recognized. Here, we develop a bioinformatics-based approach to identify normal brain cell-enriched genes, using bulk RNA sequencing (RNA-seq) data from 238 normal human cortex samples from 2 independent cohorts. We compare endothelial cell-enriched gene profiles with astrocyte, oligodendrocyte, neuron, and microglial cell profiles. Endothelial changes in malignant disease are explored using RNA-seq data from 516 lower-grade gliomas and 401 glioblastomas. Lower-grade gliomas appear to be an "endothelial intermediate'' between normal brain and glioblastoma. We apply our method for the prediction of glioblastoma-specific endothelial biomarkers, providing potential diagnostic or therapeutic targets. In summary, we provide a roadmap of endothelial cell identity in normal and malignant brain, using a method developed to resolve bulk RNA-seq into constituent cell-type-enriched profiles.
BACKGROUND: Special AT-rich sequence-binding protein 2 (SATB2) is a novel diagnostic marker of colorectal cancer (CRC), and loss of SATB2 has been linked to poor survival from the disease. In this study, we validated the prognostic ability of SATB2 expression in a large, prospective CRC cohort. METHODS: Immunohistochemical SATB2 expression was assessed in 527 incident CRC cases from the Malmo Diet and Cancer Study. Kaplan-Meier analysis and Cox proportional hazards modelling were used to explore the impact of SATB2 expression on cancer-specific survival (CSS) and overall survival (OS). RESULTS: High SATB2 expression was associated with a prolonged CSS in the full cohort (hazard ratio (HR)=0.61; 95% CI 0.41-0.92) and in colon cancer (HR=0.39; 95% CI 0.20-0.75), remaining significant in multivariable analysis of colon cancer (HR 0.49; 95% CI 0.25-0.96), with similar findings for OS. In curatively resected stage III-IV patients, a significant benefit from adjuvant and/or neoadjuvant therapy was observed for SATB2 high tumours (P-interaction 0.037 for OS) and high SATB2 expression in rectal cancer correlated with an enhanced effect of neoadjuvant therapy (P-interaction 0.033 for OS). CONCLUSION: High SATB2 expression is an independent marker of good prognosis in colon cancer and may modulate sensitivity to chemotherapy and radiation. British Journal of Cancer (2012) 106, 931-938. doi:10.1038/bjc.2012.34 www.bjcancer.com Published online 14 February 2012
AThe combination of immuno-based methods and mass spectrometry detection has great potential in the field of quantitative proteomics. Here, we describe a new method (immuno-SILAC) for the absolute quantification of proteins in complex samples based on polyclonal antibodies and stable isotope-labeled recombinant protein fragments to allow affinity enrichment prior to mass spectrometry analysis and accurate quantification. We took advantage of the antibody resources publicly available from the Human Protein Atlas project covering more than 80% of all human protein-coding genes. Epitope mapping revealed that a majority of the polyclonal antibodies recognized multiple linear epitopes, and based on these results, a semi-automated method was developed for peptide enrichment using polyclonal antibodies immobilized on protein A-coated magnetic beads. A protocol based on the simultaneous multiplex capture of more than 40 protein targets showed that approximately half of the antibodies enriched at least one functional peptide detected in the subsequent mass spectrometry analysis. The approach was further developed to also generate quantitative data via the addition of heavy isotope-labeled recombinant protein fragment standards prior to trypsin digestion. Here, we show that we were able to use small amounts of antibodies (50 ng per target) in this manner for efficient multiplex analysis of quantitative levels of proteins in a human HeLa cell lysate. The results suggest that polyclonal antibodies generated via immunization of recombinant protein fragments could be used for the enrichment of target peptides to allow for rapid mass spectrometry analysis taking advantage of a substantial reduction in sample complexity. The possibility of building up a proteome-wide resource for immuno-SILAC assays based on publicly available antibody resources is discussed.
An important issue for molecular biology is to establish if transcript levels of a given gene can be used as proxies for the corresponding protein levels. Here, we have developed a targeted proteomics approach for a set of human non-secreted proteins based on Parallel Reaction Monitoring to measure, at steady-state conditions, absolute protein copy numbers across human tissues and cell lines and compared these levels with the corresponding mRNA levels using transcriptomics. The study shows that the transcript and protein levels do not correlate well unless a gene-specific RNA-to-protein (RTP) conversion factor independent of the tissue-type is introduced, thus significantly enhancing the predictability of protein copy numbers from RNA levels. The results show that the RTP-ratio varies significantly with a few hundred copies per mRNA molecule for some genes to several hundred thousands protein copies per mRNA molecule for others. In conclusion, our data suggests that transcriptome analysis can be used as a tool to predict the protein copy numbers per cell, thus forming an attractive link between the field of genomics and proteomics.
An important issue for molecular biology is to establish whether transcript levels of a given gene can be used as proxies for the corresponding protein levels. Here, we have developed a targeted proteomics approach for a set of human non-secreted proteins based on parallel reaction monitoring to measure, at steady-state conditions, absolute protein copy numbers across human tissues and cell lines and compared these levels with the corresponding mRNA levels using transcriptomics. The study shows that the transcript and protein levels do not correlate well unless a gene-specific RNA-to-protein (RTP) conversion factor independent of the tissue type is introduced, thus significantly enhancing the predictability of protein copy numbers from RNA levels. The results show that the RTP ratio varies significantly with a few hundred copies per mRNA molecule for some genes to several hundred thousands of protein copies per mRNA molecule for others. In conclusion, our data suggest that transcriptome analysis can be used as a tool to predict the protein copy numbers per cell, thus forming an attractive link between the field of genomics and proteomics.
Here, we have used a resource of 26,000 recombinant protein fragments to create custom libraries of standards for targeted proteomics based on parallel reaction monitoring (PRM). The recombinant fragments can be produced in a bacterial cell factory to generate heavy isotope labeled standards for absolute quantification of the corresponding protein targets and be used to produce high- quality spectral libraries. Altogether, coordinates for 25,684 unique proteotypic peptide assays have been experimentally defined covering 10,163 human proteins. The protocol allows for precise monitoring of digestion kinetics and thus enables to select peptides that behave quantitative during the sample preparation process. We show that the quantification tag of each recombinant protein fragment can be used for accurate retention time prediction and allows for assay standardization across different method parameters. The use of this resource was illustrated by determining the absolute concentrations of selected protein targets using multiplex targeted proteomics assays for determination of quantitative assessment of 49 protein targets in serum samples.
The availability of proteomics resources hosting protein and peptide standards, as well as the data describing their analytical performances, will continue to enhance our current capabilities to develop targeted proteomics methods for quantitative biology. This study describes the analysis of a resource of 26,840 individually purified recombinant protein fragments corresponding to more than 16,000 human protein-coding genes. The resource was screened to identify proteotypic peptides suitable for targeted proteomics efforts, and we report LC-MS/MS assay coordinates for more than 25,000 proteotypic peptides, corresponding to more than 10,000 unique proteins. Additionally, peptide formation and digestion kinetics were, for a subset of the standards, monitored using a time-course protocol involving parallel digestion of isotope-labeled recombinant protein standards and endogenous human plasma proteins. We show that the strategy by adding isotope-labeled recombinant proteins before trypsin digestion enables short digestion protocols (<= 60 min) with robust quantitative precision. In a proof-of-concept study, we quantified 23 proteins in human plasma using assay parameters defined in our study and used the standards to describe distinct clusters of individuals linked to different levels of LPA, APOE, SERPINAS, and TFRC. In summary, we describe the use and utility of a resource of recombinant proteins to identify proteotypic peptides useful for targeted proteomics assay development.
There is a need for standardized validation methods for antibody specificity and selectivity. Recently, five alternative validation pillars were proposed to explore the specificity of research antibodies using methods with no need for prior knowledge about the protein target. Here, we show that these principles can be used in a streamlined manner for enhanced validation of research antibodies in Western blot applications. More than 6,000 antibodies were validated with at least one of these strategies involving orthogonal methods, genetic knockdown, recombinant expression, independent antibodies, and capture mass spectrometry analysis. The results show a path forward for efforts to validate antibodies in an application-specific manner suitable for both providers and users.
There is a great need for standardized validation methods for antibody specificity and selectivity. Here, we describe the use of orthogonal methods in which the specificity of an antibody in a particular application is determined based on correlation of protein abundance across several samples using an antibody-independent method. We show that pair-wise correlation between orthogonal samples can be used to score the specificity of antibodies in a standardized manner using a test panel of human cell lines. Here, we investigated two independent methods for validation of antibodies in Western blot applications, namely transcriptomics and targeted proteomics and we show that the two methods yield similar, but not identical results. The orthogonal methods can also be used to investigate on- and off- target binding for antibodies with multiple bands in the Western blot assay. In conclusion, orthogonal methods for antibody validation provide an attractive strategy for systematic validation of antibodies in a quantitative manner.