A multi-omic analysis reveals the esophageal dysbiosis as the predominant trait of eosinophilic esophagitis
Journal of Translational Medicine volume 21, Article number: 46 (2023)
Eosinophilic esophagitis (EoE) is a chronic immune-mediated rare disease, characterized by esophageal dysfunctions. It is likely to be primarily activated by food antigens and is classified as a chronic disease for most patients. Therefore, a deeper understanding of the pathogenetic mechanisms underlying EoE is needed to implement and improve therapeutic lines of intervention and ameliorate overall patient wellness.
RNA-seq data of 18 different studies on EoE, downloaded from NCBI GEO with faster-qdump (https://github.com/ncbi/sra-tools), were batch-corrected and analyzed for transcriptomics and metatranscriptomics profiling as well as biological process functional enrichment. The EoE TaMMA web app was designed with plotly and dash. Tabula Sapiens raw data were downloaded from the UCSC Cell Browser. Esophageal single-cell raw data analysis was performed within the Automated Single-cell Analysis Pipeline. Single-cell data-driven bulk RNA-seq data deconvolution was performed with MuSiC and CIBERSORTx. Multi-omics integration was performed with MOFA.
The EoE TaMMA framework pointed out disease-specific molecular signatures, confirming its reliability in reanalyzing transcriptomic data, and providing new EoE-specific molecular markers including CXCL14, distinguishing EoE from gastroesophageal reflux disorder. EoE TaMMA also revealed microbiota dysbiosis as a predominant characteristic of EoE pathogenesis. Finally, the multi-omics analysis highlighted the presence of defined classes of microbial entities in subsets of patients that may participate in inducing the antigen-mediated response typical of EoE pathogenesis.
Our study showed that the complex EoE molecular network may be unraveled through advanced bioinformatics, integrating different components of the disease process into an omics-based network approach. This may implement EoE management and treatment in the coming years.
Eosinophilic esophagitis (EoE) is a chronic inflammatory disease characterized by a T-Helper type 2 (TH2) inflammatory response, primarily induced by food antigens, resulting in an accumulation of eosinophils within the esophageal mucosa. The TH2-response-specific interleukins were described to play a pivotal role in EoE pathogenesis . To date, EoE is no longer reported as a rare disease since its incidence and prevalence rates are steadily increasing, being 3.7/100,000/year and 22.7/100,000 respectively , with a strong male predominance with a 3:1 ratio . Several risk factors are associated with EoE, including allergic/atopic conditions, environmental factors, and lack of Helicobacter (H). Pylori infection . However, the genetic predisposition to the disease has been proven, with up to 58% concordance in monozygotic twins .
EoE clinical presentation is heterogeneous, with dysphagia and food impaction as the most common symptoms in adults, while heartburn, regurgitation, and feeding intolerance are typical in children. Moreover, a wide range of other symptoms may overlap at any age, such as vomiting, nausea, and chest and/or abdominal pain .
At present, endoscopy is the only way to diagnose and monitor the activity of the EoE. Specifically, the diagnosis is defined by a count of > 15 eosinophils per High power field (HPF) at histological evaluation of esophageal biopsies, whereas therapeutic response and remission are defined by a count of ≤ 15 or < 6 eosinophils per HPF, respectively . However, endoscopy remains an invasive diagnostic tool, with low acceptability from the patients and limited availability in clinical practice. With this premise and taking into account the heterogeneity of the clinical onset, the development of new “non-endoscopic” tools aiding diagnostic assessment may be helpful .
Therefore, a deeper understanding of the pathogenetic mechanisms underlying EoE remains of paramount importance in order to implement and improve therapeutic lines of intervention and ameliorate overall patient wellness.
Along with the mucosal immunity alterations and fibrosis , the epithelial barrier function impairment does remain a hallmark of EoE, in terms of both increased permeability and antigen sensing and presentation roles. Indeed, EoE is featured by a striking pattern of dilated intercellular spaces, with the down-regulation of proteins associated with barrier function and adhesion molecules modulated via an IL-13-dependent mechanism . As a consequence, altered epithelial permeability can lead to a permissive environment that enhances antigen presentation, which in turn leads to persistent chronic inflammation, associated with microbiota dysbiosis . Despite some anatomical and molecular characterizations employed as a kind of gold standard for the definition of EoE pathogenesis, a meta-analysis of the molecular studies may improve the understanding of the physiopathology of the disease and define molecular markers helpful for a straightforward diagnosis of EoE.
We recently developed and released the Inflammatory Bowel Disease (IBD) Transcriptome and Metatranscriptome Meta-Analysis web app (IBD TaMMA, https://ibd-meta-analysis.herokuapp.com/) , a complete survey of all public data sets generated for IBD-related studies.
Considering the utility that IBD TaMMA has been increasingly showing over the last few months  and the continuous access to the platform recorded by Google Analytics (1.8K individual users in October 2022), we here propose a similar meta-analysis of EoE-related public data sets visualized in the EoE TaMMA interactive web app, (https://eoe-meta-analysis.herokuapp.com/; username: ungaro; password: steams).
EoE TaMMA, while confirming well-accepted EoE molecular characteristics, pointed out for the first time that esophageal dysbiosis is the main trait in EoE pathogenesis. Of note, this computational platform may become a precious resource for all clinicians and scientists to expedite discoveries in the field and ameliorate the overall understanding of EoE pathophysiology, which urgently needs further implementation.
Materials and methods
All authors had access to the study data and reviewed and approved the final manuscript.
RNA-seq data were downloaded from NCBI GEO (18 different studies, listed within the EoE TaMMA web app, in the metadata tab > analysis overview subtab, and metadata tab > sample characteristics subtab; Fig. 1A, B, and Additional file 2: Table S1) with faster-qdump (https://github.com/ncbi/sra-tools). FASTQ sequencing reads were adaptor-trimmed and quality-filtered with Trimmomatic , prior to mapping to the hg38 human reference genome with STAR . Gene count normalization and differential gene expression were performed with DESeq2 . Functional enrichment analysis was done with GeneSCF . Low-dimensional embedding of high-dimensional data was performed by either Uniform Manifold Approximation and Projection (UMAP) or t-distributed stochastic neighbor embedding (t-SNE) machine learning algorithms, within R (https://cran.r-project.org/web/packages/umap).
Statistical significance was set at FDR < 1e−5.
Web app design
The web app, available at https://eoe-meta-analysis.herokuapp.com/ (username: ungaro; password: steams), was designed with plotly and dash (https://plotly.com/dash/). Processed data and code are hosted by GitHub and are available at https://github.com/DU-omics/eoe-meta-analysis_data and https://github.com/DU-omics/eoe-meta-analysis. The complete guide and the related documentation are available at https://ibd-tamma.readthedocs.io/. The user interface is described in Additional file 1: Fig. S1.
Newly released EoE datasets will be continuously searched and implemented in this platform to maintain it as timely updated. A suggestion link is available at the bottom of the home page of the platform for the users to notify newly released data sets.
Batch-effect detection and correction were performed as previously described [9, 15], in accordance with source (batch covariate) and tissue of origin (explaining other possible covariance), with ComBat , within the Surrogate Variable Analysis v1.8 R package (https://bioconductor.org/packages/release/bioc/html/sva.html).
Metatranscriptomics was performed as previously described [9, 15]. The reads that failed to align to the human genome were subsequently mapped to the complete collection of all available microbial genomes (https://www.ncbi.nlm.nih.gov/genome) with Kraken2 for exact alignment of k-mers and accurate viral read classification . Relative abundances and differential analysis was performed with DESeq2 upon variance-stabilizing transformation . Microbial read calls were confirmed by manually aligning Kraken2-classified reads to the respective viral genomes with Bowtie2, and visualizing the resulting BAM alignments with the Integrative Genomics Viewer (IGV) . Before statistical analysis, classified reads were double-checked with FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc) to confirm quality filtering and adaptor trimming and then submitted to BLAST  to exclude possible artifacts resulting from the in silico analysis. Species alpha diversity and dominance indices were calculated with vegan (https://cran.r-project.org/web/packages/vegan). The statistical significance threshold was set at P < 0.05.
Single-cell RNA-seq data analysis and deconvolution
Tabula Sapiens raw data  were downloaded from the UCSC Cell Browser (https://cells.ucsc.edu). Esophageal single-cell raw data were downloaded from https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE175930 . The analysis was performed within the Automated Single-cell Analysis Pipeline (ASAP) . Single-cell data-driven bulk RNA-seq data deconvolution was performed with MuSiC and CIBERSORTx [23, 24].
Multi-omics factor analysis
The different omics datasets were integrated with the Multi-Omics Factor Analysis (MOFA) framework  which interprets multi-layer (different data modalities) high-dimensional data and infers an interpretable low-dimensional representation in terms of a few latent factors. Variance stabilization and z-scoring, followed by feature selection to select the most informative variables, namely those explaining more variance, were performed to ensure that all the molecular layers were equally represented. Variance decomposition was then performed between groups to find differences in terms of variance explained within factors and groups, thus stratifying patients into cohorts, each one of them displaying a specific molecular signature.
The EoE TaMMA web app identifies EoE-specific markers
EoE aetiopathogenesis is not fully explained, even if a major shift toward antigen-mediated TH2 response has been accepted as the most relevant characteristic . Although some RNA-seq studies have been performed, the complete survey of all transcriptomics collections to advance EoE-related research has not been compiled yet. For this purpose, we analyzed and batch-corrected a total of 18 different studies, including 660 samples from esophageal mucosa and blood, combined into the EoE TaMMA web app (Fig. 1A and B).
We included blood and esophageal tissues from EoE and GERD patients and IBD-derived blood samples. Of note, we included also IBD samples because it helped to better correct the batch variability, a normal consequence of the combination of different studies coming from a variety of data sources generated by different operators, sequencers, and analytic platforms [9, 10]. However, IBD characterization was not shown but can be fully browsed at the dedicated platform (IBD TaMMA). After batch correction, esophagus and blood-derived samples appeared as two distinct clusters (Fig. 1C), despite the different study sources (Fig. 1D), indicating that the correction approach was effective in rendering samples harmonized and comparable. Differential gene expression (DGE) analysis revealed 533 and 504 genes up- and down-regulated, respectively, in the EoE esophagus by comparison with the control (Fig. 2A).
Since the role of TH2 cytokines is key in EoE pathogenesis, we specifically evaluated the expression of interleukin (IL)13, IL4, IL5, and their receptors . According to a previously published EoE single-cell (sc)RNA-seq , IL13 and IL5 were broadly expressed by pathogenic effector GATA-3 TH2 cells, expanded in EoE tissue biopsies  (Additional file 1: Fig. S2A, and Additional file 1: Fig. S3M and 3P). IL4 was expressed by Treg exclusively (Additional file 1: Fig. S3D and 3P), while IL4 receptor by both the stromal and immune compartments (Additional file 1: Fig. 3E, J, M–O). Additionally, IL5RA resulted as expressed by all myeloid cells, among which CLC-expressing eosinophils (Additional file 1: Fig. 3F, N, O).
In EoE TaMMA, IL13 was the sole confirmed as upregulated in the EoE esophagus as compared to the control, while IL5, IL4, and IL13, IL4, and IL5 receptors were not significantly modulated, although a trend was observed (Fig. 2B). Additionally, IL13 did not result in a specific trait of EoE when compared with GERD-derived samples (Fig. 2B and Additional file 1: Fig. 4A). This evidence might support the difficulties in a straightforward diagnosis for patients with EoE and GERD-shared symptoms .
Nonetheless, we sought to further characterize and confirm EoE-related traits in our TaMMA platform. IL13 is known as a mediator of a series of processes in allergic diseases, such as eosinophil chemotaxis, epithelial (goblet) cell proliferation, collagen deposition, and smooth muscle contractility , thus prompting us to evaluate these features in EoE esophagi. Therefore, by gene ontology (GO) analysis, we observed biological processes related to epithelial cell proliferation, smooth muscle cell migration, proliferation, differentiation, extracellular matrix remodeling, and chemotaxis to be modulated in EoE by comparison with the control tissues (Fig. 2C).
Interestingly, we also found these biological signatures to be modulated when EoE tissues were compared to the GERD (Fig. 2D), indicating that the EoE pathogenesis is different from the GERD concerning these aspects.
We then evaluated which genes were involved in these biological process alterations and distinguished EoE from GERD in terms of expression levels. Besides the already known factor CCL26 (Eotaxin-3) expressed by stromal and epithelial cells (Additional file 1: Fig. S3G, 3J and 3L) known to regulate the eosinophilic trafficking to the esophagus in patients with EoE and to discriminate between EoE and GERD , other markers were pointed out, such as CXCL14, PDGFRA, CXCL12, ACVRL1, POSTN, NOX4 and LTBP4 (Fig. 2E). These results provided evidence that a composite panel of markers specific to EoE may be developed to make the diagnosis more accurate.
Furthermore, IL13 was acknowledged as a factor that induces calpain 14 (CAPN14) expression (Additional file 1: Fig. S3H and 3J), which affects the epithelial barrier through the degradation of the desmosomal protein desmoglein 1 (DSG1) . Our analysis revealed increased CAPN14 expression in the EoE esophagus by comparison with the control, while DSG1 was found down-regulated. (Fig. 2F, G), supporting the inverse relationship existing between these two proteins in the epithelial barrier . Even if the bulk sequencing data are key for understanding the molecular process in a biological system, the great limitation remains the unavailability of information regarding the proportion of cell types within a sample. Nonetheless, in recent decades, approaches like computational deconvolution of single-cell RNA-seq data have been developed and optimized to obtain such information starting from whole tissue expression profiling data . Deconvolution is a time and cost-efficient approach for obtaining cell type-specific information from bulk gene expression of heterogeneous tissues, providing an estimation of cell-type proportions or abundances in samples.
To this end, we exploited Tabula Sapiens, a multiple-organ, single-cell transcriptomic atlas of human tissues . The analysis performed on the EoE TaMMA data confirmed the increased proportion of T helper cells, described as part of the EoE pathogenic process , by comparison with both the healthy and GERD tissues.
Furthermore, considering the role of invariant (i)NKT cells, also known as classical NKT cells, during EoE pathogenesis , we verified and confirmed their increased proportion specifically in EoE tissues (Additional file 1: Fig. S5A–F).
Tissue remodeling by increased collagen deposition, matrix disassembly, and epithelial-to-mesenchymal (EMT) transition are phenomena that lead to the peculiar fibrostenotic aspect of an EoE esophagus . During fibrotic complications, epithelia lose many characteristics, such as polarity, specific markers, and tight junctions, and acquire properties of mesenchymal cells, including motility, loose cell adhesion via N-cadherin, and de-polarized cytoskeletal arrangements such as vimentin . Consistently, we observed an increased proportion of mesenchymal stem cells (MSC) in EoE compared to GERD and healthy samples (Fig. 3A–C), thus confirming the pro-fibrotic status of the esophagus in EoE conditions. This finding may have implications for developing prognostic molecular markers predicting the risk of fibrostenosis in EoE patients.
These data were also paralleled by the dysregulation of biological processes related to the transforming growth factor beta (TGFB), which was found to increase in the EoE by comparison with both the healthy and the GERD tissues (Fig. 3D and E), with specific markers distinguishing between EoE and GERD (Fig. 2E, specifically: WNT2, ACVRL1, POSTN, NOX4, LEFTY2, GDF5, and LTBP4), supporting the notion that the tissue remodeling and fibrotic process are associated with EoE pathogenesis .
Overall, these results pinpointed the EoE TaMMA web app as a reliable tool, evidencing the main hallmarks of EoE, often different from GERD, and thus resulting in a powerful asset for expediting research with novel insights into both pathogenesis and approaches for a more accurate diagnosis of EoE.
EoE TaMMA reveals microbiota dysbiosis as a predominant characteristic during EoE pathogenesis
As mentioned above, EoE TaMMA provides a wide picture of omics profiling of EoE tissues, not only confirming the already known molecular landscape associated with EoE but also pointing out new insights for further investigation of their complex pathogenesis. For instance, among all the markers that were pointed out as specifically determining EoE (Fig. 2 and Additional file 1: Fig. S1), CXCL14, a chemoattractant chemokine expressed by the epithelium, stromal cells and by monocytes (Additional file 1: Fig. S3I, 3J, 3L, and 3R), gained our attention because of its documented antimicrobial activities against pathogens  and its higher level in EoE compared with both control and GERD, suggesting possible EoE-specific microbial signatures different from the GERD.
Thus, going deeper into the microbiota profiling, EoE TaMMA pointed out the bacterial species as the most differentially dysregulated microbial entities among EoE, GERD, and control esophageal tissues (Fig. 4A and B and Additional file 1: Fig. S6A–6F).
We then intersected the bacterial species highly abundant in EoE by comparison with the healthy or GERD and identified the 9 candidates specifically characterizing the EoE esophagus (Additional file 1: Fig. S6G). The most abundant were the Streptococcus mitis and Hemophilus parainfluenzae (Fig. 4C), normally colonizing the oropharynx tract and already reported as being associated with EoE pathogenesis . Moreover, increased bacterial diversity but no species dominance was found in EoE esophagi as compared to controls (Fig. 4D), despite previous studies reporting no differences between these experimental groups [7, 36,37,38]. Such a discrepancy might be explained by the former small-sized samples and the consequent lower statistical power that might have contributed to the loss of significant signals that, by contrast, our analysis pointed out.
This is an example of how this computational approach may be exploited to highlight specific signatures. Nevertheless, dissecting each omic at a time would be a huge effort, and many statistically relevant details could be lost.
Multi-omics approaches, often supported by machine-learning algorithms , are facilitating the discovery of new molecular networks and hubs by comprehensively and simultaneously analyzing different data layers, such as the human transcriptome and metatranscriptome . Also, it can allow the identification of the origin of patient heterogeneity, ultimately stratifying them based on their molecular characteristics. Indeed, this methodological approach mitigates intersubject variability thanks to the discovery of the principal sources of variation in multi-omics data sets. In this regard, the possibility to perform such an analysis recently became effective thanks to the machine learning-based tool Multi-Omics Factor Analysis (MOFA). MOFA infers a set of (hidden) factors that capture biological and technical sources of variability .
Therefore, by applying MOFA for processing the six different types of omics data, encompassing the human transcriptome, virome, eukaryome (fungi and protists), bacteriome, and archaeome from EoE and healthy control samples (Additional file 1: Fig. S7A), the source of variation between the EoE and healthy (control) esophageal mucosa was identified mainly among the metatranscriptomics (microbiome) factors. Specifically, a subset of archaea, fungi, protozoa, and viral species and, to a lesser extent some human transcripts, allowed the development of 4 multi-layers molecular signatures able to distinguish EoE patients from controls (Fig. 5A), indicating that microbial dysbiosis may be a key player during the EoE pathogenesis.
Going deeper into the analysis, the primary source of variance between EoE and control was found at the level of factor 1 mainly in the esophageal virome and archaeon composition and in the blood bacteriome and mycome (fungi, Fig. 5B–F). The top features within factor 1, showing a high impact (weight) in explaining the variance, were the Staphylococcus virus Andhra (Fig. 5C and C′), the Sulfodiicoccus acidophilus, and the Nitrosopumilus sp. K4 (Fig. 5D and D′), the Staphylococcus aureus and the Pasteurella multocida (Fig. 5E and E′), and the Malassezia restricta (Fig. 5F and F′). Besides the factor 1-driven stratum of patients, the factor 2-driven defined another subset of human subjects with EoE where the Plasmodium knowlesi explained the majority of the variance in the protozoa profiling of the blood (Additional file 1: Fig. S7B and 7B′), while factor 3 was featured by the Proteus virus Isfahan, explaining most of the variance in this stratum of patients (Additional file 1: Fig. S7C and 7C′).
Of note, since Staphylococcus virus Andhra parasitizes Staphylococci we checked the levels of these bacteria, but no differences in the esophagi were found (Fig. 6A).
Regarding the Proteus species (i.e., mirabilis), they are known to be parasitized by the Proteus virus Isfahan . Thus, we wondered whether some Proteus species could change their levels according to the Proteus virus Isfahan abundance. Interestingly, Proteus vulgaris was pointed out as highly abundant in EoE blood by comparison with the control, while no differences in the esophagus were found (Fig. 6B).
Based on these pieces of evidence, we can speculate that the presence of defined classes of microbial entities in specific subsets of patients may participate in inducing the antigen-mediated response typical of EoE pathogenesis.
EoE is a complex, clinically heterogeneous disease where many factors have been proposed to interact with each other and lead to chronically inflamed esophageal mucosa, with upper gastrointestinal symptoms that range from dysphagia to esophageal food impaction .
Even if some treatments are available, EoE remains a chronic disease, compromising the overall patients’ quality of life. Thus, having more mechanistic details of EoE pathogenesis may enable and support the development of new therapeutic lines of intervention. Moreover, molecular profiling may pave the road for further clinical phenotyping of EoE, ultimately reflecting better-personalized care.
In recent years, the analysis of different molecular aspects in the same patients with complex pathologies has often become one of the most powerful scientific approaches that have led, in some cases, to the discovery of disease-combined characteristics that remained hidden for a long time .
For this purpose, we recently released the IBD TaMMA framework , which is currently exploited worldwide as a support for research and has already led to new scientific outputs further dissecting the complexity and heterogeneity of IBD pathogenesis [9, 41, 44]. Hence, we sought to create a similar computational framework for EoE by surveying the already published transcriptomics studies. The EoE TaMMA framework confirmed some well-accepted EoE traits and can therefore be considered a useful and reliable web app and a resource for fostering novel fields of research. EoE TaMMA confirmed IL13 upregulation in the EoE esophagus by comparison with the control, while IL5, IL4, and the IL13, IL4, and IL5 receptors were not significantly modulated. IL13 was associated with EoE pathogenesis from a clinical standpoint [25, 26], and the IL13 signaling inhibition was described as successful in the treatment of this disorder . In this regard, we can speculate that Dupilumab, a monoclonal antibody against the IL4 receptor, mediating both IL4 and IL13 pathways, was found effective in phase 2 randomized trial of EoE patients  by interfering with the IL13, rather than the IL4 signaling, justifying the absence of statistical significance in the IL4 modulation in our platform. Further clarifications will come from other transcriptomics studies with larger sample sizes that, once they will be available to the scientific community, will be integrated within the web app to better explain the role of IL4 and IL5 in EoE pathogenesis. Indeed, despite some pieces of evidence in experimental models of EoE describing IL4 and IL5 as possible actors in EoE [25, 46,47,48], the direct link between the pathogenesis and their specific roles has not been uncovered yet . This may explain the lack of statistical significance in the differential expression of these two factors between EoE and control esophagi that certainly need further investigations and demonstration in future experimental models of EoE.
After pursuing an optimal batch correction, EoE TaMMA helped to draw molecular and biological signatures able to distinguish EoE from GERD, often sharing symptomatic esophageal patterns. Therefore, such a specific profile, including increased levels of CXCL14, PDGFRA, CXCL12, ACVRL1, POSTN, NOX4, and LTBP4 in EoE by comparison with the GERD and control samples, may guide the correctness of diagnosis and might help to design accurate diagnostic panels. Future studies will clarify whether this may help the identification of clinically relevant phenotypes of EoE (refractory or aggressive fibrostenotic forms), which may benefit from personalized therapeutic approaches.
Although the microbiota has been already considered a player in EoE pathogenesis, so far no indication regarding its high impact on EoE pathogenesis was provided. Our MOFA-driven multi-omics revealed that the 4 factor-based patient stratification was mainly characterized by differential microbiota compositions. Besides the well-known bacterial dysbiosis, the other microbial components (archaea, fungi, viruses, protozoa) were unveiled to be part of patient microbiota compositions that might drive patient stratification in future studies on EoE-affected cohorts.
The results obtained indicated two main pieces of evidence: (i) MOFA is useful for characterizing the source of diversity in patients with gastrointestinal diseases; (ii) patients can be stratified by MOFA-identified factors defining sub-cohort of patients displaying stratum-specific molecular signatures, that we sought to explain. For example, the Staphylococcus virus Andhra was shown to act as an antimicrobial commensal by inhibiting the growth and degrading the cell walls of diverse Staphylococci . Its abundance, higher in the control than in the EoE (Fig. 5C) might be consistent with its protective function against detrimental commensals, such as other Staphylococci found highly abundant in the EoE blood by comparison with the control. Interestingly, in the esophagi no differences in these bacterial species levels were found (Fig. 6A), supporting the indication of the EoE as a systemic rather than a local disease .
Similarly, the Proteus virus Isfahan acts as a lytic Proteus phage active against planktonic and biofilms of Proteus mirabilis . Proteus species, low-abundance commensals of the human gut, possess many virulence factors that have been recently proposed as relevant to gastrointestinal disease pathogenesis and associated with alterations of gut motility and adherence . Notably, from our analysis, Proteus vulgaris was pointed out as highly abundant in EoE blood by comparison with the control, while no differences in the esophagus were found (Fig. 6B) These results suggested that blood virome dysbiosis might enhance the expansion of pathogenic bacteria that in turn promote EoE pathogenesis in a specific cluster of patients.
Sulfodiicoccus acidophilus and Nitrosopumilus sp. K4 are thermoacidophilic and ammonia-oxidizer archaea, respectively, poorly studied in the context of human disease pathogenesis. In recent years, the archaeome has been acquiring a great interest in the context of chronic intestinal inflammation and its dysbiosis has been reported to modulate mucosal homeostasis . Therefore, the investigation of archaeal dysbiosis associated with esophagitis is worthwhile. As reported in Fig. 5D′, the Sulfodiicoccus acidophilus and the Nitrosopumilus sp. K4 are increased and decreased, respectively, in EoE compared to the healthy samples. Their differential abundances may indicate their potential roles in EoE pathogenesis, despite the current lack of knowledge in the field. Further studies may better characterize the possible mechanisms driven by these archaeal species during the EoE pathogenesis.
The Staphylococcus aureus, and the Malassezia restricta, although not significantly modulated between EoE and healthy controls (Fig. 5E and F), may underlie the pathogenesis in specific cohorts of patients (Factor-1 driven patient stratum) by stimulating the allergic response through the release of antigenic proteins [55, 56], while the Pasteurella multocida may manipulate T cell differentiation through the release of specific toxins .
Similarly, we do not exclude that Plasmodium knowlesi, which emerged from our analysis as a specific feature of the factor 2-driven patient stratum, may act as a microbial commensal stimulating the allergic response in these patients, despite its classification as a zoonotic malaria parasite .
One major limitation of this meta-analysis at the moment is the intrinsic lack of other information, such as clinical metadata. Indeed, the transcriptomic studies included in the framework missed, for example, information on the treatment types, localization of the disease, if nonerosive esophageal reflux or GERD, or patients’ age, as well as many other important characteristics that, whether annotated, could have helped to assign to a cluster of patients specific clinical characteristics. Future implementation of the framework including this information may allow the patient clustering, virtually addressing them to tailored treatments.
Another intrinsic limitation of the platform is the small number of studies profiling the EoE samples if compared to those included in the IBD TaMMA, so we encourage scientists to perform transcriptomics experiments that will enable the platform to achieve much higher statistical power.
Despite these limitations, we believe the web app is helpful for other scientists who may use the EoE TaMMA-described features to foster new hypotheses and concepts for developing more accurate and personalized therapies.
Our study represents a step forward to possibly unravel patient heterogeneity through advanced bioinformatics, integrating different components of the disease process into an omics-based network approach that sought to unravel the molecular landscape of EoE patients and to solve its intricacy, with a promise of better patient management and treatment in a short-term future.
Availability of data and materials
Transcriptomic data are accessible at https://eoe-meta-analysis.herokuapp.com/. Processed data and code are available at https://github.com/DU-omics/eoe-meta-analysis_data/master/, and https://github.com/DU-omics/eoe-meta-analysis, respectively. Data, analytic methods, and study materials will be made available upon request to the authors.
Transcriptome and Metatranscriptome Meta-analysis
Gastroesophageal reflux disorder
T-Helper Type 2
High power field
Inflammatory Bowel Disease
Desmosomal protein desmoglein 1
C-X-C motif ligand
Transforming growth factor beta
Multi-Omics Factor Analysis
Furuta GT, Katzka DA. Eosinophilic esophagitis. N Engl J Med. 2015;373:1640–8.
Arias Á, Pérez-Martínez I, Tenías JM, Lucendo AJ. Systematic review with meta-analysis: the incidence and prevalence of eosinophilic oesophagitis in children and adults in population-based studies. Aliment Pharmacol Ther. 2016;43:3–15.
Mansoor E, Cooper GS. The 2010–2015 prevalence of eosinophilic esophagitis in the USA: a population-based study. Dig Dis Sci. 2016;61:2928–34.
Dellon ES, Peery AF, Shaheen NJ, Morgan DR, Hurrell JM, Lash RH, et al. Inverse association of esophageal eosinophilia with Helicobacter pylori based on analysis of a US pathology database. Gastroenterology. 2011;141:1586–92.
Kottyan LC, Davis BP, Sherrill JD, Liu K, Rochman M, Kaufman K, et al. Genome-wide association analysis of eosinophilic esophagitis provides insight into the tissue specificity of this allergic disease. Nat Genet. 2014;46:895–900.
Ma C, Schoepfer AM, Safroneeva E, COREOS Collaborators. Development of a core outcome set for therapeutic studies in eosinophilic esophagitis (COREOS): an international multidisciplinary consensus. Gastroenterology. 2021;161:748–55.
Facchin S, Calgaro M, Pandolfo M, Caldart F, Ghisa M, Greco E, et al. Salivary microbiota composition may discriminate between patients with eosinophilic oesophagitis (EoE) and non-EoE subjects. Aliment Pharmacol Ther. 2022;56:450–62.
Warners MJ, Oude Nijhuis RAB, de Wijkerslooth LRH, Smout AJPM, Bredenoord AJ. The natural course of eosinophilic esophagitis and long-term consequences of undiagnosed disease in a large cohort. Am J Gastroenterol. 2018;113:836–44.
Massimino L, Lamparelli LA, Houshyar Y, D’Alessio S, Peyrin-Biroulet L, Vetrano S, et al. The inflammatory bowel disease transcriptome and metatranscriptome meta-analysis (IBD TaMMA) framework. Nat Comput Sci. 2021;1:511–5.
Modos D, Thomas JP, Korcsmaros T. A handy meta-analysis tool for IBD research. Nat Comput Sci. 2021;1:571.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Dobin A, Gingeras TR. Optimizing RNA-Seq mapping with STAR. Methods Mol Biol. 2016;1415:245–62.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Subhash S, Kanduri C. GeneSCF: a real-time based functional enrichment tool with support for multiple organisms. BMC Bioinform. 2016;17:365.
Ungaro F, Massimino L, Furfaro F, Rimoldi V, Peyrin-Biroulet L, D’Alessio S, et al. Metagenomic analysis of intestinal mucosa revealed a specific eukaryotic gut virome signature in early-diagnosed inflammatory bowel disease. Gut Microbes. 2019;10:149–58.
Stein CK, Qu P, Epstein J, Buros A, Rosenthal A, Crowley J, et al. Removing batch effects from purified plasma cell gene expression microarrays with modified ComBat. BMC Bioinform. 2015;16:63.
Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15:R46.
Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14:178–92.
Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL. NCBI BLAST: a better web interface. Nucleic Acids Res. 2008;36(Web Server issue):W5-9.
The Tabula Sapiens Consortium, Quake SR. The Tabula Sapiens: a single cell transcriptomic atlas of multiple organs from individual human donors. BioRxiv. 2021.
Morgan DM, Ruiter B, Smith NP, Tu AA, Monian B, Stone BE, et al. Clonally expanded, GPR15-expressing pathogenic effector TH2 cells are associated with eosinophilic esophagitis. Sci Immunol. 2021;6:eabi5586.
Gardeux V, David FPA, Shajkofci A, Schwalie PC, Deplancke B. ASAP: a web-based platform for the analysis and interactive visualization of single-cell RNA-seq data. Bioinformatics. 2017;33:3123–5.
Wang X, Park J, Susztak K, Zhang NR, Li M. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nat Commun. 2019;10:380.
Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, Scherer F, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat Biotechnol. 2019;37:773–82.
Argelaguet R, Velten B, Arnol D, Dietrich S, Zenz T, Marioni JC, et al. Multi-omics factor analysis-a framework for unsupervised integration of multi-omics data sets. Mol Syst Biol. 2018;14: e8124.
O’Shea KM, Aceves SS, Dellon ES, Gupta SK, Spergel JM, Furuta GT, et al. Pathophysiology of eosinophilic esophagitis. Gastroenterology. 2018;154:333–45.
Bhattacharya B, Carlsten J, Sabo E, Kethu S, Meitner P, Tavares R, et al. Increased expression of eotaxin-3 distinguishes between eosinophilic esophagitis and gastroesophageal reflux disease. Hum Pathol. 2007;38:1744–53.
Davis BP, Stucke EM, Khorki ME, Litosh VA, Rymer JK, Rochman M, et al. Eosinophilic esophagitis-linked calpain 14 is an IL-13-induced protease that mediates esophageal epithelial barrier impairment. JCI Insight. 2016;1: e86355.
Jaakkola MK, Elo LL. Computational deconvolution to estimate cell type-specific gene expression from bulk data. NAR Genom Bioinform. 2021;3: lqaa110.
Tabula Sapiens Consortium*, Jones RC, Karkanias J, Krasnow MA, Pisco AO, Quake SR, et al. The Tabula Sapiens: a multiple-organ, single-cell transcriptomic atlas of humans. Science. 2022;376: eabl4896.
Mulder DJ, Justinich CJ. Understanding eosinophilic esophagitis: the cellular and molecular mechanisms of an emerging disease. Mucosal Immunol. 2011;4:139–47.
Lexmond WS, Neves JF, Nurko S, Olszak T, Exley MA, Blumberg RS, et al. Involvement of the iNKT cell pathway is associated with early-onset eosinophilic esophagitis and response to allergen avoidance therapy. Am J Gastroenterol. 2014;109:646–57.
Armbruster-Lee J, Cavender CP, Lieberman JA, Samarasinghe AE. Understanding fibrosis in eosinophilic esophagitis: are we there yet? J Leukoc Biol. 2018;104:31–40.
Kalluri R, Neilson EG. Epithelial-mesenchymal transition and its implications for fibrosis. J Clin Invest. 2003;112:1776–84.
Lu J, Chatterjee M, Schmid H, Beck S, Gawaz M. CXCL14 as an emerging immune and inflammatory modulator. J Inflamm (Lond). 2016;13:1.
Laserna-Mendieta EJ, FitzGerald JA, Arias-Gonzalez L, Ollala JM, Bernardo D, Claesson MJ, et al. Esophageal microbiome in active eosinophilic esophagitis and changes induced by different therapies. Sci Rep. 2021;11:7113.
Benitez AJ, Hoffmann C, Muir AB, Dods KK, Spergel JM, Bushman FD, et al. Inflammation-associated microbiota in pediatric eosinophilic esophagitis. Microbiome. 2015;3:23.
Parashette KR, Sarsani VK, Toh E, Janga SC, Nelson DE, Gupta SK. Esophageal microbiome in healthy children and esophageal eosinophilia. J Pediatr Gastroenterol Nutr. 2022;74:e109–14.
Ning L, Huixin H. Topic evolution analysis for omics data integration in cancers. Front Cell Dev Biol. 2021;9: 631011.
Ungaro F, Massimino L, D’Alessio S, Danese S. The gut virome in inflammatory bowel disease pathogenesis: from metagenomics to novel therapeutic approaches. United Eur Gastroenterol J. 2019;7:999–1007.
Brooks-Warburton J, Modos D, Sudhakar P, Madgwick M, Thomas JP, Bohar B, et al. A systems genomics approach to uncover patient-specific pathogenic pathways and proteins in ulcerative colitis. Nat Commun. 2022;13:2299.
Dellon ES, Liacouras CA, Molina-Infante J, Furuta GT, Spergel JM, Zevit N, et al. Updated international consensus diagnostic criteria for eosinophilic esophagitis: proceedings of the AGREE conference. Gastroenterology. 2018;155:1022-1033.e10.
Vlachavas EI, Bohn J, Ückert F, Nürnberg S. A detailed catalogue of multi-omics methodologies for identification of putative biomarkers and causal molecular networks in translational cancer research. Int J Mol Sci. 2021;22:2822.
González-Dávila P, Schwalbe M, Danewalia A, Wardenaar R, Dalile B, Verbeke K, et al. Gut microbiota transplantation drives the adoptive transfer of colonic genotype-phenotype characteristics between mice lacking catestatin and their wild type counterparts. Gut Microbes. 2022;14:2081476.
Zhernov YV, Vysochanskaya SO, Sukhov VA, Zaostrovtseva OK, Gorshenin DS, Sidorova EA, et al. Molecular mechanisms of eosinophilic esophagitis. Int J Mol Sci. 2021;22:13183.
Hirano I, Dellon ES, Hamilton JD, Collins MH, Peterson K, Chehade M, et al. Efficacy of dupilumab in a phase 2 randomized trial of adults with active eosinophilic esophagitis. Gastroenterology. 2020;158:111-122.e10.
Cheng E, Zhang X, Huo X, Yu C, Zhang Q, Wang DH, et al. Omeprazole blocks eotaxin-3 expression by oesophageal squamous cells from patients with eosinophilic oesophagitis and GORD. Gut. 2013;62:824–32.
Zhang X, Cheng E, Huo X, Yu C, Zhang Q, Pham TH, et al. Omeprazole blocks STAT6 binding to the eotaxin-3 promoter in eosinophilic esophagitis cells. PLoS ONE. 2012;7: e50037.
Odiase E, Zhang X, Chang Y, Nelson M, Balaji U, Gu J, et al. In esophageal squamous cells from eosinophilic esophagitis patients, Th2 cytokines increase Eotaxin-3 secretion through effects on intracellular calcium and a non-gastric proton pump. Gastroenterology. 2021;160:2072.
Cater K, Dandu VS, Bari SMN, Lackey K, Everett GFK, Hatoum-Aslan A. A novel staphylococcus podophage encodes a unique lysin with unusual modular design. mSphere. 2017;2: e00040.
Abonia JP, Spergel JM, Cianferoni A. Eosinophilic esophagitis: a primary disease of the esophageal mucosa. J Allergy Clin Immunol Pract. 2017;5:951–5.
Yazdi M, Bouzari M, Ghaemi EA. Genomic analyses of a novel bacteriophage (VB_PmiS-Isfahan) within Siphoviridae family infecting Proteus mirabilis. Genomics. 2019;111:1283–91.
Hamilton AL, Kamm MA, Ng SC, Morrison M. Proteus spp. as putative gastrointestinal pathogens. Clin Microbiol Rev. 2018;31: e00085.
Houshyar Y, Massimino L, Lamparelli LA, Danese S, Ungaro F. Going beyond bacteria: uncovering the role of archaeome and mycobiome in inflammatory bowel disease. Front Physiol. 2021;12: 783295.
Nordengrün M, Abdurrahman G, Treffon J, Wächter H, Kahl BC, Bröker BM. Allergic reactions to serine protease-like proteins of Staphylococcus aureus. Front Immunol. 2021;12: 651060.
Lockey RF, Ledford DK, editors. Allergens and allergen immunotherapy subcutaneous sublingual and oral. 6th, illustrated edition. Boca Raton: CRC Press; 2020.
Hildebrand D, Heeg K, Kubatzky KF. Pasteurella multocida toxin manipulates T cell differentiation. Front Microbiol. 2015;6:1273.
Lee W-C, Cheong FW, Amir A, Lai MY, Tan JH, Phang WK, et al. Plasmodium knowlesi: the game changer for malaria eradication. Malar J. 2022;21:140.
We acknowledge the IRCCS San Raffaele Hospital and Università Vita-Salute San Raffaele for their support for this work.
SD has served as a speaker, consultant, and advisory board member for Schering Plough, Abbott (AbbVie) Laboratories, Merck and Co, UCB Pharma, Ferring, Cellerix, Millenium Takeda, Nycomed, Pharmacosmos, Actelion, Alfa Wasserman, Genentech, Grunenthal, Pfizer, AstraZeneca, Novo Nordisk, Vifor, and Johnson and Johnson. LPB has served as consultant for Merck, Abbvie, Janssen, Genentech, Ferring, Tillots, Vifor, Pharmacosmos, Celltrion, Takeda, Biogaran, Boerhinger-lngelheim, Lilly, Pfizer, Jndex Pharmaceuticals, Amgen, Sandoz, Celgene, Biogen, Samsung Bioepis, Alma, Sterna, Nestlé, Enterome, Mylan, HAC-Pharma, Tigenix, and has served as speaker for Merck, Abbvie, Janssen, Genentech, Ferring, Tillots, Vifor, Pharmacosmos, Celltrion, Takeda, Boerhinger-lngelheim, Pfizer, Amgen, Biogen, Samsung Bioepis. ES declares lecture fees from Takeda, Janssen, MSD, Abbvie, Malesci, Sofar, and consulting fees from BMS, Gilead, Takeda, Janssen, MSD, Reckitt Benckiser, Sofar, Unifarco, SILA, Oftagest, Diadema. VJ has received consulting/advisory board fees from AbbVie, Alimentiv Inc (formerly Robarts Clinical Trials), Arena pharmaceuticals, Asieris, Bristol Myers Squibb, Celltrion, Eli Lilly, Ferring, Fresenius Kabi, Galapagos, GlaxoSmithKline, Genetech, Gilead, Janssen, Merck, Mylan, Pandion, Pendopharm, Pfizer, Reistone Biopharma, Roche, Sandoz, Takeda, Teva, and Topivert; speaker's fees from AbbVie, Ferring, Galapagos, Janssen Pfizer Shire, and Takeda. The other authors declare no conflicts of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Massimino, L., Barchi, A., Mandarino, F.V. et al. A multi-omic analysis reveals the esophageal dysbiosis as the predominant trait of eosinophilic esophagitis. J Transl Med 21, 46 (2023). https://doi.org/10.1186/s12967-023-03898-x
- Web app