Background
Non-alcoholic fatty liver disease (NAFLD) is characterized by the pathological accumulation of triglycerides in hepatocytes and is associated with insulin resistance, atherogenic dyslipidaemia and cardiometabolic diseases. Thus far, the extent of metabolic dysregulation associated with hepatic triglyceride accumulation has not been fully addressed. In this study, we aimed to identify metabolites associated with hepatic triglyceride content (HTGC) and map these associations using network analysis.
Methods
To gain insight in the spectrum of metabolites associated with hepatic triglyceride accumulation, we performed a comprehensive plasma metabolomics screening of 1363 metabolites in apparently healthy middle aged (age 45–65) individuals (N = 496) in whom HTGC was measured by proton magnetic resonance spectroscopy. An atlas of metabolite–HTGC associations, based on univariate results, was created using correlation-based Gaussian graphical model (GGM) and genome scale metabolic model network analyses. Pathways associated with the clinical prognosis marker fibrosis 4 (FIB-4) index were tested using a closed global test.
Results
Our analyses revealed that 118 metabolites were univariately associated with HTGC (p-value <6.59 × 10−5), including 106 endogenous, 1 xenobiotic and 11 partially characterized/uncharacterized metabolites. These associations were mapped to several biological pathways including branched amino acids (BCAA), diglycerols, sphingomyelin, glucosyl-ceramide and lactosyl-ceramide. We also identified a novel possible HTGC-related pathway connecting glutamate, metabolonic lactone sulphate and X-15245 using the GGM network. These pathways were confirmed to be associated with the FIB-4 index as well.
Conclusion
The combined network and pathway analyses indicated extensive associations between BCAA and the lipids pathways with HTGC and the FIB-4 index. Moreover, we report a novel pathway glutamate-metabolonic lactone sulphate-X-15245 with a potential strong association with HTGC. These findings can aid elucidating HTGC metabolomic profiles and provide insight into novel drug targets for fibrosis-related outcomes.
Significant metabolites from the main analysis and the sex stratified analysis were used to develop the GGM networks separately. To create the network, we used the full measurements of the metabolites from our imputed dataset. Since we used multiple imputation, we could only select one imputed dataset. The correlations between the metabolites and HTGC were calculated using sparse Gaussian Graphical Models with graphical LASSO using the glasso function in R (qgraph R package version 1.6.9) coupled with the Extended Bayesian Information Criterium (EBIC) to tune the LASSO shrinkage factor. EBIC has been reported in simulation studies to perform better than the standard BIC, particularly for controlling false positivity rates. Here, we set the EBIC tuning parameter to the default value of 0.5 (wherein a value of 0 uses standard BIC) for a balanced tuning . Based on the correlation coefficients, edges were drawn between the correlated metabolite nodes with the width corresponding to the coefficient strength. The size of the nodes was determined by the absolute value of the effect estimate from the multivariable linear regression analysis. The metabolite nodes were colored based on their biochemical super and sub pathway classes and organized in ascending order based on their p-values. Visualization and layout of the networks were created in Gephi and exported as an interactive HTML5 using the sigmaExporter plugin.
Learn MoreWe mapped the metabolite names onto a curated version of the Human1 genome-scale metabolic model (17) and calculated the biochemical interconversions between all pairs of metabolites that were measured on the Metabolon platform on a liver-specific subset of the model. To facilitate the mapping of metabolomics data to the GSMM, we enriched the model with compound synonyms and external identifiers from the Chemical Entities of Biological Interest (ChEBI) database (18), where ChEBI identifiers and synonyms of conjugate acids and bases were also included. Of the 1067 metabolites with known identity that were quantified, 436 mapped properly to the GSMM. All reactions and compounds in Human1 were checked for mass and redox balance and were adjusted when necessary. Biochemical interactions between metabolites were determined by converting the GSMM into a weighted directed graph where nodes represent metabolites and edges represent reactions. Subsequently all reaction paths between the measured metabolites that involved one or two reaction steps were determined using a generic path finding algorithm that was developed in house. This resulted in a network of connected metabolites that we integrated with biochemical reaction and pathway knowledge into an interactive HTML/JavaScript document. The results from the network analysis include measured associated metabolites as well as relevant intermediate metabolites regardless of whether they were measured in the dataset. In addition, the network shows the directionality of the biological reaction paths for the biosynthesis and degradation of the metabolites and includes details regarding the involved genes and intermediate reaction steps. Furthermore, for all metabolites and genes, also those not included in the network, hyperlinks to external databases and gene expression profiles are provided. Reaction information was enriched by importing tissue-specific gene expression from the Human Protein Atlas (19) and Genotype-Tissue Expression (GTEx) project. All computations described in this section were performed in MATLAB R2019b.
Learn More