Skip to content

new in GECKO 3.2.0: protein usage reactions always draw from protein pool, also when proteomics is integrated #375

Closed
@edkerk

Description

@edkerk

27 May 2024: The suggested changes are now implemented in GECKO 3.2.0. The text below describes how GECKO worked before version 3.2.0.

Currently, the enzyme usage reactions can be defined in two ways, dependent on whether proteomics data is integrated.

Model content Without proteomics With proteomics
Protein usage rxn prot_Q99312[c] <= prot_pool[c] prot_Q99312[c] <=
LB of protein usage rxn -1000 Measured Q99312 concentration, as taken from model.ec.concs, or potentially flexibilized by flexibilizeEnzConcs. Example = -0.0416
Protein pool exchange rxn prot_pool[c] <= prot_pool[c] <=
LB of protein pool exchange rxn Total enzyme content, as defined by Ptot * sigma * f. Example = -125 Non-measured enzyme content, as calculated by updateProtPool . Example = -95.915

A problem that I have encountered with this approach is that the new lower bound of the protein pool exchange reaction might be too strict. The model can no longer be solved, unless some proteins are flexibilized by a high amount (although sometimes this even does not resolve the problem).

  • In the calculation by updateProtPool, it assumes that the f-factor (fraction of protein being enzymes) is the same for both the measured- and unmeasured-protein fraction.
  • Already when the f-factor is first calculated, it is only based on the measured-protein fraction (if this data is available), which might be somewhat biased, but at that stage it would be countered out by the fitting of the sigma-factor.
  • In addition, to avoid over-constraining individual proteins based on noisy proteomics data, loadProtData by default adds 1 or more standard deviations to the protein measurements. As a consequence, the sum of measured protein concentrations Pmeas is substantially higher, which automatically means that the unmeasured protein fraction Ptot-Pmeas is always lower than it should be.

As an alternative, there is actually no good reason why the enzyme usage reaction has to change when proteomics data is integrated, except for changing its lower bound. The new approach suggested below would prevent the issues raised above, and instead would keep using the same lower bound for the protein pool exchange reaction that earlier in the model generation pipeline had been fitted to give realistic growth predictions. New suggestion:

Model content Without proteomics With proteomics
Protein usage rxn prot_Q99312[c] <= prot_pool[c] prot_Q99312[c] <= prot_pool[c]
LB of protein usage rxn -1000 Measured Q99312 concentration, as taken from model.ec.concs, or potentially flexibilized by flexibilizeEnzConcs. Example = -0.0416
Protein pool exchange rxn prot_pool[c] <= prot_pool[c] <=
LB of protein pool exchange rxn Total enzyme content, as defined by Ptot * sigma * f. Example = -125 Total enzyme content, as defined by Ptot * sigma * f. Example = -125

I hereby confirm that:

  • The new feature is not already in the main branch of the repository.
  • A similar issue does not already exist.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions