3. Reading and Writing Models

Cobrapy supports reading and writing models in SBML (with and without FBC), JSON, YAML, MAT, and pickle formats. Generally, SBML with FBC version 2 is the preferred format for general use. The JSON format may be more useful for cobrapy-specific functionality.

The package also ships with test models in various formats for testing purposes.

[1]:
import cobra.test
import os
from os.path import join

data_dir = cobra.test.data_dir

print("mini test files: ")
print(", ".join(i for i in os.listdir(data_dir) if i.startswith("mini")))

textbook_model = cobra.test.create_test_model("textbook")
ecoli_model = cobra.test.create_test_model("ecoli")
salmonella_model = cobra.test.create_test_model("salmonella")
mini test files:
mini.json, mini.mat, mini.pickle, mini.yml, mini_cobra.xml, mini_fbc1.xml, mini_fbc2.xml, mini_fbc2.xml.bz2, mini_fbc2.xml.gz

3.1. SBML

The Systems Biology Markup Language is an XML-based standard format for distributing models which has support for COBRA models through the FBC extension version 2.

Cobrapy has native support for reading and writing SBML with FBCv2. Please note that all id’s in the model must conform to the SBML SID requirements in order to generate a valid SBML file.

[2]:
cobra.io.read_sbml_model(join(data_dir, "mini_fbc2.xml"))
[2]:
Name mini_textbook
Memory address 0x01074fd080
Number of metabolites 23
Number of reactions 18
Objective expression -1.0*ATPM_reverse_5b752 - 1.0*PFK_reverse_d24a6 + 1.0*PFK + 1.0*ATPM
Compartments cytosol, extracellular
[3]:
cobra.io.write_sbml_model(textbook_model, "test_fbc2.xml")

There are other dialects of SBML prior to FBC 2 which have previously been use to encode COBRA models. The primary ones is the “COBRA” dialect which used the “notes” fields in SBML files.

Cobrapy can use libsbml, which must be installed separately (see installation instructions) to read and write these files. When reading in a model, it will automatically detect whether FBC was used or not. When writing a model, the use_fbc_package flag can be used can be used to write files in this legacy “cobra” format.

Consider having the lxml package installed as it can speed up parsing considerably.

[4]:
cobra.io.read_sbml_model(join(data_dir, "mini_cobra.xml"))
[4]:
Name mini_textbook
Memory address 0x0112fa6b38
Number of metabolites 23
Number of reactions 18
Objective expression -1.0*ATPM_reverse_5b752 - 1.0*PFK_reverse_d24a6 + 1.0*PFK + 1.0*ATPM
Compartments cytosol, extracellular
[5]:
cobra.io.write_sbml_model(
    textbook_model, "test_cobra.xml", use_fbc_package=False)

3.2. JSON

Cobrapy models have a JSON (JavaScript Object Notation) representation. This format was created for interoperability with escher.

[6]:
cobra.io.load_json_model(join(data_dir, "mini.json"))
[6]:
Name mini_textbook
Memory address 0x0113061080
Number of metabolites 23
Number of reactions 18
Objective expression -1.0*ATPM_reverse_5b752 - 1.0*PFK_reverse_d24a6 + 1.0*PFK + 1.0*ATPM
Compartments cytosol, extracellular
[7]:
cobra.io.save_json_model(textbook_model, "test.json")

3.3. YAML

Cobrapy models have a YAML (YAML Ain’t Markup Language) representation. This format was created for more human readable model representations and automatic diffs between models.

[8]:
cobra.io.load_yaml_model(join(data_dir, "mini.yml"))
[8]:
Name mini_textbook
Memory address 0x0113013390
Number of metabolites 23
Number of reactions 18
Objective expression -1.0*ATPM_reverse_5b752 - 1.0*PFK_reverse_d24a6 + 1.0*PFK + 1.0*ATPM
Compartments extracellular, cytosol
[9]:
cobra.io.save_yaml_model(textbook_model, "test.yml")

3.4. MATLAB

Often, models may be imported and exported solely for the purposes of working with the same models in cobrapy and the MATLAB cobra toolbox. MATLAB has its own “.mat” format for storing variables. Reading and writing to these mat files from python requires scipy.

A mat file can contain multiple MATLAB variables. Therefore, the variable name of the model in the MATLAB file can be passed into the reading function:

[10]:
cobra.io.load_matlab_model(
    join(data_dir, "mini.mat"), variable_name="mini_textbook")
[10]:
Name mini_textbook
Memory address 0x0113000b70
Number of metabolites 23
Number of reactions 18
Objective expression -1.0*ATPM_reverse_5b752 - 1.0*PFK_reverse_d24a6 + 1.0*PFK + 1.0*ATPM
Compartments c, e

If the mat file contains only a single model, cobra can figure out which variable to read from, and the variable_name parameter is unnecessary.

[11]:
cobra.io.load_matlab_model(join(data_dir, "mini.mat"))
[11]:
Name mini_textbook
Memory address 0x0113758438
Number of metabolites 23
Number of reactions 18
Objective expression -1.0*ATPM_reverse_5b752 - 1.0*PFK_reverse_d24a6 + 1.0*PFK + 1.0*ATPM
Compartments c, e

Saving models to mat files is also relatively straightforward

[12]:
cobra.io.save_matlab_model(textbook_model, "test.mat")

3.5. Pickle

Cobra models can be serialized using the python serialization format, pickle.

Please note that use of the pickle format is generally not recommended for most use cases. JSON, SBML, and MAT are generally the preferred formats.