mobipy

The mobipy python package can be used to run Mobius2 models from python (64 bit python3 only).

It lets you set parameter values and input series dynamically and read out result data.

Among other things, this allows you to:

  • Automate running of many different scenarios. For instance, your script could access various climate scenarios from a database (or files), set them as forcings in the model, then storing and comparing the results. This can be done without generating different input files for the model.
  • Make a website for stakeholder interaction that uses a python backend such as streamlit, and have it dynamically run a model based on a custom user web interface or forecasts from a database. See one example of this being done with INCA-Microplastics using Mobius1.
  • Write your own autocalibration and uncertainty analysis scripts if you need something more specific than what you find in MobiView2.

Notebook

Figure: An example of running the NIVAFjord model with mobipy in a jupyter notebook.

You can also look at some more example notebooks.

Installation

This assumes you have already followed the getting started guide and cloned the Mobius2 repository.

  • Windows: To be able to run mobipy on Windows you need to download mobipy/c_abi.dll from ftp://mobiserver.niva.no/Mobius2 and put it in your local Mobius2/mobipy folder.
  • Linux: See the separate document

Known issues

  • It can be problematic to use mobipy from an Anaconda environment (including through Pycharm).

Basic usage

It is useful to look at the central model concepts to understand what we mean by certain terms below like (“parameter”, “state variable”, “identifier”, etc.).

First you need to load a model application using

app = mobipy.Model_Application.build_from_model_and_data_file(
	"path/to/model.txt", 
	"path/to/data.dat")

To be able to extract any results you need to run the model, using

app.run()

It is important to note that the Model_Application object is not python-serializable, so you can not access the same app object across multiple processes. Using multiple threads works fine, just note that operations modifying the state of the app (running the model, modifying parameters, etc.) are not thread safe. If you want to run the model in several parallel instances, you can make copies of the working dataset as seen below.

# This is a thread safe way to run a model instance in parallel.
def run_model_instance(app) :
	data = app.copy()
	#modify_some_parameters(data)
	data.run()
	#do_something_with_the_results(data)
	del data

Acessing model entities

Any model entity can in principle be accessed in the app, but for the most part it only makes sense to access modules, parameters or components.

Entities are accessed using one of the following methods

  • scope.identifier where identifier is the Mobius2 language identifier of the entity in the given model scope.
  • scope["Name"] where "Name" is the Mobius2 language name of the entity in the given model scope.

The app acts as the model top scope. You can then scope into modules from it using e.g.

app["Module name"]

For a full example,

app["SimplyQ land"].bfi

accesses the bfi parameter in the "SimplyQ land" module. You can also store scopes in variables to reuse them:

sq = app["SimplyQ land"]
sq.bfi = 0.1

The time series of a state variable can be accessed by composing the Mobius2 components for the location (if one exists), e.g. the river water dissolved organic carbon could be (depending on how it is declared in the model)

app.river.water.oc

You can also access any model series (result or input) using

app.var("Series name")

where "Series name" is the name (either series serialized name or declared name) of the series in the current application. (Note that the declared name - from the var declaration - does not always identify the series uniquely, but usually it does).

Parameters

You can set the value of a parameter by using the regular field setting syntax in python, e.g.

app.start_date = '1995-01-01'

However if the parameter indexes over one or more index sets, you have to provide an index (tuple) for it. E.g.

app["SimplyQ land"].tc_s["Agricultural"] = 3

Here “Agricultural” is an index of the index set that this parameter is distributed over.

Remember that you must re-run the model for the parameter edit to take effect in the results.

To read the value you must always provide an index even if it doesn’t index over anything. If there are no index sets, use [()].

print("The start date is %s" % app.start_date[()])

Parameters also have the min(), max(), unit() and description() member functions that let you extract this information from their declaration in the model. These must be called on the Entity, not on the value access (i.e. don’t index it).

Series

When you read the values of a series, you must access it using its indexes. If the series does not have any index sets, you must still access it using an empty tuple [()]. The result of reading a series is a pandas.Series. See the pandas documentation. It is indexed by a pandas.DateTimeIndex. It is convenient to quickly plot such series, as in the example below.

notebook minimal

You can also construct a pandas.DataFrame from several such series (using pandas.concat), which is one of the most common ways to organize scientific data in python.

If a series indexes over several indexes you can read out a slice. We only support slicing one index set at a time for now, and no custom strides. When slicing, instead of a ´pd.Series´ you get a tuple (values, position, dates), where values is a numpy.ndarray with dimensions (time_steps, n_indexes). Here n_indexes is the amount of indexes in the slice range. Example

temps, pos, dates = app.layer.water.temp["Drammensfjorden", 4:10]

In this example, the temperatures in the “Drammensfjorden” basin in NIVAFjord is extracted between layers 4 and 10.

The pos vector is of size n_indexes+1 and contains the boundaries of the indexes in the slice. The dates vector is of size time_steps+1 (the last value being the first time step after the model run). These contain the boundaries because this is more convenient for plotting heatmaps. If you only want positions of each data point, discard the last element in pos and dates.

For input series you always get the expanded data that is sampled to the application’s sampling frequency, even if it was provided sparsely in the data file.

You can also set the values of an input series. The value you provide must be a pandas.Series that is indexed by a DateTimeIndex. This could be sparse. In that case, only the given dates are overwritten.

For now we don’t support setting slices.

Model series have a unit() member function giving you a string representation of its unit.

You can also get the concentration series of a dissolved variable using conc(). For a flux, you can further use the . operator to get the transport flux of dissolved variables.

The conc and transport operations must be called on the mobipy.Series object, not on the pandas.Series value (i.e. don’t index it before the operation). Example:

# Correct:
fig, (ax0, ax1) = plt.subplots(2, 1, figsize=(8, 8))
# Plot the concentration of organic carbon in the river water.
app.river.water.oc.conc()["Kråkstadelva"].plot(ax=ax0)
# Plot the transport flux of organic carbon with the river discharge.
app.var("Reach flow flux").oc["Kråkstadelva"].plot(ax=ax1)

# Wrong:
app.river.water.oc["Kråkstadelva"].conc()
app.var("Reach flow flux")["Kråkstadelva"].oc

Table of contents


© Norsk Institutt for Vannforskning (The Norwegian Institute for Water Research), NIVA 2024.