Note

Go to the end to download the full example code

DyNeMo: Mixing Coefficients Analysis#

In this tutorial we will analyse the dynamic networks inferred by DyNeMo on resting-state source reconstructed MEG data. This tutorial covers:

Downloading a Trained Model
Summarizing the Mixing Coefficient Time Course
Binarizing the Mixing Coefficeint Time Course

Note, this webpage does not contain the output of running each cell. See OSF for the expected output.

Downloading a Trained Model#

First, let’s download a model that’s already been trained on a resting-state dataset. See the DyNeMo Training on Real Data tutorial for how to train DyNeMo.

import os

def get_trained_model(name):
    if os.path.exists(name):
        return f"{name} already downloaded. Skipping.."
    os.system(f"osf -p by2tc fetch trained_models/{name}.zip")
    os.system(f"unzip -o {name}.zip -d {name}")
    os.remove(f"{name}.zip")
    return f"Model downloaded to: {name}"

# Download the trained model (approximately 33 MB)
model_name = "dynemo_notts_rest_10_subj"
get_trained_model(model_name)

# List the contents of the downloaded directory
sub_dirs = os.listdir(model_name)
print(sub_dirs)
for sub_dir in sub_dirs:
    print(f"{sub_dir}:", os.listdir(f"{model_name}/{sub_dir}"))

We can see the directory contains two sub-directories:

model: contains the trained HMM.
data: contains the inferred mixing coefficients (alpha.pkl) and mode spectra (f.npy, psd.npy, coh.npy).

Summarizing the Mixing Coefficient Time Course#

Let’s start by plotting the mixing coefficients to get a feel for the description provided by DyNeMo. First, we load the inferred mixing coefficients.

import pickle

alpha = pickle.load(open(f"{model_name}/data/alpha.pkl", "rb"))

We can use the utils.plotting.plot_alpha to plot the mixing coefficients.

from osl_dynamics.utils import plotting

# Plot the mixing coefficient time course for the first subject (8 seconds)
plotting.plot_alpha(alpha[0], n_samples=2000)

Plotting the raw values of the mixing coefficients as provided by DyNeMo it gives the impression that one mode dominates, in this case mode 4. However, the model DyNeMo uses is \(C_t = \displaystyle\sum_j \alpha_{jt} D_j\), where \(\alpha_{jt}\) are the mixing coefficients and \(D_j\) are the mode covariances. This means a mode can have a small \(\alpha_{jt}\) value but still be a large contributor to the time-varying covariance \(C_t\).

Normalizing the mixing coefficients#

To account for the difference in mode covariances we can renormalise the mixing coefficients include the ‘size’ of the mode covariances.

import numpy as np

def normalize_alpha(a, D):
    # Calculate the weighting for each mode
    # We use the trace of each mode covariance
    w = np.trace(D, axis1=1, axis2=2)

    # Weight the alphas
    wa = w[np.newaxis, ...] * a

    # Renormalize so the alphas sum to one
    wa /= np.sum(wa, axis=1, keepdims=True)
    return wa

# Load the inferred mode covariances
covs = np.load("dynemo_notts_rest_10_subj/data/covs.npy")

# Renormalize the mixing coefficients
norm_alpha = [normalize_alpha(a, covs) for a in alpha]

# Plot the renormalized mixing coefficient time course for the first subject (8 seconds)
plotting.plot_alpha(norm_alpha[0], n_samples=2000)

We can now see each mode has a roughly equal contribution to the total covariance. We can also see fast fluctuations in the relative contribution of each mode.

Summary Statistics#

An analogous summary measure to the Hidden Markov Model’s (HMM’s) fractional occupancy is the time average mixing coefficient. This is the average contribution a mode makes to the total covariance. Let’s calculate this for each subject.

mean_norm_alpha = np.array([np.mean(a, axis=0) for a in norm_alpha])
print(mean_norm_alpha)

Note, we can calculate this using the raw mixing coefficients or the normalized mixing coefficients. We see there is a lot of variability between subjects in terms of this metric.

Another summary measure we could calculate for each subject is the standard deviation of the mixing coefficient time course, which can be a measure for how variable the mixing of each modes is.

std_norm_alpha = np.array([np.std(a, axis=0) for a in norm_alpha])
print(std_norm_alpha)

Binarizing the Mixing Coefficient Time Course#

If we convert the mixing coefficients into a binary time course for each mode (i.e. something similar to the HMM state time courses) we can calculate the usual summary statistics we did for the HMM states (fractional occupancies, lifetimes, intervals). See the HMM Summary Statistics Analysis tutorial for further details.

We have a couple options for binarizing the mixing coefficient time course:

Take the mode with the largest value as ‘on’ while the other modes are ‘off’. This gives a mutually exclusive mode activation time course.
Specify a threshold for mode activations. This can be a hand specified threshold or one determined using a data driven approach, such as fitting a Gaussian Mixture Model (GMM). In this case we can have co-activating modes.

Let’s have a look at each of these options.

Binarize with argmax#

We can use the inference.modes.argmax_time_courses function to calculate a mutually exclusive mode activation time course.

from osl_dynamics.inference import modes

# Use the mode with the largest value to define mode activations
atc = modes.argmax_time_courses(norm_alpha)

# Plot the mode activation time course for the first subject (first 8 seconds)
plotting.plot_alpha(atc[0], n_samples=2000)

With the mode activation time course we can calculate the usual HMM summary statistics like fractional occupancy.

fo = modes.fractional_occupancies(atc)
print(np.round(fo, 2))

We see there is more of an imbalance between mode activations compared to the HMM state description.

Binarize with a percentile threshold#

Next, let’s first specify a threshold by hand. Looking at the distribution of mixing coefficients could help us evaluate a good threshold.

import matplotlib.pyplot as plt

def plot_hist(alpha, x_label=None):
    fig, ax = plt.subplots()
    for i in range(alpha.shape[1]):
        ax.hist(alpha[:, i], label=f"Mode {i+1}", bins=50, histtype="step")
    ax.legend()
    ax.set_xlabel(x_label)

# Concatenate the alphas for each subject
concat_norm_alpha = np.concatenate(norm_alpha)

# Plot their distribution
plot_hist(concat_norm_alpha, x_label="Normalized mixing coefficients")

We see the distribution of each mode’s mixing coefficients has a long tail. We’re interested in picking out the time points when the mixing coefficients take values in the long tail. We also see the appropriate threshold for each mode is very different. Let’s use the 90th percentile to determine the threshold of each mode.

# Calculate thresholds for each subject
thres = np.array([np.percentile(a, 90, axis=0) for a in norm_alpha])
print(thres)

We see the threshold varies a lot across the subjects. Now we have the threshold, let’s binarize.

# Binarize the mixing coefficients
thres_norm_alpha = []
n_subjects = len(norm_alpha)
for na, t in zip(norm_alpha, thres):
    tna = (na > t).astype(int)
    thres_norm_alpha.append(tna)

# Plot the mode activation time courses for the first subject (first 8 seconds)
plotting.plot_separate_time_series(thres_norm_alpha[0], n_samples=2000)

When we threshold using a percentile with the group concatenated data, by definition the fractional occupancy equals the percentile.

fo = modes.fractional_occupancies(thres_norm_alpha)
print(fo)

We see the fractional occupancy for each subject and state is 0.1, which corresponds to the segments with normalised alpha values in the top 10%. In contrast, the lifetimes are interpretable.

mlt = modes.mean_lifetimes(thres_norm_alpha, sampling_frequency=250)
print(mlt * 1000)  # in ms

We see with a 90th percentile threshold we get lifetimes of around ~50 ms.

Binarize with a GMM threshold#

An aternative approach is to use a two-component GMM to identify an on and off component. This involves fitting the two-component GMM to the distribution of mixing coefficients. We can do this with the inference.modes.gmm_time_courses function in osl-dynamics.

# Binarize the mixing coefficients using a GMM
thres_norm_alpha = modes.gmm_time_courses(norm_alpha)

# Plot the mode activation time courses for the first subject (first 8 seconds)
plotting.plot_separate_time_series(thres_norm_alpha[0], n_samples=2000)

# Display the fractional occupancies
fo = modes.fractional_occupancies(thres_norm_alpha)
print(fo)

We can see the GMM thresholds identify more mode activations, which are reflected in the higher fractional occupancies.

With a mode activation time course we can do the same analysis we did for the HMM state time courses. See the HMM Summary Statistics Analysis tutorial for more details.

Wrap Up#

We have shown how to plot the inferred mixing coefficients and how to normalize them.
We have explored various options for binarizing the mixing coefficient time courses.

Total running time of the script: (0 minutes 0.000 seconds)

Gallery generated by Sphinx-Gallery