nSTAT-python — Neural Spike Train Analysis for Python

InstallPython 3.10+ · install from PyPI, then pull the example dataset

Pull the example dataset (~150 MB from figshare, used by the paper examples and several notebooks):

pypi

pip install nstat-toolbox

dataset

nstat-install --download-example-data always

Opt-in nstat.extras dep groups — install only what you need:

extras

pip install nstat-toolbox[neo,pynapple,nwb]       # data-interop bridges
pip install nstat-toolbox[test-parity]            # validation oracles (nemos, pykalman, statsmodels)
pip install nstat-toolbox[metrics]                # spike-distance metrics (pyspike)
pip install nstat-toolbox[dynamax]                # state-space EM (JAX, ~200 MB)
pip install nstat-toolbox[clusterless]            # clusterless decoding (JAX, ~200 MB)
pip install nstat-toolbox[all-extras]             # all of the above EXCEPT dynamax + clusterless

5-minute tourSix runnable snippets covering the most common workflows — each card links to a notebook or paper example

1 Build a spike train core

Wrap a vector of spike times in an nspikeTrain — the universal spike-train object used everywhere in the toolbox.

python

import numpy as np
from nstat import nspikeTrain

# 100 spikes over 1 second at 1 kHz
times = np.sort(np.random.default_rng(0).uniform(0, 1, 100))
st = nspikeTrain(times, name="neuron1",
                 sampleRate=1000,
                 minTime=0.0, maxTime=1.0)
print(f"{st.n_spikes} spikes; rate ~ {st.n_spikes:.0f} Hz")
st.plot()

📓 More: notebooks/nSpikeTrainExamples.ipynb

2 Assemble a Trial core

A Trial bundles spike trains, covariates, events, and neighbors for one experiment. The unit that Analysis takes as input.

python

from nstat import Trial, TrialConfig, Covariate, nstColl

t = np.arange(0, 1.0, 1e-3)
stim = np.sin(2*np.pi*5*t).reshape(-1, 1)
cov = Covariate(t, stim, name="stim",
                xlabelval="time", xunitval="s",
                ylabelval="vel", yunitval="mm/s",
                dataLabels=["stim"])
nstc = nstColl([st])
nstc.setMinTime(0.0); nstc.setMaxTime(1.0)
trial = Trial(nstc, ev=None, covarColl=None, neighbors=None)

📓 More: notebooks/TrialExamples.ipynb

3 Fit a Poisson GLM + check fit core

Analysis.GLMFit trains a point-process GLM with a stimulus filter and history kernel; FitResult.computeKSStats runs the time-rescaling KS goodness-of-fit test.

python

from nstat import TrialConfig, ConfigColl, Analysis

cfg = TrialConfig(
    covariate_specs=[("Baseline", "constant"),
                     ("stim", "spline")],
    sampleRate=1000,
    history_window_times=[0.001, 0.002, 0.005, 0.01],
    ensCovHist=[],
)
results = Analysis.runAnalysisForAllNeurons(
    trial, ConfigColl([cfg]))
fit = results[0][0]
print(f"AIC={fit.AIC:.1f}  "
      f"KS p={fit.computeKSStats()['ks_pvalue']:.3f}")

📓 More: notebooks/AnalysisExamples.ipynb · 📊 Figure: Paper Example 2 — whisker-stimulus thalamic GLM

4 Population goodness-of-fit core

Per-neuron KS plots can pass for every neuron while the model misses inter-neuron coupling. The Tao et al. (2018) marked time-rescaling test catches exactly those failures.

python

from nstat import population_time_rescale

# counts_list[k] : binned spike counts for neuron k
# lams[k] : model-expected counts per bin
r = population_time_rescale(counts_list, lams,
                            n_tau_bins=5)
print(f"ground KS p = {r.ground_ks_pvalue:.3g}  "
      f"mark chi2 p = {r.mark_chi2_pvalue:.3g}")
# On synchronous neurons modeled as independent:
# univariate KS passes; ground KS rejects at ~3e-49.

📖 New in v0.4.0 — see the What's New page

5 State-space EM extras / em

Multi-restart EM with held-out predictive-LL selection — the recommended workflow for PP_EM on real data. Built on Dynamax (JAX) but returns plain NumPy.

python

from nstat.extras.em.dynamax_bridge import (
    fit_point_process_em_best_of)

result = fit_point_process_em_best_of(
    spike_counts, state_dim=3,
    n_restarts=8, holdout_fraction=0.2)
result.best_result.transition_matrix
result.best_predictive_ll      # held-out LL
result.all_predictive_lls      # per-seed trace

▶ Demo: examples/extras/em_dynamax_demo.py · 📖 help file · install [dynamax]

6 Clusterless decoding extras / decoding

Decode position from unsorted multiunit spikes (mark cube of waveform features) plus a discrete trajectory-type classifier on top. Bridges Denovellis 2021's library.

python

from nstat.extras.decoding.clusterless_bridge import (
    fit_clusterless_decoder)

result = fit_clusterless_decoder(
    position,        # (T, n_position_dims)
    multiunits,      # (T, n_marks, n_electrodes)
    place_bin_size=5.0)
result.posterior         # (T, n_position_bins)
result.map_position      # (T, n_position_dims)

▶ Demo: examples/extras/decoding_clusterless_demo.py · 📖 help file · install [clusterless]

nstat.extrasOpt-in Python-only additions — modeled after scikit-learn-contrib, each subpackage brings its own dep group

The core nstat.* namespace preserves the MATLAB nSTAT contract (stable, parity-faithful). nstat.extras.* is the monorepo addon namespace for Python-only features that have no MATLAB counterpart.

`interop` data formats

Converters between Trial / SpikeTrainCollection / nspikeTrain and the wider Python neuro stack.

interop.neo — Neo objects ([neo])
interop.pynapple — pynapple TS / TSGroup ([pynapple])
interop.nwb — NWB units + observation intervals ([nwb])

`validation` parity oracles

Cross-validation bridges that triangulate nstat's MATLAB-faithful estimates against independent reference implementations.

nemos_bridge — GLM oracle (NeMoS)
pykalman_bridge — Kalman / smoother oracle
statsmodels_bridge — Poisson GLM oracle
nitime_bridge — spectral analysis cross-check

`metrics` spike-train distance

Modern multi-neuron spike-train distance metrics via pyspike ([metrics]).

ISI distance, SPIKE distance, SPIKE-synchronization profiles

`em` state-space EM

KF_EM / PP_EM / mPPCO_EM equivalents (Dynamax-backed) with the full Tier-0.1 canonical gauge, Tier-0.2 predictive-LL diagnostic, and Tier-0.3 multi-restart selector.

fit_linear_gaussian_em · fit_point_process_em · fit_hybrid_em
cmgf_poisson_filter / _smoother (inference on known model)
fit_*_em_best_of + point_process_predictive_ll (recommended workflow)

`decoding` clusterless

Marked point-process state-space decoder + trajectory classifier (Denovellis 2021), the modern descendant of nSTAT's PPAF / PPHF. Spike-waveform features replace spike sorting.

fit_clusterless_decoder (continuous position)
fit_clusterless_classifier (replay vs. local etc.)

📚 Visual summary of every nstat.extras bridge — per-card install matrix and runnable code snippets.

Paper-example galleryThe five canonical paper examples (Cajigas 2012), each runnable in <1 minute on the figshare dataset

01 · mEPSCs Do mEPSCs follow constant vs. piecewise Poisson firing under Mg²⁺ washout?
Figures · Script

02 · Thalamus GLM How do explicit whisker-stimulus and spike-history terms improve thalamic GLM fits?
Figures · Script

03 · PSTH & SSGLM How do PSTH and the state-space GLM (SSGLM) capture within-trial and across-trial dynamics?
Figures · Script

04 · Place cells Which receptive-field basis (Gaussian vs. Zernike) better fits hippocampal place cells?
Figures · Script

05 · Decoding How well do the adaptive (PPAF) and hybrid (PPHF) point-process filters decode stimulus and reach state?
Figures · Script

Where to nextReference, class maps, release notes, and the road ahead

Concepts & Background

Learn the neuroscience and statistics behind nSTAT — microelectrode recordings, the LFP, point-process GLMs, goodness-of-fit, decoding, state-space EM, rhythmic firing — with figures and cited literature. 14 pages, runnable snippets.

Hands-on tutorials

Six runnable end-to-end lessons (encoding → goodness-of-fit, PPAF decoding, model comparison, network coupling, clinical microelectrode, place-cell capstone) plus a Jupyter walkthrough. Each is a self-contained script you can run in under a minute on simulated data.

API reference

Every public symbol, auto-generated from the package docstrings (NumPy-style).

Class definitions

Method catalog for each MATLAB-faithful class (SignalObj, nspikeTrain, Trial, Analysis, FitResult, …).

Paper-aligned toolbox map

Crosswalk between the 2012 paper's workflow categories and the Python API.

nstat.extras visual summary

Per-bridge cards with install commands, status, and runnable snippets.

What's New

Per-iteration change summaries (newest first) covering the v0.4.x release set.

RELEASE_NOTES.md

Per-version scope summaries (start here when upgrading).

Methods roadmap

What's shipping next — see RELEASE_READINESS.md for the current planning snapshot.

GitHub repository

Source, issues, discussions, and the full notebook gallery under notebooks/.

InstallPython 3.10+ · install from PyPI, then pull the example dataset

5-minute tourSix runnable snippets covering the most common workflows — each card links to a notebook or paper example

1 Build a spike train core

2 Assemble a Trial core

3 Fit a Poisson GLM + check fit core

4 Population goodness-of-fit core

5 State-space EM extras / em

6 Clusterless decoding extras / decoding

nstat.extrasOpt-in Python-only additions — modeled after scikit-learn-contrib, each subpackage brings its own dep group

interop data formats

validation parity oracles

metrics spike-train distance

em state-space EM

decoding clusterless

Paper-example galleryThe five canonical paper examples (Cajigas 2012), each runnable in <1 minute on the figshare dataset

Where to nextReference, class maps, release notes, and the road ahead

`interop` data formats

`validation` parity oracles

`metrics` spike-train distance

`em` state-space EM

`decoding` clusterless