allensdk.brain_observatory.ecephys.ecephys_session module

class allensdk.brain_observatory.ecephys.ecephys_session.EcephysSession(api: allensdk.brain_observatory.ecephys.ecephys_session_api.ecephys_session_api.EcephysSessionApi, test: bool = False, **kwargs)[source]

Bases: allensdk.core.lazy_property.lazy_property_mixin.LazyPropertyMixin

Represents data from a single EcephysSession

Attributes:
units : pd.Dataframe

A table whose rows are sorted units (putative neurons) and whose columns are characteristics of those units. Index is:

unit_id : int

Unique integer identifier for this unit.

Columns are:
firing_rate : float

This unit’s firing rate (spikes / s) calculated over the window of that unit’s activity (the time from its first detected spike to its last).

isi_violations : float

Estamate of this unit’s contamination rate (larger means that more of the spikes assigned to this unit probably originated from other neurons). Calculated as a ratio of the firing rate of the unit over periods where spikes would be isi-violating vs the total firing rate of the unit.

peak_channel_id : int

Unique integer identifier for this unit’s peak channel. A unit’s peak channel is the channel on which its peak-to-trough amplitude difference is maximized. This is assessed using the kilosort 2 templates rather than the mean waveforms for a unit.

snr : float

Signal to noise ratio for this unit.

probe_horizontal_position : numeric

The horizontal (short-axis) position of this unit’s peak channel in microns.

probe_vertical_position : numeric

The vertical (long-axis, lower values are closer to the probe base) position of this unit’s peak channel in microns.

probe_id : int

Unique integer identifier for this unit’s probe.

probe_description : str

Human-readable description carrying miscellaneous information about this unit’s probe.

location : str

Gross-scale location of this unit’s probe.

spike_times : dict

Maps integer unit ids to arrays of spike times (float) for those units.

running_speed : RunningSpeed
NamedTuple with two fields
timestamps : numpy.ndarray

Timestamps of running speed data samples

values : np.ndarray

Running speed of the experimental subject (in cm / s).

mean_waveforms : dict

Maps integer unit ids to xarray.DataArrays containing mean spike waveforms for that unit.

stimulus_presentations : pd.DataFrame

Table whose rows are stimulus presentations and whose columns are presentation characteristics. A stimulus presentation is the smallest unit of distinct stimulus presentation and lasts for (usually) 1 60hz frame. Since not all parameters are relevant to all stimuli, this table contains many ‘null’ values. Index is

stimulus_presentation_id : int

Unique identifier for this stimulus presentation

Columns are
start_time : float

Time (s) at which this presentation began

stop_time : float

Time (s) at which this presentation ended

duration : float

stop_time - start_time (s). Included for convenience.

stimulus_name : str

Identifies the stimulus family (e.g. “drifting_gratings” or “natural_movie_3”) used for this presentation. The stimulus family, along with relevant parameter values, provides the information required to reconstruct the stimulus presented during this presentation. The empty string indicates a blank period.

stimulus_block : numeric

A stimulus block is made by sequentially presenting presentations from the same stimulus family. This value is the index of the block which contains this presentation. During a blank period, this is ‘null’.

TF : float

Temporal frequency, or ‘null’ when not appropriate.

SF : float

Spatial frequency, or ‘null’ when not appropriate

Ori : float

Orientation (in degrees) or ‘null’ when not appropriate

Contrast : float Pos_x : float Pos_y : float Color : numeric Image : numeric Phase : float stimulus_condition_id : integer

identifies the session-unique stimulus condition (permutation of parameters) to which this presentation belongs

stimulus_conditions : pd.DataFrame

Each row is a unique permutation (within this session) of stimulus parameters presented during this experiment. Columns are as stimulus presentations, sans start_time, end_time, stimulus_block, and duration.

inter_presentation_intervals : pd.DataFrame

The elapsed time between each immediately sequential pair of stimulus presentations. This is a dataframe with a two-level multiindex (levels are ‘from_presentation_id’ and ‘to_presentation_id’). It has a single column, ‘interval’, which reports the elapsed time between the two presentations in seconds on the experiment’s master clock.

DETAILED_STIMULUS_PARAMETERS = ('colorSpace', 'flipHoriz', 'flipVert', 'depth', 'interpolate', 'mask', 'opacity', 'rgbPedestal', 'tex', 'texRes', 'units', 'rgb', 'signalDots', 'noiseDots', 'fieldSize', 'fieldShape', 'fieldPos', 'nDots', 'dotSize', 'dotLife', 'color_triplet')
age_in_days
channel_structure_intervals(self, channel_ids)[source]

find on a list of channels the intervals of channels inserted into particular structures

Parameters:
channel_ids : list

A list of channel ids

structure_id_key : str

use this column for numerically identifying structures

structure_label_key : str

use this column for human-readable structure identification

Returns:
labels : np.ndarray

for each detected interval, the label associated with that interval

intervals : np.ndarray

one element longer than labels. Start and end indices for intervals.

conditionwise_spike_statistics(self, stimulus_presentation_ids=None, unit_ids=None, use_rates=False)[source]

Produce summary statistics for each distinct stimulus condition

Parameters:
stimulus_presentation_ids : array-like

identifies stimulus presentations from which spikes will be considered

unit_ids : array-like

identifies units whose spikes will be considered

use_rates : bool, optional

If True, use firing rates. If False, use spike counts.

Returns:
pd.DataFrame :

Rows are indexed by unit id and stimulus condition id. Values are summary statistics describing spikes emitted by a specific unit across presentations within a specific condition.

classmethod from_nwb_path(path, nwb_version=2, api_kwargs=None, **kwargs)[source]
full_genotype
get_current_source_density(self, probe_id)[source]

Obtain current source density (CSD) of trial-averaged response to a flash stimuli for this probe. See allensdk.brain_observatory.ecephys.current_source_density for details of CSD calculation.

CSD is computed with a 1D method (second spatial derivative) without prior spatial smoothing User should apply spatial smoothing of their choice (e.g., Gaussian filter) to the computed CSD

Parameters:
probe_id : int

identify the probe whose CSD data ought to be loaded

Returns:
xr.DataArray :

dimensions are channel (id) and time (seconds, relative to stimulus onset). Values are current source density assessed on that channel at that time (V/m^2)

get_inter_presentation_intervals_for_stimulus(self, stimulus_names)[source]

Get a subset of this session’s inter-presentation intervals, filtered by stimulus name.

Parameters:
stimulus_names : array-like of str

The names of stimuli to include in the output.

Returns:
pd.DataFrame :

inter-presentation intervals, filtered to the requested stimulus names.

get_invalid_times(self)[source]

Report invalid time intervals with tags describing the scope of invalid data

The tags format: [scope,scope_id,label]

scope:
‘EcephysSession’: data is invalid across session ‘EcephysProbe’: data is invalid for a single probe
label:
‘all_probes’: gain fluctuations on the Neuropixels probe result in missed spikes and LFP saturation events ‘stimulus’ : very long frames (>3x the normal frame length) make any stimulus-locked analysis invalid ‘probe#’: probe # stopped sending data during this interval (spikes and LFP samples will be missing) ‘optotagging’: missing optotagging data
Returns:
pd.DataFrame :

Rows are invalid intervals, columns are ‘start_time’ (s), ‘stop_time’ (s), ‘tags’

get_lfp(self, probe_id, mask_invalid_intervals=True)[source]

Load an xarray DataArray with LFP data from channels on a single probe

Parameters:
probe_id : int

identify the probe whose LFP data ought to be loaded

mask_invalid_intervals : bool

if True (default) will mask data in the invalid intervals with np.nan

Returns
——-
xr.DataArray :

dimensions are channel (id) and time (seconds). Values are sampled LFP data.

Notes

Unlike many other data access methods on this class. This one does not cache the loaded data in memory due to the large size of the LFP data.

get_parameter_values_for_stimulus(self, stimulus_name, drop_nulls=True)[source]

For each stimulus parameter, report the unique values taken on by that parameter while a named stimulus was presented.

Parameters:
stimulus_name : str

filter to presentations of this stimulus

Returns:
dict :

maps parameters (column names) to their unique values.

get_pupil_data(self, suppress_pupil_data: bool = True) → pandas.core.frame.DataFrame[source]

Return a dataframe with eye tracking data

Parameters:
suppress_pupil_data : bool, optional

Whether or not to suppress eye gaze mapping data in output dataframe, by default True.

Returns:
pd.DataFrame
Contains columns for eye, pupil and cr ellipse fits:

*_center_x *_center_y *_height *_width *_phi

May also contain raw/filtered columns for gaze mapping if suppress_pupil_data is set to False:

*_eye_area *_pupil_area *_screen_coordinates_x_cm *_screen_coordinates_y_cm *_screen_coordinates_spherical_x_deg *_screen_coorindates_spherical_y_deg

get_stimulus_epochs(self, duration_thresholds=None)[source]

Reports continuous periods of time during which a single kind of stimulus was presented flipVert

duration_thresholds : dict, optional
keys are stimulus names, values are floating point durations in seconds. All epochs with
  • a given stimulus name
  • a duration shorter than the associated threshold

will be removed from the results

get_stimulus_parameter_values(self, stimulus_presentation_ids=None, drop_nulls=True)[source]

For each stimulus parameter, report the unique values taken on by that parameter throughout the course of the session.

Parameters:
stimulus_presentation_ids : array-like, optional

If provided, only parameter values from these stimulus presentations will be considered.

Returns:
dict :

maps parameters (column names) to their unique values.

get_stimulus_table(self, stimulus_names=None, include_detailed_parameters=False, include_unused_parameters=False)[source]

Get a subset of stimulus presentations by name, with irrelevant parameters filtered off

Parameters:
stimulus_names : array-like of str

The names of stimuli to include in the output.

Returns:
pd.DataFrame :

Rows are filtered presentations, columns are the relevant subset of stimulus parameters

metadata
num_channels
num_probes
num_stimulus_presentations
num_units
presentationwise_spike_counts(self, bin_edges, stimulus_presentation_ids, unit_ids, binarize=False, dtype=None, large_bin_size_threshold=0.001, time_domain_callback=None)[source]

Build an array of spike counts surrounding stimulus onset per unit and stimulus frame.

bin_edges : numpy.ndarray
Spikes will be counted into the bins defined by these edges. Values are in seconds, relative to stimulus onset.
stimulus_presentation_ids : array-like
Filter to these stimulus presentations
unit_ids : array-like
Filter to these units
binarize : bool, optional
If true, all counts greater than 0 will be treated as 1. This results in lower storage overhead, but is only reasonable if bin sizes are fine (<= 1 millisecond).
large_bin_size_threshold : float, optional
If binarize is True and the largest bin width is greater than this value, a warning will be emitted.
time_domain_callback : callable, optional
The time domain is a numpy array whose values are trial-aligned bin edges (each row is aligned to a different trial). This optional function will be applied to the time domain before counting spikes.
Returns:
xarray.DataArray :

Data array whose dimensions are stimulus presentation, unit, and time bin and whose values are spike counts.

presentationwise_spike_times(self, stimulus_presentation_ids=None, unit_ids=None)[source]

Produce a table associating spike times with units and stimulus presentations

Parameters:
stimulus_presentation_ids : array-like

Filter to these stimulus presentations

unit_ids : array-like

Filter to these units

Returns:
pandas.DataFrame :
Index is
spike_time : float

On the session’s master clock.

Columns are
stimulus_presentation_id : int

The stimulus presentation on which this spike occurred.

unit_id : int

The unit that emitted this spike.

rig_equipment_name
rig_geometry_data
session_type
sex
specimen_name
spike_times
stimulus_conditions
stimulus_names
stimulus_presentations
structure_acronyms
structurewise_unit_counts
units
allensdk.brain_observatory.ecephys.ecephys_session.array_intervals(array)[source]

find interval bounds (bounding consecutive identical values) in an array

Parameters:
array : np.ndarray
Returns:
np.ndarray :

start and end indices of detected intervals (one longer than the number of intervals)

allensdk.brain_observatory.ecephys.ecephys_session.build_spike_histogram(time_domain, spike_times, unit_ids, dtype=None, binarize=False)[source]
allensdk.brain_observatory.ecephys.ecephys_session.build_time_window_domain(bin_edges, offsets, callback=None)[source]
allensdk.brain_observatory.ecephys.ecephys_session.coerce_scalar(value, message, warn=False)[source]
allensdk.brain_observatory.ecephys.ecephys_session.is_distinct_from(left, right)[source]
allensdk.brain_observatory.ecephys.ecephys_session.nan_intervals(array, nan_like=['null'])[source]

find interval bounds (bounding consecutive identical values) in an array, which may contain nans

Parameters:
array : np.ndarray
Returns:
np.ndarray :

start and end indices of detected intervals (one longer than the number of intervals)

allensdk.brain_observatory.ecephys.ecephys_session.removed_unused_stimulus_presentation_columns(stimulus_presentations)[source]