pip install allensdk


!pip install --upgrade pip
!pip install allensdk


from pathlib import Path
import matplotlib.pyplot as plt

import allensdk
from allensdk.brain_observatory.behavior.behavior_project_cache import VisualBehaviorNeuropixelsProjectCache

# Confirming your allensdk version
print(f"Your allensdk version is: {allensdk.__version__}")

Your allensdk version is: 2.13.5


# Update this to a valid directory in your filesystem
data_storage_directory = Path("/tmp/vbn_cache")

cache = VisualBehaviorNeuropixelsProjectCache.from_s3_cache(cache_dir=data_storage_directory)

ecephys_sessions.csv: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████| 63.5k/63.5k [00:00<00:00, 146kMB/s]
behavior_sessions.csv: 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████| 531k/531k [00:00<00:00, 959kMB/s]
units.csv: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 132M/132M [00:14<00:00, 8.87MMB/s]
probes.csv: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 127k/127k [00:00<00:00, 614kMB/s]
channels.csv: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 27.9M/27.9M [00:09<00:00, 3.01MMB/s]


cache.list_manifest_file_names()

['visual-behavior-neuropixels_project_manifest_v0.1.0.json',
 'visual-behavior-neuropixels_project_manifest_v0.2.0.json',
 'visual-behavior-neuropixels_project_manifest_v0.3.0.json']


cache.latest_manifest_file()

'visual-behavior-neuropixels_project_manifest_v0.3.0.json'


cache.latest_downloaded_manifest_file()

'visual-behavior-neuropixels_project_manifest_v0.3.0.json'


cache.list_all_downloaded_manifests()

['visual-behavior-neuropixels_project_manifest_v0.3.0.json']


cache.current_manifest()

'visual-behavior-neuropixels_project_manifest_v0.3.0.json'


cache.load_manifest('visual-behavior-neuropixels_project_manifest_v0.2.0.json')

/Users/adam.amster/AllenSDK/allensdk/api/cloud_cache/cloud_cache.py:466: OutdatedManifestWarning: 

The manifest file you are loading is not the most up to date manifest file available for this dataset. The most up to data manifest file available for this dataset is 

visual-behavior-neuropixels_project_manifest_v0.3.0.json

To see the differences between these manifests,run

VisualBehaviorNeuropixelsProjectCache.compare_manifests('visual-behavior-neuropixels_project_manifest_v0.2.0.json', 'visual-behavior-neuropixels_project_manifest_v0.3.0.json')

To see all of the manifest files currently downloaded onto your local system, run

self.list_all_downloaded_manifests()

If you just want to load the latest manifest, run

self.load_latest_manifest()


  warnings.warn(msg, OutdatedManifestWarning)


cache.current_manifest()

'visual-behavior-neuropixels_project_manifest_v0.2.0.json'


# This cell will not be useful until an updated version of the data release is issued

msg = cache.compare_manifests('visual-behavior-neuropixels_project_manifest_v0.1.0.json',
                              'visual-behavior-neuropixels_project_manifest_v0.2.0.json')
print(msg)

Changes going from
visual-behavior-neuropixels_project_manifest_v0.1.0.json
to
visual-behavior-neuropixels_project_manifest_v0.2.0.json

project_metadata/units.csv changed


ecephys_sessions = cache.get_ecephys_session_table()

print(f"Total number of ecephys sessions: {len(ecephys_sessions)}")

ecephys_sessions.head()

Total number of ecephys sessions: 103


behavior_sessions = cache.get_behavior_session_table()

print(f"Total number of behavior sessions: {len(behavior_sessions)}")

behavior_sessions.head()

Total number of behavior sessions: 3424


probes = cache.get_probe_table()

print(f"Total number of probes: {len(probes)}")

probes.head()

Total number of probes: 905


channels = cache.get_channel_table()

print(f"Total number of channels: {len(channels)}")

channels.head()

Total number of channels: 347520


units = cache.get_unit_table()

print(f"Total number of units: {len(units)}")

units.head()

Total number of units: 319013


ecephys_session = cache.get_ecephys_session(ecephys_session_id=1052533639)

ecephys_session_1052533639.nwb: 100%|██████████████████████████████████████████████████████████████████████████████████████████████| 2.31G/2.31G [05:34<00:00, 6.92MMB/s]
/opt/anaconda3/envs/allensdk/lib/python3.8/site-packages/hdmf/spec/namespace.py:532: UserWarning: Ignoring cached namespace 'hdmf-common' version 1.5.1 because version 1.5.0 is already loaded.
  warn("Ignoring cached namespace '%s' version %s because version %s is already loaded."
/opt/anaconda3/envs/allensdk/lib/python3.8/site-packages/hdmf/spec/namespace.py:532: UserWarning: Ignoring cached namespace 'hdmf-experimental' version 0.2.0 because version 0.1.0 is already loaded.
  warn("Ignoring cached namespace '%s' version %s because version %s is already loaded."


# List methods of the session that can be used to get data
print(ecephys_session.list_data_attributes_and_methods())

['behavior_data_class', 'behavior_session_id', 'eye_tracking', 'eye_tracking_rig_geometry', 'get_channels', 'get_performance_metrics', 'get_reward_rate', 'get_rolling_performance_df', 'get_units', 'licks', 'mean_waveforms', 'metadata', 'optotagging_table', 'probes', 'raw_running_speed', 'rewards', 'running_speed', 'spike_amplitudes', 'spike_times', 'stimulus_presentations', 'stimulus_templates', 'stimulus_timestamps', 'task_parameters', 'trials']


# Listing the different stimuli templates
ecephys_session.stimulus_templates


# Visualizing a particular stimulus
plt.imshow(ecephys_session.stimulus_templates['warped']['im104_r'], cmap='gray')

<matplotlib.image.AxesImage at 0x7f9a57ece790>


# Remove rows from the behavior sessions table which don't correspond to a behavior session NWB file
filtered_ecephys_sessions = ecephys_sessions.dropna(subset=["file_id"])

for ecephys_session_id, _ in filtered_ecephys_sessions.iterrows():
    cache.get_ecephys_session(ecephys_session_id=ecephys_session_id)


from urllib.parse import urljoin

def get_manifest_url(manifest_version: str) -> str:
    hostname = "https://visual-behavior-neuropixels-data.s3.us-west-2.amazonaws.com"
    object_key = f"visual-behavior-neuropixels/manifests/visual-behavior-neuropixels_project_manifest_v{manifest_version}.json"
    return urljoin(hostname, object_key)

# Example:
print(get_manifest_url("0.1.0"))

https://visual-behavior-neuropixels-data.s3.us-west-2.amazonaws.com/visual-behavior-neuropixels/manifests/visual-behavior-neuropixels_project_manifest_v0.1.0.json


def get_metadata_url(metadata_table_name: str) -> str:
    hostname = "https://visual-behavior-neuropixels-data.s3.us-west-2.amazonaws.com"
    object_key = f"visual-behavior-neuropixels/project_metadata/{metadata_table_name}.csv"
    return urljoin(hostname, object_key)

# Example:
print(get_metadata_url("behavior_sessions"))

https://visual-behavior-neuropixels-data.s3.us-west-2.amazonaws.com/visual-behavior-neuropixels/project_metadata/behavior_sessions.csv


def get_behavior_session_url(ecephys_session_id: int) -> str:
    hostname = "https://visual-behavior-neuropixels-data.s3.us-west-2.amazonaws.com"
    object_key = f"visual-behavior-neuropixels/ecephys_sessions/ecephys_session_{ecephys_session_id}.nwb"
    return urljoin(hostname, object_key)

# Example:
print(get_behavior_session_url(1052533639))

https://visual-behavior-neuropixels-data.s3.us-west-2.amazonaws.com/visual-behavior-neuropixels/ecephys_sessions/ecephys_session_1052533639.nwb


from typing import List
from urllib.parse import urljoin
import json

# The location will differ based on where you downloaded the manifest.json!
my_manifest_location = data_storage_directory / 'visual-behavior-neuropixels_project_manifest_v0.3.0.json'

def generate_all_download_urls_from_manifest(manifest_path: Path) -> List[str]:
    with manifest_path.open('r') as fp:
        manifest = json.load(fp)
    
    download_links = []
    
    # Get download links for specific version of metadata files
    for metadata_file_entry in manifest["metadata_files"].values():
        base_download_url = metadata_file_entry["url"]
        version_query = f"?versionId={metadata_file_entry['version_id']}"
        full_download_url = urljoin(base_download_url, version_query)
        download_links.append(full_download_url)

    # Get download links for specific version of data files
    for data_file_entry in manifest["data_files"].values():
        base_download_url = data_file_entry["url"]
        version_query = f"?versionId={data_file_entry['version_id']}"
        full_download_url = urljoin(base_download_url, version_query)
        download_links.append(full_download_url)    

    return download_links

# Example:
print('\n'.join(generate_all_download_urls_from_manifest(my_manifest_location)))

	behavior_session_id	date_of_acquisition	equipment_name	session_type	mouse_id	genotype	sex	project_code	age_in_days	unit_count	...	channel_count	structure_acronyms	image_set	prior_exposures_to_image_set	session_number	experience_level	prior_exposures_to_omissions	file_id	abnormal_histology	abnormal_activity
ecephys_session_id
1052342277	1052374521	2020-09-23 15:34:18.179	NP.1	EPHYS_1_images_G_3uL_reward	530862	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	M	NeuropixelVisualBehavior	148	1696.0	...	2304.0	['APN', 'CA1', 'CA3', 'DG-mo', 'DG-po', 'DG-sg...	G	32.0	1	Familiar	0.0	0	NaN	NaN
1051155866	1052162536	2020-09-17 15:05:39.665	NP.1	EPHYS_1_images_H_3uL_reward	524760	wt/wt	F	NeuropixelVisualBehavior	180	1922.0	...	2304.0	['APN', 'CA1', 'CA3', 'DG-mo', 'DG-po', 'DG-sg...	H	0.0	2	Novel	1.0	1	NaN	NaN
1052533639	1052572359	2020-09-24 15:12:13.229	NP.1	EPHYS_1_images_H_3uL_reward	530862	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	M	NeuropixelVisualBehavior	149	1677.0	...	2304.0	['APN', 'CA1', 'CA3', 'DG-mo', 'DG-po', 'DG-sg...	H	0.0	2	Novel	1.0	4	NaN	NaN
1053925378	1053960984	2020-10-01 16:07:18.990	NP.0	EPHYS_1_images_H_3uL_reward	532246	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	M	NeuropixelVisualBehavior	145	1823.0	...	2304.0	['APN', 'CA1', 'CA3', 'DG-mo', 'DG-po', 'DG-sg...	H	0.0	2	Novel	1.0	5	NaN	NaN
1053941483	1053960987	2020-10-01 17:03:58.362	NP.1	EPHYS_1_images_H_3uL_reward	527749	Sst-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	M	NeuropixelVisualBehavior	180	1543.0	...	2304.0	['APN', 'CA1', 'CA3', 'DG-mo', 'DG-po', 'DG-sg...	H	0.0	2	Novel	1.0	6	NaN	NaN

	equipment_name	genotype	mouse_id	sex	age_in_days	session_number	prior_exposures_to_session_type	prior_exposures_to_image_set	prior_exposures_to_omissions	ecephys_session_id	date_of_acquisition	session_type	image_set
behavior_session_id
1051333618	BEH.G-Box2	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	540536	M	85	1	0	NaN	0.0	NaN	2020-09-18 10:02:30.869000	TRAINING_0_gratings_autorewards_15min_0uL_reward	NaN
1052301754	BEH.G-Box2	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	540536	M	90	4	2	NaN	0.0	NaN	2020-09-23 09:43:25.595000	TRAINING_1_gratings_10uL_reward	NaN
1052374521	NP.1	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	530862	M	148	44	0	32.0	0.0	1.052342e+09	2020-09-23 15:34:18.179000	EPHYS_1_images_G_3uL_reward	G
1051860415	BEH.G-Box4	wt/wt	533539	F	127	9	0	3.0	0.0	NaN	2020-09-21 09:57:23.650000	TRAINING_4_images_G_training_7uL_reward	G
1052132182	BEH.F-Box5	Vip-IRES-Cre/wt;Ai32(RCL-ChR2(H134R)_EYFP)/wt	536480	M	112	8	1	1.0	0.0	NaN	2020-09-22 12:04:46.304000	TRAINING_3_images_G_10uL_reward	G

	ecephys_session_id	name	sampling_rate	lfp_sampling_rate	phase	has_lfp_data	unit_count	channel_count	structure_acronyms
ecephys_probe_id
1044506933	1044385384	probeB	30000.178402	2500.014867	1.0	True	701	384	['CA1', 'DG', 'LP', 'POL', 'PoT', 'VISpm', 'ro...
1044506934	1044385384	probeC	30000.049852	2500.004154	1.0	True	307	384	['MB', 'MRN', 'POST', 'SCig', 'VISp', 'root']
1044506935	1044385384	probeD	30000.029115	2500.002426	1.0	True	521	384	['CA1', 'CA3', 'DG', 'LGv', 'MB', 'TH', 'VISl'...
1044506936	1044385384	probeE	30000.075851	2500.006321	1.0	True	282	384	['CA1', 'DG', 'MB', 'MGd', 'MGm', 'MRN', 'SGN'...
1044506937	1044385384	probeF	29999.959578	2499.996631	1.0	True	368	384	['CA1', 'DG', 'LP', 'MRN', 'POL', 'PoT', 'SGN'...

	ecephys_probe_id	ecephys_session_id	probe_channel_number	probe_vertical_position	probe_horizontal_position	anterior_posterior_ccf_coordinate	dorsal_ventral_ccf_coordinate	left_right_ccf_coordinate	structure_acronym	unit_count	valid_data
ecephys_channel_id
1049365509	1048089911	1047969464	0	20.0	43.0	8445.0	4013.0	6753.0	MRN	0	True
1049365511	1048089911	1047969464	1	20.0	11.0	8443.0	4005.0	6755.0	MRN	5	True
1049365512	1048089911	1047969464	2	40.0	59.0	8441.0	3997.0	6757.0	MRN	0	True
1049365513	1048089911	1047969464	3	40.0	27.0	8439.0	3989.0	6759.0	MRN	5	True
1049365514	1048089911	1047969464	4	60.0	43.0	8438.0	3981.0	6761.0	MRN	7	True

	ecephys_channel_id	ecephys_probe_id	ecephys_session_id	amplitude_cutoff	anterior_posterior_ccf_coordinate	dorsal_ventral_ccf_coordinate	left_right_ccf_coordinate	cumulative_drift	d_prime	structure_acronym	...	valid_data	amplitude	waveform_duration	waveform_halfwidth	PT_ratio	recovery_slope	repolarization_slope	spread	velocity_above	velocity_below
unit_id
1157005856	1157001834	1046469925	1046166369	0.500000	8453.0	3353.0	6719.0	140.32	6.088133	MB	...	True	286.132665	0.151089	0.096147	0.310791	-0.227726	0.961313	20.0	-0.457845	NaN
1157005853	1157001834	1046469925	1046166369	0.323927	8453.0	3353.0	6719.0	239.76	4.635583	MB	...	True	181.418835	0.357119	0.192295	0.531490	-0.150522	0.732741	30.0	2.060302	-2.060302
1157005720	1157001786	1046469925	1046166369	0.044133	8575.0	3842.0	6590.0	263.32	5.691955	MRN	...	True	180.866205	0.521943	0.178559	0.612217	-0.024239	0.539687	80.0	0.000000	0.863364
1157006074	1157001929	1046469925	1046166369	0.000583	8212.0	2477.0	6992.0	154.64	6.049284	NOT	...	True	574.984215	0.343384	0.192295	0.470194	-0.356670	2.258649	40.0	1.373534	0.000000
1157006072	1157001929	1046469925	1046166369	0.500000	8212.0	2477.0	6992.0	242.58	4.745499	NOT	...	True	315.794115	0.329648	0.164824	0.488276	-0.210010	1.320270	70.0	0.412060	0.343384

Accessing Visual Behavior Neuropixels Data¶

Tutorial overview¶

Options for data access¶

Using the AllenSDK to retrieve data¶

Instal AllenSDK into your local environment¶

Install AllenSDK into your notebook environment (good for Google Colab)¶

Import required packages¶

Managing versions of the dataset¶

Discovering manifests¶

Loading manifests/dataset versions¶

Using the AllenSDK to access Visual Behavior Neuropixels metadata¶

Ecephys sessions table¶

Behavior sessions table¶

Probes table¶

Channels table¶

Units table¶

Using the AllenSDK to access Visual Behavior Neuropixels data¶

Downloading the complete dataset with AllenSDK¶

Direct download of data from S3¶

Downloading previous versions of released data from S3¶

Listing and downloading a specific manifest version for the data release¶

Using a versioned manifest to download a specific data version¶

	unwarped	warped
image_name
im104_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[136, 138, 140, 141, 141, 141, 140, 140, 140,...
im114_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[193, 190, 192, 194, 190, 182, 175, 173, 174,...
im083_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[6, 9, 2, 0, 0, 0, 7, 5, 0, 0, 0, 2, 7, 6, 2,...
im005_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[81, 82, 80, 76, 76, 80, 83, 82, 80, 78, 78, ...
im087_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[38, 39, 34, 28, 28, 35, 41, 39, 34, 31, 33, ...
im024_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[19, 21, 15, 8, 8, 17, 23, 22, 15, 11, 14, 19...
im111_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[53, 55, 50, 44, 45, 51, 56, 56, 52, 49, 50, ...
im034_r	[[nan, nan, nan, nan, nan, nan, nan, nan, nan,...	[[124, 126, 128, 128, 129, 129, 129, 129, 127,...