This readme file was generated on 2024-05-30 by Minhaj Hussain. ------------------- GENERAL INFORMATION ------------------- Title of Dataset: Data and scripts from: Highly efficient modeling and optimization of neural fiber responses to electrical stimulation. Abstract: Peripheral neuromodulation has emerged as a powerful modality for controlling physiological functions and treating a variety of medical conditions including chronic pain and organ dysfunction. The underlying complexity of the nonlinear responses to electrical stimulation make it challenging to design precise and effective neuromodulation protocols. Computational models have thus become indispensable in advancing our understanding and control of neural responses to electrical stimulation. However, existing approaches suffer from computational bottlenecks, rendering them unsuitable for real-time applications, large-scale parameter sweeps, or sophisticated optimization. In this work, we introduce an approach for massively parallel estimation and optimization of neural fiber responses to electrical stimulation using machine learning techniques. By leveraging advances in high-performance computing and parallel programming, we present a surrogate fiber model that generates spatiotemporal responses to a wide variety of electrical peripheral nerve stimulation protocols. We used our surrogate fiber model to design stimulation parameters for selective stimulation of pig and human vagus nerves. Our approach yields a several-orders-of-magnitude improvement in computational efficiency while retaining generality and high predictive accuracy, demonstrating its robustness and potential to enhance the design and optimization of peripheral neuromodulation therapies. First Author: Name: Minhaj Hussain Institution: Duke University Email: minhaj.hussain@duke.edu ORCID: 0000-0002-4833-7046 Principal Investigator (Corresponding Author): Name: Nicole Pelot PhD. Institution: Duke University Email: nikki.pelot@duke.edu ORCID: 0000-0003-2844-0190 Funding: NIH SPARC OT2 OD025340 and 75N98022C00018 Licensing: Data is licensed under Creative Commons CC0 1.0 Universal. ASCENT code license can be found at https://github.com/wmglab-duke/ascent?tab=License-1-ov-file. -------------------- DATA & FILE OVERVIEW -------------------- Data and analysis scripts (to generate figures) are organised into 4 subdirectories: `ascent`, `figures`, `inputs`, and `sourcedata`. --------- ./ascent/ --------- - Contains data to support vagus nerve stimulation FEM-ing with ASCENT [v1.2.1 (DOI: 10.5281/zenodo.7627427)]. ---------- ./figures/ ---------- - Contains data and scripts required to generate all figures. --------- ./inputs/ --------- - Contains data required to run examples included in the code repositories associated with the paper (https://github.com/minhajh/cajal and https://github.com/minhajh/axonml). ------------- ./sourcedata/ ------------- - Collects all data used to generate all figures into a single human-readable EXCEL file (source_data.xlsx). - All column headers and variables are explained in ./sourcedata/key.xlsx - Numerical IDs are used to identify nerves in some datasets; conversions to the corresponding nerve labels used in the paper are in ./sourcedata/id_to_nerve.xlsx. Data file formats used: .csv, .xlsx (Excel), .txt, .npy (NumPy), .json, .tif, .h5 (hdf5), .mmap (NumPy memory-mapped file). ------- DETAILS ------- --------- ./ascent/ --------- There are 3 further subdirectories: - ./ascent/input/ : Each nerve has a set of binary .tif images and an associated sample.json configuration file as required by ASCENT. - ./ascent/config/ : Contains custom configuration files (in particular ./ascent/config/system/cuffs/ImThera.json and ImTheraShrunken_flip_100.json that implement the original ImThera cuff and shrunk ImThera cuff, respectively). To use, simply copy the contents into your ascent/config directory. Remember to modify env.json to reflect your system and installation. - ./ascent/src : Contains custom modifications to ASCENT (ascent/src/core/sample.py) to support loading of the included binary .tif images. To use, simply copy the contents into your ascent/src directory. Additionally, there is an example model.json file (replace the cuff as necessary to reproduce the various model configurations reported in the paper). NOTE: solving a single nerve FEM can take several hours and 100s of GBs of RAM (especially larger pig nerves) on a multicore PC. ---------- ./figures/ ---------- All figures are saved (in .svg format) to the directory `./output/` (which will be generated if it does not exist). Data is stored primarily in csv format. Variables / column headers are explained in ./sourcedata/key.xlsx. - figure2.py : - Loads './data/figure2/figure2.csv' and generates figures 2a, 2b, 2c, and 2d. - Loads './data/figure2/figure2e.csv' and generates figure 2e. - Loads './data/figure2/figure2f.csv' and generates figure 2f. - figure3.py : - Loads './data/figure3/a_stim_{0-3}.npy' and './data/figure3/a_stim_{0-3}_nrn.npy' which contains spatiotemporal recordings of Vm at 4 suprathreshold stimulation amplitudes for S-MF and NEURON models respectively and generates figure 3a. - Loads './data/figure3/figure3b.csv' and generates figure 3b. - Loads './data/figure3/c_nrn_{0-3}.npy' and './data/figure3/c_surr_collision.npy' which contains spatiotemporal recordings of Vm for 4 fiber diameters with intracellular stimulation for NEURON and S-MF models respectively and generates figure 3c. - Loads './data/figure3/figure3d.csv' and generates figure 3d. - figure4_5_stats.py : - Loads './data/figure4_5/figure4e.csv' and generates figures for 4e (fig4e_{1-3}.svg). - Loads './data/figure4_5/figure4f.csv' and generates figures for 4f (fig4f_{1-2}.svg). - Loads './data/figure4_5/figure5d.csv' and generates figures for 5d (fig5d_{1-3}.svg). - Loads './data/figure4_5/figure5e.csv' and generates figures for 5e (fig5e_{1-2}.svg). - Conducts all pairwise comparisons (two sided Wilcoxon signed-rank test) of optimization performance between different optimization scenarios (algorithm / model / waveform constraint) and prints p and Holm-Bonferroni corrected critical p values to terminal. Other scripts generate supplementary figures. All supplementary data is stored in .csv format. - supplementary_figure4.py : - In addition to summary activation data and optimized waveform data stored in supplement_figure4_activations.csv, supplement_figure4_arbitrary.csv, and supplement_figure4_biphasic.csv, geometry data defining fascicle and nerve boundaries are loaded from './samples/{sample_id}/slides/0/0/sectionwise2d/ in .txt format and rendered using shapely. Additionally, geometric information regarding the rotation, expansion, and relative location of the ImThera cuff on each model is loaded from './samples/{sample_id}/models/0/model.json' and used to render the 2D representation of the cuff geometry. - All cross-section figures are saved to './output/cross_sections' and all optimized waveform figures are saved to './output/optimized_waveforms'. --------- ./inputs/ --------- There are two subdirectories: --------------- ./inputs/axonml --------------- To use these inputs with the axonml examples, dataset generation, and model training scripts, simply copy the contents into the cloned axonml directory (https://github.com/minhajh/axonml). We include example training (./inputs/axonml/axonml/data/example_datasets/train_example.h5) and validation datasets. These are truncated - running the training algorithm on these data will only execute the first training step of each training epoch. The full training set used to generate S-MF was >100GB; we include extracellular field data (./inputs/axonml/axonml/data/fields/{H and M}/) required to generate your own training / validation datasets with the dataset generating scripts provided in the axonml repository. The contents of './inputs/axonml/examples/' support running example kHz frequency stimulation modeling, optimization, and activation thresholding applications: --- kHz --- Contains extracellular field distributions in .npy format in khz/fields. Arrays {i}.npy for each contact in the ImTHera cuff, where each is a 2D array with dimensions n_axons, n_samples, where n_axons is the total number of axons in the model and n_samples are the number of longitudinal locations along the length of the nerve at which the extracellular potential has been sampled. fiber_z.npy is a 1D array length n_samples, coresponding to the longitudinal coordinates (in um) at which the field has been sampled. Contains ground truth data in .npy format in khz/ground_truth, with filename format {frequency}khz_{d}um_2_3_0_{a_idx}.npy, where frequency is the extracellular stimulation frequency in kHz, d is the fiber diameter in um, and a_idx is the numerical index of the fascicle in which the fiber is being simulated. ------------ optimization ------------ Contains data defining all optimization problems for all nerves used in the paper are specified under individual subdirectories in optimization/samples/. For example, the contents of directory P5 are: ./samples/P5 ├── 0.npy ├── 1.npy ├── 2.npy ├── 3.npy ├── 4.npy ├── 5.npy ├── fiber_z.npy ├── fibers_xy.npy ├── target.npy └── weights.npy All optimization problems include: * Arrays {i}.npy for each contact in the electrode, where i is the contact index (in the case of P3 there are 6 corresponding to the 6 contacts of the ImThera cuff). Each is a 2D array with dimensions n_axons, n_samples, where n_axons is the total number of axons in the model and n_samples are the number of longitudinal locations along the length of the nerve at which the extracellular potential has been sampled. * Array fiber_z.npy. This is a 1D array length n_samples, coresponding to the longitudinal coordinates (in um) at which the field has been sampled. * Array target.npy. This is a 1D boolean array length n_axons indicating which axons are targeted for activation (1 / True) and which aren't (0 / False). * Array weights.npy. This is a 1D boolean array length n_axons indicating the proportion of fascicular area that each axon represents. The sum over all elements in weights.npy must be 1. ------------ thresholding ------------ Two example datasets are provided for use with the example thresholding application proxided with axonml. One is for thresholds with the ImThera cuff (monopolar stimulation), and the other with the Livanova Helical cuff (bipolar stimulation). For example, the contents of example_imthera: ./thresholding/example_imthera ├── example_diamters_imthera.npy ├── example_field_array_imthera.mmap └── example_threshold_imthera.npy * example_diamters_imthera.npy contains the diameters of the example fibers (in um). * example_field_array_imthera.mmap contains the extracellular spatiotemporal fields (in mV) due to a unit stimulus (1 mA) for each exaple thresholding task. * example_threshold_imthera.npy contains the ground-truth (evaluated with NEURON) threshold (in mA) for each extracellular field and associated fiber diameter. The same descriptions apply for the contents of ./thresholding/example_livanova -------------- ./inputs/cajal -------------- To use these inputs with the cajal examples, simply copy the contents into the cloned cajal directory (https://github.com/minhajh/cajal). These consist of inputs for optimization problems in the same format and file structure as specified above. -------------------------- METHODOLOGICAL INFORMATION -------------------------- We used ASCENT v1.2.1 (DOI: 10.5281/zenodo.7627427). Analysis / plotting scripts were run using: - Python 3.9 - pandas 1.5.3 (with Excel support) - numpy 1.24.2 - matplotlib 3.7.0 - seaborn 0.12.2 - scipy 1.10.1 - natsort 8.3.1 - shapely 2.0.1 - geopandas 0.14.4