13.07.2015 Views

Nonlinear Modeling of Causal Interrelationships in Neuronal ...

Nonlinear Modeling of Causal Interrelationships in Neuronal ...

Nonlinear Modeling of Causal Interrelationships in Neuronal ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

336 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008<strong>Nonl<strong>in</strong>ear</strong> <strong>Model<strong>in</strong>g</strong> <strong>of</strong> <strong>Causal</strong> <strong>Interrelationships</strong><strong>in</strong> <strong>Neuronal</strong> EnsemblesTheodoros P. Zanos, Student Member, IEEE, Spiros H. Courellis, Theodore W. Berger, Senior Member, IEEE,Robert E. Hampson, Sam A. Deadwyler, and Vasilis Z. Marmarelis, Fellow, IEEEAbstract—The <strong>in</strong>creas<strong>in</strong>g availability <strong>of</strong> multiunit record<strong>in</strong>gsgives new urgency to the need for effective analysis <strong>of</strong> “multidimensional”time-series data that are derived from the recordedactivity <strong>of</strong> neuronal ensembles <strong>in</strong> the form <strong>of</strong> multiple sequences <strong>of</strong>action potentials—treated mathematically as po<strong>in</strong>t-processes andcomputationally as spike-tra<strong>in</strong>s. Whether <strong>in</strong> conditions <strong>of</strong> spontaneousactivity or under conditions <strong>of</strong> external stimulation, theobjective is the identification and quantification <strong>of</strong> possible causall<strong>in</strong>ks among the neurons generat<strong>in</strong>g the observed b<strong>in</strong>ary signals.A multiple-<strong>in</strong>put/multiple-output (MIMO) model<strong>in</strong>g methodologyis presented that can be used to quantify the neuronal dynamics <strong>of</strong>causal <strong>in</strong>terrelationships <strong>in</strong> neuronal ensembles us<strong>in</strong>g spike-tra<strong>in</strong>data recorded from <strong>in</strong>dividual neurons. These causal <strong>in</strong>terrelationshipsare modeled as transformations <strong>of</strong> spike-tra<strong>in</strong>s recordedfrom a set <strong>of</strong> neurons designated as the “<strong>in</strong>puts” <strong>in</strong>to spike-tra<strong>in</strong>srecorded from another set <strong>of</strong> neurons designated as the “outputs.”The MIMO model is composed <strong>of</strong> a set <strong>of</strong> multi<strong>in</strong>put/s<strong>in</strong>gle-output(MISO) modules, one for each output. Each module is the cascade<strong>of</strong> a MISO Volterra model and a threshold operator generat<strong>in</strong>gthe output spikes. The Laguerre expansion approach is used toestimate the Volterra kernels <strong>of</strong> each MISO module from therespective <strong>in</strong>put–output data us<strong>in</strong>g the least-squares method. Thepredictive performance <strong>of</strong> the model is evaluated with the use <strong>of</strong>the receiver operat<strong>in</strong>g characteristic (ROC) curve, from which theoptimum threshold is also selected. The Mann–Whitney statistic isused to select the significant <strong>in</strong>puts for each output by exam<strong>in</strong><strong>in</strong>gthe statistical significance <strong>of</strong> improvements <strong>in</strong> the predictiveaccuracy <strong>of</strong> the model when the respective <strong>in</strong>puts is <strong>in</strong>cluded.Illustrative examples are presented for a simulated system and foran actual application us<strong>in</strong>g multiunit data record<strong>in</strong>gs from thehippocampus <strong>of</strong> a behav<strong>in</strong>g rat.Index Terms—Functional connectivity, hippocampus, <strong>in</strong>put selection,Laguerre expansion, Mann–Whitney, multielectrode, multi<strong>in</strong>put,multioutput, nonl<strong>in</strong>ear model<strong>in</strong>g, po<strong>in</strong>t process, receiveroperat<strong>in</strong>g characteristic (ROC) curves, spikes, volterra series.Manuscript received May 11, 2007; revised October 23, 2007; February 22,2008; accepted March 9, 2008. First published June 3, 2008; last published August13, 2008 (projected). This work was supported <strong>in</strong> part by the NationalScience Foundation (NSF) funded Biomimetic Microelectronic Systems-Eng<strong>in</strong>eer<strong>in</strong>gResource Center (BMES-ERC) and the <strong>in</strong> part by the NIH/NIBIBfundedBiomedical Simulations Resource (BMSR).T. P. Zanos is with the Biomedical Eng<strong>in</strong>eer<strong>in</strong>g Department, Biomimetic MicroelectronicSystems-Eng<strong>in</strong>eer<strong>in</strong>g Resource Center (BMES-ERC), BiomedicalSimulations Resource (BMSR), University <strong>of</strong> Southern California, Los Angeles,CA, 90089 USA (e-mail: zanos@usc.edu).S. H. Courellis, T. W. Berger, and V. Z. Marmarelis are with the BiomedicalEng<strong>in</strong>eer<strong>in</strong>g Department, University <strong>of</strong> Southern California, Los Angeles, CA90089 USA (e-mail: shc@usc.edu, berger@usc.edu; vzm@usc.edu,).R. E. Hampson and S. A. Deadwyler are with the Physiology and PharmacologyDepartment, Wake Forest University, W<strong>in</strong>ston-Salem, NC 27157 USA(e-mail: rhampson@wfubmc.edu; sdeadwyl@wfubmc.edu).Color versions <strong>of</strong> one or more <strong>of</strong> the figures <strong>in</strong> this paper are available onl<strong>in</strong>eat http://ieeexplore.ieee.org.Digital Object Identifier 10.1109/TNSRE.2008.926716I. INTRODUCTIONCOMPUTATIONAL methods have been used extensively<strong>in</strong> neuroscience to enhance our understand<strong>in</strong>g <strong>of</strong> <strong>in</strong>formationprocess<strong>in</strong>g by the nervous system—i.e., the dynamic transformation<strong>of</strong> neuronal signals as they flow through neuronalensembles. Quantitative neurophysiological studies have producedcomputational models that seek to describe the functionalrelationships between observed neuronal variables <strong>of</strong> the system<strong>of</strong> <strong>in</strong>terest. S<strong>in</strong>ce bra<strong>in</strong> states are <strong>in</strong>herently labile and characterizedby a high degree <strong>of</strong> complexity, issues such as neuronalcoupl<strong>in</strong>g, neuronal cod<strong>in</strong>g, neuronal feedback, functionalconnectivity (convergence/divergence), and functional <strong>in</strong>tegrationcont<strong>in</strong>ue to be studied extensively because they are not yetadequately understood. The development <strong>of</strong> quantitative functionalmodels is a grow<strong>in</strong>g trend <strong>in</strong> computational neuroscience[1]–[6]. Extensive literature discusses various types <strong>of</strong> modelsthat summarize large amounts <strong>of</strong> experimental data, us<strong>in</strong>g l<strong>in</strong>earor nonl<strong>in</strong>ear methods, depend<strong>in</strong>g on the specific characteristics<strong>of</strong> the system and the goals <strong>of</strong> the study. Generally, the development<strong>of</strong> reliable computational models for neuronal ensembleshas been h<strong>in</strong>dered by the <strong>in</strong>tr<strong>in</strong>sic complexity aris<strong>in</strong>g fromthe fact that neuronal function is nonl<strong>in</strong>ear, dynamic, highly<strong>in</strong>terconnected, and subject to stochastic variations. This taskis further complicated by the vast amount <strong>of</strong> time-series datathat is required to represent the activity <strong>of</strong> a neuronal ensembleand by the many possible <strong>in</strong>teractions among the multiple neuronalunits. The recent availability <strong>of</strong> multiunit data record<strong>in</strong>gs(through multiple electrodes or multielectrode arrays) presentsa unique opportunity to tackle this important problem if effectivemethodologies can be developed for their proper analysis.This provides the motivation for the work presented here<strong>in</strong>The proper analysis <strong>of</strong> multiunit record<strong>in</strong>gs requires efficientmethodologies for obta<strong>in</strong><strong>in</strong>g multiple-<strong>in</strong>put/multiple-output(MIMO) models <strong>in</strong> a nonl<strong>in</strong>ear dynamic context. This presentsus with the daunt<strong>in</strong>g complexity <strong>of</strong> <strong>in</strong>corporat<strong>in</strong>g all exist<strong>in</strong>gnonl<strong>in</strong>ear <strong>in</strong>teractions among the various neuronal units. Whileparametric models (that utilize differential equations to def<strong>in</strong>especific mechanisms or compartments) may provide important<strong>in</strong>sight <strong>in</strong>to the biophysical and physiological aspects <strong>of</strong>neuronal systems, they require proper validation and are facedwith rapidly grow<strong>in</strong>g complexity <strong>in</strong> scal<strong>in</strong>g up to many <strong>in</strong>putsand outputs. On the other hand, nonparametric models (thatutilize Volterra-type functional expansions) provide a generalapproach that is data-based and does not require a priori modelpostulates, while it is able to scale up to multiple <strong>in</strong>puts andoutputs rather gracefully. Us<strong>in</strong>g broadband experimental data,1534-4320/$25.00 © 2008 IEEEAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 337the nonparametric approach yields data-true models that quantifythe nonl<strong>in</strong>ear dynamic manner <strong>in</strong> which multiple “<strong>in</strong>put”neuronal units <strong>in</strong>teract <strong>in</strong> order to generate the activity <strong>of</strong> acausally connected output unit. Although issues <strong>of</strong> computationalcomplexity still arise for large numbers <strong>of</strong> <strong>in</strong>puts andoutputs, the nonparametric approach allows the methodicalsearch for the important <strong>in</strong>put <strong>in</strong>teractions <strong>in</strong> a manner thatyields complete, robust and accurate models <strong>of</strong> manageablecomplexity.Although l<strong>in</strong>ear methods cont<strong>in</strong>ue to be used <strong>in</strong> model<strong>in</strong>gneuronal systems, the bulk <strong>of</strong> the applications currently utilizenonl<strong>in</strong>ear methods, because nonl<strong>in</strong>earities are <strong>in</strong>herent andessential <strong>in</strong> most neuronal systems [7], [8]. <strong>Nonl<strong>in</strong>ear</strong> model<strong>in</strong>ghas been applied successfully to several neuronal systems todate, such as the ret<strong>in</strong>a [9]–[12], the auditory cortex [13],[14], the visual cortex [15]–[18], the somatosensory system[19], the motor cortex [20], [21], the cerebellum [22], and thehippocampus [23]–[25]. At the level <strong>of</strong> s<strong>in</strong>gle neurons, muchwork has been done us<strong>in</strong>g multicompartmental models basedon biophysical pr<strong>in</strong>ciples. Such models attempt to captureknown attributes <strong>of</strong> neuronal structure and function that havebeen revealed by electrophysiological <strong>in</strong>vestigations. Thesemodels typically conta<strong>in</strong> a fair number <strong>of</strong> unknown parametersthat must be adjusted <strong>in</strong> each particular case on the basis <strong>of</strong>experimental measurements. The use <strong>of</strong> these models has shedlight on several aspects <strong>of</strong> neuronal function, such as synaptictransmission and somato-dendritic <strong>in</strong>tegration [26], [27]. Althoughsuch parametric models are extremely useful (whenaccurate), they present considerable challenges <strong>in</strong> a practicalcontext with regard to the task <strong>of</strong> model structure specificationand the requisite complexity to approximate the actual system(trade-<strong>of</strong>f between model fidelity and tractability). The problem<strong>of</strong> model complexity becomes especially acute when we needto scale up the model to accommodate multiple <strong>in</strong>puts andoutputs. Limit<strong>in</strong>g the model parameters to a tractable numbercan lead to oversimplification <strong>of</strong> the actual system dynamics,while <strong>in</strong>creas<strong>in</strong>g the number <strong>of</strong> parameters can raise the modelcomplexity to an impractical level. The issue <strong>of</strong> model complexityand fidelity is fundamental for the particular subject<strong>of</strong> this paper: the model<strong>in</strong>g <strong>of</strong> neuronal systems with multiple<strong>in</strong>puts and outputs. The use <strong>of</strong> multicompartmental models notonly makes the problem very complex computationally, butraises also the risk <strong>of</strong> misrepresentation, s<strong>in</strong>ce knowledge aboutthe exact way neurons are <strong>in</strong>terconnected and communicatewith each other <strong>in</strong> a nonl<strong>in</strong>ear context is still limited. Thus,<strong>in</strong>accurate representation <strong>of</strong> this <strong>in</strong>terconnectivity (<strong>in</strong> structuraland functional terms) can lead to <strong>in</strong>valid model<strong>in</strong>g results anderroneous conclusions.As an alternative to parametric model<strong>in</strong>g, the nonparametricapproach <strong>of</strong> Volterra-type model<strong>in</strong>g employs a generalmodel form and avoids the treacherous task <strong>of</strong> model specification—especially<strong>in</strong> the case <strong>of</strong> nonl<strong>in</strong>ear and highly<strong>in</strong>terconnected systems. For this reason, it is considered ageneral methodology for nonl<strong>in</strong>ear model<strong>in</strong>g <strong>of</strong> physiologicalsystems that is based on a rigorous mathematical frameworkand provides a quantitative description <strong>of</strong> the nonl<strong>in</strong>ear dynamics<strong>of</strong> the <strong>in</strong>put–output relation <strong>in</strong> the form <strong>of</strong> a functionalseries that conta<strong>in</strong>s unknown kernel functions estimated fromthe data. This nonparametric model has predictive capabilitiesfor arbitrary <strong>in</strong>puts [5]. The first extension <strong>of</strong> Volterra-typenonparametric model<strong>in</strong>g to the case <strong>of</strong> two <strong>in</strong>puts was made<strong>in</strong> the study <strong>of</strong> motion detection <strong>in</strong> the fly visual system and<strong>of</strong> ganglion cells <strong>in</strong> the catfish ret<strong>in</strong>a [11], [12]. This approachwas extended to spatio-temporal visual <strong>in</strong>puts and ganglioncell responses <strong>in</strong> the frog ret<strong>in</strong>a [9], [10] or cortical cells [28].More recent applications to multiple or spatio-temporal <strong>in</strong>putshave been reported for several neuronal systems [16], [29]. Theaforementioned cases concern systems with cont<strong>in</strong>uous <strong>in</strong>putsand cont<strong>in</strong>uous outputs. However, the nonparametric model<strong>in</strong>gmethodology has also been applied to neuronal systems withpo<strong>in</strong>t-process <strong>in</strong>puts [30], two <strong>in</strong>puts [31], multiple <strong>in</strong>puts anda s<strong>in</strong>gle output [32], and a MIMO system that is modeled us<strong>in</strong>goutput-triggered feedback [33]. Note that the <strong>in</strong>put–output datarequired for the effective estimation <strong>of</strong> the unknown quantities<strong>in</strong> nonparametric models (Volterra kernels) must have broadspectral content that endows them with predictive capabilityfor arbitrary <strong>in</strong>puts and makes their estimation robust <strong>in</strong> thepresence <strong>of</strong> noise.The nonparametric approach has its own limitations. For<strong>in</strong>stance, direct physiological <strong>in</strong>terpretation <strong>of</strong> the nonparametricmodel is difficult, because this model is not developedon the basis <strong>of</strong> biophysical pr<strong>in</strong>ciples or exist<strong>in</strong>g knowledgeabout the system <strong>in</strong>ternal structure, but constitutes <strong>in</strong>steada data-based representation <strong>of</strong> the <strong>in</strong>put–output transformation/mapp<strong>in</strong>g.Another limitation is that this general approachmay lead to cumbersome models <strong>in</strong> the case <strong>of</strong> highly nonl<strong>in</strong>earsystems—a problem that has been addressed to some extent byrecent work [5]. F<strong>in</strong>ally, a potential problem with the proposedapproach is the utilization <strong>of</strong> least-squares methods for theestimation <strong>of</strong> the model parameters (Laguerre coefficients <strong>of</strong>the Volterra kernel expansions) <strong>in</strong> the context <strong>of</strong> po<strong>in</strong>t-processoutputs, where the b<strong>in</strong>ary nature <strong>of</strong> the output may <strong>in</strong>troducebiases <strong>in</strong> the kernel estimates under certa<strong>in</strong> circumstances(elaborated <strong>in</strong> Section V). Nonetheless, computer simulations(where ground truth is available) <strong>in</strong>dicate that these estimationbiases are limited <strong>in</strong> cases with characteristics similar to our application(as demonstrated by the illustrative example presented<strong>in</strong> Section III). The obta<strong>in</strong>ed models <strong>in</strong> applications to realsystems (where ground truth is not available) can be evaluatedon the basis <strong>of</strong> their predictive performance and can be acceptedas reasonable model approximations if such performance isdeemed satisfactory. Alternative model<strong>in</strong>g methodologies forpo<strong>in</strong>t-processes have been presented <strong>in</strong> the past [34]–[39] withBrill<strong>in</strong>ger’s formulation <strong>of</strong> the maximum-likelihood estimationproblem <strong>in</strong> the b<strong>in</strong>omial context (for Gaussian variations <strong>of</strong>the postulated stochastic threshold) deserv<strong>in</strong>g special notebecause it yielded excellent results (<strong>in</strong> the form <strong>of</strong> l<strong>in</strong>ear<strong>in</strong>put–output models) for real po<strong>in</strong>t-process data from threeAplysia neurons [43]. In recent studies, probabilistic formulations<strong>of</strong> neuronal encod<strong>in</strong>g have led to methodologies utiliz<strong>in</strong>gmaximum-likelihood techniques [33], [41], [42] and networklikelihood models [43] that may <strong>of</strong>fer advantages <strong>in</strong> certa<strong>in</strong>applications.This paper presents an extension <strong>of</strong> the nonparametric model<strong>in</strong>gapproach to the case <strong>of</strong> multiple <strong>in</strong>puts and outputs thatrequires only moderate <strong>in</strong>crease <strong>in</strong> representational complexityAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


338 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008Fig. 1. (A) Schematic representation <strong>of</strong> the decomposition <strong>of</strong> the MIMO model <strong>in</strong>to an array <strong>of</strong> MISO modules. (B) Schematic detail <strong>of</strong> the structure <strong>of</strong> the MISOmodule compris<strong>in</strong>g the cascade <strong>of</strong> the NVT and the TT operators.and computational effort. The presented methodology estimatesa Volterra model with multiple <strong>in</strong>puts and a s<strong>in</strong>gle output(MISO) [32] and appends to this output a threshold-triggeroperator that generates the spikes at each recorded output.The validation <strong>of</strong> this MISO model is based on its predictiveperformance for arbitrary <strong>in</strong>puts, which is assessed with theuse <strong>of</strong> receiver operat<strong>in</strong>g characteristic (ROC) curves commonlyused <strong>in</strong> detection systems. The statistical significance<strong>of</strong> this predictive capability is assessed with the use <strong>of</strong> theMann–Whitney statistic, which is also used for select<strong>in</strong>g the<strong>in</strong>puts that are causally l<strong>in</strong>ked to each output (i.e., the relevant<strong>in</strong>puts that significantly affect it). The result<strong>in</strong>g MIMO modelfor multiunit record<strong>in</strong>gs is composed <strong>of</strong> multiple MISO modelmodules, one for each output.The actual data analyzed <strong>in</strong> this paper are collected <strong>in</strong> connectionwith the prospective development <strong>of</strong> hippocampal neuroprosthesesthat may emulate the nonl<strong>in</strong>ear dynamics <strong>of</strong> neuronaltransmission between the hippocampal regions <strong>of</strong> dentate gyrusand CA3 or CA1 [44], or between the CA3 and the CA1 regionsthat are believed to be <strong>in</strong>volved <strong>in</strong> the process<strong>in</strong>g <strong>of</strong> <strong>in</strong>formationlead<strong>in</strong>g to long-term memory formation [45], [46]. Advances <strong>in</strong>the development <strong>of</strong> neural prostheses have been remarkable overrecent years, with applications <strong>in</strong> the auditory system (cochlearimplants [47]), visual prostheses [48], the motor cortex [49], andbra<strong>in</strong>–mach<strong>in</strong>e <strong>in</strong>terfaces [50]. Implement<strong>in</strong>g such neuroprostheticdevices requires quantitative and reliable representation<strong>of</strong> the transformations performed by the respective neuronal regions<strong>in</strong> the form <strong>of</strong> a biomimetic model. Nonparametric, databasedmodels with predictive capabilities are excellent candidatesfor this purpose, s<strong>in</strong>ce they do not require explicit knowledge<strong>of</strong> the complex underly<strong>in</strong>g mechanisms or neuronal <strong>in</strong>terconnectivityand provide sufficient predictive accuracy <strong>in</strong> a robustand computationally tractable manner. In this application,we use multiunit data from behav<strong>in</strong>g rats that are recorded contemporaneouslyfrom several spatially dist<strong>in</strong>ct sites at the CA3and CA1 hippocampal regions, while the rats are perform<strong>in</strong>gspecific behavioral tasks <strong>in</strong>volv<strong>in</strong>g memory. These data are usedto model the nonl<strong>in</strong>ear multiunit causal relationship between theCA3 (<strong>in</strong>put region) and the CA1 (output region) dur<strong>in</strong>g these behavioraltasks.The paper also discusses the strengths and limitations <strong>of</strong>the Volterra-type model<strong>in</strong>g methodology adapted to the case<strong>of</strong> multiple po<strong>in</strong>t-process <strong>in</strong>puts and outputs recorded viamultielectrode arrays. Illustrative results from simulated andreal systems are presented to demonstrate the efficacy <strong>of</strong> theproposed model<strong>in</strong>g approach. The proposed methodology ispresented <strong>in</strong> the follow<strong>in</strong>g section and the results are presented<strong>in</strong> the subsequent section.II. METHODSThe proposed MIMO methodology for deriv<strong>in</strong>g nonl<strong>in</strong>ear dynamicmodels <strong>of</strong> neuronal multiunit transformations utilizes anarray <strong>of</strong> MISO model modules, as shown <strong>in</strong> Fig. 1(a).Each MISO module represents the neural transformation <strong>of</strong>the po<strong>in</strong>t-process <strong>in</strong>puts <strong>in</strong>to the respective po<strong>in</strong>t-processoutput. This is done <strong>in</strong> two stages [see Fig. 1(b)]: (1) a “nonl<strong>in</strong>earVolterra transformation” (NVT) <strong>of</strong> the <strong>in</strong>put spikesequences , <strong>in</strong>to an <strong>in</strong>termediatevariable that can be <strong>in</strong>terpreted as the transmembranepotential at the axon hillock <strong>of</strong> the respective th output neuron; and (2) a threshold-trigger (TT) operat<strong>in</strong>gon the <strong>in</strong>termediate variable to generate the spikes(action potentials) <strong>of</strong> the respective po<strong>in</strong>t-process outputwhen the <strong>in</strong>termediate variable exceeds the specified thresholdvalue .The NVT stage <strong>of</strong> each MISO module conta<strong>in</strong>s a set <strong>of</strong>Volterra kernels (characteristic <strong>of</strong> each <strong>in</strong>put–output mapp<strong>in</strong>g)that operate on the <strong>in</strong>puts to generate the <strong>in</strong>termediate variableaccord<strong>in</strong>g to the follow<strong>in</strong>g mathematical expression(1) <strong>of</strong> the cont<strong>in</strong>uous multiple-<strong>in</strong>put Volterra modelwhere , , and denote the zeroth-, first-, and second-orderkernels, respectively. Higher order kernels may generally existbut are not <strong>in</strong>cluded <strong>in</strong> these expressions. At the second stage,the threshold-trigger operator for the respective MISO moduleis applied on the <strong>in</strong>termediate variableand generates a spike when the threshold value is exceeded.A. Kernel Estimation With the Laguerre Expansions MethodTo facilitate the estimation <strong>of</strong> the Volterra kernels <strong>in</strong> themodel <strong>of</strong> (1) and improve the estimation variance by reduc<strong>in</strong>g(1)Authorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 339the number <strong>of</strong> free parameters, we expand the kernels us<strong>in</strong>g thediscrete-time Laguerre basis functions [28] aswhereare the Laguerre expansion coefficientsand are the Laguerre basis functions. Substitution<strong>of</strong> these kernel expansions <strong>in</strong>to (1) gives rise to the mult<strong>in</strong>omialexpression <strong>of</strong> the modified Volterra model [29]where denotes the convolution <strong>of</strong> the th <strong>in</strong>put with theth Laguerre basis function. The estimation <strong>of</strong> the Laguerre expansioncoefficients (which are now the unknown parameters <strong>of</strong>the Volterra model) can be performed <strong>in</strong> (3) via least-squaresmethods—i.e., by m<strong>in</strong>imiz<strong>in</strong>g the sum <strong>of</strong> the squared errors <strong>of</strong>the NVT model prediction, denoted by <strong>in</strong> the model <strong>of</strong>(1), relative to the po<strong>in</strong>t-process output , without <strong>in</strong>volv<strong>in</strong>gthe TT operator. This approach is expected to yield good estimates<strong>of</strong> the Laguerre expansion coefficients if the distribution<strong>of</strong> the prediction errorsis approximatelyGaussian—otherwise biases may exist <strong>in</strong> the estimates (see SectionV). The aggregate effect <strong>of</strong> these possible biases is assessedthrough the performance evaluation <strong>of</strong> the estimated model utiliz<strong>in</strong>gROC curves. An “optimum” threshold <strong>of</strong> the TT operatorcan be selected from the ROC curve, as described below.The estimated Laguerre expansion coefficients are used to reconstructthe estimates <strong>of</strong> the Volterra kernels accord<strong>in</strong>g to (2).The Laguerre basis functions are suitable for kernel expansionbecause they conta<strong>in</strong> an exponential-decay term that forcesthem asymptotically to zero, as the kernels <strong>of</strong> stable natural(nonoscillatory) systems must do to avoid <strong>in</strong>stabilities. This rate<strong>of</strong> decay is determ<strong>in</strong>ed by the Laguerre parameter alpha thatmust be selected properly for each system (based on the data).The exponential term <strong>of</strong> the th-order Laguerre basis functionis multiplied by an th-degree polynomial that has roots(i.e., the th-order Laguerre basis function has zero-cross<strong>in</strong>gs)—see[28] and references there<strong>in</strong>. These basis functionshave proven to be very efficient for the representation <strong>of</strong> thekernels <strong>of</strong> real systems, as suggested orig<strong>in</strong>ally by Wiener <strong>in</strong>the 1950s and numerous other prom<strong>in</strong>ent <strong>in</strong>vestigators over thelast 50 years. They have proven their utility <strong>in</strong> nonl<strong>in</strong>ear model<strong>in</strong>g<strong>of</strong> physiological systems (for partial review, see [29]).The sav<strong>in</strong>gs <strong>in</strong> representational and computational complexityachieved by the Laguerre expansion <strong>of</strong> the kernelsare considerable. For <strong>in</strong>stance, for a second-order model <strong>of</strong> a(2)(3)s<strong>in</strong>gle-<strong>in</strong>put/s<strong>in</strong>gle-output system with kernel memory <strong>of</strong>lags, the unknown discrete kernel values that must be estimatedare: for and for , after tak<strong>in</strong>g <strong>in</strong>to accountkernel symmetries—for a total <strong>of</strong>unknowndiscrete kernel values, where the zeroth-order kernel value isalso <strong>in</strong>cluded [29]. A typical value <strong>of</strong> 50 lags would requirethe estimation <strong>of</strong> a total <strong>of</strong> 1326 unknown discrete kernelvalues. Obviously, the computational burden <strong>of</strong> estimat<strong>in</strong>g somany unknown kernel values is immense and would requireextremely long <strong>in</strong>put–output data sets to avoid high estimationvariance. On the other hand, the use <strong>of</strong> Laguerre expansions forthe estimation <strong>of</strong> the kernels reduces significantly the number<strong>of</strong> unknown parameters (expansion coefficients) and makes thecomputational and representational task tractable. If Laguerrebasis functions are used for this purpose, then the total number<strong>of</strong> unknown expansion coefficients that must be estimated is. Experience has shown that a maximum <strong>of</strong>n<strong>in</strong>e Laguerre basis functions is required for adequaterepresentation <strong>of</strong> the kernels <strong>of</strong> physiological systems, whichcorresponds to a maximum total <strong>of</strong> 55 expansion coefficients.Note that or even has been found to be adequate<strong>in</strong> many actual applications, correspond<strong>in</strong>g to a total <strong>of</strong> 15 or 6expansion coefficients, respectively. Considerable advantages<strong>in</strong> terms <strong>of</strong> estimation accuracy, <strong>in</strong>put–output data requirementsand model complexity result from this significant reduction <strong>in</strong>the number <strong>of</strong> estimated parameters [28], [29].Obviously, the number <strong>of</strong> free parameters <strong>in</strong>creases rapidly <strong>in</strong>the MIMO case because <strong>of</strong> the presence <strong>of</strong> possible nonl<strong>in</strong>ear<strong>in</strong>teraction terms among the various <strong>in</strong>puts as they affect theoutputs. For <strong>in</strong>stance, a second-order MIMO model with <strong>in</strong>putsand outputs has a total number <strong>of</strong> free parameters equalto:, which grows ma<strong>in</strong>ly as the square<strong>of</strong> the product (dom<strong>in</strong>ant dependence) and is only proportionalto . Thus, for a MIMO model with 10 <strong>in</strong>puts and 10outputs, the total number <strong>of</strong> free parameters,even for , is 5050. This provides the motivation to screenthe various <strong>in</strong>puts and their <strong>in</strong>teractions regard<strong>in</strong>g the significance<strong>of</strong> their contributions to the output <strong>in</strong> order to reduce thetotal number <strong>of</strong> free parameters <strong>in</strong> the model (see the SectionII-D below).B. Selection <strong>of</strong> Threshold and Evaluation <strong>of</strong> ModelPerformance us<strong>in</strong>g ROC CurvesThe predictive capability <strong>of</strong> the obta<strong>in</strong>ed models is the basisfor the quantitative evaluation <strong>of</strong> their performance. Whenthe output data are discretized cont<strong>in</strong>uous signals, the mostcommonly used measure <strong>of</strong> prediction error is the mean-squareerror. However, <strong>in</strong> the case <strong>of</strong> po<strong>in</strong>t-process outputs (as <strong>in</strong> thisstudy), the mean-square error is not appropriate due to theb<strong>in</strong>ary nature <strong>of</strong> the output, and the balance between counts<strong>of</strong> correct and <strong>in</strong>correct predictions <strong>of</strong> output spikes should beused <strong>in</strong>stead. A spike predicted by the model is termed a “truepositive” (TP), if it co<strong>in</strong>cides with an actual output spike, orit is termed a “false positive” (FP) otherwise. S<strong>in</strong>ce the modelprediction depends on the threshold value <strong>of</strong> the TT operator(which is not estimated through the Laguerre expansion methodused for kernel estimation), various values <strong>of</strong> the threshold areAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


340 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008applied upon the cont<strong>in</strong>uous NTV model prediction andthe two performance metrics <strong>of</strong> true positives fraction (TPF)and false positives fraction (FPF) are computed for each value<strong>of</strong> the threshold accord<strong>in</strong>g to the relationsWhen we plot the FPF value <strong>in</strong> the abscissa and the TPF value<strong>in</strong> the ord<strong>in</strong>ate for each threshold value, we obta<strong>in</strong> the receiveroperat<strong>in</strong>g characteristic (ROC) curve that was developed <strong>in</strong> the1950s as a tool <strong>of</strong> assess<strong>in</strong>g the performance <strong>of</strong> detection systems<strong>in</strong> the presence <strong>of</strong> noise. The ROC curve is a graphicalrepresentation <strong>of</strong> the competitive relation between sensitivity(TPF) and specificity (1-FPF) <strong>of</strong> a b<strong>in</strong>ary detector/classifier, asits detection threshold changes. The shape <strong>of</strong> the ROC curve depictsthe detection performance and the Area Under the Curve(AUC) is a quantitative measure <strong>of</strong> this performance (i.e., themodel performance is better over all threshold values when thecorrespond<strong>in</strong>g AUC value is closer to 1, while the worst performancecorresponds to an AUC value <strong>of</strong> 0.5 when the ROC curveco<strong>in</strong>cides with the diagonal). The threshold value that br<strong>in</strong>gs theROC curve closest to the (0, 1) corner-po<strong>in</strong>t is <strong>of</strong>ten selected asthe “optimum” threshold <strong>of</strong> the modelC. Model StatisticsIn order to establish the statistical significance <strong>of</strong> each estimatedMISO model (as the quantitative representation <strong>of</strong> acausal l<strong>in</strong>k between the <strong>in</strong>puts and the output) <strong>in</strong> the presence<strong>of</strong> noise or other sources <strong>of</strong> stochastic variability <strong>in</strong> the data,we utilize the Mann–Whitney two-sample statistic [17] that relatesto the AUC value <strong>of</strong> the ROC curve and can be used to teststatistically whether a specific model is better than another as ab<strong>in</strong>ary predictor.The Mann–Whitney statistic (MWS) represents the probabilitythat a randomly selected sample from the <strong>in</strong>termediatevariable that corresponds to zero output will be lessthan (or equal to) a randomly selected sample from the values<strong>of</strong> that correspond to spikes <strong>in</strong> the output. Essentially theMWS represents how well these two random variables, and, are separated. It has been shown that the area under the ROCcurve (calculated us<strong>in</strong>g the trapezoidal rule) is equivalent to theMWS. The unbiased estimate <strong>of</strong> the MWS is the average <strong>of</strong> thesamplesformed by all possible pair comb<strong>in</strong>ations <strong>of</strong> the two sets <strong>of</strong> samplesand <strong>in</strong> the data record. The MWS is a U-statistic and,accord<strong>in</strong>g to the theory developed by Hoeffd<strong>in</strong>g for U-statistics[17], it follows an asymptotically normal distribution with unbiasedestimates <strong>of</strong> its mean and variance calculated from thedata. Therefore, we can use a -test to perform statistical test<strong>in</strong>g<strong>of</strong> significant differences <strong>in</strong> performance between two models(4)(5)(6)(e.g., with and without certa<strong>in</strong> model component), us<strong>in</strong>g therespective MWS. Specifically, the statistical significance <strong>of</strong> anestimated model can be tested aga<strong>in</strong>st the null hypothesis <strong>of</strong> arandom predictor (i.e., no causal <strong>in</strong>put–output relationship).D. Selection <strong>of</strong> Significant InputsThe complexity <strong>of</strong> the MIMO Volterra model depends onthe number <strong>of</strong> <strong>in</strong>puts that are causally l<strong>in</strong>ked to each output,s<strong>in</strong>ce the number <strong>of</strong> required kernels rises dramatically with <strong>in</strong>creas<strong>in</strong>gnumber <strong>of</strong> <strong>in</strong>puts. This is especially true <strong>in</strong> the presence<strong>of</strong> high-order nonl<strong>in</strong>earities (higher order Volterra kernels) thatgive rise to numerous nonl<strong>in</strong>ear <strong>in</strong>teraction terms (cross kernels)and, consequently, to the number <strong>of</strong> free parameters that mustbe estimated (i.e., the Laguerre expansion coefficients). As <strong>in</strong>dicatedearlier, for a second-order MIMO Volterra model, the totalnumber <strong>of</strong> free parameters is:, whereis the number <strong>of</strong> <strong>in</strong>puts and the number <strong>of</strong> outputs. Therefore,it is important to select only the necessary <strong>in</strong>puts for each MISOmodule to achieve the m<strong>in</strong>imum model complexity. The significant<strong>in</strong>puts are those causally l<strong>in</strong>ked to each specific output andthey are selected on the basis <strong>of</strong> their contribution to the prediction<strong>of</strong> the output us<strong>in</strong>g the respective MWS.The algorithm that selects the causally-l<strong>in</strong>ked <strong>in</strong>puts to eachoutput, builds successive models with <strong>in</strong>creas<strong>in</strong>g number <strong>of</strong> <strong>in</strong>putsand exam<strong>in</strong>es whether the <strong>in</strong>clusion <strong>of</strong> a specific <strong>in</strong>put (orset <strong>of</strong> <strong>in</strong>puts) improves the predictive accuracy <strong>of</strong> the model <strong>in</strong> astatistically significant sense. The comparison <strong>of</strong> the predictiveaccuracy <strong>of</strong> two successive models is performed by apply<strong>in</strong>g the-test (with ) on the two MWS values that correspondto the compared models. A complete comb<strong>in</strong>atorial search <strong>of</strong> allpossible pair-wise comparisons can be followed <strong>in</strong> order to selectthe causally-l<strong>in</strong>ked <strong>in</strong>puts to each output. In order to keepthe number <strong>of</strong> required pair-wise comparisons manageable <strong>in</strong>the case <strong>of</strong> a high number <strong>of</strong> possible <strong>in</strong>puts, we propose thefollow<strong>in</strong>g procedure that is more efficient.1) The MWS <strong>of</strong> each s<strong>in</strong>gle-<strong>in</strong>put/s<strong>in</strong>gle-output modelis compared with the MWS <strong>of</strong> a “random” predictor<strong>in</strong> order to assess whether the <strong>in</strong>put has statisticallysignificant impact on improv<strong>in</strong>g the prediction <strong>of</strong> therespective output (relative to the “random” predictor thatserves as the null hypothesis). The random predictor isa s<strong>in</strong>gle-<strong>in</strong>put/s<strong>in</strong>gle-output model with the same <strong>in</strong>putas the tested model but with a statistically <strong>in</strong>dependentPoisson output hav<strong>in</strong>g the same mean fir<strong>in</strong>g rate as theoutput <strong>of</strong> the tested model. The distribution <strong>of</strong> theta estimatesare calculated for the random predictor under theseconditions through Monte Carlo runs and a “cut<strong>of</strong>f” valueis established at the 95% significance level that servesas the threshold for the theta estimate obta<strong>in</strong>ed from theactual data. The <strong>in</strong>put is accepted as “statistically significant”at the 95% confidence level when the computed thetaestimate is higher than the respective cut<strong>of</strong>f value. Afterthe significant <strong>in</strong>puts have been selected for each output,we have an <strong>in</strong>itial set <strong>of</strong> MISO modules compris<strong>in</strong>g the<strong>in</strong>itial MIMO model.2) We form all possible pairs <strong>of</strong> the rema<strong>in</strong><strong>in</strong>g <strong>in</strong>puts (thatwere not selected <strong>in</strong> step 1) with the selected <strong>in</strong>puts andAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 341exam<strong>in</strong>e whether the addition <strong>of</strong> each pair to each <strong>in</strong>itialMISO model has a statistically significant impact on thepredictive ability <strong>of</strong> the model us<strong>in</strong>g the MWS. Theta estimatesand their variances are calculated for both cases <strong>of</strong>the model <strong>in</strong>clud<strong>in</strong>g a pair <strong>of</strong> <strong>in</strong>puts and a model withoutthem and used for a -test with a null hypothesis <strong>of</strong> equalmean values at a 99% confidence level. This step is done <strong>in</strong>order to explore any possible modulatory <strong>in</strong>teractions betweenthe <strong>in</strong>itially selected <strong>in</strong>puts and those that were notdeemed significant on their own.The result<strong>in</strong>g MISO model conta<strong>in</strong>s all <strong>in</strong>puts that arecausally-l<strong>in</strong>ked with the specific output either directly (step 1)or through modulatory second-order <strong>in</strong>teractions (step 2). Onemay cont<strong>in</strong>ue further to exam<strong>in</strong>e higher order <strong>in</strong>teractions, butthe process becomes rather burdensome and has only limitedvalue (if one posits that any possible higher order <strong>in</strong>teractionsare likely to be among <strong>in</strong>puts that also exhibit second-order<strong>in</strong>teractions already <strong>in</strong>cluded <strong>in</strong> the model). This procedure isrepeated for each output <strong>of</strong> the system. The f<strong>in</strong>al MIMO modelis comprised <strong>of</strong> the f<strong>in</strong>al set <strong>of</strong> MISO modules thus constructed.Note that the order <strong>of</strong> nonl<strong>in</strong>earity <strong>of</strong> each f<strong>in</strong>al MISO modulemay be determ<strong>in</strong>ed after the selection <strong>of</strong> the relevant <strong>in</strong>putsthrough a procedure that exam<strong>in</strong>es the statistical significance<strong>of</strong> improvements <strong>in</strong> model performance (us<strong>in</strong>g the MWS) aswe extend the order <strong>of</strong> nonl<strong>in</strong>earity.E. Selection <strong>of</strong> Model Order and Number <strong>of</strong> LaguerreFunctionsIn order to select the model order (degree <strong>of</strong> nonl<strong>in</strong>earity) <strong>of</strong>the system and the number <strong>of</strong> Laguerre functions, we employ theMWS <strong>in</strong> a similar way to the <strong>in</strong>put selection algorithm. For themodel order selection, we start by compar<strong>in</strong>g the first-order withthe second-order model us<strong>in</strong>g the respective MWS estimates forthe two models and a -test. If the second-order model is selected,then the comparison cont<strong>in</strong>ues between the second-orderand the third-order model, until the predictive performance <strong>of</strong>the higher order model ceases to improve significantly (basedon the -test <strong>of</strong> the respective MWS). The same procedure isfollowed <strong>in</strong> order to select the proper number <strong>of</strong> Laguerre basisfunctions. The two procedures should be performed jo<strong>in</strong>tly byfirst <strong>in</strong>creas<strong>in</strong>g the number <strong>of</strong> basis functions and subsequently<strong>in</strong>creas<strong>in</strong>g the model order, until no significant improvement isobserved <strong>in</strong> the respective MWS.III. COMPUTER-SIMULATED EXAMPLEIn order to illustrate the efficacy <strong>of</strong> the proposed model<strong>in</strong>gmethodology and its robustness <strong>in</strong> the presence <strong>of</strong> noise (spuriousspikes) and jitter (small shift <strong>of</strong> spike location) <strong>in</strong> the<strong>in</strong>put–output data, we consider a second-order MISO systemwith four <strong>in</strong>dependent <strong>in</strong>puts and a s<strong>in</strong>gle output (s<strong>in</strong>ce eachoutput is treated separately as a MISO module <strong>in</strong> the proposedMIMO approach—see Fig. 1). The functional characteristics<strong>of</strong> this system are def<strong>in</strong>ed by the first-order and second-orderVolterra kernels shown <strong>in</strong> Fig. 2(a) that describe the quantitativemanner <strong>in</strong> which each <strong>in</strong>put affects the output. The form <strong>of</strong>these kernels resembles the ones that have been experimentallyobserved. Specifically, the first and second <strong>in</strong>put have first-orderexcitatory and second-order <strong>in</strong>hibitory impact on the output,respectively, and the fourth <strong>in</strong>put has the reverse effect; whilethe third <strong>in</strong>put has no impact on the output s<strong>in</strong>ce all kernelsthat l<strong>in</strong>k it to the output are zero. A cross-kernel that describesthe second-order dynamic <strong>in</strong>teractions between the first and thefourth <strong>in</strong>put as they affect the output is also present (these arethe only exist<strong>in</strong>g nonl<strong>in</strong>ear <strong>in</strong>teractions among different <strong>in</strong>puts<strong>in</strong> this system). The system is simulated for <strong>in</strong>dependent Poissonprocess <strong>in</strong>puts with different values <strong>of</strong> the Poisson parameterthat def<strong>in</strong>es the mean fir<strong>in</strong>g rate <strong>of</strong> each <strong>in</strong>put.S<strong>in</strong>ce the ma<strong>in</strong> application <strong>of</strong> our model<strong>in</strong>g methodology<strong>in</strong>volves spike transformations <strong>in</strong> the hippocampus, we selectthe mean fir<strong>in</strong>g rates <strong>of</strong> the four <strong>in</strong>puts to correspond to theaverage level <strong>of</strong> activity <strong>of</strong> four types <strong>of</strong> hippocampal neuronsobserved <strong>in</strong> our experiments (viz. 3 spikes/s, 10 spikes/s, 5spikes/s, and 1 spikes/s for the first, second, third, and fourth<strong>in</strong>put, respectively). The memory <strong>of</strong> this simulated system isset at 1 s that corresponds to the observed average memoryextent <strong>of</strong> hippocampal neurons. We select a sampl<strong>in</strong>g <strong>in</strong>terval(b<strong>in</strong>width) <strong>of</strong> 10 ms which is slightly smaller than the m<strong>in</strong>imum<strong>in</strong>terspike <strong>in</strong>terval observed <strong>in</strong> the data and larger than therefractory period; thus, each kernel dimension has 100 sampledvalues (b<strong>in</strong>s). To estimate the kernels and apply the proposed<strong>in</strong>put-selection method, we use 6000 samples <strong>of</strong> <strong>in</strong>put–outputdata that correspond to 1 m<strong>in</strong> <strong>of</strong> recorded neuronal activity. Thethreshold <strong>of</strong> the simulated system is chosen to be 0.22 <strong>in</strong> orderto produce an output mean fir<strong>in</strong>g rate <strong>of</strong> about 12 spikes/s that isconsistent with our experimental observations. Validation <strong>of</strong> theproposed approach is based on the form <strong>of</strong> the obta<strong>in</strong>ed kernelestimates, the <strong>in</strong>put selection and the predictive capability <strong>of</strong>the model for these <strong>in</strong>put–output data (“<strong>in</strong>-sample” tra<strong>in</strong><strong>in</strong>g set)and an <strong>in</strong>dependent set <strong>of</strong> <strong>in</strong>put–output data (“out-<strong>of</strong>-sample”test<strong>in</strong>g set). Note that through 500 Monte Carlo simulations <strong>of</strong>random predictors us<strong>in</strong>g the same fir<strong>in</strong>g rates and length <strong>of</strong> datarecords, we derive a 95% confidence cut<strong>of</strong>f value to be used asthe threshold <strong>of</strong> significance for each <strong>in</strong>put.A. Noise-Free CaseFirst, we test the proposed methodology <strong>in</strong> the noise-free case(i.e., the <strong>in</strong>put–output data have no spurious spikes). The results<strong>of</strong> our <strong>in</strong>put selection algorithm are shown <strong>in</strong> Table I for thetest<strong>in</strong>g data sets. The calculated MWS theta estimates for each<strong>of</strong> the s<strong>in</strong>gle-<strong>in</strong>put/s<strong>in</strong>gle-output models are shown <strong>in</strong> the sameTable I, along with the 95% confidence cut<strong>of</strong>f value from the“random predictor.” The result<strong>in</strong>g decisions <strong>of</strong> the algorithm areall correct regard<strong>in</strong>g the statistical significance <strong>of</strong> each specific<strong>in</strong>put <strong>in</strong> terms <strong>of</strong> its causal effects on the output.After the <strong>in</strong>put selection is completed, we determ<strong>in</strong>e theproper model order and number <strong>of</strong> Laguerre basis functionsus<strong>in</strong>g a similar MWS-based procedure, as described<strong>in</strong> Section II. Then, we estimate the Volterra kernels for theselected second-order model us<strong>in</strong>g the tra<strong>in</strong><strong>in</strong>g data set. Theestimated kernels for this simulated example are shown <strong>in</strong>Fig. 2(b) and exhibit close resemblance to the true kernels.Note that all other cross-kernels (except for the exist<strong>in</strong>g )do not contribute to a statistically significant improvement<strong>in</strong> output prediction when <strong>in</strong>cluded <strong>in</strong> the model. Us<strong>in</strong>g theestimated kernels, we predict the output <strong>of</strong> the system forAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


342 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008Fig. 2. (A) Kernels <strong>of</strong> the second-order simulated MISO system with four <strong>in</strong>puts and one output: the first column shows the first-order kernels, the second columnshows the second-order self-kernels, and the third column shows the cross-kernel between the first and the fourth <strong>in</strong>put (see text). (B) The estimated kernels <strong>of</strong> thissecond-order model from the simulated data for the noise free case, along with ROC curves and theta values <strong>of</strong> <strong>in</strong>-sample and out-<strong>of</strong>-sample prediction (see text).The x axes <strong>in</strong> the first-order kernels and the x and y axes <strong>in</strong> the second-order kernels range from 0 to 1000 ms, while the ROC curves have <strong>in</strong> both axes valuesfrom 0 to 1.TABLE IRESULTS OF MWS-BASED TEST FOR INPUT SIGNIFICANCE IN THE SIMULATEDNOISE-FREE CASEvarious threshold values <strong>of</strong> the TT operator and form the ROCcurves that are shown <strong>in</strong> the right compartment <strong>of</strong> Fig. 2(b)for the tra<strong>in</strong><strong>in</strong>g (<strong>in</strong>-sample) and test<strong>in</strong>g (out-<strong>of</strong>-sample) datasets. These ROC curves and the correspond<strong>in</strong>g theta valuesdemonstrate the quality <strong>of</strong> model performance achieved withthe proposed methodology.B. Noisy CaseWe now test the proposed methodology <strong>in</strong> the presence <strong>of</strong>spurious spikes (noise) <strong>in</strong> the <strong>in</strong>put–output data. The spuriousspikes are <strong>in</strong>dependent Poisson processes that are <strong>in</strong>serted <strong>in</strong>tothe noise-free data (all <strong>in</strong>puts and output). The result<strong>in</strong>g “noisy”data are analyzed with the previously described algorithms for<strong>in</strong>put selection and kernel estimation. We exam<strong>in</strong>ed two levels<strong>of</strong> noise (numbers <strong>of</strong> <strong>in</strong>serted spurious spikes): one equal toone-fourth <strong>of</strong> the total spikes <strong>of</strong> each <strong>in</strong>put/output process (forexample a spike sequence with 100 true spikes will be contam<strong>in</strong>atedwith 25 spurious spikes, lead<strong>in</strong>g to a total <strong>of</strong> 125 spikes)and the other equal to half <strong>of</strong> the total spikes <strong>of</strong> each <strong>in</strong>put/outputAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 343TABLE IIRESULTS OF MWS-BASED TEST FOR INPUT SIGNIFICANCE IN THE SIMULATED25% ADDED NOISE CASETABLE IIIRESULTS OF MWS-BASED TEST FOR INPUT SIGNIFICANCE IN THE SIMULATED50% ADDED NOISE CASEprocess. For the first case (25% added spurious spikes), the results<strong>of</strong> the <strong>in</strong>put selection algorithm are shown <strong>in</strong> Table II andthey are not affected adversely. The same is true for the obta<strong>in</strong>edkernel estimates (not shown <strong>in</strong> the <strong>in</strong>terest <strong>of</strong> space).For the second case (50% added spurious spikes), we havethe results <strong>of</strong> <strong>in</strong>put selection shown <strong>in</strong> Table III. After the <strong>in</strong>putselection, we estimate the Volterra kernels us<strong>in</strong>g the noisy<strong>in</strong>put–output data and the results are shown <strong>in</strong> Fig. 3(a). It isevident that the kernel estimates rema<strong>in</strong> satisfactory, <strong>in</strong> spite <strong>of</strong>the large number <strong>of</strong> spurious spikes, demonstrat<strong>in</strong>g the remarkablerobustness <strong>of</strong> this approach even <strong>in</strong> the presence <strong>of</strong> a verylarge number <strong>of</strong> spurious spikes. Subsequently, we compute themodel prediction <strong>of</strong> the output us<strong>in</strong>g the estimated kernels forboth tra<strong>in</strong><strong>in</strong>g and test<strong>in</strong>g data sets and evaluate its performanceus<strong>in</strong>g the ROC curves shown on the right <strong>of</strong> Fig. 3(a) for the“<strong>in</strong>-sample” and “out-<strong>of</strong>-sample” cases.C. Spike Jitter CaseWe now test the proposed methodology for the case <strong>of</strong>random jitter <strong>in</strong> the location <strong>of</strong> the <strong>in</strong>put–output spikes that issimulated by <strong>in</strong>troduc<strong>in</strong>g stochastic latencies to each one <strong>of</strong>the spikes, us<strong>in</strong>g a normally distributed latency with a meanvalue <strong>of</strong> zero and a standard deviation <strong>of</strong> two lags <strong>of</strong> jitter.The results <strong>of</strong> <strong>in</strong>put selection are shown <strong>in</strong> the Table IV andthe estimated kernels with the correspond<strong>in</strong>g ROC curves areshown <strong>in</strong> Fig. 3(b). These results demonstrate the remarkablerobustness <strong>of</strong> this approach <strong>in</strong> the presence <strong>of</strong> spike jitter.D. Deleted Spikes CaseWe also test the proposed methodology for the case wheresome spikes are mistakenly deleted <strong>in</strong> the <strong>in</strong>put/output data. Inthe test performed here, we delete randomly 30% <strong>of</strong> the totalspikes <strong>in</strong> all <strong>in</strong>puts and output. The results <strong>of</strong> <strong>in</strong>put selectionare shown <strong>in</strong> Table V and they are not affected adversely. Thesame is true for the obta<strong>in</strong>ed kernel estimates (not shown <strong>in</strong> the<strong>in</strong>terest <strong>of</strong> space).E. Misassigned Spikes CaseWe f<strong>in</strong>ally test the proposed methodology for the case wherethere are misassigned spikes between the four <strong>in</strong>puts <strong>of</strong> themodel. In the test performed here, we misassign randomly 5%<strong>of</strong> the total spikes <strong>in</strong> each <strong>in</strong>put. The results <strong>of</strong> <strong>in</strong>put selectionare shown <strong>in</strong> Table VI and they are not affected adversely. Thesame is true for the obta<strong>in</strong>ed kernel estimates (not shown <strong>in</strong> the<strong>in</strong>terest <strong>of</strong> space).F. Selection <strong>of</strong> Model Order and Number <strong>of</strong> LaguerreFunctionsAs an illustrative example <strong>of</strong> the selection procedure for themodel order and the number <strong>of</strong> Laguerre functions that we describedearlier, we use the case <strong>of</strong> 25% spurious spikes case totest the proposed model-order selection algorithm by <strong>in</strong>creas<strong>in</strong>gthe model order successively from first to second and then tothird for the correct . For each model order, we calculatethe MWS theta estimates and run a -test for the significance <strong>of</strong>the effect <strong>of</strong> each model order <strong>in</strong>crease on the output prediction.The results are shown <strong>in</strong> Table VII and yield the correct answer<strong>of</strong> a second-order model.We also test the selection algorithm for the number <strong>of</strong> Laguerrefunctions by <strong>in</strong>creas<strong>in</strong>g successively the number <strong>of</strong> basisfunctions from two to three and then four (for a second-ordermodel), us<strong>in</strong>g the same MWS-based approach as with themodel-order selection. The results are shown <strong>in</strong> Table VIIIand yield the correct answer <strong>of</strong> . In practice, these twosearches should be performed together by <strong>in</strong>crement<strong>in</strong>g foreach model order until a “No” outcome occurs and then <strong>in</strong>crement<strong>in</strong>gthe model order until another “No” outcome occurs.IV. APPLICATION TO HIPPOCAMPAL DATAThis application is motivated by the development efforts <strong>of</strong>a hippocampal prosthesis that emulates the multiunit transformations<strong>of</strong> neuronal activity between different regions <strong>of</strong> thehippocampus. In this study, we use multiunit data from behav<strong>in</strong>grats that are recorded contemporaneously from severalspatially dist<strong>in</strong>ct sites at the CA3 and CA1 hippocampal regions,while the rats are perform<strong>in</strong>g a behavioral task <strong>in</strong>volv<strong>in</strong>gmemory. These data are used to model the nonl<strong>in</strong>ear multiunitfunctional relationship between the CA3 (<strong>in</strong>put region) and theCA1 (output region) dur<strong>in</strong>g these behavioral tasks.Male Long-Evans rats ( ,3–11 months <strong>of</strong> age) weretra<strong>in</strong>ed on a two-lever spatial delayed-nonmatch-to-sample(DNMS) task with randomly occurr<strong>in</strong>g variable delay <strong>in</strong>tervals<strong>of</strong> 1–40 s [see Fig. 4(a)]. The animal performs the task bypress<strong>in</strong>g the lever presented <strong>in</strong> the sample phase (left or right)[see Fig. 4(a)(I)]. This constitutes the sample/position <strong>in</strong>formationon the trial. The lever is then retracted, and the delayphase <strong>in</strong>itiated, dur<strong>in</strong>g which the animal must poke its nose<strong>in</strong>to a lighted device on the opposite wall for the duration <strong>of</strong> thedelay [see Fig. 4(a)(II)]. Follow<strong>in</strong>g the term<strong>in</strong>ation <strong>of</strong> the delay,both levers are extended, and the animal must press the leveropposite the sample/position <strong>in</strong>formation [nonmatch response(NR)] [see Fig. 4(a)(III)]. The longer the delay between trials,the less likely it is that the animal to have the correct responseon that trial. Correct responses are rewarded with a drop <strong>of</strong>water delivered to the trough between both levers dur<strong>in</strong>g a 10 sAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


344 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008Fig. 3. (A) The estimated model for the case <strong>of</strong> 50% spurious spikes <strong>in</strong> the <strong>in</strong>puts and output and (B) for the case <strong>of</strong> jitter <strong>in</strong> the spike location <strong>in</strong> the <strong>in</strong>put–outputdata (see caption <strong>of</strong> Fig. 2 for detailed description <strong>of</strong> the layout and the axes <strong>of</strong> the plots). The results demonstrate the robustness <strong>of</strong> the proposed methodology <strong>in</strong>the presence <strong>of</strong> added spurious spikes and jitter.TABLE IVRESULTS OF MWS-BASED TEST FOR INPUT SIGNIFICANCE IN THE SIMULATEDJITTER CASETABLE VRESULTS OF MWS-BASED TEST FOR INPUT SIGNIFICANCE IN THE SIMULATEDDELETED SPIKES CASE<strong>in</strong>tertrial <strong>in</strong>terval (ITI). Incorrect responses produce a darkenedchamber for 5 s without reward, followed by a 5 s ITI. The nexttrial is always <strong>in</strong>itiated after at least a 5 s ITI. Only data fromcorrect-response trials were utilized <strong>in</strong> this study.To acquire the multiunit record<strong>in</strong>gs, multielectrode arrays <strong>of</strong>16 microwires (40 ) were surgically implanted us<strong>in</strong>g bothstereotaxic coord<strong>in</strong>ates and spontaneous cell fir<strong>in</strong>g activityto position correctly the electrode tips <strong>in</strong> both the CA3 andCA1 cell layers [Fig. 4(b)]. All arrays had a fixed geometrywith 200 between pairs (septotemporal axis) and 800between rows (medial-lateral axis) supplied by NB Labs(Denison, TX). Electrode tip length was precisely trimmed toAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 345Fig. 4. Schematic <strong>of</strong> the three behavioral tasks that the live rat is perform<strong>in</strong>g <strong>in</strong> each trial <strong>of</strong> the experiment: (I) right sample, (II) delay phase, (III) left nonmatch.(B) Schematic representation <strong>of</strong> the multielectrode array stimulat<strong>in</strong>g and record<strong>in</strong>g arrangement <strong>in</strong> the CA3 and CA1 hippocampal regions <strong>of</strong> the behav<strong>in</strong>g rat.TABLE VIRESULTS OF MWS-BASED TEST FOR INPUT SIGNIFICANCE IN THE SIMULATEDMISASSIGNED SPIKES CASETABLE VIIRESULTS OF MWS-BASED TEST FOR MODEL ORDER SELECTIONTABLE VIIIRESULTS OF MWS-BASED TEST FOR NUMBER OF LAGUERREFUNCTIONS SELECTIONfollow the longitud<strong>in</strong>al curvature <strong>of</strong> the hippocampus. Datawere collected from 11 animals and a total <strong>of</strong> 198 pyramidalcells (average 16–20 simultaneously recorded per animal)over at least seven DNMS sessions with stable extracellularaction potential waveforms and consistent event-specific fir<strong>in</strong>gpatterns. Neurons that exhibited low fir<strong>in</strong>g rates ( 0.2 spikes/s)were excluded as unresponsive and neurons with high fir<strong>in</strong>grates ( 6 spikes/s) were also excluded as <strong>in</strong>terneurons. F<strong>in</strong>ally,the sampl<strong>in</strong>g <strong>in</strong>terval dur<strong>in</strong>g data acquisition was very small(less than 1 ms), the time b<strong>in</strong>width used for the process<strong>in</strong>g <strong>of</strong>the data was 10 ms, a value smaller than the m<strong>in</strong>imum <strong>in</strong>terspike<strong>in</strong>terval <strong>of</strong> the data analyzed and larger than the refractoryperiod. This b<strong>in</strong>width was considered the proper choice formodel<strong>in</strong>g efficiency <strong>in</strong> this application (i.e., it m<strong>in</strong>imizes thenumber <strong>of</strong> sample po<strong>in</strong>ts while reta<strong>in</strong><strong>in</strong>g all useful <strong>in</strong>formation<strong>in</strong> the data).Several <strong>in</strong>stances <strong>of</strong> each behavioral task were consideredacross a number <strong>of</strong> different trials, and all trials were recordedfrom a specific animal dur<strong>in</strong>g one session (day) <strong>of</strong> experiments.Each <strong>in</strong>stance <strong>of</strong> a behavioral task <strong>in</strong>cluded the neuronal activitywith<strong>in</strong> 1.5 s around the time <strong>of</strong> press<strong>in</strong>g the lever thatis associated with the task. Recorded cells from the CA1 areawere considered as outputs <strong>of</strong> the MIMO model, while the CA3cells contribut<strong>in</strong>g to the activity <strong>of</strong> each output CA1 cell wereconsidered as <strong>in</strong>puts to for the respective MISO module. Themultiunit <strong>in</strong>put–output data recorded dur<strong>in</strong>g each one <strong>of</strong> thebehavioral tasks (left nonmatch, right nonmatch, left sample,right sample) were analyzed with the proposed methodologyand second-order Volterra models were obta<strong>in</strong>ed for each MISOmodule, <strong>in</strong>volv<strong>in</strong>g first-order and second-order kernel estimates.The <strong>in</strong>put selection algorithm was applied to each MISO module(correspond<strong>in</strong>g to each <strong>of</strong> the recorded outputs) and the resultsare summarized for each <strong>of</strong> the four behavioral tasks <strong>in</strong> Table IX.Note that the MWS estimates for all <strong>in</strong>put channels that werenot considered significant with regard to a specific output areomitted <strong>in</strong> the <strong>in</strong>terest <strong>of</strong> space. Also note that the theta estimateand its variance for the random predictor are the same fora given output channel (based on the fir<strong>in</strong>g rate <strong>of</strong> that specificoutput channel) regardless <strong>of</strong> the <strong>in</strong>put channel.The result<strong>in</strong>g selection <strong>of</strong> CA3 <strong>in</strong>put cells for each CA1output cell and the causal connections between them for eachbehavioral task are shown <strong>in</strong> Figs. 5 and 6. The results fromthe analysis <strong>of</strong> the data collected dur<strong>in</strong>g the Left Sample taskare presented <strong>in</strong> detail (for illustrative purposes) <strong>in</strong> Fig. 5 thatshows the ROC curves (with the correspond<strong>in</strong>g theta value) foreach output <strong>of</strong> the model (A), the estimated first-order kernelsAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


346 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008TABLE IXINPUT SELECTION FOR THE FOUR BEHAVIORAL TASKS(B), the estimated second-order self-kernels and cross-kernels(C), the colormap used to depict the second-order kernels (assign<strong>in</strong>gthe red color to the maximum value and the blue colorto the m<strong>in</strong>imum value <strong>of</strong> each kernel (D), a schematic <strong>of</strong> thetopology <strong>of</strong> the identified <strong>in</strong>put–output causal connections (E),and the MIMO model prediction and the actual recorded activityfor each output neuron (F). Note that the Volterra kernels<strong>of</strong> the obta<strong>in</strong>ed second-order MISO modules for all these caseswere estimated us<strong>in</strong>g three Laguerre basis functionsand the nonl<strong>in</strong>ear <strong>in</strong>teractions between different <strong>in</strong>puts wererepresented <strong>in</strong> each MISO model by the estimated cross-kernels(when statistically significant). The model predictions shown<strong>in</strong> Fig. 5(f) for all output CA1 units demonstrate the good predictivecapability <strong>of</strong> the obta<strong>in</strong>ed MIMO model. We must notethat each MIMO model is animal-specific and task-specific.A closer <strong>in</strong>spection <strong>of</strong> Fig. 5 can provide important <strong>in</strong>formationabout the functional characteristics <strong>of</strong> the modeled systemand the underly<strong>in</strong>g physiology that are quantified by the Volterrakernels. The vast majority <strong>of</strong> the first-order kernels and many<strong>of</strong> the second-order kernels for all outputs and <strong>in</strong>puts <strong>of</strong> thissystem depict a facilitatory characteristic (positive values), anobservation that can be expla<strong>in</strong>ed by the fact that we consideredonly pyramidal CA3 and CA1 cells (exclud<strong>in</strong>g any <strong>in</strong>terneurons),thus model<strong>in</strong>g predom<strong>in</strong>antly the monosynaptic connectionsbetween CA3 and CA1 cells which are mostly excitatory.The “effective memory” <strong>of</strong> most <strong>in</strong>put pathways <strong>of</strong> this systemis about 500 ms, s<strong>in</strong>ce all the values <strong>of</strong> most first-order andsecond-order kernels have become negligible before or arounda lag <strong>of</strong> 500 ms. A few kernels reta<strong>in</strong> significant (but not large)values beyond the lag <strong>of</strong> 500 ms but not beyond a lag <strong>of</strong> 1000 ms.For illustrative purposes, let us exam<strong>in</strong>e closer the kernels <strong>of</strong>output 1 (first l<strong>in</strong>e <strong>of</strong> kernels <strong>in</strong> Fig. 5) that correspond to threesignificant <strong>in</strong>puts. The first-order kernels <strong>of</strong> all three <strong>in</strong>puts havea similar shape, with positive values <strong>in</strong> the early lags and relax<strong>in</strong>gto zero around a lag <strong>of</strong> 150 ms. The second-order self-kernelsfor two (first and third) <strong>of</strong> the three <strong>in</strong>puts have a slight depressiveeffect when the two lags are shorter than 150 ms, whilethe second-order self-kernel <strong>of</strong> the second <strong>in</strong>put presents a smallfacilitatory region when both lags are shorter than 100 ms. Thesecond-order cross-kernel between the first and second <strong>in</strong>put ishighly asymmetric, reveal<strong>in</strong>g the presence <strong>of</strong> a modulatory effectthat the second <strong>in</strong>put has on the way the first <strong>in</strong>put <strong>in</strong>fluencesthe output—but not significantly the other way around.Specifically, this modulatory effect is triphasic: <strong>in</strong>itially depressive(for lags less than 30 ms), then facilitatory (for lags between30 and 120 ms) and subsequently depressive aga<strong>in</strong> (for lags between120 and 500 ms). The other two cross-kernels are moresymmetrical and while the one between the second and third<strong>in</strong>put is clearly facilitatory for short lags (less than 100 ms), thecross-kernel between the first and second <strong>in</strong>puts is biphasic (<strong>in</strong>itiallydepressive up to 80 ms lags and excitatory for longer lags).For the second <strong>in</strong>put, the shapes <strong>of</strong> first-order kernels exhibita variety <strong>of</strong> functional characteristics for the four different <strong>in</strong>puts.The effect <strong>of</strong> the third <strong>in</strong>put is excitatory, as well as thefourth (with maximum excitation around 50 ms), while the firstand second <strong>in</strong>puts exhibit a biphasic behavior (<strong>in</strong>itially excitatoryand subsequently <strong>in</strong>hibitory) with the first <strong>in</strong>put dynamicsbe<strong>in</strong>g slower than the second. Also noticeable is the fact thatthe first-order and second-order kernels for the third and fifthoutput [which have the same significant <strong>in</strong>puts as shown by theschematic <strong>in</strong> Fig. 5(e)] are very similar to each other.An illustration <strong>of</strong> the MIMO model predictions and the actualrecorded activity <strong>of</strong> each output neuron for the other threebehavioral tasks are shown <strong>in</strong> Fig. 6. It is evident that the observedbursts <strong>of</strong> activity <strong>in</strong> some output neurons are usually predictedcorrectly, as well as the <strong>in</strong>termittent activity. Table Xreports the significant <strong>in</strong>put channels for each output channeland the respective theta estimates for the four behavioral tasks.These results demonstrate the predictive capability <strong>of</strong> the obta<strong>in</strong>edmodels and the efficacy <strong>of</strong> the proposed approach.As a f<strong>in</strong>al hypothesis test that the f<strong>in</strong>al MIMO models capturean exist<strong>in</strong>g temporal dependence, we form the null hypothesisfor these MIMO models. We generate randomly generated<strong>in</strong>dependent Poisson spike tra<strong>in</strong>s (whose rate parameter followsa normal distribution with a fixed mean value ), which haveAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 347Fig. 5. Illustrative Case <strong>of</strong> a complete MIMO model for the left sample event. (A) ROC curve and theta estimate <strong>of</strong> each output, (B) first-order kernels, (C)second-order self-kernels and cross-kernels, (D) colormap <strong>of</strong> the meshes used for the second-order kernels, (E) schematic <strong>of</strong> the topology <strong>of</strong> the <strong>in</strong>put–outputneurons <strong>in</strong> the CA3 and the CA1, respectively, and the causal connections considered <strong>in</strong> the MIMO model, (F) the model prediction and the actual recorded activity<strong>of</strong> each output neurons dur<strong>in</strong>g the left sample task). The x axes <strong>in</strong> the first-order kernels and the x and y axes <strong>in</strong> the second-order kernels range from 0 to 500 ms,while the ROC curves have <strong>in</strong> both axes, values from 0 to 1.no causal relationship between each other, and randomly pickone <strong>of</strong> them as the output and the rema<strong>in</strong><strong>in</strong>g spike tra<strong>in</strong>s ( -1)as the <strong>in</strong>puts <strong>of</strong> the system. We build the MISO model for eachcase and estimate the theta values. We repeat the procedure 50times and get averaged statistics, that are reported <strong>in</strong> Table XI,for different values <strong>of</strong> and . This test reveals the basel<strong>in</strong>e <strong>of</strong>the predictor performance <strong>of</strong> the proposed model<strong>in</strong>g approachand establishes the causal relationship captured by the aforementionedmodels.V. DISCUSSIONThe <strong>in</strong>creas<strong>in</strong>g availability <strong>of</strong> multiunit record<strong>in</strong>gs gives newurgency to the need for effective analysis <strong>of</strong> such data that arederived from the recorded activity <strong>of</strong> neuronal ensembles. Ageneral methodological framework has been presented for analyz<strong>in</strong>gthe possible causal l<strong>in</strong>ks among multiunit record<strong>in</strong>gsfrom neuronal ensembles by model<strong>in</strong>g them as systems withmultiple <strong>in</strong>puts and outputs. This methodological frameworkAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


348 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008Fig. 6. Schematic <strong>of</strong> the topology <strong>of</strong> the <strong>in</strong>put–output neurons <strong>in</strong> the CA3 and the CA1, respectively, and the causal connections considered <strong>in</strong> each model (leftpanels). The model predicted and the actual recorded activity <strong>of</strong> the output neurons dur<strong>in</strong>g three behavioral tasks: (A) left nonmatch, (B) right sample, and (C)right nonmatch. The time axes <strong>in</strong> all the prediction plots are <strong>in</strong> seconds.TABLE XMANN–WHITNEY STATISTICS FOR FINAL MIMO MODEL FOR LNM, RNM,AND RS TASKSTABLE XIAVERAGED MANN–WHITNEY STATISTICS FOR MISO MODELS ESTIMATEDFROM RANDOMLY GENERATED INDEPENDENT POISSON SPIKE TRAINSFOR DIFFERENT MFRS AND NUMBERS OF INPUTSis broadly applicable (because <strong>of</strong> the m<strong>in</strong>imal assumptions regard<strong>in</strong>gthe underly<strong>in</strong>g system structure) and robust <strong>in</strong> the presence<strong>of</strong> noise or errors <strong>in</strong> the data. It is computationally efficientand applicable <strong>in</strong> a practical context (e.g., it can be effective withrelatively short data records). It is also scalable and flexible toan arbitrary number <strong>of</strong> <strong>in</strong>puts and outputs.The use <strong>of</strong> functional connectivity measures for ascerta<strong>in</strong><strong>in</strong>gthe subset <strong>of</strong> significant <strong>in</strong>puts <strong>in</strong> multiple-<strong>in</strong>put systems hasbeen suggested <strong>in</strong> the context <strong>of</strong> bra<strong>in</strong>–mach<strong>in</strong>e <strong>in</strong>terfaces [53]and EMG record<strong>in</strong>gs [54]. As noted by both studies, a largenumber <strong>of</strong> <strong>in</strong>puts not only <strong>in</strong>creases the computational burdenbut also affects the generalization <strong>of</strong> the model, thus mak<strong>in</strong>g themodel reduction through <strong>in</strong>put selection an important part <strong>of</strong>the model<strong>in</strong>g process. <strong>Nonl<strong>in</strong>ear</strong> measures <strong>of</strong> dependency [55]<strong>in</strong>clude mutual <strong>in</strong>formation [56], which quantifies statistical dependenciesbetween two time-series data without any assumptionabout their respective probability density functions but isAuthorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 349only sensitive to static nonl<strong>in</strong>ear dependencies. <strong>Nonl<strong>in</strong>ear</strong> <strong>in</strong>terdependences[55], phase synchronization from Hilbert transform[57] and phase synchronization from the wavelet transform[58] can also be used to assess the degree <strong>of</strong> such dependenciesand to capture both l<strong>in</strong>ear and nonl<strong>in</strong>ear aspects <strong>of</strong> causality.The ma<strong>in</strong> weakness <strong>of</strong> these methods is the requirement <strong>of</strong> sufficient(generally large) amounts <strong>of</strong> data and, secondarily, thesignificant computational burden that they add to the model<strong>in</strong>gprocedure.The method proposed <strong>in</strong> this paper seeks to identify the<strong>in</strong>puts that are causally-l<strong>in</strong>ked to each output and affect significantlythe predictive capability <strong>of</strong> the model (<strong>in</strong> a statisticalsense embodied <strong>in</strong> the Mann–Whitney statistic). The proposedapproach explores all possible second-order <strong>in</strong>teractions amongthe <strong>in</strong>puts and exceeds the capabilities <strong>of</strong> l<strong>in</strong>ear approaches,such as cross-correlation, coherence function, Granger causalityand partial directed coherence [59] while keep<strong>in</strong>g the problemcomputationally tractable size and requir<strong>in</strong>g relatively smallamounts <strong>of</strong> experimental data. As shown with the computer-simulatedexamples (where ground truth is available),as well as <strong>in</strong> the application to real hippocampal data, themethod succeeds <strong>in</strong> captur<strong>in</strong>g the causal relationships whereverthey are present. The method was also shown with computersimulations to be remarkably robust <strong>in</strong> the presence <strong>of</strong> noise(spurious spikes <strong>in</strong> the <strong>in</strong>puts and outputs), jitter <strong>in</strong> the recordedspike location and deleted or misassigned spikes that may occur<strong>in</strong> actual record<strong>in</strong>gs and represent serious impediments <strong>in</strong> theuse <strong>of</strong> other methods <strong>in</strong> actual applications.Let us recap the fundamental rationale <strong>of</strong> the proposed model<strong>in</strong>gapproach. We posit the existence <strong>of</strong> an <strong>in</strong>ternal/<strong>in</strong>termediatecont<strong>in</strong>uous variable that is generated by the <strong>in</strong>putsbe<strong>in</strong>g transformed through a multi<strong>in</strong>put Volterra model (the operatorNVT <strong>in</strong> Fig. 1) operat<strong>in</strong>g <strong>in</strong> discrete time ,where denotes the time-b<strong>in</strong>width which is selected slightlylarger than the effective refractory period <strong>of</strong> the specific neuron(10 ms <strong>in</strong> this case). We view this <strong>in</strong>termediate variable asrepresent<strong>in</strong>g the transmembrane potential at the axon hillock <strong>of</strong>the output neuron and we posit further that an output spike isgenerated if, and only if, this variable exceeds a threshold value. This threshold value and the parameters <strong>of</strong> the aforementionedMISO discrete-time Volterra model <strong>of</strong> the operator NVTmust be estimated from the <strong>in</strong>put–output data <strong>in</strong> this determ<strong>in</strong>isticformulation (us<strong>in</strong>g currently the least-squares method). It isevident that this model<strong>in</strong>g problem is comprised <strong>of</strong> two criticalsteps: 1) estimation <strong>of</strong> the kernels <strong>of</strong> the NVT model and 2) theselection <strong>of</strong> the appropriate threshold for the operator TT. Theformer is currently accomplished via the least-squares method<strong>in</strong> an approximate manner (see further comments below) and thelatter through the use <strong>of</strong> ROC curves—which also provide thequantitative means for evaluat<strong>in</strong>g the model performance.The use <strong>of</strong> the Volterra kernels as the basis <strong>of</strong> the proposedmodel<strong>in</strong>g methodology assures excellent predictive capabilitiesfor the tim<strong>in</strong>g <strong>of</strong> output spikes <strong>in</strong> response to arbitrary <strong>in</strong>putsand provides reliable quantitative descriptors <strong>of</strong> the underly<strong>in</strong>gsystem dynamics—l<strong>in</strong>ear and nonl<strong>in</strong>ear—<strong>in</strong> a manner that advancesscientific knowledge and allows the rigorous test<strong>in</strong>g <strong>of</strong>scientific hypotheses. Application <strong>of</strong> the proposed methodologyto the real hippocampal data has revealed dist<strong>in</strong>ct spatio-temporalconnectivity patterns between neuronal populations <strong>in</strong> theCA3 (<strong>in</strong>put) and CA1 (output) region <strong>of</strong> the rat hippocampusfor each <strong>of</strong> four different behavioral tasks. The false-negatives(misses) that are seen <strong>in</strong> the output predictions by the modelmay be due either to the selection <strong>of</strong> the “optimum” thresholdthat balances the true-positives with the false-positives, thus result<strong>in</strong>gto some false-negatives (misses), or to the presence <strong>of</strong>spurious spikes <strong>in</strong> the actual output that are not related to the <strong>in</strong>putsand, therefore, are not predicted by the model. Among these“spurious” spikes there may be spikes due to “spontaneous activity”<strong>of</strong> the neurons that has been modeled <strong>in</strong> the past by <strong>in</strong>corporat<strong>in</strong>g<strong>in</strong> the threshold operator a monotonic function <strong>of</strong> time<strong>in</strong>dependent <strong>of</strong> the <strong>in</strong>put (see, for <strong>in</strong>stance, [40]). This <strong>in</strong>put–<strong>in</strong>dependentfunction leads to neuron fir<strong>in</strong>g even <strong>in</strong> the absence <strong>of</strong><strong>in</strong>put (spontaneous activity). Our model does not currently <strong>in</strong>corporatesuch a feature, but it can easily accommodate it as anadditive term to the <strong>in</strong>termediate variable —provided thatits form can be reasonably postulated.There are three known drawbacks <strong>of</strong> the proposed methodology.The first is model complexity. The obta<strong>in</strong>ed MIMOmodel generally has considerable complexity (many kernelsfor multiple, nonl<strong>in</strong>early <strong>in</strong>teract<strong>in</strong>g <strong>in</strong>puts). However, the use<strong>of</strong> Laguerre expansions <strong>of</strong> the kernels and the <strong>in</strong>put selectionalgorithm reduce the model complexity significantly and makethe model<strong>in</strong>g problem tractable, requir<strong>in</strong>g only modest computationaleffort for the ambitious task <strong>of</strong> MIMO model<strong>in</strong>g.Nonetheless, the large number <strong>of</strong> kernels, as shown <strong>in</strong> Fig. 5,also makes the <strong>in</strong>terpretation <strong>of</strong> the model difficult and comprehension<strong>of</strong> the model overwhelm<strong>in</strong>g for most <strong>in</strong>vestigators.This has provided the motivation for explor<strong>in</strong>g <strong>in</strong> future studiesthe use <strong>of</strong> pr<strong>in</strong>cipal dynamic modes to reduce this modelcomplexity [5]. The second drawback is that our methodologyis determ<strong>in</strong>istic <strong>in</strong> nature and cannot <strong>in</strong>corporate the stochasticaspects <strong>of</strong> biological phenomena. One such limitation <strong>in</strong> thecontext <strong>of</strong> the present application is the <strong>in</strong>ability to <strong>in</strong>corporatethe likely stochastic variations <strong>in</strong> the spike-trigger<strong>in</strong>g threshold.The third drawback is the possible bias <strong>in</strong>troduced <strong>in</strong> the kernelestimates <strong>of</strong> the NVT component <strong>of</strong> each MISO module by theemployed least-squares method. Note that the optimality <strong>of</strong> theleast-squares estimation method is guaranteed only when themodel prediction errors follow a Gaussian distribution (whenit becomes equivalent to the optimal maximum-likelihoodestimation). In this application, the model prediction errorsare not expected to follow a Gaussian distribution because<strong>of</strong> the b<strong>in</strong>ary nature <strong>of</strong> the output po<strong>in</strong>t-process. Therefore,the least-squares method <strong>of</strong> estimation is not optimal—i.e.,it does not generally yield unbiased estimates with m<strong>in</strong>imumvariance. The extent <strong>of</strong> this estimation bias depends on thespecific characteristics <strong>of</strong> each problem and relates to the issue<strong>of</strong> the truncation <strong>of</strong> the Volterra model and its implicationsfor the obta<strong>in</strong>ed kernel estimates. The proposed methodologyis based on the premise that the result<strong>in</strong>g estimation bias willgenerally be small and the obta<strong>in</strong>ed kernel estimates will besatisfactory approximations <strong>of</strong> the actual kernels. The validity<strong>of</strong> this premise can be exam<strong>in</strong>ed after the model estimation iscompleted by evaluat<strong>in</strong>g the model predictive performance. Ifthe latter is satisfactory, then the validity <strong>of</strong> this premise can beaccepted.Authorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


350 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008This issue has been exam<strong>in</strong>ed through computer simulations(where ground truth is available) and, as expected, theanswer is that the severity <strong>of</strong> the estimation bias depends on theparticular characteristics <strong>of</strong> each estimation problem (systemstructure and threshold statistics). Nonetheless, the results <strong>of</strong>the computer-simulated example presented <strong>in</strong> the manuscript(which seek to emulate the kernel forms found <strong>in</strong> this application),demonstrate that the obta<strong>in</strong>ed least-square estimates <strong>of</strong>the kernels <strong>of</strong> NVT are very close to the actual kernels used<strong>in</strong> the simulation. This provides some reassurance that theleast-squares estimates <strong>of</strong> the kernels are likely to be reasonableapproximations, but it does not guarantee the extent <strong>of</strong> possibleestimation biases. Because <strong>of</strong> the potential importance <strong>of</strong> thisissue, we are currently explor<strong>in</strong>g alternative estimation methodsthat will guarantee unbiased kernel estimates <strong>in</strong> the case <strong>of</strong>po<strong>in</strong>t-process outputs—<strong>in</strong>clud<strong>in</strong>g the maximum-likelihoodapproach us<strong>in</strong>g the b<strong>in</strong>omial distribution <strong>of</strong> the po<strong>in</strong>t-processoutput—ak<strong>in</strong> to the Brill<strong>in</strong>ger’s formulation <strong>of</strong> the problem <strong>in</strong>the l<strong>in</strong>ear case [40] and iterative estimation methods that avoidestimation biases by employ<strong>in</strong>g a cost function consistent withthe b<strong>in</strong>ary nature <strong>of</strong> the output.The proposed <strong>in</strong>put selection method seeks to determ<strong>in</strong>ewhich <strong>of</strong> the predesignated <strong>in</strong>puts are statistically significant <strong>in</strong>terms <strong>of</strong> their effects <strong>of</strong> the respective output <strong>in</strong> the MISO context.The proposed approach does not seek the “m<strong>in</strong>imum set <strong>of</strong><strong>in</strong>puts” <strong>in</strong> the sense <strong>of</strong> a maximally reduced model and, therefore,it does not exclude “relay-type” <strong>of</strong> neurons (i.e., neuronsthat receive exclusive <strong>in</strong>put from other predesignated <strong>in</strong>put neuronsand simply relay it—perhaps transformed—to the outputneuron). If one is only <strong>in</strong>terested <strong>in</strong> the overall <strong>in</strong>put–outputmapp<strong>in</strong>g, then these “relay-type” neurons can be excluded <strong>in</strong>the <strong>in</strong>terest <strong>of</strong> model parsimony. However, the purpose <strong>of</strong> theproposed approach is simply to determ<strong>in</strong>e whether there is asignificant causal l<strong>in</strong>k between each <strong>of</strong> the predesignated <strong>in</strong>putsand the specific output <strong>of</strong> each MISO model. Nonetheless, them<strong>in</strong>imum set <strong>of</strong> <strong>in</strong>puts can be determ<strong>in</strong>ed <strong>in</strong> a subsequent step<strong>of</strong> analysis whereby one <strong>in</strong>put is excluded on a rotat<strong>in</strong>g basisand the effect on the output prediction is assessed statistically(us<strong>in</strong>g the MWS) so that this <strong>in</strong>put can be excluded if the effectis <strong>in</strong>significant. This problem was addressed <strong>in</strong> the past throughthe use <strong>of</strong> “partial coherence” measures <strong>in</strong> the l<strong>in</strong>ear case [40].F<strong>in</strong>ally, it must be emphasized that the identified “causal” relationshipsbetween <strong>in</strong>put and output variables (po<strong>in</strong>t-processes)<strong>in</strong> a real neuronal system are only temporal statistical dependenciesand cannot be fully substantiated as manifestations <strong>of</strong>causality without further evidence about the <strong>in</strong>ternal work<strong>in</strong>gs<strong>of</strong> the system. For <strong>in</strong>stance, the common effect <strong>of</strong> a “third party”on both <strong>in</strong>put and output may be viewed mistakenly as the result<strong>of</strong> an <strong>in</strong>put–output causal relationship. An excellent example<strong>of</strong> this was presented <strong>in</strong> the aforementioned study <strong>of</strong> the threeAplysia neurons, where two <strong>of</strong> them were affected by the third<strong>in</strong> a manner that gave the false impression <strong>of</strong> a causal l<strong>in</strong>k betweenthem, although none existed as ascerta<strong>in</strong>ed by use <strong>of</strong> the“partial coherence” measures [40]. It is important to keep thispossibility <strong>in</strong> m<strong>in</strong>d, although it should not discourage us fromseek<strong>in</strong>g the “statistical dependencies” <strong>in</strong> actual multiunit data,s<strong>in</strong>ce the latter can provide important <strong>in</strong>sight <strong>in</strong>to the systemfunction when cast <strong>in</strong> the proper model<strong>in</strong>g framework.REFERENCES[1] L. Stark, Neurological Control Systems: Studies <strong>in</strong> Bioeng<strong>in</strong>eer<strong>in</strong>g.New York: Plenum, 1968.[2] P. Z. Marmarelis and V. Z. Marmarelis, Analysis <strong>of</strong> Physiological Systems:The White Noise Approach. New York: Plenum, 1978.[3] Methods <strong>in</strong> <strong>Neuronal</strong> <strong>Model<strong>in</strong>g</strong>: From Synapses to Networks, C. Kochand I. Segev, Eds. Cambridge, MA: MIT Press, 1989.[4] <strong>Model<strong>in</strong>g</strong> <strong>in</strong> the Neurosciences, R. R. Poznanski, Ed. Amsterdam,The Netherlands: Harwood Academic Publishers, 1999.[5] V. Z. Marmarelis, <strong>Nonl<strong>in</strong>ear</strong> Dynamic <strong>Model<strong>in</strong>g</strong> <strong>of</strong> Physiological Systems.Piscataway, NJ: Wiley-IEEE Press, 2004.[6] M. C. Wu, S. V. David, and J. L. Gallant, “Complete functional characterization<strong>of</strong> sensory neurons by system identification,” Annu. Rev.Neurosci., vol. 29, pp. 477–505, 2006.[7] M. Carand<strong>in</strong>i, J. B. Demb, V. Mante, D. J. Tolhurst, Y. Dan, B. A.Olshausen, J. L. Gallant, and N. C. Rust, “Do we know what the earlyvisual system does?,” J. Neurosci., vol. 25, pp. 10577–10597, 2005.[8] M. Ito, H. Tamura, I. Fujita, and K. Tanaka, “Size and position <strong>in</strong>variance<strong>of</strong> neuronal responses <strong>in</strong> monkey <strong>in</strong>ferotemporal cortex,” J. Neurophysiol.,vol. 73, pp. 218–226, 1995.[9] M. C. Citron, J. P. Kroeker, and G. D. McCann, “<strong>Nonl<strong>in</strong>ear</strong> <strong>in</strong>teractions<strong>in</strong> ganglion cell receptive fields,” J. Neurophysiol., vol. 46, pp.1161–1176, 1981.[10] M. C. Citron, R. C. Emerson, and W. R. Levick, “<strong>Nonl<strong>in</strong>ear</strong> measurementand classification <strong>of</strong> receptive fields <strong>in</strong> cat ret<strong>in</strong>al ganglion cells,”Ann. Biomed. Eng., vol. 16, pp. 65–77, 1988.[11] P. Z. Marmarelis and K. I. Naka, “<strong>Nonl<strong>in</strong>ear</strong> analysis and synthesis <strong>of</strong>receptive field responses <strong>in</strong> the catfish ret<strong>in</strong>a II: One-Input white-noiseanalysis,” J. Neurophys., vol. 36, pp. 619–633, 1973.[12] P. Z. Marmarelis and K. I. Naka, “Identification <strong>of</strong> multi-<strong>in</strong>put biologicalsystems,” IEEE Trans. Biomed. Eng., vol. BE-21, no. 2, pp.88–101, Mar. 1974.[13] D. McAlp<strong>in</strong>e, “Creat<strong>in</strong>g a sense <strong>of</strong> auditory space,” J. Physiol., vol.566, pp. 21–28, 2005.[14] E. D. Young and B. M. Calhoun, “<strong>Nonl<strong>in</strong>ear</strong> model<strong>in</strong>g <strong>of</strong> auditorynerverate responses to wideband stimuli,” J. Neurophysiol., vol. 94,no. 6, pp. 4441–4454, 2005.[15] M. Carand<strong>in</strong>i, D. J. Heeger, and J. A. Movshon, “L<strong>in</strong>earity and normalization<strong>in</strong> simple cells <strong>of</strong> the macaque primary visual cortex,” J.Neurosci., vol. 17, pp. 8621–8644, 1997.[16] C. C. Pack, B. R. Conway, R. T. Born, and M. S. Liv<strong>in</strong>gstone, “Spatiotemporalstructure <strong>of</strong> nonl<strong>in</strong>ear subunits <strong>in</strong> macaque visual cortex,”J. Neurosci., vol. 26, no. 3, pp. 893–907, 2006.[17] J. Rapela, J. M. Mendel, and N. M. Grzywacz, “Estimat<strong>in</strong>g nonl<strong>in</strong>earreceptive fields from natural images,” J. Vision, vol. 6, no. 4, pp.441–474, 2006.[18] E. P. Simoncelli and D. J. Heeger, “A model <strong>of</strong> neuronal responses <strong>in</strong>visual area MT,” Vis. Res., vol. 38, no. 5, pp. 743–761, Mar. 1998.[19] R. Lao, O. V. Favorov, and J. P. Lu, “<strong>Nonl<strong>in</strong>ear</strong> dynamical properties <strong>of</strong>a somatosensory cortical model,” Inf. Sci., vol. 132, no. 1–4, pp. 53–66,2001.[20] L. Pan<strong>in</strong>ski, M. R. Fellows, N. G. Hatsopoulos, and J. P. Donoghue,“Spatiotemporal tun<strong>in</strong>g <strong>of</strong> motor neurons for hand position and velocity,”J. Neurophysiol., vol. 91, pp. 515–532, 2004.[21] W. Truccolo, U. T. Eden, M. R. Fellows, J. P. Donoghue, and E. N.Brown, “A po<strong>in</strong>t process framework for relat<strong>in</strong>g neural spik<strong>in</strong>g activityto spik<strong>in</strong>g history, neural ensemble, and extr<strong>in</strong>sic covariate effects,” J.Neurophysiol., vol. 93, no. 2, pp. 1074–1089, 2005, 2005.[22] T. Yamazaki and S. Tanaka, “The cerebellum as a liquid state mach<strong>in</strong>e,”Neural Netw., vol. 20, pp. 290–297, 2007.[23] T. W. Berger, J. L. Eriksson, D. A. Ciarolla, and R. J. Sclabassi, “<strong>Nonl<strong>in</strong>ear</strong>systems analysis <strong>of</strong> the hippocampal perforant path-dentate projection.II. effects <strong>of</strong> random tra<strong>in</strong> stimulation,” J. Neurophys., vol. 60,pp. 1077–1094, 1988.[24] T. W. Berger, M. Baudry, R. D. Br<strong>in</strong>ton, J. S. Liaw, V. Z. Marmarelis,A. Y. Park, B. J. Sheu, and A. R. Tanguay, “Bra<strong>in</strong>-implantablebiomimetic electronics as the next era <strong>in</strong> neural prosthetics,” Proc.IEEE, vol. 89, pp. 993–1012, 2001.[25] R. J. Sclabassi, J. L. Eriksson, R. L. Port, G. B. Rob<strong>in</strong>son, and T.W. Berger, “<strong>Nonl<strong>in</strong>ear</strong> systems analysis <strong>of</strong> the hippocampal perforantpath-dentate projection. I. theoretical and <strong>in</strong>terpretational considerations,”J. Neurophysiol., vol. 60, pp. 1066–1076, 1988.[26] P. Dayan and L. F. Abbot, Theoretical Neuroscience, Computationaland Mathematical <strong>Model<strong>in</strong>g</strong> <strong>of</strong> Neural Systems. Cambridge, MA:MIT Press, 2005.Authorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


ZANOS et al.: NONLINEAR MODELING OF CAUSAL INTERRELATIONSHIPS IN NEURONAL ENSEMBLES 351[27] M. London and M. Hausser, “Dendritic computation,” Annu. Rev. Neurosci.,vol. 28, pp. 503–532, 2005.[28] R. C. Emerson, M. C. Citron, W. J. Vaughn, and S. A. Kle<strong>in</strong>, “<strong>Nonl<strong>in</strong>ear</strong>directionally selective subunits <strong>in</strong> complex cells <strong>of</strong> the cat striatecortex,” J. Neurophysiol., vol. 58, pp. 33–65, 1987.[29] J. van Kleef, A. C. James, and G. Stange, “A spatiotemporal white noiseanalysis <strong>of</strong> photoreceptor responses to UV and green light <strong>in</strong> the dragonflymedian ocellus,” J. Gen. Physiol., vol. 126, no. 5, pp. 481–497,2005.[30] V. Z. Marmarelis and T. W. Berger, “General methodology for nonl<strong>in</strong>earmodel<strong>in</strong>g <strong>of</strong> neural systems with poisson po<strong>in</strong>t-process <strong>in</strong>puts,”Math. Biosci., vol. 196, pp. 1–13, 2005.[31] A. Dimoka, S. H. Courellis, G. Gholmieh, V. Z. Marmarelis, and T. W.Berger, “<strong>Model<strong>in</strong>g</strong> the nonl<strong>in</strong>ear properties <strong>of</strong> the <strong>in</strong> vitro hippocampalperforant path-dentate system us<strong>in</strong>g multi-electrode array technology,”IEEE Trans. Biomed. Eng., vol. 55, no. 2, pp. 693–702, Feb. 2008, (<strong>in</strong>press).[32] T. P. Zanos, S. H. Courellis, R. E. Hampson, S. A. Deadwyler, V.Z. Marmarelis, and T. W. Berger, “A multi-<strong>in</strong>put model<strong>in</strong>g approachto quantify hippocampal nonl<strong>in</strong>ear dynamic transformations,” <strong>in</strong> Int.Conf. IEEE Eng. Med. Biol. Soc., New York, 2006, pp. 4967–4970.[33] D. Song, R. H. M. Chan, V. Z. Marmarelis, R. E. Hampson, S. A.Deadwyler, and T. W. Berger, “<strong>Nonl<strong>in</strong>ear</strong> dynamic model<strong>in</strong>g <strong>of</strong> spiketra<strong>in</strong> transformation for hippocampal-cortical prostheses,” IEEE Trans.Biomed. Eng., vol. 54, no. 6, pp. 1053–1065, Jun. 2007.[34] D. R. Brill<strong>in</strong>ger, “The identification <strong>of</strong> po<strong>in</strong>t process systems,” Ann.Probabil., vol. 3, pp. 909–924, 1975.[35] D. R. Brill<strong>in</strong>ger, H. L. Bryant, and J. P. Segundo, “Identification <strong>of</strong>synaptic <strong>in</strong>teractions,” Biol. Cybern., vol. 22, pp. 213–228, 1976.[36] D. R. Cox and P. A. W. Lewis, The Statistical Analysis <strong>of</strong> Series <strong>of</strong>Event. London, U.K.: Nethuen, 1966.[37] D. L. Snyder, Random Po<strong>in</strong>t Processes. New York: Wiley, 1975.[38] W. Maas, T. Natschlager, and H. Markram, “Real-time comput<strong>in</strong>gwithout stable states: A new framework for neural computation basedon perturbations,” Neural Comput., vol. 14, no. 11, pp. 2531–2560,2002.[39] Y. Ogata and H. Akaike, “On l<strong>in</strong>ear <strong>in</strong>tensity models for mixed doublystochastic poisson and self-excit<strong>in</strong>g po<strong>in</strong>t processes,” J. Roy. Stat. Soc.Ser. B, vol. 44, pp. 102–107, 1982.[40] D. R. Brill<strong>in</strong>ger, “Maximum likelihood approach to the identification<strong>of</strong> neuronal fir<strong>in</strong>g systems,” Ann. Biomed. Eng., vol. 16, no. 1, pp. 3–16.[41] L. Pan<strong>in</strong>ski, J. W. Pillow, and E. P. Simoncelli, “Maximum likelihoodestimation <strong>of</strong> a stochastic <strong>in</strong>tegrate-and-fire neural encod<strong>in</strong>g model,”Neural Comput., vol. 16, no. 12, pp. 2533–2561, 2004.[42] W. Truccolo, U. T. Eden, M. R. Fellows, J. P. Donoghue, and E. N.Brown, “A po<strong>in</strong>t process framework for relat<strong>in</strong>g neural spik<strong>in</strong>g activityto spik<strong>in</strong>g history, neural ensemble and extr<strong>in</strong>sic covariate effects,” J.Neurophysiol., vol. 93, no. 2, pp. 1074–1098, 2005.[43] M. Okatan, M. A. Wilson, and E. N. Brown, “Analyz<strong>in</strong>g functional connectivityus<strong>in</strong>g a network likelihood model <strong>of</strong> ensemble neural spik<strong>in</strong>gactivity,” Neural Comp., vol. 9, pp. 1927–1961, 2005.[44] G. Gholmieh, S. H. Courellis, V. Z. Marmarelis, and T. W. Berger,“An efficient method for study<strong>in</strong>g short-term plasticity with randomimpulse tra<strong>in</strong> stimuli,” J. Neurosci. Methods, vol. 121, pp. 111–127,2002.[45] G. Gholmieh, S. H. Courellis, V. Z. Marmarelis, and T. W. Berger,“<strong>Nonl<strong>in</strong>ear</strong> dynamic model <strong>of</strong> CA1 short-term plasticity us<strong>in</strong>g randomimpulse tra<strong>in</strong> stimulation,” Ann. Biomed. Eng., vol. 35, pp. 847–857,2007.[46] A. Gruart, M. D. Munoz, and J. M. Delgado-Garcia, “Involvement <strong>of</strong>the CA3-CA1 synapse <strong>in</strong> the acquisition <strong>of</strong> associative learn<strong>in</strong>g <strong>in</strong> behav<strong>in</strong>gmice,” J. Neurosci., vol. 26, no. 4, pp. 1077–1087, 2006.[47] B. Wilson, Cochlear Implant Technology. Philadelphia, PA: Lipp<strong>in</strong>cottWilliams Wilk<strong>in</strong>s, 2000.[48] J. D. Weiland and M. S. Humayun, “Intraocular ret<strong>in</strong>al prosthesis,”IEEE Eng. Med. Biol. Mag., vol. 25, no. 1, pp. 60–66, 2006.[49] A. B. Schwartz, X. T. Cui, D. J. Weber, and D. W. Moran, “Bra<strong>in</strong>controlled<strong>in</strong>terfaces: Movement restoration with neural prosthetics,”Neuron, vol. 52, no. 1, pp. 205–220, , 2006.[50] M. A. Lebedev and M. A. Nicolelis, “Bra<strong>in</strong>–mach<strong>in</strong>e <strong>in</strong>terfaces: Past,present and future,” Trends Neurosci, vol. 29, no. 9, pp. 536–546, Sep.2006.[51] V. Z. Marmarelis, “Identification <strong>of</strong> nonl<strong>in</strong>ear biological systemsus<strong>in</strong>g laguerre expansions <strong>of</strong> kernels,” Ann. Biomed. Eng., vol. 21, pp.573–589, 1993.[52] W. Hoeffd<strong>in</strong>g, “A class <strong>of</strong> statistics with asymptotically normal distribution,”Ann. Math. Statist., vol. 19, no. 3, pp. 293–325, 1948.[53] J. C. Sanchez, J. M. Carmena, M. A. Lebedev, M. A. Nicolelis, J. G.Harris, and J. C. Pr<strong>in</strong>cipe, “Ascerta<strong>in</strong><strong>in</strong>g the importance <strong>of</strong> neurons todevelop better bra<strong>in</strong>-mach<strong>in</strong>e <strong>in</strong>terfaces,” IEEE Trans. Biomed. Eng.,vol. 51, no. 6, pp. 943–953, Jun. 2004.[54] D. T. Westwick, E. A. Pohlmeyer, S. A. Solla, L. E. Miller, and E. J.Perreault, “Identification <strong>of</strong> multiple-<strong>in</strong>put systems with highly coupled<strong>in</strong>puts: Application to EMG prediction from multiple <strong>in</strong>tracorticalelectrodes,” Neural Comput., vol. 18, pp. 329–355, Feb. 2006.[55] Q. R. Quiroga, A. Kraskov, T. Kreuz, and P. Grassberger, “Performance<strong>of</strong> different synchronization measures <strong>in</strong> real data: A case studyon electroencephalographic signals,” Phys. Rev. E, vol. 65, p. 041903,2002.[56] T. Schreiber, “Measur<strong>in</strong>g <strong>in</strong>formation transfer,” Phys. Rev. Lett., vol.85, no. 2, pp. 461–464, 2000.[57] F. Mormann, K. Lehnertz, P. David, and C. E. Elger, “Mean phasecoherenceas a measure for phase synchronization and its applicationto the EEG <strong>of</strong> epilepsy patients,” Physica D, vol. 144, no. 3–4, pp.358–369, 2000.[58] J.-P. Lachaux, E. Rodriguez, J. Mart<strong>in</strong>erie, and F. J. Varela, “Measur<strong>in</strong>gphase synchrony <strong>in</strong> bra<strong>in</strong> signals,” Hum. Bra<strong>in</strong> Mapp., vol. 8,pp. 194–208, 1999.[59] K. Sameshima and L. A. Baccala, “Us<strong>in</strong>g partial directed coherence todescribe neuronal ensemble <strong>in</strong>teractions,” J. Neurosci. Meth., vol. 15,pp. 93–103, 1999.Theodoros P. Zanos (S’00) was born <strong>in</strong> Drama,Greece, on July 31, 1980. He received the Diplomadegree <strong>in</strong> electrical and computer eng<strong>in</strong>eer<strong>in</strong>g fromthe Aristotle University <strong>of</strong> Thessaloniki, Thessaloniki,Greece, <strong>in</strong> 2004, and the M.S. degree <strong>in</strong>biomedical eng<strong>in</strong>eer<strong>in</strong>g, <strong>in</strong> 2006, from the University<strong>of</strong> Southern California, Los Angeles, where he iscurrently work<strong>in</strong>g toward the Ph.D. degree <strong>in</strong> theDepartment <strong>of</strong> Biomedical Eng<strong>in</strong>eer<strong>in</strong>g.S<strong>in</strong>ce 2004, he has been a Research Assistantat the National Science Foundation (NSF)-fundedBiomimetic Microelectronic Systems Eng<strong>in</strong>eer<strong>in</strong>g Resource Center andthe NIH/National Institute <strong>of</strong> Biomedical Imag<strong>in</strong>g and Bioeng<strong>in</strong>eer<strong>in</strong>g(NIH/NIBIB)-funded Biomedical Simulation Resource. His ma<strong>in</strong> research<strong>in</strong>terests are <strong>in</strong> the areas <strong>of</strong> nonl<strong>in</strong>ear system identification and model<strong>in</strong>g <strong>of</strong>neural systems, neural prosthesis, nonparametric methods <strong>of</strong> MIMO systemsmodel<strong>in</strong>g and spatiotemporal/spectrotemporal model<strong>in</strong>g <strong>of</strong> nonl<strong>in</strong>ear systems,with applications to neural <strong>in</strong>formation process<strong>in</strong>g.Mr. Zanos is a student member <strong>of</strong> Biomedical Eng<strong>in</strong>eer<strong>in</strong>g Society and Societyfor Neuroscience.Spiros H. Courellis was born on June 3, 1962<strong>in</strong> Nemea, Greece. He received the electrical andcomputer eng<strong>in</strong>eer<strong>in</strong>g Diploma from the AristotleUniversity <strong>of</strong> Thessaloniki, Thessaloniki, Greece <strong>in</strong>1985, the M.S. degree <strong>in</strong> electrical eng<strong>in</strong>eer<strong>in</strong>g fromthe California Institute <strong>of</strong> Technology, Pasadena, <strong>in</strong>1986, and the Ph.D. degree <strong>in</strong> electrical eng<strong>in</strong>eer<strong>in</strong>gfrom the University <strong>of</strong> Southern California, LosAngeles, <strong>in</strong> 1992.From 1993 to 2004, he was a Research Associatewith the Biomedical Eng<strong>in</strong>eer<strong>in</strong>g Department at theUniversity <strong>of</strong> Southern California. He was also <strong>in</strong>volved <strong>in</strong> <strong>in</strong>dustrial researchand design, education, and leadership. Currently, he is a Research Assistant Pr<strong>of</strong>essor<strong>of</strong> Biomedical Eng<strong>in</strong>eer<strong>in</strong>g at the University <strong>of</strong> Southern California, LosAngeles, and an Assistant Pr<strong>of</strong>essor <strong>of</strong> Computer Science at California StateUniversity, Fullerton. His ma<strong>in</strong> research <strong>in</strong>terests are <strong>in</strong> the areas <strong>of</strong> computationalmodel<strong>in</strong>g <strong>of</strong> neural systems, biologically <strong>in</strong>spired <strong>in</strong>formation fusionsystems, biosensors, neural computation and memory, embedded comput<strong>in</strong>g,sensor networks, and distributed implantable secure medical devices.Dr. Courellis is a member <strong>of</strong> the ACM and the Society for Neuroscience.Authorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.


352 IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, VOL. 16, NO. 4, AUGUST 2008Theodore W. Berger (M’03–SM’04) received thePh.D. degree from Harvard University, Cambridge,MA, <strong>in</strong> 1976.He is the David Packard Pr<strong>of</strong>essor <strong>of</strong> Eng<strong>in</strong>eer<strong>in</strong>g,Pr<strong>of</strong>essor <strong>of</strong> Biomedical Eng<strong>in</strong>eer<strong>in</strong>g and Neuroscience,and Director <strong>of</strong> the Center for NeuralEng<strong>in</strong>eer<strong>in</strong>g at the University <strong>of</strong> Southern California(USC), Los Angeles. He conducted postdoctoralresearch at the University <strong>of</strong> California, Irv<strong>in</strong>e, from1977 to 1978, and was an Alfred P. Sloan FoundationFellow at The Salk Institute, La Jolla, CA, from1978 to 1979. He jo<strong>in</strong>ed the Departments <strong>of</strong> Neuroscience and Psychiatryat the University <strong>of</strong> Pittsburgh, Pittsburgh, PA, <strong>in</strong> 1979, be<strong>in</strong>g promoted toFull Pr<strong>of</strong>essor <strong>in</strong> 1987. S<strong>in</strong>ce 1992, he has been Pr<strong>of</strong>essor <strong>of</strong> BiomedicalEng<strong>in</strong>eer<strong>in</strong>g and Neurobiology at USC, and was appo<strong>in</strong>ted the David PackardChair <strong>of</strong> Eng<strong>in</strong>eer<strong>in</strong>g <strong>in</strong> 2003. In 1997, he became Director <strong>of</strong> the Center forNeural Eng<strong>in</strong>eer<strong>in</strong>g, Los Angeles, <strong>in</strong> 1997, an organization which helps to uniteUSC faculty with cross-discipl<strong>in</strong>ary <strong>in</strong>terests <strong>in</strong> neuroscience, eng<strong>in</strong>eer<strong>in</strong>g, andmedic<strong>in</strong>e. He has published many journal articles and book chapters, and is thecoeditor <strong>of</strong> a book recently published by the MIT Press on Toward ReplacementParts for the Bra<strong>in</strong>: Implantable Biomimetic Electronics as Neural Prostheses.Dr. Berger’s thesis work received the James McKeen Cattell Award from theNew York Academy <strong>of</strong> Sciences. Dur<strong>in</strong>g that time, he received a McKnightFoundation Scholar Award, twice received an NIMH Research Scientist DevelopmentAward, and was elected a Fellow <strong>of</strong> the American Association for theAdvancement <strong>of</strong> Science. While at USC, Dr. Berger has received an NIMHSenior Scientist Award, was awarded the Lockheed Senior Research Award<strong>in</strong> 1997, was elected a Fellow <strong>of</strong> the American Institute for Medical and BiologicalEng<strong>in</strong>eer<strong>in</strong>g <strong>in</strong> 1998, received a Person <strong>of</strong> the Year “Impact Award”from the AARP <strong>in</strong> 2004 for his work on neural prostheses, was a NationalAcademy <strong>of</strong> Sciences International Scientist Lecturer <strong>in</strong> 2003, and an IEEE Dist<strong>in</strong>guishedLecturer <strong>in</strong> 2004–2005. In 2005, he received a “Great M<strong>in</strong>ds, GreatIdeas” Award from the EE Times <strong>in</strong> the same year, and <strong>in</strong> 2006, was awardedUSC’s Associates Award for Creativity <strong>in</strong> Research and Scholarship.Robert E. Hampson received his Ph.D. degree <strong>in</strong>physiology from Wake Forest University, W<strong>in</strong>ston-Salem, NC, <strong>in</strong> 1988.He was a Postdoctoral Fellow and Assistant Pr<strong>of</strong>essorat Wake Forest University, W<strong>in</strong>ston-Salem,NC, from 1989 to 1998 when he was promoted tothe rank <strong>of</strong> Associate Pr<strong>of</strong>essor. He is an AssociatePr<strong>of</strong>essor <strong>in</strong> the Department <strong>of</strong> Physiology andPharmacology, Wake Forest University School <strong>of</strong>Medic<strong>in</strong>e, W<strong>in</strong>ston-Salem, NC. He is a past member<strong>of</strong> the IFCN-7 National Institutes <strong>of</strong> Health (NIH)review panel and has served as an ad hoc member <strong>of</strong> other review panels forNIH. His ma<strong>in</strong> <strong>in</strong>terests are <strong>in</strong> learn<strong>in</strong>g and memory, <strong>in</strong> particular decipher<strong>in</strong>gthe neural code utilized by the hippocampus and other related structures toencode behavioral events and cognitive decisions. His other <strong>in</strong>terests are <strong>in</strong>l<strong>in</strong>ear and nonl<strong>in</strong>ear model<strong>in</strong>g <strong>of</strong> neural encod<strong>in</strong>g <strong>of</strong> cognitive processes as wellas alterations <strong>of</strong> that encod<strong>in</strong>g by drugs <strong>of</strong> abuse. He has published extensively<strong>in</strong> the areas <strong>of</strong> cannab<strong>in</strong>oid effects on behavior and electrophysiology, and thecorrelation <strong>of</strong> behavior with mult<strong>in</strong>euron activity patterns, particularly apply<strong>in</strong>gl<strong>in</strong>ear discrim<strong>in</strong>ant analysis to neural data to decipher population encod<strong>in</strong>g andrepresentation.Samuel A. Deadwyler received the Ph.D. degreefrom the State University <strong>of</strong> New York, StonyBrook, and conducted postdoctoral research at theUniversity <strong>of</strong> California, Irv<strong>in</strong>e.He is a Pr<strong>of</strong>essor with the Department <strong>of</strong> Physiologyand Pharmacology, Wake Forest UniversitySchool <strong>of</strong> Medic<strong>in</strong>e (WFUSM), W<strong>in</strong>ston-Salem,NC, where he has been s<strong>in</strong>ce 1978. He has beenfunded by the National Institutes <strong>of</strong> Health (NIH)cont<strong>in</strong>uously s<strong>in</strong>ce 1974. He is on the editorialboard <strong>of</strong> the journal Hippocampus and has been an<strong>in</strong>vited Grass Foundation lecturer on several different occasions. He is pastpresident <strong>of</strong> the International Cannab<strong>in</strong>oid Research Society and was electedto the Board <strong>of</strong> Directors <strong>of</strong> the College on Problems <strong>of</strong> Drug Dependence fora four-year term. He has published extensively <strong>in</strong> journal articles, chapters,and books on his research <strong>in</strong>to neural mechanisms <strong>of</strong> learn<strong>in</strong>g and memory aswell as cellular neurophysiological <strong>in</strong>vestigation <strong>of</strong> drug actions. His research<strong>in</strong>terests <strong>in</strong>clude mechanisms <strong>of</strong> <strong>in</strong>formation encod<strong>in</strong>g <strong>in</strong> neuronal populations,relationship between neuronal codes and behavioral performance, mechanisms<strong>of</strong> enhancement <strong>of</strong> short-term memory <strong>in</strong> relation to neural codes generated <strong>in</strong>the hippocampus and frontal cortex. He served on the editorial board for theJournal <strong>of</strong> Neuroscience from 1996 to 2005 dur<strong>in</strong>g which time he was also areview<strong>in</strong>g editor.Dr. Deadwyler is the recipient <strong>of</strong> a National Institutes <strong>of</strong> Health Senior ResearchScientist award from 1987 to 2008 and also a MERIT award from 1990to 2000.Vasilis Z. Marmarelis (M’79–SM’94–F’97) wasborn <strong>in</strong> Mytil<strong>in</strong>e, Greece, on November 16, 1949. Hereceived the Diploma degree <strong>in</strong> electrical eng<strong>in</strong>eer<strong>in</strong>gand mechanical eng<strong>in</strong>eer<strong>in</strong>g from the NationalTechnical University <strong>of</strong> Athens, Athens, Greece, <strong>in</strong>1972 and the M.S. and Ph.D. degrees <strong>in</strong> eng<strong>in</strong>eer<strong>in</strong>gscience (<strong>in</strong>formation science and bio<strong>in</strong>formationsystems) from the California Institute <strong>of</strong> Technology,Pasadena, <strong>in</strong> 1973 and 1976, respectively.After two years <strong>of</strong> postdoctoral work at theCalifornia Institute <strong>of</strong> Technology, he jo<strong>in</strong>ed thefaculty <strong>of</strong> Biomedical and Electrical Eng<strong>in</strong>eer<strong>in</strong>g at the University <strong>of</strong> SouthernCalifornia, Los Angeles, where he is currently Pr<strong>of</strong>essor and Director <strong>of</strong> theBiomedical Simulations Resource, a research center funded by the NationalInstitutes <strong>of</strong> Health s<strong>in</strong>ce 1985 and dedicated to model<strong>in</strong>g/simulation studies<strong>of</strong> biomedical systems. He served as Chairman <strong>of</strong> the Biomedical Eng<strong>in</strong>eer<strong>in</strong>gDepartment from 1990 to 1996. His ma<strong>in</strong> research <strong>in</strong>terests are <strong>in</strong> the areas <strong>of</strong>nonl<strong>in</strong>ear and nonstationary system identification and model<strong>in</strong>g, with applicationsto biology and medic<strong>in</strong>e. His other <strong>in</strong>terests <strong>in</strong>clude spatio-temporal andMIMO model<strong>in</strong>g <strong>of</strong> nonl<strong>in</strong>ear systems, with applications to neural <strong>in</strong>formationprocess<strong>in</strong>g, closed-loop system model<strong>in</strong>g, and high-resolution 3-D ultrasonicimag<strong>in</strong>g and tissue classification. He is coauthor <strong>of</strong> Analysis <strong>of</strong> PhysiologicalSystem: The White Noise Approach (Plenum, 1978; Russian translation:Moscow, Mir Press, 1981; Ch<strong>in</strong>ese translation: Academy <strong>of</strong> Sciences Press,Beij<strong>in</strong>g, 1990), editor <strong>of</strong> three research volumes on Advanced Methods <strong>of</strong>Physiological System <strong>Model<strong>in</strong>g</strong> (Plenum, 1987, 1989, 1994) and author <strong>of</strong> amonograph on <strong>Nonl<strong>in</strong>ear</strong> Dynamic <strong>Model<strong>in</strong>g</strong> <strong>of</strong> Physiological Systems (IEEEPress and Wiley Interscience, 2004). He has published more than 100 papersand book chapters <strong>in</strong> the areas <strong>of</strong> system model<strong>in</strong>g and signal analysis.Dr. Marmarelis is a Fellow <strong>of</strong> the American Institute for Medical and BiologicalEng<strong>in</strong>eer<strong>in</strong>g.Authorized licensed use limited to: University <strong>of</strong> Southern California. Downloaded on November 20, 2008 at 19:53 from IEEE Xplore. Restrictions apply.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!