16.11.2012 Views

Brain–Computer Interfaces - Index of

Brain–Computer Interfaces - Index of

Brain–Computer Interfaces - Index of

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Digital Signal Processing and Machine Learning 317<br />

Initially, many features are extracted from various methods and from channels covering<br />

all potentially useful brain areas. Then, feature selection is employed to find<br />

the most suitable channel locations and feature extraction methods. This section<br />

describes two aspects <strong>of</strong> feature selection for BCIs: channel selection and frequency<br />

band selection.<br />

4.1 Channel Selection<br />

As mentioned before, not all electrodes distributed over the whole scalp are useful<br />

in BCI systems in general. Hence, channel selection is performed to select the useful<br />

channels according to the features used in a BCI system. For SSVEP-based BCIs,<br />

the EEG signals from the visual cortex are selected. For mu/beta-based BCIs, the<br />

channels in the sensorimotor area are selected. For P300-based BCIs, the channels<br />

exhibiting an obvious P300 are selected. For P300 or SSVEP-based BCIs, channel<br />

selection is <strong>of</strong>ten manually performed before other processing steps. Apart from<br />

physiological considerations, channel selection can be performed by applying a<br />

feature selection method to a training dataset. Examples <strong>of</strong> such a feature selection<br />

method include optimizing statistical measures like Student’s t-statistics, Fisher<br />

criterion [33] and bi-serial correlation coefficient [34] and genetic algorithms [35].<br />

In the following, channel selection based on the bi-serial correlation coefficient<br />

is described [34]. Let (x1, y1), ··· ,(xK, yK) be a sequence <strong>of</strong> one-dimensional observations<br />

(i.e. a single feature) with labels yk ∈ {1, −1}. Define X + as the set <strong>of</strong><br />

observations xk with label <strong>of</strong> 1, and X − the set <strong>of</strong> observations xk with label <strong>of</strong><br />

−1. Then bi-serial correlation coefficient r is calculated as<br />

rX =<br />

√ N + N −<br />

N + + N −<br />

mean(X− ) − mean(X + )<br />

std(X + � X− , (11)<br />

)<br />

where N + and N− are the numbers <strong>of</strong> observations <strong>of</strong> X + and X− respectively. r2- coefficient r2 X , which reflects how much <strong>of</strong> the variance in the distribution <strong>of</strong> all<br />

samples is explained by the class affiliation, is defined as a score for this feature.<br />

For each channel <strong>of</strong> a BCI system, a score is calculated as above using the observations<br />

from this channel. If the objective is to choose the N most informative<br />

channels, one would choose the channels with the top N scores.<br />

4.2 Frequency Band Selection<br />

As mentioned in Sect. 2.2, temporal filtering plays an important role in improving<br />

the signal-to-noise ratio in BCI systems [7], and temporal filtering is also important<br />

to extract band power features. Before temporal filtering, one or several frequency<br />

bands need to be determined that affect the performance <strong>of</strong> BCIs. The optimal<br />

frequency bands are generally subject-specific in BCIs. Frequency band selection<br />

is <strong>of</strong>ten performed using a feature selection method. As an example, we use the<br />

bi-serial correlation coefficient presented in Sect. 4.1 to carry out frequency band

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!