20.07.2014 Views

Analysis of Noise — Part V - Spectroscopy

Analysis of Noise — Part V - Spectroscopy

Analysis of Noise — Part V - Spectroscopy

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

I N S P E C T R O S C O P Y<br />

<strong>Analysis</strong> <strong>of</strong> <strong>Noise</strong> — <strong>Part</strong> V<br />

H OWARD M ARK AND J EROME W ORKMAN J R .<br />

This column is the continuation <strong>of</strong> a series dealing with the rigorous<br />

derivation <strong>of</strong> the expressions relating the effect <strong>of</strong> instrument<br />

(and other) noise to its effect on the spectra we observe<br />

(1–4). Our first column in this series was an overview; since<br />

then we’ve been analyzing the effect <strong>of</strong> noise on spectra when<br />

the noise is constant detector noise — for example, noise that is independent<br />

<strong>of</strong> the strength <strong>of</strong> the optical signal. Inasmuch as we are<br />

dealing with a continuous set <strong>of</strong> columns, we resume our discussion<br />

by continuing our equation numbering, figure numbering, use <strong>of</strong><br />

symbols, and so forth, as though there were no break.<br />

UPDATE<br />

It seems we said something wrong. When we first began this series<br />

<strong>of</strong> columns dealing with the effects <strong>of</strong> various kinds <strong>of</strong> noise on<br />

spectra (1, 2), we said that there seemingly wasn’t any recent attention<br />

paid to the question <strong>of</strong> noise in spectra. It turns out that this is<br />

not quite true. Edward Voightman pointed out that, in fact, he performed<br />

and published computer simulation studies <strong>of</strong> just this subject<br />

(5, 6). His studies were based on computer simulations <strong>of</strong> the<br />

behavior <strong>of</strong> various analytical instruments in various situations using<br />

a simulation engine described in an Analytical Chemistry report<br />

(7). In addition to the simulations <strong>of</strong> spectrometers, he also published<br />

simulations <strong>of</strong> polarimeters (8, 9) with results that are interesting,<br />

if not directly applicable to our current study. The diagrams<br />

he published (5) clearly show the difference in the optimum absorbance<br />

values (such as minimum relative absorbance error) between<br />

these simulations and the previous conventional theory. Unfortunately<br />

the noise levels <strong>of</strong> the simulations were too high to<br />

precisely determine the actual minimum. When Voightman contacted<br />

us to inform us about these papers, we discussed the results<br />

he obtained. He revealed that, because <strong>of</strong> the limitations <strong>of</strong> the computer<br />

hardware available at the time the simulations were performed,<br />

he could not use more than a few hundred repeats <strong>of</strong> the<br />

Monte-Carlo experiments, resulting in the high noise levels observed.<br />

Having seen our early columns dealing with this topic (1,<br />

2), he reprogrammed his simulation engine to perform new simulations<br />

and compared the results with the exact solution we derived<br />

(see equation 19 [2]), using much more extensive Monte-Carlo<br />

calculations, and found excellent agreement (10). We are grateful<br />

to Voightman for pointing out the literature that we had previously<br />

missed, as well as sharing the results <strong>of</strong> his new simulations<br />

with us.<br />

Figure 7. Relative noise level (SD(T )) as a function <strong>of</strong> E r at the lowsignal<br />

limit according to equation 71, for values <strong>of</strong> E r less than unity<br />

(that is, one standard deviation <strong>of</strong> E r ).<br />

CONTINUATION<br />

Now let us recap where we came from, in this series-within-a-series,<br />

and where we are going. In the second column (2) we demonstrated<br />

that because previous treatments <strong>of</strong> this topic failed to take<br />

into account the effect <strong>of</strong> the noise <strong>of</strong> the reference reading, they<br />

did not come up with the rigorously correct formula to describe the<br />

effect <strong>of</strong> noise on the computed value <strong>of</strong> transmittance. The rigorously<br />

exact solution to this situation shows that the noise level <strong>of</strong> a<br />

transmittance spectrum increases with the transmittance <strong>of</strong> the<br />

sample, rather than being independent <strong>of</strong> the sample characteristics,<br />

as previously thought.<br />

We then continued the development <strong>of</strong> those equations (3) to<br />

show the effect <strong>of</strong> the random noise on absorbance spectra, and on<br />

the relative precision: SD(A)/A, in both cases comparing the result<br />

<strong>of</strong> the rigorous treatment <strong>of</strong> the topic to the previous mathematical<br />

analysis, and showing that in both cases, the results from the rigorous<br />

treatment differ slightly but noticeably from the previous results.<br />

Finally we developed and solved the equations for the minimum<br />

in the curve <strong>of</strong> SD(A)/A, this being the generally accepted<br />

criterion for determining the best value <strong>of</strong> transmittance (or absorbance)<br />

that a sample should have to obtain the most accurate results<br />

from this form <strong>of</strong> spectroscopic chemical analysis. Our conclusion<br />

was that the optimum value <strong>of</strong> transmittance under those<br />

conditions — constant detector noise — is approximately<br />

33 %T rather than the previously accepted 36.8 %T.<br />

We next noted (4) that all the results obtained up to that point<br />

were relevant only to the condition in which the detector noise was<br />

34 SPECTROSCOPY 16(4) APRIL 2001 www.spectroscopyonline.com


...........................<br />

CHEMOMETRICS IN SPECTROSCOPY ............................<br />

small compared with the reference signal, and therefore the signalto-noise<br />

ratio (S/N) was high. We then noted that if that condition<br />

did not hold for any particular set <strong>of</strong> measurements, then other phenomena<br />

also come into action. We pointed out that under low-noise<br />

conditions the signal can affect the noise level, but under conditions<br />

in which the signal is weak or the noise excessive, the noise can affect<br />

the computed transmittance as well. The expressions we obtained<br />

showed that as the reference signal gets weaker and weaker<br />

(or the noise gets larger and larger), the system first reaches a<br />

point at which the expected value <strong>of</strong> T is larger than E s<br />

/E r<br />

, and as<br />

the reference signal continues to decrease, the multiplying factor<br />

first goes through a maximum and then decreases so that the expected<br />

value <strong>of</strong> T approaches zero as the reference signal energy,<br />

E r<br />

, approaches zero.<br />

We are now ready to consider the behavior <strong>of</strong> the noise under<br />

conditions in which it is not small compared with the signal.<br />

We start with the definition <strong>of</strong> transmittance, as we pointed out<br />

previously, and we rewrite the equation here:<br />

T E s /E r [6]<br />

To put equation 6 into a usable form under the conditions we<br />

wish to consider, we could start from any <strong>of</strong> several points <strong>of</strong> view:<br />

the statistical approach <strong>of</strong> Hald (11), for example, which starts from<br />

fundamental probabilistic considerations and also derives confidence<br />

intervals (albeit for various special cases only); the mathematical<br />

approach (12) or the Propagation <strong>of</strong> Uncertainties approach<br />

<strong>of</strong> Ingle and Crouch (13). Inasmuch as any <strong>of</strong> these starting points<br />

will arrive at the same result when done properly, the choice <strong>of</strong> how<br />

to attack an equation such as equation 6 is a matter <strong>of</strong> familiarity,<br />

simplicity, and, to some extent, taste. We, being chemists and spectroscopists,<br />

and writing for spectroscopists, will use the Propagation<br />

<strong>of</strong> Uncertainties approach <strong>of</strong> Ingle and Crouch:<br />

[64]<br />

Note that we use the letters C,D to represent the variables in equation<br />

64 to avoid confusion with our usage <strong>of</strong> A to mean absorbance.<br />

Applying this to equation 6:<br />

and apply first, the theorem that Var (A B) Var (A) Var (B):<br />

and then the theorem that Var (aX) a 2 Var (X):<br />

and continue as before by setting E s<br />

E r<br />

E:<br />

and finally take square roots to obtain:<br />

[68]<br />

[69]<br />

[70]<br />

[71]<br />

This is clearly a function <strong>of</strong> both E r and E s ; in the regime we are<br />

concerned with in this column, however, as E r approaches zero, the<br />

second term under the radical dominates the expression, although<br />

clearly the point at which the numerical value becomes large compared<br />

to 1/E r 2 will depend on the value <strong>of</strong> E s as well, or equivalently,<br />

the transmittance <strong>of</strong> the sample. Here, again, therefore, the<br />

behavior <strong>of</strong> the noise <strong>of</strong> the transmittance must be expressed as a<br />

family <strong>of</strong> curves. Figures 7 and 8 present the behavior <strong>of</strong> this family<br />

<strong>of</strong> curves as functions <strong>of</strong> E r and E s , respectively.<br />

Note that equation 71 can be reduced to equation 19 (2), which is<br />

appropriate when the signal-to-noise ratio is high and may be considered<br />

constant. Under these conditions E r is large and the second<br />

term under the radical is small and the first term under the radical,<br />

which is independent <strong>of</strong> E s , dominates; then the noise <strong>of</strong> the trans-<br />

As usual, we take the variance <strong>of</strong> this:<br />

[65]<br />

[66]<br />

Figure 8. Relative noise<br />

level (SD(T )) as a function<br />

<strong>of</strong> E s at the lowsignal<br />

limit according to<br />

equation 71, for values <strong>of</strong><br />

E s between 0 and 1. The<br />

lower plot (b) is an expansion<br />

<strong>of</strong> the upper<br />

plot (a).<br />

[67]<br />

APRIL 2001 16(4) SPECTROSCOPY 35


...........................<br />

CHEMOMETRICS IN SPECTROSCOPY ............................<br />

mittance increases with T as and inversely with the reference<br />

energy.<br />

Here, however, under low-signal, high-noise conditions, where<br />

the variation <strong>of</strong> E r<br />

cannot be ignored and therefore the signal-tonoise<br />

ratio varies, we must use the full expression <strong>of</strong> equation 71.<br />

Note further that when E r<br />

is small enough, then, as mentioned earlier,<br />

the second term under the radical dominates, then:<br />

[72]<br />

Table II. Value <strong>of</strong> integral <strong>of</strong> (1/E r<br />

) 2 over the range 0.01 to<br />

0.01.<br />

Integration Interval<br />

Value <strong>of</strong> Integral<br />

10 2 2.0000000000000000e002<br />

10 3 3.0995354623330845e003<br />

10 4 3.2699678003698089e004<br />

10 5 3.2878691333625099e005<br />

10 6 3.2896681436917488e006<br />

10 7 3.2898481337470137e007<br />

The noise <strong>of</strong> the transmittance thus becomes directly proportional<br />

to T and inversely proportional to E r<br />

. Under these conditions<br />

the noise <strong>of</strong> the transmittance approaches infinite values as E r<br />

approaches<br />

zero, even as the expected value <strong>of</strong> the transmittance approaches<br />

zero, as we saw in the previous column(4).<br />

To summarize the effects at low signal-to-noise to compare with<br />

the high signal-to-noise case summarized earlier, here the noise <strong>of</strong><br />

the transmittance increases directly with T and still inversely with<br />

the reference energy.<br />

We now wish to follow through, as we did before, on finding the<br />

optimum value for sample transmittance under these conditions. To<br />

do this, we start with equation 24 (3):<br />

[24]<br />

This is the point at which, in the previous development, we considered<br />

the effect <strong>of</strong> letting E r become negligible, but <strong>of</strong> course in<br />

this case we wish to investigate the small-signal/large-noise behavior.<br />

We now, therefore, go directly to dividing A by A (from equation<br />

20b):<br />

[73]<br />

[74]<br />

[75]<br />

[76]<br />

To determine the variance <strong>of</strong> A/A we perform our usual exercise<br />

<strong>of</strong> taking the variance <strong>of</strong> both sides <strong>of</strong> equation 76 and applying<br />

our two favorite theorems; the result is:<br />

[77]<br />

We cannot simplify this equation further; in particular, we cannot<br />

separate out the variances <strong>of</strong> E s and E r to replace them with the<br />

same generic value. To determine the variance <strong>of</strong> A/A — that is,<br />

the relative precision (in chemist’s terms) — we need to evaluate<br />

the variance <strong>of</strong> the two terms in equation 77. As we observed previously,<br />

as the value <strong>of</strong> E r approaches E r , the value <strong>of</strong> the expressions<br />

attains infinite values. However, a difference here is that when<br />

the variance is computed these values are squared, and hence the<br />

computations are always done using positive values. This differs<br />

from our previous case, where the presence <strong>of</strong> both positive and<br />

negative values afforded the opportunity for cancellation <strong>of</strong> near-infinite<br />

contributions; we do not have that situation here. Therefore<br />

we are faced with the possibility that the variance will be infinite.<br />

An empirical test <strong>of</strong> this possibility was performed by computing<br />

values <strong>of</strong> the variance <strong>of</strong> the two terms in equation 77. The normal<br />

random number generator <strong>of</strong> MATLAB was used to create multiple<br />

values <strong>of</strong> normally distributed random numbers for E r and E s ; these<br />

were plugged into the two expressions <strong>of</strong> equation 77 and the variance<br />

was computed. Between 10 2 and 10 6 values were used in each<br />

computation <strong>of</strong> the variance.<br />

When E r was more than five standard deviations away from the<br />

center <strong>of</strong> the normal distribution representing E r , the computed<br />

variance was fairly small and reasonably stable, and decreased as E r<br />

was moved further away from the center <strong>of</strong> E r . This might be considered<br />

an empirical determination <strong>of</strong> the point <strong>of</strong> demarcation <strong>of</strong><br />

the “small-signal” case.<br />

When E r was moved below five standard deviations, the computed<br />

value <strong>of</strong> the variance became very unstable; computed values<br />

<strong>of</strong> the variance would differ by as much as four orders <strong>of</strong> magnitude.<br />

The closer E r came to E r , the more erratic the computed<br />

variance became. It was clear that bringing E r close to the center <strong>of</strong><br />

E r afforded more opportunity for a given reading <strong>of</strong> the noise to<br />

become close to E r<br />

, thus giving a value approaching infinity that<br />

would be included in the calculation. Furthermore, for a given relationship<br />

between E r<br />

and E r<br />

, the more readings that were included<br />

in the computation, the higher the values <strong>of</strong> variance that would be<br />

calculated. For example, with 100 readings, values <strong>of</strong> variance<br />

might fall between 10 1 and 10 4 while, with 10,000 readings, calculated<br />

variance values would fall in the range <strong>of</strong> approximately<br />

36 SPECTROSCOPY 16(4) APRIL 2001 www.spectroscopyonline.com


...........................<br />

CHEMOMETRICS IN SPECTROSCOPY ............................<br />

10 3 –10 6 . This is attributed to the increased likelihood <strong>of</strong> more data<br />

points being close to E r<br />

and also <strong>of</strong> at least a few points being<br />

closer to E r<br />

than with fewer data.<br />

Another test <strong>of</strong> whether the variance actually diverges and becomes<br />

infinite is the same as the test we applied in the previous column:<br />

to integrate the expressions in equation 77 in a small region<br />

around the point E r<br />

E r<br />

using different intervals <strong>of</strong> integration<br />

and see if the values converge or diverge. Basically, except for a<br />

multiplying factor, these are both the same expression, so evaluating<br />

the expression once suffices to settle the question for both <strong>of</strong><br />

them. Furthermore, because we are integrating over values <strong>of</strong> variance,<br />

the expression that needs to be integrated is (1/E r<br />

) 2 . The result<br />

<strong>of</strong> performing this test is presented in Table II. In contrast to<br />

the previous test results, the values are clearly growing increasingly<br />

larger without bound as the integration interval is reduced.<br />

The conclusion from all this is that the variance and therefore the<br />

standard deviation attains infinite values when the reference energy<br />

is so low that it includes the value zero. However, in a probabilistic<br />

way, it is still possible to perform computations in this regime and<br />

obtain at least some rough idea <strong>of</strong> how the various quantities involved<br />

will change as the reference energy approaches zero. After<br />

all, real data are obtained with a finite number <strong>of</strong> readings, each <strong>of</strong><br />

which is finite, and will give some finite answer. What we can do for<br />

the rest <strong>of</strong> this current analysis is perform empirical computations<br />

to find out what the expectation for that behavior is; we will do that<br />

in the next column.<br />

REFERENCES<br />

(1) H. Mark and J. Workman, <strong>Spectroscopy</strong> 15(10), 24–25 (2000).<br />

(2) H. Mark and J. Workman, <strong>Spectroscopy</strong> 15(11), 20–23 (2000).<br />

(3) H. Mark and J. Workman, <strong>Spectroscopy</strong> 15(12), 14–17 (2000).<br />

(4) H. Mark and J. Workman, <strong>Spectroscopy</strong> 16(2), 44–52 (2001).<br />

(5) E. Voightman, Anal. Instrum. 21(1&2), 43–62 (1993).<br />

(6) E. Voightman, Anal. Chem. 69(2), 226–234 (1997).<br />

(7) E. Voightman, Anal. Chem. 65, 1029A–1035A (1993).<br />

(8) E. Voightman, Anal. Chem. 64, 2590–2598 (1992).<br />

(9) E. Voightman, Analyst 120, 325–330 (1995).<br />

(10) E. Voightman, personal communication (2001).<br />

(11) A. Hald, Statistical Theory with Engineering Applications (Wiley, New<br />

York, 1952, pp. 115–118).<br />

(12) G. A. Korn and T.M.Korn, Mathematical Handbook for Scientists and<br />

Engineers (McGraw-Hill, New York, 1961, pp. 550–554).<br />

(13) J.D. Ingle and S.R. Crouch, Spectrochemical <strong>Analysis</strong> (Prentice-Hall,<br />

Upper Saddle River, NJ, 1988, p. 548).<br />

Howard Mark and Jerome Workman Jr. serve on the Editorial Advisory<br />

Board <strong>of</strong> <strong>Spectroscopy</strong>. Mark runs a consulting service, Mark Electronics,<br />

21 Terrace Avenue, Suffern, NY 10901, that provides assistance,<br />

training, and consultation in near-IR spectroscopy as well as<br />

custom hardware and s<strong>of</strong>tware design and development. He can be<br />

reached via e-mail at hlmark@j51.com.Workman is a research fellow<br />

at Kimberly-Clark Corporation, Neenah, WI 54956, (920) 721-<br />

3988. He can be reached via e-mail at jworkman@kcc.com. ◆<br />

Circle 25<br />

APRIL 2001 16(4) SPECTROSCOPY 37

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!