08.07.2015 Views

Nonnegativity Constraints in Numerical Analysis - CiteSeer

Nonnegativity Constraints in Numerical Analysis - CiteSeer

Nonnegativity Constraints in Numerical Analysis - CiteSeer

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

5.4 Environmetrics and ChemometricsIn the air pollution research community, one observational technique makes use of the ambientdata and source profile data to apportion sources or source categories [39, 44, 75]. Thefundamental pr<strong>in</strong>ciple <strong>in</strong> this model is that mass conservation can be assumed and a massbalance analysis can be used to identify and apportion sources of airborne particulate matter<strong>in</strong> the atmosphere. For example, it might be desirable to determ<strong>in</strong>e a large number of chemicalconstituents such as elemental concentrations <strong>in</strong> a number of samples. The relationshipsbetween p sources which contribute m chemical species to n samples leads to a mass balanceequationp∑y ij = a ik f kj , (36)k=1where y ij is the elemental concentration of the ith chemical measured <strong>in</strong> the j th sample,a ik is the gravimetric concentration of the ith chemical <strong>in</strong> the kth source, and f kj is theairborne mass concentration that the kth source has contributed to the j th sample. In atypical scenario, only values of y ij are observable whereas neither the sources are known northe compositions of the local particulate emissions are measured. Thus, a critical questionis to estimate the number p, the compositions a ik , and the contributions f kj of the sources.Tools that have been employed to analyze the l<strong>in</strong>ear model <strong>in</strong>clude pr<strong>in</strong>cipal componentanalysis, factor analysis, cluster analysis, and other multivariate statistical techniques. Inthis receptor model, however, there is a physical constra<strong>in</strong>t imposed upon the data. That is,the source compositions a ik and the source contributions f kj must all be nonnegative. Theidentification and apportionment problems thus become a nonnegative matrix factorizationproblem for the matrix Y.5.5 Speech RecognitionStochastic language model<strong>in</strong>g plays a central role <strong>in</strong> large vocabulary speech recognition,where it is usually implemented us<strong>in</strong>g the n-gram paradigm. In a typical application, thepurpose of an n-gram language model may be to constra<strong>in</strong> the acoustic analysis, guide thesearch through various (partial) text hypotheses, and/or contribute to the determ<strong>in</strong>ation ofthe f<strong>in</strong>al transcription.In language model<strong>in</strong>g one has to model the probability of occurrence of a predicted wordgiven its history P r(w n |H). N-gram based Language Models have been used successfully <strong>in</strong>Large Vocabulary Automatic Speech Recognition Systems. In this model, the word historyconsists of the N − 1 immediately preced<strong>in</strong>g words. Particularly, tri-gram language models(P r(w n |w n−1 ; w n−2 )) offer a good compromise between model<strong>in</strong>g power and complexity. Amajor weakness of these models is the <strong>in</strong>ability to model word dependencies beyond the spanof the n-grams. As such, n-gram models have limited semantic model<strong>in</strong>g ability. Alternatemodels have been proposed with the aim of <strong>in</strong>corporat<strong>in</strong>g long term dependencies <strong>in</strong>to themodel<strong>in</strong>g process. Methods such as word trigger models, high-order n-grams, cache models,etc., have been used <strong>in</strong> comb<strong>in</strong>ation with standard n-gram models.One such method, a Latent Semantic <strong>Analysis</strong> based model has been proposed [2]. Aword-document occurrence matrix X m×n is formed (m = size of the vocabulary, n = number22

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!