NASA Scientific and Technical Aerospace Reports

More documents

Recommendations

Info

discrimination information (or the cross-entropy) between the source and the model is proposed. This approach does not require the commonly used assumption that the source to be modeled is a hidden Markov process. The algorithm is started from the model estimated by the traditional maximum likelihood (ML) approach and alternatively decreases the discrimination information over all probability distributions of the source which agree with the given measurements and all hidden Markov models. The proposed procedure generalizes the Baum algorithm for ML hidden Markov modeling. The procedure is shown to be a descent algorithm for the discrimination information measure and its local convergence is proved. Author Markov Processes; Information Theory; Information Systems; Probability Distribution Functions; Maximum Likelihood Estimates 20060001627 International Business Machines Corp., Paris, France Context-Dependent Phonetic Markov Models for Large Vocabulary Speech Recognition Derouault, Anne-Marie; IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘87); Volume 1; 1987, pp. 10.1.1 - 10.1.4; In English; See also 20060001583; Copyright; Avail.: Other Sources One approach to large vocabulary speech recognition, is to build phonetic Markov models, and to concatenate them to obtain word models. In previous work, we already designed a recognizer based on 40 phonetic Markov machines, which accepts a 10,000 words vocabulary ([3]), and recently 200,000 words vocabulary ([5]). Since there is one machine per phoneme, these models obviously do not account for coarticulatory effects, which may lead to recognition errors. In this paper, we improve the phonetic models by using general principles about coarticulation effects on automatic phoneme recognition. We show that both the analysis of the errors made by the recognizer, and linguistic facts about phonetic context influence, suggest a method for choosing context dependent models. This method allows to limit the growing of the number of phonems, and still account for the most important coarticulation effects. We present our experiments with a system applying these principles to a set of models for French. With this new system including context-dependent machines, the phoneme recognition rate goes from 82.2% to 85.3%, and the error rate on words with a 10,000 word dictionary, is decreased from 11.2 to 9.8%. Author Context; Phonemes; Error Analysis; Phonetics; Words (Language); Speech Recognition; Linguistics 20060001657 Mitre Corp., McLean, VA, USA Information-Theoretic Compressibility of Speech Data Ramsey, L. Thomas; Gribble, David; IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘87); Volume 1; 1987, pp. 1.6.1 - 1.6.4; In English; See also 20060001583; Copyright; Avail.: Other Sources Two standard reversible coding algorithms, Ziv-Lempel and a dynamic Huffman algorithm, are applied to various types of speech data. The data tested were PCM, DPCM, and prediction residuals from LPC. Neither algorithm shows much promise on small amounts of data, but both performed well on large amounts. Typically the Ziv-Lempel required about 12 seconds of data (with 8000 samples per second) to reach a stable compression rate. The dynamic Huffman coding took much less time to warm up’, often needing something like 64 milliseconds. Approximately 66 seconds of PCM with 12 bits per samples was compressed 6.4% by the Ziv-Lempel coding and 20.7% by the dynamic Huffman coding. The same numbers for DPCM with 13 bits per sample are 17.7% and 35.6% respectively. The prediction residuals had compression rates very close to those of DPCM, regardless of whether 1, 2, 5, or 10 prediction coefficients were used. Author Information Theory; Compressibility; Predictions; Speech; Coeffıcients; Differential Pulse Code Modulation 20060001668 American Telephone and Telegraph Co., NJ, USA A Connected Speech Recognition System Based on Spotting Diphone-Like Segments - Preliminary Results Rosenberg, A. E.; Colla, A. M.; IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘87); Volume 1; 1987, pp. 3.6.1-3.6.4; In English; See also 20060001583; Copyright; Avail.: Other Sources A template-based connected speech recognition system, which represents words as sequences of diphone-like segments, has been implemented and evaluated. The inventory of segments is divided into two principal classes: ‘steady-state’ speech sounds such as vowels, fricatives, and nasals, and ‘composite’ speech sounds consisting of sequences of two or more speech sounds in which the transitions from one sound to another are intrinsic to the representation of the composite sound. Templates representing these segments are extracted from labelled training utterances. Words are represented by network models whose branches are diphone segments. Word juncture phenomena are accommodated by including segment branches that characterize transition pronunciations between specified classes of words. The recognition of a word in a specified utterance takes place 218
y ‘spotting’ all the segments contained in the model of the word. Putative words and word combinations are found by searching for best scoring sequences of segments specified by the models subject to segment separation constraints. A pruning procedure finds the best scoring string of words subject to constraints on word lengths. separations, and overlaps. An evaluation of the recognizer has been carried out on a database of connected digit utterances spoken by a single male talker. Templates are extracted from half the database consisting of 2100 digit utterances and system performance tested on the remaining 2100 utterances. The performance obtained to date is approximately 2% digit error rate and 7 to 8% digit string error rate. Author Speech Recognition; Words (Language); Vowels; Speech; Sequencing 20060001670 BBN Systems and Technologies Corp., Cambridge, MA, USA BYBLOS: The BNN Continuous Speech Recognition System Chow, Y.L.; Dunham, M.O.; Kimball, O. A.; Krasner, M. A.; Kubala, G. F.; Makhoul, J.; Price, P. J.; Roucos, S.; Schwartz, R. M.; IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘87); Volume 1; 1987, pp. 3.7.1-3.7.4; In English; See also 20060001583 Contract(s)/Grant(s): N00039-85-C-0423; Copyright; Avail.: Other Sources In this paper, we describe BYBLOS, the BBN continuous speech recognition system. The system, designed for large vocabulary applications, integrates acoustic, phonetic, lexical, and linguistic knowledge sources to achieve high recognition performance. The basic approach, as described in previous papers, makes extensive use of robust context-dependent models of phonetic coarticulation using Hidden Markov Models (HMM). We describe the components of the BYBLOS system, including: signal processing frontend, dictionary, phonetic model training system, word model generator, grammar and decoder. In recognition experiments, we demonstrate consistently high word recognition performance on continuous speech across: speakers, task domains, and grammars of varying complexity. In speaker-dependent mode, where 15 minutes of speech is required for training to a speaker, 98.5% word accuracy has been achieved in continuous speech for a 350-word task, using grammars with perplexity ranging from 30 to 60. With only 15 seconds of training speech we demonstrate performance of 97% using a grammar. Author Signal Processing; Speech Recognition; Phonetics; Linguistics; Grammars; Decoders 20060001673 American Telephone and Telegraph Co., USA Performance Evaluation of a Connected Digit Recognizer Rabiner, L. R.; Wilpon, J. G.; Juang, B. H.; IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘87); Volume 1; 1987, pp. 3.10.1-3.10.4; In English; See also 20060001583; Copyright; Avail.: Other Sources In this paper we discuss a system for automatically recognizing fluently spoken digit strings based on whole word reference units. The system that we will describe can use either hidden Markov model (HMM) technology or template-based technology. The training procedure derives the digit reference patterns (either templates or statistical models) from connected digit strings. To evaluate the performance of the overall connected digit recognizer, a set of 50 people (25 men, 25 women), from the non-technical local population, was each asked to record 1200 random connected digit strings over local dialed-up telephone lines. Both a speaker trained and a multispeaker training set was created, and a full performance evaluation was made. Results show that the average string accuracy for unknown and known length strings, in the speaker trained mode, was 98% and 99% respectively; in the multi-speaker mode the average string accuracies were 94% and 96.6% respectively. Author Performance Tests; Templates; Evaluation; Mathematical Models; Speech 20060001738 International Business Machines Corp., Paris, France Speech Recognition with Very Large Size Dictionary Merialdo, Bernard; IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘87); Volume 1; 1987, pp. 10.2.1 - 10.2.4; In English; See also 20060001583; Copyright; Avail.: Other Sources This paper proposes a new strategy, the Multi-Level Decoding (MLD), that allows to use a Very Large Size Dictionary (VLSD, size more than 100,000 words) in speech recognition. MLD proceeds in three steps: 1) a Syllable Match procedure uses an acoustic model to build a list of the most probable syllables that match the acoustic signal from a given time frame. 2) from this list, a Word Match procedure uses the dictionary to build partial word hypothesis. 3) then a Sentence Match procedure uses a probabilistic language model to build partial sentence hypothesis until total sentences are found. An original 219
Page 2 and 3:
Since its founding, NASA has been d
Page 4 and 5:
NASA STI Availability Information N
Page 6 and 7:
Life Sciences 51 Life Sciences (Gen
Page 8 and 9:
Microstructure Fabrication and Perf
Page 10 and 11:
RNP SAAAR procedures have been prop
Page 12 and 13:
concept was originally developed to
Page 14 and 15:
appropriate information in the righ
Page 16 and 17:
preceding aircraft. The proposed DE
Page 18 and 19:
Workshop; November 2005; 31 pp.; In
Page 20 and 21:
20060002288 Johns Hopkins Univ., La
Page 22 and 23:
Workshop; November 2005, pp. 1-19;
Page 24 and 25:
aircraft would then vary its broadc
Page 26 and 27:
20060001772 Northrop Grumman Corp.,
Page 28 and 29:
seedling effort, Metron has been aw
Page 30 and 31:
aircraft ‘streaming’ that impli
Page 32 and 33:
implantation of analog-to-digital c
Page 34 and 35:
12 percent; Denver, Kennedy and Orl
Page 36 and 37:
maneuver is along or opposite the v
Page 38 and 39:
18 SPACECRAFT DESIGN, TESTING AND P
Page 40 and 41:
High-Level Waste Workshop held on J
Page 42 and 43:
detectors, but 50 seconds slower th
Page 44 and 45:
computed to shed light on the feasi
Page 46 and 47:
have to be satisfied. Usually, an i
Page 48 and 49:
desired speech with better intellig
Page 50 and 51:
noise is not required. Simulation e
Page 52 and 53:
we present a series of lessons lear
Page 54 and 55:
eceived instantaneous power level h
Page 56 and 57:
integration of these highly integra
Page 58 and 59:
The scheme is based on the harmonic
Page 60 and 61:
A family of Quantized State (QS) ad
Page 62 and 63:
atios. It is also used in a speech
Page 64 and 65:
In AM radio it is of interest to in
Page 66 and 67:
We define a piecewise AR model for
Page 68 and 69:
the far domain eliminates scan time
Page 70 and 71:
Fault Diagnosis of Piecewise-Linear
Page 72 and 73:
20060002115 Iowa Univ., Iowa City,
Page 74 and 75:
applications requiring wide dynamic
Page 76 and 77:
attractive media for meeting the co
Page 78 and 79:
20060002148 American Telephone and
Page 80 and 81:
are considered and modeled as an un
Page 82 and 83:
the program executes at least 100 t
Page 84 and 85:
with low precision components, and
Page 86 and 87:
20060002295 Pennsylvania State Univ
Page 88 and 89:
losses, and boundary layer velocity
Page 90 and 91:
object. The goal of this paper is t
Page 92 and 93:
20060001640 General Electric Co., S
Page 94 and 95:
The Advanced Volume Sensor Project
Page 96 and 97:
20060001813 Michigan Univ., Ann Arb
Page 98 and 99:
demonstrate that operations perform
Page 100 and 101:
context of how Army installations n
Page 102 and 103:
13.4% on a non-textured single-side
Page 104 and 105:
ilayer structure, the DLC films not
Page 106 and 107:
operating information, and a black-
Page 108 and 109:
46 GEOPHYSICS Includes Earth struct
Page 110 and 111:
20060002033 National War Coll., Was
Page 112 and 113:
esults of this study indicate that
Page 114 and 115:
Bacillus anthracis. Anthrax will be
Page 116 and 117:
were evaluated throughout the exper
Page 118 and 119:
20060001236 Institute of Space Medi
Page 120 and 121:
20060001245 Institute of Space Medi
Page 122 and 123:
III dummy was similar to those obta
Page 124 and 125:
properties of hyperbolic Dirichlet
Page 126 and 127:
This paper had been prepared as a t
Page 128 and 129:
20060002350 BERG2 Micro-Computer Es
Page 130 and 131:
Exploratory Learning; Neurobiologic
Page 132 and 133:
operation of LSI is pursuited to ac
Page 134 and 135:
industrial tools, and increased int
Page 136 and 137:
20060001694 Motorola, Inc., Schaumb
Page 138 and 139:
enchmarks in this document is meant
Page 140 and 141:
contemporary team theory, adapted a
Page 142 and 143:
toolkit), each of which we built to
Page 144 and 145:
developmental effort. This paper il
Page 146 and 147:
signal-processing algorithms, allow
Page 148 and 149:
Scheduling (TIES) for efficient net
Page 150 and 151:
safety, and real time performance o
Page 152 and 153:
20060001877 New York Univ., New Yor
Page 154 and 155:
domain. dRBAC utilizes PKI to ident
Page 156 and 157:
difference in the synchronous frequ
Page 158 and 159:
This paper presents an analysis of
Page 160 and 161:
method relies on two mathematical t
Page 162 and 163:
This paper considers the behaviour
Page 164 and 165:
20060001709 Pennsylvania State Univ
Page 166 and 167:
The feasibility of real time, micro
Page 168 and 169:
a product of two Toeplitz data matr
Page 170 and 171:
The model parameters necessary for
Page 172 and 173:
estimators, and investigate the eff
Page 174 and 175: was only able to identify DMMP down
Page 176 and 177: takes I+26M, I+31M for an addition
Page 178 and 179: 65 STATISTICS AND PROBABILITY Inclu
Page 180 and 181: A new approach to making voiced/unv
Page 182 and 183: near-optimal strategy with DBN serv
Page 184 and 185: non-linear system design. Finally,
Page 186 and 187: The aim of this three year project
Page 188 and 189: extraction, and object recognition.
Page 190 and 191: 20060001806 North Carolina State Un
Page 192 and 193: ate of the Booster, and provide a l
Page 194 and 195: The magnetic harmonic errors of the
Page 196 and 197: 20060002069 Fermi National Accelera
Page 200 and 201: micrographs as triangular and trape
Page 202 and 203: plays a significant role in the dev
Page 206 and 207: 20060002468 Lawrence Livermore Nati
Page 208 and 209: 20060002492 Lawrence Livermore Nati
Page 210 and 211: parameters and propose some importa
Page 212 and 213: ATAL introduced a technique for dec
Page 214 and 215: The problem is addressed of evaluat
Page 216 and 217: 20060001730 Naval Underwater System
Page 218 and 219: and tip-tilt natural guide star (NG
Page 220 and 221: oscillation, bursting, chirping and
Page 222 and 223: specifications automatically. Some
Page 226 and 227: matching algorithm is proposed for
Page 228 and 229: 20060001768 Naval Postgraduate Scho
Page 230 and 231: 20060001826 Massachusetts Univ., Am
Page 232 and 233: allows the user to specify a contex
Page 234 and 235: information from these resources, u
Page 236 and 237: such as ADSL and FTTH, we are now a
Page 238 and 239: Presented is the National Optical A
Page 240 and 241: 20060000076 Gemini Observatory, Hil
Page 242 and 243: Telescope (HST) at optical waveleng
Page 244 and 245: Misadjustment Expressions for Infin
Page 246 and 247: A-4 Complexity Reduced Lattice Filt
Page 248 and 249: ASYMMETRY Anthrax Biosensor, Protec
Page 250 and 251: Regulation of hTERT Expression and
Page 252 and 253: CLIMATE C-Haendelser i Vinterklimat
Page 254 and 255: Neutronics Assessments for a RIA Fr
Page 256 and 257: CRYSTAL SURFACES High-Throughput Ap
Page 258 and 259: DIELECTRICS Theoretical and Experim
Page 260 and 261: Analysis of Depletion-Region Collec
Page 262 and 263: Scenario Customization for Informat
Page 264 and 265: GALLIUM ARSENIDES InGaAs/GaAs QD Su
Page 266 and 267: HUMAN RESOURCES The Management of P
Page 268 and 269: INFRARED SPECTRA Chemical Vapor Ide
Page 270 and 271: Adaptive Transversal Filters with D
Page 272 and 273: MATCHED FILTERS Simplified Class 1
Page 274 and 275:
MILITARY TECHNOLOGY The USA Air For
Page 276 and 277:
NONLINEAR FILTERS A Novel Approach
Page 278 and 279:
Object Classification and Registrat
Page 280 and 281:
Predicting Ductile Crack Growth in
Page 282 and 283:
RECURSIVE FUNCTIONS Recursive Array
Page 284 and 285:
Texture Segmentation Using a Class
Page 286 and 287:
SITUATIONAL AWARENESS Aspects of Sh
Page 288 and 289:
STABILITY Damping Transverse Instab
Page 290 and 291:
Single Target Spin Asymmetries and
Page 292 and 293:
On Generalized Feussner’s Princip
Page 294 and 295:
WEAR Wear Prediction of Strip Seals
Page 296 and 297:
Anderson, Robert H. The Vulnerabili
Page 298 and 299:
Neutronics Assessments for a RIA Fr
Page 300 and 301:
Chien, Ting-Li Navigation of a Mobi
Page 302 and 303:
The Modeling and Segmentation of Sp
Page 304 and 305:
Fomin, Pavel Test and Evaluation of
Page 306 and 307:
Hann, P. R. Technical Equivalency D
Page 308 and 309:
Irons, F. H. A Phase-Plane Approach
Page 310 and 311:
Kawarada, Hiroshi A Hierarchical Re
Page 312 and 313:
Lai, Feipei JADE: A Hierarchical Sw
Page 314 and 315:
Lopez-Perez, Javier VHF Channel Occ
Page 316 and 317:
Menard, J. Scaling of Kinetic Insta
Page 318 and 319:
Ney, Hermann Dynamic Programming Sp
Page 320 and 321:
Using Rhythms of Relationships to U
Page 322 and 323:
Resch, C. Communications Technology
Page 324 and 325:
Sedra, A. S. SlCOMP: A Silicon Comp
Page 326 and 327:
Steffler, E. D. Predicting Ductile
Page 328 and 329:
Torkelson, Mats A VLSI Sigma-Delta
Page 330 and 331:
Wheeler, P. D. Practical Adaptive N
Page 332:
Zong-ye, Wang Application of Multis
show all

NASA Scientific and Technical Aerospace Reports

Create successful ePaper yourself

Delete template?

Save as template?