1 - LumenVox

More documents

Recommendations

Info

Tuning Processes LumenVox's Speech Tuner provides full support for LumenVox's Speech Recognition Engine, Nuance 8.5, ScanSoft OSR 2, and other ASRs. The Speech Tuner allows you to work with any supported ASR via a single interface. LumenVox is an active supporter of the Tools committee in the VXML Forum, and is working to help define standard logging information, to help ease the tuning process. The tuning process involves three easy steps: Import Data. 1 2 3 The basic process is simple. Users import call log data into the Speech Tuner database. All information stored by the call log is available in the Speech Tuner. In most cases, log fields between ASR engines are very similar; when the information differs, every effort is made to preserve the original data. Each special case is fully documented. Transcribe Speech. Transcribers can type the text of the caller's speech directly into the Speech Tuner. Once the audio is transcribed, the Tuner compares audio transcripts with the speech engine results to determine accuracy, greatly reducing errors associated with hand evaluations. If semantic interpretations are available, the transcriber can also mark whether the semantic interpretation was correct or incorrect. The transcripts are evaluated using the actual decode grammar, producing measurements such as word-error-rate, in- and out-of-grammar rates, and semantic error rates. Test Immediately. Selecting an interaction in the Call Log automatically loads the associated audio and grammar into the Tester. The grammar can be edited, speech engine parameters set, and individual recognition tests generated. The Speech Tuner natively supports industry standard SRGS grammars. Once a set of possible changes is identified, users can batch test audio to evaluate performance, using those changes. The Speech Tuner assumes the user possesses licensed versions of the relevant ASR, that the ASR platform is up and running, and that the platform is able to accept connections. LumenVox Speech Tuner Database The Speech Tuner communicates with an open-source, freeware database called SQLite (www.sqlite.org). The Speech Tuner manages call log importing, searching, and exporting⎯so users can focus on the task of tuning, not log management. The database is contained in a single file, is easy to back up and transport, and can be queried using SQL-92 (see the SQLite website for full details) from a variety of exterior tools. Other speech engine vendors are free to convert their native logs to ones the engine understands. The format, content, and semantics of the LumenVox Speech Tuner database are published. The database maintains all the information contained in the original call log. The Speech Tuner includes not only the decode grammar and ASR results, but also the decode platform, parameter settings, alternative results, prompt audio, and pre- and post-processed audio. Depending on the platform logging capabilities, the database can provide more advanced information, such as ASR result alignments within the audio; the list of phonemes used in the decode; and word, utterance, and semantic interpretation confidence measurements. In addition, the Tuner stores all transcripts and evaluations within the call log. As transcripts are entered into the Speech Tuner, they are automatically evaluated against the decode grammar. These transcripts, and any notes or additional information, are stored directly into the database. Individual scores⎯such as word error rate, semantic error rate, and in- and out-of-grammar measurements⎯are stored along with their alignments, as well as information about how the scores were reached. Users can generate a variety of reports from these results, including error rate by grammar or dialog, confusion matrices, transcription progress, and confidence thresholds for confirmation or rejection settings. In the future, LumenVox's Speech Tuner will also support back-end database replacement, for use in enterprise level systems, where multiple users will be analyzing the same data simultaneously. Companies who use an ODBC-capable database can replace, with certain SQL changes, the diskbased SQLite system with an enterprise system such as MS SQL Server 2000, MySQL, PostgreSQL, and/or Oracle. LumenVox has created speech recognition products that are easy to code with and GUI-based tools, such as the new Speech Tuner that greatly simplifies post-deployment maintenance. Vern Baker President of enGenic Corporation 30 31
Taking Out the Guesswork Make changes to grammars, parameters, or ASR engines, secure in the knowledge that those changes will make the application better, faster, and more accurate. The Speech Tuner uses historical information to validate your changes, ensuring your success. Grammar Tester Most 'tuning' tools are passive log viewers, requiring that changes be made in the live speech application and retested over a period of time with live callers. With LumenVox's Tuner, we send the changes to the Speech Engine, simulating the recognition process and evaluating changes instantly. Instead of slow, non-interactive, static tuning, the Speech Tuner enables on-the-fly, highly interactive, dynamic tuning. Make a change, do the test, get the results! The Grammar Tester is a dynamic testing component. You can switch ASR engines, grammars, and engine search parameters on-the-fly, and test changes in single or batch tests. Grammar Evaluation Evaluate speech and grammar sets against the speech engine, as they took place during the actual call. Adjust grammars and instantly re-test and re-score to evaluate improvements in performance. With LumenVox's Speech Tuner, you can instantly determine whether adding a new phrase to the grammar will improve your accuracy. Parameter Evaluation Setting parameters optimizes the speech engine performance, further improving the caller's experience. Traditionally, changing ASR parameters is a difficult and time-consuming task, often requiring long delays between changing a parameter, and evaluating its effects on performance. Our Speech Tuner can dramatically shorten the process. The dynamic test capability of the LumenVox Speech Tuner allows the user to quickly make and test parameter changes: now, ASR engine parameters such as search optimizations, speech end-pointing, and n-best result processing can be easily adjusted, and immediately re-tested and re-scored from within the Speech Tuner. Performance Measurements The Speech Tuner rates performance against commonly accepted measures like WER (Word Error Rate), Grammar Coverage, and Semantic Interpretation matching. This helps give an accurate picture of details such as average confidence scores, correct versus incorrect responses, and In-Grammar versus Out-of-Grammar performance. Assessing Upgrades Installing new versions of platforms and ASR engines entails a certain risk with each new upgrade. On occasion, new default settings, search routines, changes to acoustic models, and so on will actually worsen the caller's experience, until the application is re-tuned. But using the LumenVox Speech Tuner, you can perform baseline testing with the old version to establish the minimum acceptable performance. Then, using the upgraded version of the ASR engine, you can easily re-test all existing data and compare the results to the baseline. The new performance, judged against the baseline, gives you the information you need to make a decision, and deploy an upgrade with confidence. 32 33
Page 1 and 2: Table of Contents About LumenVox Wh
Page 3 and 4: Why Choose LumenVox? Rethinking the
Page 5 and 6: Speech Engine LumenVox's Speech Rec
Page 7 and 8: Advanced Features Noise Reduction M
Page 9 and 10: Everything You Need LumenVox's Spee
Page 11 and 12: Call Handler The Speech Platform's
Page 13 and 14: Configuration Tool User’s email a
Page 15: Tuner Overview The Speech Tuner is
Page 19 and 20: LumenVox Training Courses The Lumen
Page 21 and 22: Types of Speech Recognition Speech
Page 23 and 24: Effective Design = Customer Satisfa
Page 25 and 26: Voice Matters The voice of your spe
Page 27 and 28: Practical Guide To Tuning Untuned s
Page 29 and 30: Failures and Fixes: Common Prompt T
Page 31: Standards and Systems Supported Ind

1 - LumenVox

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?