|
|
IntroductionWhat is SR?Speech Recognition vs. Voice RecognitionTechnology of SRTimelineThe PlayersFutureLimitations and PotentialReferences |
As
Speech Recognition technology improves in terms of accuracy, vocabulary, and its
ability to understand natural language, we will see the concept of interactive
machines in
every arena. From assembly line
mechanical tools to intelligent microwave ovens to "writing" a check,
we will have the power to use our voice to instruct the electronic devices we
encounter everyday. The
progress of Speech Recognition technologies, in the near future, may be hindered
by the lack of an effective and legitimate standard code. Efforts are being made
by AT&T, Lucent, Motorola, and seventeen other leading institutions to develop the
Voice eXtensible Markup Language (VXML) standard. However, Microsoft and Unisys
are also collaborating to popularize the Standard Application Programming Interface (SAPI),
which is not compatible with the VXML standard. Speech and Voice
Recognition technologies will probably not flourish until these standards are
approved by the World Wide Web Consortium (W3), just as the World Wide Web did
not grow until the HyperText Markup Language (HTML) standards had been adopted. It
is apparent that humans are trying to create a computing environment in which
the computer learns from the user instead of one where the user must learn how
to use the computer. Speech Recognition technology is the next obvious step in an
attempt to integrate computing into a "natural" way of life. This
effective means of communication, even when perfected, will still present
limitations as to how humans can express themselves. The bottleneck of the
future will be the physical constraint of not being able to speak all
of one's thoughts in a coherent and sensible fashion. Instead, scary as it may
be, computers may have systems in place that can receive and interpret
neurological data. |
Copyright © 1999 Ira Greenberg and Andrew Bate. All Rights Reserved.
|