<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5//EN" "http://cnx.rice.edu/technology/cnxml/schema/dtd/0.5/cnxml_plain.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" id="id3609753">
<name>Conclusions and Thoughts on Speech Recognition</name>
<metadata>
  <md:version>1.2</md:version>
  <md:created>2006/12/19 14:41:04 US/Central</md:created>
  <md:revised>2006/12/20 11:51:41.168 US/Central</md:revised>
  <md:authorlist>
      <md:author id="cpasich">
      <md:firstname>Chris</md:firstname>
      
      <md:surname>Pasich</md:surname>
      <md:email>cpasich@rice.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="cpasich">
      <md:firstname>Chris</md:firstname>
      
      <md:surname>Pasich</md:surname>
      <md:email>cpasich@rice.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  

  <md:abstract>The group's thoughts on our project.</md:abstract>
</metadata>
<content>
<section id="id3503008">
<name>Conclusions</name>
<para id="id3583519">Initially looking at the experiment, the plan
was to have a text-dependent system, or a speaker verification
system, or something that could actually determine what word was
being spoken to the system by the user. It has become painfully
clear that that would be a very difficult task to accomplish, and
would require much more time, effort, and background on the subject
than we could possibly acquire in a short period of time. Our
system was, for what it did, relatively successful – it found
vowels with regularity, and it identified speakers at a rate of
almost 70% - a very good rate for a basic system.</para>
<para id="id3527370">As obvious, however, is how much more in depth
speech recognition is than the scope of our project. Being able to
determine what is spoken or who a speaker is with near perfect
accuracy is an extremely formidable task. Preventing another
individual from breaking into the system can be just as difficult,
as it requires a system dependent on text and a system that will
not accept anything other than what it specifies. Our initial idea
of being able to determine what word was being spoken is, at best,
naïve, and at worst not at all feasible. With that said, however,
the end results were very acceptable.</para>
</section>
</content>
</document>
