微软语音识别引擎切换语法

3

我有一个语音识别代码,使用了微软的语音识别引擎。但是不幸的是,它并不能很好地理解语音,因此我一直在想办法。其中之一是在特定语法和通用字典语法之间切换。然而,我无法弄清楚如何在一种语法无法识别给定的语音时切换到另一种语法。

如果有人能帮我解决这个问题,请告诉我如何构建这个功能,例如在我的commandList语法无法识别所捕获的语音时,切换到DictationGrammar()。

以下是代码:

//using Microsoft.Speech.Recognition;

using System;
using System.Speech.Recognition;
using System.Windows.Forms;
using System.Collections.Generic;

namespace vRec
{
  public class Form1
  {
      static int counter = 0;
      static bool stop = false;
      static string command = null;
      static List<String> commandList = new List<string>() { "zooey", "open", "quit", "search", "close", "yes", "no" };
      static Choices keywords = new Choices();

     public static void Main()
    {
        command = null;
        stop = false;

      // Create an in-process speech recognizer for the en-US locale.
      using (
       SpeechRecognitionEngine recognizer =
        new SpeechRecognitionEngine(
          new System.Globalization.CultureInfo("en-US")))
      {

        keywords.Add(commandList.ToArray());
        GrammarBuilder grammarBuilder = new GrammarBuilder(keywords);
        Grammar testGrammar = new Grammar(grammarBuilder);
        recognizer.LoadGrammar(testGrammar);
        recognizer.LoadGrammar(new DictationGrammar());

        // Add a handler for the speech recognized event.
        recognizer.SpeechRecognized += 
          new EventHandler<SpeechRecognizedEventArgs>(recognizer_SpeechRecognized);

        // Configure input to the speech recognizer.
        recognizer.SetInputToDefaultAudioDevice();

        // Start asynchronous, continuous speech recognition.
        recognizer.RecognizeAsync(RecognizeMode.Multiple);

        Console.WriteLine("NOT TERMINATED");

         // Keep the console window open.
         if(!stop)
         Console.ReadLine();
    }
 }

    // Handle the SpeechRecognized event.
    public static void recognizer_SpeechRecognized(object sender, SpeechRecognizedEventArgs e)
    {
        if (counter == 2 && (e.Result.Text.ToUpper() == "YES" || e.Result.Text.ToUpper() == "US" || e.Result.Text.ToUpper() == "AS"))
        {
            counter = 0;
            Console.WriteLine("THANK YOU LORD JEBUS");
            stop = true;
            SendKeys.SendWait("{ENTER}");
            //command string to be passed in for functions in c++ code
        }
        else if (counter == 2 && (e.Result.Text.ToUpper() == "NO" || e.Result.Text.ToUpper() == "NOW" || e.Result.Text.ToUpper() == "KNOW" || e.Result.Text.ToUpper() == "OH" || e.Result.Text.ToUpper() == "NOT" || e.Result.Text.ToUpper() == "NOPE" || e.Result.Text.ToUpper() == "NAH"))
        {
            Console.WriteLine("Can you spell that?");
            counter = 1;
            command = e.Result.Text;
        }
        else if (counter == 2 && (e.Result.Text.ToUpper() != "YES" || e.Result.Text.ToUpper() != "US" || e.Result.Text.ToUpper() != "AS" || e.Result.Text.ToUpper() != "NO" || e.Result.Text.ToUpper() != "NOW" || e.Result.Text.ToUpper() != "KNOW" || e.Result.Text.ToUpper() != "OH" || e.Result.Text.ToUpper() != "NOT" || e.Result.Text.ToUpper() != "NOPE" || e.Result.Text.ToUpper() != "NAH"))
        {
            //Console.WriteLine(counter);
            Console.WriteLine("Can you repeat that?");
            counter = 1;
        }

        if (counter == 1) 
        {
            command = e.Result.Text;

            if (e.Result.Text.ToUpper() == "ALL BEEN" || e.Result.Text.ToUpper() == "OPIUM" || e.Result.Text.ToUpper() == "OLD AND" || e.Result.Text.ToUpper() == "HOLE IN" || e.Result.Text.ToUpper() == "HOPING" || e.Result.Text.ToUpper() == "OLD BEEN" || e.Result.Text.ToUpper() == "OPEN")
                command = "open";

            if (e.Result.Text.ToUpper() == "WAIT" || e.Result.Text.ToUpper() == "QUITE" || e.Result.Text.ToUpper() == "QUIP" || e.Result.Text.ToUpper() == "QUICK" || e.Result.Text.ToUpper() == "CLIP" || e.Result.Text.ToUpper() == "QUIT")
                command = "quit";

            if (e.Result.Text.ToUpper() == "SUCH" || e.Result.Text.ToUpper() == "SORT" || e.Result.Text.ToUpper() == "SEARCH")
                command = "search";

            if (e.Result.Text.ToUpper() == "RULES" || e.Result.Text.ToUpper() == "FELLOWS" || e.Result.Text.ToUpper() == "CLOSE")
                command = "close";

            commandList.Add(command);
            Console.WriteLine(counter);
            Console.WriteLine("Recognized text: " + command);
            Console.WriteLine("Is this correct?");

            for (int i = 0; i < commandList.Count; i++)
            {
                Console.WriteLine("/" + commandList[i]);
            }

            counter++;

        }

        if (e.Result.Text.ToUpper() == "ZOOEY" || e.Result.Text.ToUpper() == "ZOE" || e.Result.Text.ToUpper() == "EASILY" || e.Result.Text.ToUpper() == "SALLY" || e.Result.Text.ToUpper() == "ZONE" || e.Result.Text.ToUpper() == "ZONE WE" || e.Result.Text.ToUpper() == "SOLELY" || e.Result.Text.ToUpper() == "ZOELLICK" && counter == 0)
        {
            counter++;
            Console.WriteLine("How can I help you?");
        }

        Console.WriteLine("Recognized text: " + e.Result.Text);
    }

    public static string getCommand()
    {
        return command;
    }

  }
}

任何帮助都将不胜感激。^.^
1个回答

1

我能想到的一种方法是:

设置一个置信度阈值,任意选择0.6,然后在您的recognizer_SpeechRecognized方法中,如果捕获到的语音低于该阈值,则切换语法。 为此,您应该使用'SpeechRecognitionEngine.UnloadAllGrammars'与SpeechRecognitionEngine.LoadGrammarAsync结合使用以获取识别到的语音的置信度。

e.Result.Confidence`, so your code could look like: 
    if (e.Result.Confidence <0.6)  {
      recognizer.RequestRecognizerUpdate();
      recognizer.UnloadAllGrammars();
      recognizer.LoadGrammarAsync(//switch your grammmars);

    }

您需要查看此范围内语法等的可用性。希望这能帮到您!


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接