Using OpenKinect (drivers) + OpenCV (image processing and recognition) + FestVox (speech synthesis) + CMU Sphinx (speech recognition). Proof of concept. I'm wearing a headset because OpenKinect does not yet support audio input. All of the processing and recognition occurs in real time.Link: http://www.youtube.com/watch?v=fQ59dXOo63o
Middleware:
Operating System: