Build a realtime voice conversational app. User uses mic to communicate, then its converted to speed to text and then from there it will connect to open ai for response. The response from OpenAI will be sent back to use with text to speech