UiPath reall time transcription and interaction with external IVR (Twilio, Amazon Connect)

Hey Guys,
does anybody successfully implemented Twilio Gather or any other method to capture and transcribe voice in the real-time from the phone call? I have couple of design ideas, but before I embark on prototyping I would love to see what community has already explored.

M