There is no speech recognition system that can simply take a program's audio feed with various speakers in it and automatically generate captions with sufficient accuracy. (If such a system existed and worked reliably, every TV station would simply have a "magic box" that did all of the captioning for you, and nobody would need to create their own closed captions.)