GPT-4o is better at understanding compared to existing models; it can reason across audio, vision, text in real time