Mira Murati Presents Interaction-Model Vision for Thinking Machines

In a Bloomberg Tech interview with Emily Chang on June 4, Mira Murati, co-founder and CEO of Thinking Machines Lab, made her first major public appearance in roughly 18 months and laid out the lab's direction, TechCrunch reports. Murati discussed the company's 'interaction models,' which Thinking Machines introduced in a research preview in mid-May and which process audio, video, and text as continuous parallel streams in 200-millisecond chunks, so the system can listen, see, and respond in real time instead of waiting for a user to finish. Per TechCrunch, the lab has spent about a year and a half raising capital, hiring researchers, and shipping one product, Tinker, an API for fine-tuning open-source models, and Murati framed the broader product vision as an early step without giving a release timeline. She also addressed the November 2023 OpenAI board crisis that employees called the 'blip,' saying she felt clear about her decisions at each moment, according to TechCrunch.
What happened
At Bloomberg Tech in San Francisco on June 4, Mira Murati, co-founder and CEO of Thinking Machines Lab, made her first major public appearance in roughly 18 months, in an on-stage interview with Bloomberg's Emily Chang, TechCrunch reports. According to TechCrunch, Murati used the conversation to lay out the lab's direction and to discuss its 'interaction models,' the approach Thinking Machines introduced in a research preview in mid-May. TechCrunch writes that the lab has spent about a year and a half raising capital, hiring researchers, and shipping one product, Tinker, an API for fine-tuning open-source models, and that Murati framed the broader product vision as an early step without committing to a release timeline. She also addressed the November 2023 OpenAI board crisis that employees reportedly called the 'blip,' saying she felt clear about her decisions in each moment, per TechCrunch.
What interaction models are
Thinking Machines describes interaction models as systems that process audio, video, and text as continuous, parallel streams rather than flattening everything into a single ordered token sequence. In its research-preview write-up, the lab says the model interleaves perceiving roughly 200 milliseconds of input with generating 200 milliseconds of output, letting the system listen, see, and respond at the same time, including interrupting or adding context mid-conversation. Independent coverage of the May preview reported demos such as live two-way translation, repetition counting, and safety monitoring.
Why it matters
Real-time, streaming multimodal interaction is architecturally harder than turn-based, batch prompt flows. Designs of this kind generally require low-latency streaming inference, tightly integrated audio and vision capture, and stateful memory or chunked attention to stay responsive at sub-200-millisecond granularity, which raises compute and engineering costs. The approach also runs directly into established real-time voice and assistant systems, making latency, reliability, and cost per minute the metrics that will decide whether it is practical.
What to watch
Useful signals include published latency and throughput benchmarks for live streams, demos showing clean interruption and mid-thought handling, developer adoption of Tinker, any open checkpoints, and partner integrations. More detailed disclosures on how the lab manages persistent state, cost, and privacy for streaming audio and video would clarify how close interaction models are to production.
Scoring Rationale
A high-profile reappearance: Murati's first major public appearance in about 18 months, at Bloomberg Tech, where she discussed Thinking Machines' direction and its interaction models. The substantive product news, the interaction-models research preview, broke in mid-May, so this interview mainly adds executive visibility and context rather than a new launch, keeping it in the notable-but-not-major band.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

