Thinking Machines Lab, an innovative AI startup founded by former OpenAI CTO Mira Murati, has recently unveiled a groundbreaking concept known as interaction models. This new approach allows AI to engage in conversations more naturally, resembling the dynamics of a phone call rather than a simple text exchange.
Traditionally, AI systems operate on a linear model: you speak, the AI listens, and then it responds. Thinking Machines seeks to revolutionize this interaction by developing a model that processes inputs and generates responses simultaneously. This method, referred to as "full duplex," aims to enhance the fluidity of conversations.
The company's model, named TML-Interaction-Small, boasts an impressive response time of just 0.40 seconds, aligning closely with the pace of natural human dialogue. This performance is notably quicker than comparable models from industry giants like OpenAI and Google.
However, it's important to note that this development is currently in a research preview phase and is not yet available to the public. A limited research preview is anticipated in the coming months, with plans for a broader release later this year.
The benchmarks for this technology are certainly remarkable, and the concept of integrating interactivity at the core of the model is intriguing. The true test, however, will be whether users find the real-world application lives up to these technical promises once it becomes accessible.