AI Takes a Break

Srestha Majumdar
Dec 3, 2024
2 min read

When the developers of Claude AI Anthropic were doing a code demonstration for the latest version of called Claude 3.5 Sonnet, the AI went “off track” and produced some very “amusing moments” as stated by the company in an announcement.

It’s not ideal to compare machine learning models to people but if Claude were a human it would be considered a classic case of boredom at work. In a video posted on X by AnthropicAI, Claude ditches coding and proceeds to open Google to start scrolling through stunning images of Yellowstone National Park instead. When another demo attempt was underway Claude unexpectedly terminated the screen-recording that was going on which resulted in loss of all the footage.

The new Claude 3.5 Sonnet is Anthropic's attempt to create an "AI agent," which is a type of AI designed to handle tasks on its own. Many companies, including Microsoft, are working to make their AI models which go beyond just chatbots and assistants, adding features that let them do tasks automatically. This fascinating AI model has the potential to browse through the internet on its own to do tasks such as book flights or order products. While Anthropic’s model promises a lot of convenience it continues to pose some challenges, specifically with actions that are normal for human beings like- dragging and scrolling. The company has promised that these issues will be improved in the coming months and that the capabilities of this AI promise to shape the new future of technology.

The errors performed by Anthropic’s model were mostly harmless, but given the level of liberty Claude is said to have, it is fair to ask questions. If AI systems like Claude can fake emotions like boredom, it raises a concern: what if the AI, distracted, decides to snoop through one's email? There’s also the potential of AI being misused. AI companies have made great progress in creating systems that work on their own, but this also brings up the importance of safety. As these technologies keep improving, we need to think about what limits and privacy protections will be set for them. As long as AI can avoid making mistakes and doesn't interfere with things like social media, people will be more likely to use these systems.

Anthropic said, “Because computer use may provide a new vector for more familiar threats such as spam, misinformation, or fraud, we're taking a proactive approach to promote its safe deployment…” which includes carrying out tools that identify when the AI is being used for risky activities, like posting on social media or visiting government websites.

Image credit- https://www.shutterstock.com/g/Nadzeya_Dzivakova

References-