New platform helps consider AI for advanced laptop use

February 20, 2025

The GIST Editors' notes

This text has been reviewed in accordance with Science X's editorial course of and insurance policies. Editors have highlighted the next attributes whereas making certain the content material's credibility:

fact-checked

trusted supply

proofread

New platform helps consider AI for advanced laptop use

robot using computer
Credit score: Pixabay/CC0 Public Area

Think about asking AI to plan your journey itinerary, e-book and pay for all of your flights, and organize your airport transport—all inside a single click on. Thankfully, a global analysis crew is making this imaginative and prescient a actuality.

The crew, composed of researchers from the College of Waterloo, College of Hong Kong, Salesforce Analysis and Carnegie Mellon College developed Pc Agent Enviornment—an analysis platform that may improve and create laptop brokers.

A pc agent is a sort of software program that may carry out duties on behalf of an individual or group, without having fixed human intervention. It might interpret the state of the pc and act autonomously to assist customers clear up issues. Examples of laptop brokers embody voice assistants like Siri and Alexa, who may also help customers ship messages and schedule conferences.

AI-based laptop brokers wrestle with performing advanced laptop duties as a result of it requires controlling a number of laptop purposes and varied steps. For instance, submitting an expense report could also be troublesome as a result of it requires updating a spreadsheet by looking a number of emails and folders crammed with financial institution statements and receipts.

Pc Agent Enviornment is the primary interactive laptop use analysis platform that focuses on performing various duties throughout a number of purposes. This work is an extension of the researchers' work on OSWorld, the world's first scalable and actual laptop surroundings for multimodal brokers.

Credit score: College of Waterloo

"Pc Agent Enviornment supplies a platform for the analysis neighborhood to develop efficient and environment friendly brokers that generalize to real-world laptop utilization," says co-developer Dr. Victor Zhong, assistant professor on the Cheriton College of Pc Science. Like different Waterloo researchers, he’s investigating human-technology interactions, exploring learn how to mitigate on a regular basis issues by creating novel applied sciences.

"Pc Agent Enviornment is distinct from comparable analysis like Mind2Web and WebArena as a result of it supplies unified utility programming interfaces for complete observations and actions in an executable surroundings with a number of purposes."

By way of Pc Agent Enviornment, customers can assess and evaluate varied laptop brokers primarily based on massive language fashions (LLM) and imaginative and prescient language fashions. First, customers choose an working system akin to Home windows, and purposes like Google Chrome and Excel. Customers can then immediate the pc agent with a activity, which shall be carried out concurrently by two AI fashions in real-time. After completion, customers can fee every mannequin's efficiency and supply suggestions.

In the end, the crew seeks to supply a various and dynamic platform for constructing and evaluating brokers that may carry out real-world laptop duties as safely, successfully and effectively as people do.

"Our present findings present that basis fashions akin to GPT4 and Claude are removed from having the ability to act safely and successfully as assistant laptop brokers," Zhong says. "Pc Agent Enviornment supplies a well timed testbed to develop the subsequent technology of AI brokers."

Supplied by College of Waterloo Quotation: New platform helps consider AI for advanced laptop use (2025, February 20) retrieved 20 February 2025 from https://techxplore.com/information/2025-02-platform-ai-complex.html This doc is topic to copyright. Other than any honest dealing for the aim of personal research or analysis, no half could also be reproduced with out the written permission. The content material is offered for info functions solely.

Discover additional

UI-TARS GUI agent mannequin can automate duties akin to discovering and reserving airline tickets shares

Feedback to editors