{"id":20880,"date":"2023-08-08T21:09:21","date_gmt":"2023-08-08T21:09:21","guid":{"rendered":"https:\/\/nftandcrypto-news.com\/crypto\/chatgpt-and-claude-are-becoming-capable-of-tackling-real-world-missions-say-scientists\/"},"modified":"2023-08-08T21:09:23","modified_gmt":"2023-08-08T21:09:23","slug":"chatgpt-and-claude-are-becoming-capable-of-tackling-real-world-missions-say-scientists","status":"publish","type":"post","link":"https:\/\/nftandcrypto-news.com\/crypto\/chatgpt-and-claude-are-becoming-capable-of-tackling-real-world-missions-say-scientists\/","title":{"rendered":"ChatGPT and Claude are \u2018becoming capable of tackling real-world missions,\u2019 say scientists"},"content":{"rendered":"
\n

Nearly two dozen researchers from Tsinghua University, Ohio State University and the University of California at Berkeley collaborated to create a method for measuring the capabilities of large language models (LLMs) as real-world agents.<\/p>\n

LLMs such as OpenAI\u2019s ChatGPT and Anthropic\u2019s Claude have taken the technology world by storm over the past year, as cutting-edge \u201cchatbots\u201d have proven useful at a variety of tasks, including coding, cryptocurrency trading\u00a0and text generation. <\/p>\n

Related: <\/em><\/strong>OpenAI launches web crawler ‘GPTBot’ amid plans for next model: GPT-5<\/em><\/strong><\/p>\n

Typically, these models are benchmarked based on their ability to output text perceived as humanlike or by their scores on plain-language tests designed for humans. By comparison, far fewer papers have been published on the subject of LLM models as agents. <\/p>\n

Artificial intelligence (AI) agents perform specific tasks, such as following a set of instructions within a specific environment. For example, researchers will often train an AI agent to navigate a complex digital environment as a method for studying the use of machine learning to develop autonomous robots safely.<\/p>\n