Gpt-4 Turbo Reclaims The 'best Ai Model' Crown From Anthropic's Claude 3

SERVIDORES

Trophy technology — Getty Images/sofiana indriani

OpenAI has been on an update hot streak lately, making the latest GPT-4 Turbo available to developers and paid ChatGPT subscribers last week. When launching the model, OpenAI shared that the new GPT-4 Turbo boasts several improvements from its predecessor. Users are now finding that to be true.

Also: Zoom gets its first major overhaul in 10 years, powered by generative AI

On Thursday, the updated version of GPT-4 Turbo, gpt-4-turbo-2024-04-09, reclaimed its number one spot on the Large Model Systems Organization (LMSYS) Chatbot Arena, a crowdsourced platform where users can evaluate large language models (LLMs).

Exciting news -- GPT-4-Turbo has just reclaimed the No. 1 spot on the Arena leaderboard again! Woah!
We collect over 8K user votes from diverse domains and observe its strong coding & reasoning capability over others. Hats off to @OpenAI for this incredible launch!
To offer... pic.twitter.com/IxbN2Q9ecJ
- lmsys.org (@lmsysorg) April 11, 2024

The Chatbot Arena lets users chat with two LLMs side by side and compare their responses to each other without knowing the models' names.

After viewing the responses, users can continue chatting until they feel comfortable determining which model won, if it is a tie, or if they both performed poorly, as seen below.

Chatbot Arena then uses the results to rank the 82 LLMs on its leaderboard, which includes the most popular LLMs available, such as Gemini Pro, Claude 3, and Mistral-Large-2402.

As of the latest Chatbot Arena update on April 13, the updated version of GPT-4 Turbo holds the lead in the overall, coding, and English categories.

Also: The best AI chatbots: ChatGPT isn't the only one worth trying

This means that less than a month after overtaking GPT-4 Turbo in the Chatbot Arena, Anthropic's Claude 3 Opus has been pushed into second place in the overall category, followed by GPT-4-1106-preview, an older version of GPT-4 Turbo, in third place.

These results could be attributed to gpt-4-turbo-2024-04-09's improved coding, math, logical reasoning, and writing capabilities, demonstrated by its higher performance on a series of benchmarks used to test the proficiency of AI models, as seen below.

UPDATE: the MMLU points weren't clear on the previous graph. Here's an updated one. pic.twitter.com/HexJzytDts
- OpenAI (@OpenAI) April 12, 2024

If you're interested in comparing gpt-4-turbo-2024-04-09's performance against other LLMs, you can visit Chatbot Arena and click on the Arena (side-by-side) option to select the models you want to compare.

Also: Adobe Premiere Pro's two new AI tools blew my mind. Watch them in action for yourself

Since you know the identity of the models in the side-by-side option, you will not be able to vote. If you want to vote and have that count toward the leaderboard, use the "Arena (battle)" option to compare random models.

If you'd rather skip the testing and jump straight into using gpt-4-turbo-2024-04-09 in ChatGPT, you have to subscribe to ChatGPT Plus, which costs$20 per month.

Artificial Intelligence

I asked Gemini and GPT-4 to explain deep learning AI, and Gemini won hands down
How to use ChatGPT's file analysis capability (and what it can do for you)
I tried Copilot Notebook: Microsoft's new AI tool offers two handy prompt features
What to know about Mistral AI: The company behind the latest GPT-4 rival

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVIDORES

NOTÍCIAS QUENTES

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

GPT-4 Turbo reclaims the 'best AI model' crown from Anthropic's Claude 3

Artificial Intelligence

Tags quentes : Inovação

Ordering Guide

Recursos

Quem somos