ChatGPT vs Bing vs Bard: You can pick the best in this chatbot arena

It's like a blind taste test for generative AI. Try it yourself.

By Cecily Mauran on June 22, 2023

two robotic arms shaking hands

Which LLM is winning the top spot? Credit: Getty Images

Want to know how ChatGPT, Bing, and Bard stack up against each other? Welcome to the Chatbot Arena.

A UC Berkeley research group in partnership with UC San Diego and Carnegie Mellon University has devised an experiment where users can chat with two anonymous models at the same time and vote for the best one. Chatbot Arena includes LLMs from Open AI (GPT-4), Google (PaLM), Meta (LLaMA), and Anthropic's Claude, as well as other models built using these companies' APIs.

SEE ALSO: ChatGPT, Google Bard produce free Windows 11 keys

When you enter a prompt in the Chatbot Arena, two anonymous models give their responses. Once you cast your vote, the experiment tells you which model you voted for. You can also experiment with side-by-side comparisons of different models and check the leaderboard for the top voted model.

Mashable Light Speed

Want more out-of-this world tech, space and science stories?

Sign up for Mashable's weekly Light Speed newsletter.

By signing up you agree to our Terms of Use and Privacy Policy.

Thanks for signing up!

Chatbot Arena showing two responses from anonymous chatbots based on the same prompt

Which chatbot was the better Karen? I voted for A. Credit: LMSYS Org

The research group, called Large Model Systems Organization (LMSYS) created the crowdsourced experiment as a way to effectively benchmark the many LLMs that have proliferated recently. "Benchmarking LLM assistants is extremely challenging because the problems can be open-ended, and it is very difficult to write a program to automatically evaluate the response quality," said the LMSYS blog post announcing Chatbot Arena. So far, more than 40,000 votes have been cast.

So which LLM is the best? So far, that honor goes to GPT-4. In second place is Anthropic's Claude-v1, followed by Claude Instant, which is Anthropic's lighter, faster version of Claude. Check out the leaderboard for the full results, and try out the Chatbot Arena for yourself on the LMSYS website.

Topics Artificial Intelligence ChatGPT

Cecily Mauran

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on Twitter at @cecily_mauran.

Recommended For You

21 of the best ChatGPT courses you can take online for free

Learn how to harness the power of ChatGPT without spending anything.

03/09/2024

By Joseph Green

ChatGPT on phone

ChatGPT on phone

ChatGPT got an upgrade — and OpenAI says it's better in these key areas

GPT-4 Turbo is now available for ChatGPT Plus users.

04/12/2024

By Cecily Mauran

I used AI to plan my Costa Rica trip — why I'll never use it again

Planning trips with ChatGPT was a letdown.

04/02/2024

By Kimberly Gedeon

Depiction of using ChatGPT in Costa Rica

Depiction of using ChatGPT in Costa Rica

'Challengers' Zendaya, Josh O'Connor and Mike Faist on the significance of the 'I Told Ya' shirt

"There's constantly this idea of traveling garments of clothing that interconnect them."

04/25/2024

By Mark Stetson

Challengers Zendaya, Josh O'Connor and Mike Faist

Challengers Zendaya, Josh O'Connor and Mike Faist

OpenAI's GPT-5 release could be as early as this summer

A 'materially better' model is reportedly months away.

03/20/2024

By Cecily Mauran

OpenAI CEO Sam Altman speaking onstage

OpenAI CEO Sam Altman speaking onstage

Trending on Mashable

NYT Connections today: See hints and answers for May 9

Everything you need to solve 'Connections' #333.

23 hours ago

By Mashable Team

A phone displaying the New York Times game 'Connections.'

A phone displaying the New York Times game 'Connections.'

'Wordle' today: Here's the answer hints for May 9

Here are some tips and tricks to help you find the answer to "Wordle" #1055.

22 hours ago

By Mashable Team

a phone displaying Wordle

a phone displaying Wordle

AT&T, Verizon, and T-Mobile declare legal war on FCC

The wireless companies plan to appeal the FCC's $200 million fine over the handling of user location data.

05/07/2024

Person holding smartphone

Person holding smartphone

NYT's The Mini crossword answers for May 9

Stuck on any of the clues? We have the answers you need.

16 hours ago

By Mashable Team

Closeup view of crossword puzzle clues

Closeup view of crossword puzzle clues

NYT Connections today: See hints and answers for May 8

Everything you need to solve 'Connections' #332.

05/07/2024

By Mashable Team

A phone displaying the New York Times game 'Connections.'

A phone displaying the New York Times game 'Connections.'

The biggest stories of the day delivered to your inbox.

This newsletter may contain advertising, deals, or affiliate links. Subscribing to a newsletter indicates your consent to our Terms of Use and Privacy Policy. You may unsubscribe from the newsletters at any time.

Thanks for signing up. See you at your inbox!