ChatGPT vs Bing vs Bard: You can pick the best in this chatbot arena

It's like a blind taste test for generative AI. Try it yourself.
By Cecily Mauran  on 
two robotic arms shaking hands
Which LLM is winning the top spot? Credit: Getty Images

Want to know how ChatGPT, Bing, and Bard stack up against each other? Welcome to the Chatbot Arena.

A UC Berkeley research group in partnership with UC San Diego and Carnegie Mellon University has devised an experiment where users can chat with two anonymous models at the same time and vote for the best one. Chatbot Arena includes LLMs from Open AI (GPT-4), Google (PaLM), Meta (LLaMA), and Anthropic's Claude, as well as other models built using these companies' APIs.

When you enter a prompt in the Chatbot Arena, two anonymous models give their responses. Once you cast your vote, the experiment tells you which model you voted for. You can also experiment with side-by-side comparisons of different models and check the leaderboard for the top voted model.

Mashable Light Speed
Want more out-of-this world tech, space and science stories?
Sign up for Mashable's weekly Light Speed newsletter.
By signing up you agree to our Terms of Use and Privacy Policy.
Thanks for signing up!
Chatbot Arena showing two responses from anonymous chatbots based on the same prompt
Which chatbot was the better Karen? I voted for A. Credit: LMSYS Org

The research group, called Large Model Systems Organization (LMSYS) created the crowdsourced experiment as a way to effectively benchmark the many LLMs that have proliferated recently. "Benchmarking LLM assistants is extremely challenging because the problems can be open-ended, and it is very difficult to write a program to automatically evaluate the response quality," said the LMSYS blog post announcing Chatbot Arena. So far, more than 40,000 votes have been cast.

So which LLM is the best? So far, that honor goes to GPT-4. In second place is Anthropic's Claude-v1, followed by Claude Instant, which is Anthropic's lighter, faster version of Claude. Check out the leaderboard for the full results, and try out the Chatbot Arena for yourself on the LMSYS website.

Mashable Image
Cecily Mauran

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on Twitter at @cecily_mauran.


Recommended For You
21 of the best ChatGPT courses you can take online for free
ChatGPT on phone

ChatGPT got an upgrade — and OpenAI says it's better in these key areas
ChatGPT Plus logo on a smartphone

I used AI to plan my Costa Rica trip — why I'll never use it again
Depiction of using ChatGPT in Costa Rica

'Challengers' Zendaya, Josh O'Connor and Mike Faist on the significance of the 'I Told Ya' shirt
Challengers Zendaya, Josh O'Connor and Mike Faist

OpenAI's GPT-5 release could be as early as this summer
OpenAI CEO Sam Altman speaking onstage

Trending on Mashable
NYT Connections today: See hints and answers for May 9
A phone displaying the New York Times game 'Connections.'

'Wordle' today: Here's the answer hints for May 9
a phone displaying Wordle

AT&T, Verizon, and T-Mobile declare legal war on FCC
Person holding smartphone

NYT's The Mini crossword answers for May 9
Closeup view of crossword puzzle clues

NYT Connections today: See hints and answers for May 8
A phone displaying the New York Times game 'Connections.'
The biggest stories of the day delivered to your inbox.
This newsletter may contain advertising, deals, or affiliate links. Subscribing to a newsletter indicates your consent to our Terms of Use and Privacy Policy. You may unsubscribe from the newsletters at any time.
Thanks for signing up. See you at your inbox!