FINAL ROUND RESULTS

Thank you for participating in March Model Madness 2024!

We will contact participation prize winner via email by Wednesday 4/10/2024.
We hope to see you next year in 2025!
chatImage
ML Ops + Seaplane IO logos

Welcome to ML Square Garden, here in Neural York for the inaugural March Model Madness!

This annual spectacle brings 16 of the top Foundational Models (FM) and Large Language Models (LLM) together in a clash of computational capabilities. In a grand tournament format these giants of GenAI will compete across four categories: Chat, Instruct, Code, and Image Creation.

To add some extra spice, this is a blind contest; the winning model proceeds to the next round, while the defeated model is revealed.

To make this monumental model match-up magnificent, we need your involvement over two stages:

  • Pre-game prep – before we even tip-off, help us craft challenging prompts for the models by submitting your own and upvoting the ones you believe will push their capabilities to the buzzer.
  • Game-time – we’re underway in the 2024 tournament. For each matchup, you pick which response is best: was it an air-ball or a slam dunk? Vote now on the best responses to ensure your preferred models advance to the next round.

Contest format - Battles begin March 25th

Chat

Knockout:

Round 1 – 8 Battles

Round 2 – 4 Battles

Round 3 – 2 Battles

Round 4 – 1 Battle (Final)

Instruct

Knockout:

Round 1 – 8 Battles

Round 2 – 4 Battles

Round 3 – 2 Battles

Round 4 – 1 Battle (Final)

Code

Knockout:

Round 1 – 8 Battles

Round 2 – 4 Battles

Round 3 – 2 Battles

Round 4 – 1 Battle (Final)

Image Creation

Round-robin:

Round 1 – 2 Battles (1 vs 2, 3 vs 4)

Round 2 – 2 Battles (1 vs 3, 2 vs 4)

Round 3 – 2 Battles (1 vs 4, 2 vs 3)


Who's in the draft?

CHAT models:

Claude 2.1

Claude 3

Claude Instant

Falcon-40b-instruct

GPT3.5

GPT4

Gemini Pro

Jurassic2 Ultra

LLama2-13b-chat

LLama2-70b-chat

LLama2-7b-chat

Mistral-7b-v0.1

Mixtral-8x7b-instruct-v0.1

Starling-lm-7b-alpha

Yi-34b-chat

Zephyr-7b-beta

INSTRUCT models:

Claude 2.1

Claude 3

Claude Instant

Falcon-40b-instruct

GPT3.5

GPT4

Gemini Pro

Jurassic2 Ultra

Llama-2-13b-chat

Llama-2-70b-chat

Llama-2-7b-chat

Mistral-7b-instruct

Mixtral-8x7b-instruct-v0.1

Starling-lm-7b-alpha

Vicuna-13b

Zephyr-7b-beta

CODE models:

Claude 2.1

Claude 3

Codellama-34b-instruct

Codellama-34b-python

Codellama-70b-instruct

Codellama-70b-python

Codellama-7b-instruct

Codey

GPT3.5

GPT4

Gemini Pro

Jurassic2 Ultra

Mistral-7b-v0.1

Mixtral-8x7b-instruct-v0.1

Olmo-7b

Wizardcoder-34b

IMAGE CREATION models:

Gemini

Midjourney

OpenAI Dall-e-3

Sdxl