FINAL ROUND RESULTS
Thank you for participating in March Model Madness 2024!
Welcome to ML Square Garden, here in Neural York for the inaugural March Model Madness!
This annual spectacle brings 16 of the top Foundational Models (FM) and Large Language Models (LLM) together in a clash of computational capabilities. In a grand tournament format these giants of GenAI will compete across four categories: Chat, Instruct, Code, and Image Creation.
To add some extra spice, this is a blind contest; the winning model proceeds to the next round, while the defeated model is revealed.
To make this monumental model match-up magnificent, we need your involvement over two stages:
- Pre-game prep – before we even tip-off, help us craft challenging prompts for the models by submitting your own and upvoting the ones you believe will push their capabilities to the buzzer.
- Game-time – we’re underway in the 2024 tournament. For each matchup, you pick which response is best: was it an air-ball or a slam dunk? Vote now on the best responses to ensure your preferred models advance to the next round.
Contest format - Battles begin March 25th
Chat
Knockout:
Round 1 – 8 Battles
Round 2 – 4 Battles
Round 3 – 2 Battles
Round 4 – 1 Battle (Final)
Instruct
Knockout:
Round 1 – 8 Battles
Round 2 – 4 Battles
Round 3 – 2 Battles
Round 4 – 1 Battle (Final)
Code
Knockout:
Round 1 – 8 Battles
Round 2 – 4 Battles
Round 3 – 2 Battles
Round 4 – 1 Battle (Final)
Image Creation
Round-robin:
Round 1 – 2 Battles (1 vs 2, 3 vs 4)
Round 2 – 2 Battles (1 vs 3, 2 vs 4)
Round 3 – 2 Battles (1 vs 4, 2 vs 3)
Who's in the draft?
CHAT models:
Claude 2.1
Claude 3
Claude Instant
Falcon-40b-instruct
GPT3.5
GPT4
Gemini Pro
Jurassic2 Ultra
LLama2-13b-chat
LLama2-70b-chat
LLama2-7b-chat
Mistral-7b-v0.1
Mixtral-8x7b-instruct-v0.1
Starling-lm-7b-alpha
Yi-34b-chat
Zephyr-7b-beta
INSTRUCT models:
Claude 2.1
Claude 3
Claude Instant
Falcon-40b-instruct
GPT3.5
GPT4
Gemini Pro
Jurassic2 Ultra
Llama-2-13b-chat
Llama-2-70b-chat
Llama-2-7b-chat
Mistral-7b-instruct
Mixtral-8x7b-instruct-v0.1
Starling-lm-7b-alpha
Vicuna-13b
Zephyr-7b-beta
CODE models:
Claude 2.1
Claude 3
Codellama-34b-instruct
Codellama-34b-python
Codellama-70b-instruct
Codellama-70b-python
Codellama-7b-instruct
Codey
GPT3.5
GPT4
Gemini Pro
Jurassic2 Ultra
Mistral-7b-v0.1
Mixtral-8x7b-instruct-v0.1
Olmo-7b
Wizardcoder-34b
IMAGE CREATION models:
Gemini
Midjourney
OpenAI Dall-e-3
Sdxl