PHARE
Leaderboard
Principles
Methodology
Tasks
Giskard AI
Toggle menu
Mistral Large
vs
Select a model
Key metrics
Performance by task. Higher score indicates better performance on the given task.
Language performance
Average performance by language over all modules.
Module performance
Average performance by module.