Identify and analyze model weaknesses automatically
Detailed analysis of identified weaknesses
Explore common patterns in model failures
Compare performance differences between models
Visualize relationships between different models
Analyze performance trade-offs across query types