model-tester

Name: model-tester
Availability: InStock
Author: TerryFYL

Community

Standardized multi-model radar for AI evaluation.

AuthorTerryFYL

Version1.0.0

Installs0

System Documentation

This Skill automates standardized cross-model evaluation by generating a six-dimension radar profile for AI models.

Six-dimension radar framework (Dim1–Dim6) with 18 test cases, scored by an AI Judge to form a searchable model profile.
End-to-end workflow for registering models, running tests, aggregating results, and visualizing radar data.
Cross-validation and comprehensive testing workflows to compare models and inform deployment decisions.
Output includes a composite radar, detailed task results, and stored histories for auditing.

Register a model and run the standard six-dimension radar workflow to generate a radar profile and composite results.