evaluating-llms-harness

Evaluates LLMs using 60+ benchmarks for model quality assessment, widely adopted in academic and industry settings.

Install this skill

or

evaluating-llms-harness5 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!

Installation guide →

GitHub Stars 22.3K

Rate this skill

Categorydevelopment

UpdatedJune 15, 2026

openclaw api ml-ai-engineer data-scientist data-analyst researcher product-manager huggingface development data analytics education research product

davila7/claude-code-templates