soheunyi/get-research-done/evaluation-suite
Unified Stage 3 and Stage 3.5 evaluation skill with mode=decision (hypothesis decision from metrics/statistics) and mode=diagnostics (error analysis and sanity checks). Use when results need either outcome classification or deep failure-mode diagnostics.
Risk Score
25
out of 100
Popularity
1
Stars
0
Forks
Feb 13, 2026
Updated
Findings by Severity (Latest Scan)
CodeThreat AppSec
Full SAST + SCA agentic security analysis for MCP servers and Skills.