close
Skip to content

rubric_based_final_response_quality_v1 is hard to use for factual evaluation of google_search agents because its judge prompt requires tool_response evidence #5685

rubric_based_final_response_quality_v1 is hard to use for factual evaluation of google_search agents because its judge prompt requires tool_response evidence

rubric_based_final_response_quality_v1 is hard to use for factual evaluation of google_search agents because its judge prompt requires tool_response evidence #5685

Triggered via issue May 25, 2026 15:34
Status Skipped
Total duration 1s
Artifacts

triage.yml

on: issues
agent-triage-issues
0s
agent-triage-issues
Fit to window
Zoom out
Zoom in