Test case ID
Test case name
Whether the test passed
Overall score (0-1)
Individual metric scores
Actual agent output
Optional expectedExpected output
Latency in ms
Optional tokenToken usage
Optional costEstimated cost
Optional errorError if any
Timestamp
Result of a single test case evaluation.