-
Notifications
You must be signed in to change notification settings - Fork 39
Closed
OpenHands/benchmarks
#67Description
Observed on: 88f1d80
Issue:
output.jsonl for all models is returning empty usage_to_metrics.
Example:
{"usage_to_metrics":{"default":{"model_name":"litellm_proxy/anthropic/claude-sonnet-4-20250514","accumulated_cost":0.0,"accumulated_token_usage":{"model":"litellm_proxy/anthropic/claude-sonnet-4-20250514","prompt_tokens":0,"completion_tokens":0,"cache_read_tokens":0,"cache_write_tokens":0,","reasoning_tokens":0,"context_window":0,"per_turn_token":0,"response_id":""}
Steps to reproduce:
run
SKIP_BUILD=0 uv run swebench-infer .llm_config/gpt-5.json --num-workers 1 --n-limit 1 --max-iterations 20 --note accumulated_cost_test --max-attempts 1 --critic pass --max-retries 3 --split test
inspect output.jsonl
Metadata
Metadata
Assignees
Labels
No labels