You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: evaluation/swe_bench/README.md
+27Lines changed: 27 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -154,6 +154,33 @@ The final results will be saved to `evaluation/evaluation_outputs/outputs/swe_be
154
154
-`report.json`: a JSON file that contains keys like `"resolved"` pointing to instance IDs that are resolved by the agent.
155
155
-`summary.json`: a JSON file contains more fine-grained information for each test instance.
156
156
157
+
## Visualize Results
158
+
159
+
First you need to clone `https://huggingface.co/spaces/OpenDevin/evaluation` and add your own running results from opendevin into the `outputs` of the cloned repo.
0 commit comments