Content-Length: 334207 | pFad | https://github.com/All-Hands-AI/OpenHands/pull/10270

78 feat(evaluation): Added INSTRUCTION_TEMPLATE_NAME to run_infer.py in swe_bench by KevinMusgrave · Pull Request #10270 · All-Hands-AI/OpenHands · GitHub
Skip to content

Conversation

KevinMusgrave
Copy link
Contributor

  • This change is worth documenting at https://docs.all-hands.dev/
  • Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Summarize what the PR does, explaining any non-trivial design decisions.

Allows the prompt template filename to be specified with an environment variable, when running SWE Bench evaluations.

This makes it easier to experiment with prompts, without having to change the if/else logic in run_infer.py.

@KevinMusgrave KevinMusgrave changed the title Added INSTRUCTION_TEMPLATE_NAME to run_infer.py in swe_bench feat(evaluation): Added INSTRUCTION_TEMPLATE_NAME to run_infer.py in swe_bench Aug 12, 2025
Copy link
Collaborator

@xingyaoww xingyaoww left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

# Determine the template file based on mode and LLM
if mode.startswith('swt'):
if INSTRUCTION_TEMPLATE_NAME:
template_name = INSTRUCTION_TEMPLATE_NAME
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

actually another quetsion: you might want to log this to eval metadata for tracking purpose

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense. I've put it in EvalMetadata now.

Copy link
Collaborator

@xingyaoww xingyaoww left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@xingyaoww xingyaoww enabled auto-merge (squash) August 12, 2025 22:48
auto-merge was automatically disabled August 13, 2025 15:58

Head branch was pushed to by a user without write access

@neubig neubig assigned neubig and unassigned neubig Aug 15, 2025
@neubig neubig enabled auto-merge (squash) August 18, 2025 14:06
@neubig neubig merged commit 74ba21b into All-Hands-AI:main Aug 18, 2025
16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: https://github.com/All-Hands-AI/OpenHands/pull/10270

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy