OpenHands 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
..
README.md 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
__init__.py 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
eval_w_subhypo_gen.py 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
lm_utils.py 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
openai_helpers.py 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
openai_semantic_gen_prompts.py 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago
response_parser.py 678436da30 Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223) 1 year ago

README.md

DiscoveryBench Evaluation Utils

  • eval_w_subhypo_gen.py: Implements the DiscoveryBench logic for evaluating agent-generated hypotheses.
  • lm_utils.py: Provides utility functions necessary for the evaluation process.
  • openai_helpers.py: Includes helper functions for OpenAI-related tasks.
  • openai_semantic_gen_prompts.py: Contains prompts used for semantic generation.
  • response_parser.py: Handles the parsing of agent-generated hypotheses.