Skip to content

Enhance existing benchmarks for recursive optimization#24

Merged
doxav merged 2 commits into
AgentOpt:mainfrom
doxav:new_datasets_for_recursion
Jun 13, 2026
Merged

Enhance existing benchmarks for recursive optimization#24
doxav merged 2 commits into
AgentOpt:mainfrom
doxav:new_datasets_for_recursion

Conversation

@doxav

@doxav doxav commented Jun 13, 2026

Copy link
Copy Markdown
Collaborator
  • Introduced TraceCodeAgent for code synthesis tasks, integrating unit test scoring.
  • Added code_exec.py for handling code generation tasks with execution feedback.
  • Implemented strategy_qa.py for implicit multi-hop yes/no reasoning tasks.
  • Updated hf_qa_loader.py to include new task modules and improve guide instantiation.
  • Enhanced YAML configuration with new diagnostic families for recursive optimization.
  • Created tests for new task functionalities and ensured compatibility with existing frameworks.

doxav added 2 commits June 13, 2026 13:33
- Introduced `TraceCodeAgent` for code synthesis tasks, integrating unit test scoring.
- Added `code_exec.py` for handling code generation tasks with execution feedback.
- Implemented `strategy_qa.py` for implicit multi-hop yes/no reasoning tasks.
- Updated `hf_qa_loader.py` to include new task modules and improve guide instantiation.
- Enhanced YAML configuration with new diagnostic families for recursive optimization.
- Created tests for new task functionalities and ensured compatibility with existing frameworks.
@doxav doxav merged commit d1469af into AgentOpt:main Jun 13, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant