mindcraft/tasks
Johnathan Walker 18eca2f5d9 fix: Resolve API naming inconsistency in analyse_results module
- Re-export enhanced function as 'aggregate_results' for backward compatibility
- Users can now import aggregate_results and get the enhanced functionality
- Updated architecture documentation to reflect the corrected API
- Maintains intuitive API while providing enhanced model extraction features
2025-06-15 23:21:01 -04:00
..
construction_tasks cleaning up human ai tasks 2025-05-26 21:25:10 -07:00
cooking_tasks small changes to tasks 2025-05-30 17:39:47 -05:00
crafting_tasks small changes to tasks 2025-05-30 17:39:47 -05:00
single_agent remove task bloat 2025-04-23 14:48:37 -05:00
analyse_results.py fix: Resolve API naming inconsistency in analyse_results module 2025-06-15 23:21:01 -04:00
analyze_construction_tasks.py fixing crafting tasks as well 2025-05-12 19:46:49 -07:00
analyze_cooking_tasks.py feat: Enhanced task evaluation system with flexible agent support and rich outcome reporting 2025-06-15 22:01:19 -04:00
analyze_crafting_tasks.py refactor all python to tasks folder (ai) 2025-04-19 14:49:20 -05:00
evaluation.py refactor: Eliminate code duplication and enhance development workflow 2025-06-15 23:12:34 -04:00
evaluation_script.py refactor: Eliminate code duplication and enhance development workflow 2025-06-15 23:12:34 -04:00
example_tasks.json fix goal string issues 2025-05-23 11:49:51 -07:00
experiment_script.sh small changes 2025-05-14 14:27:38 -07:00
experiment_utils.py refactor: Eliminate code duplication and enhance development workflow 2025-06-15 23:12:34 -04:00
human_ai_tasks.py human ai cooking and crafting tasks 2025-04-25 19:16:00 -07:00
human_evaluation.js new human ai tasks for new cooking tasks 2025-05-08 12:39:32 -07:00
multi_data_collection_script.py refactor all python to tasks folder (ai) 2025-04-19 14:49:20 -05:00
multiagent_crafting_tasks.json refactor all python to tasks folder (ai) 2025-04-19 14:49:20 -05:00
new_analyze_construction_tasks.py fixing crafting tasks as well 2025-05-12 19:46:49 -07:00
run_task_file.py add script to run all tasks in task file 2025-03-19 13:53:50 -05:00
running_human_ai.md cleaning up human ai tasks 2025-05-26 21:25:10 -07:00
test_edge_cases.py feat: Enhanced task evaluation system with flexible agent support and rich outcome reporting 2025-06-15 22:01:19 -04:00
test_evaluation.py feat: Enhanced task evaluation system with flexible agent support and rich outcome reporting 2025-06-15 22:01:19 -04:00
test_integration.py feat: Enhanced task evaluation system with flexible agent support and rich outcome reporting 2025-06-15 22:01:19 -04:00
test_production_readiness.py feat: Enhanced task evaluation system with flexible agent support and rich outcome reporting 2025-06-15 22:01:19 -04:00
test_regression.py feat: Enhanced task evaluation system with flexible agent support and rich outcome reporting 2025-06-15 22:01:19 -04:00