Basic Agent Evaluation Runner
Instructions:
- Please clone this space, then modify the code to define your agent's logic, the tools, the necessary packages, etc ...
- Log in to your Hugging Face account using the button below. This uses your HF username for submission.
- Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see the score.
Disclaimers: Once clicking on the submit button, it can take quite some time.
Questions and Agent Answers