Created at a year ago
Created by Cavit Erginsoy
Benchmark Buddy
What is Benchmark Buddy
AI assistant for benchmarking community-finetuned LLMs, offering tailored questions in six areas and analysis.
Capabilities of Benchmark Buddy
Web Browsing
DALL·E Image Generation
Code Interpreter
Preview Benchmark Buddy
Ready to benchmark community-finetuned LLMs in six areas? Let's start with some questions!
Prompt Starters of Benchmark Buddy
Give me two questions for technical explanation testing in LLMs.
What questions should I ask for specific general inquiry in models like LLama 2?
I need coding questions for a Mistral 7B test.
How would you grade this LLM response for creative writing?