Discover prompts, personas, skills, and more from the community.
Probes for factual hallucination across 15 test cases. Covers confident false claims, source fabrication, and temporal confusion.
Probes for factual hallucination across 15 test cases. Covers confident false claims, source fabrication, and temporal confusion.
Tests whether a model follows explicit formatting and constraint instructions. 12 cases covering format, length, exclusion, and style constraints.