Dataset Factuality
This example demonstrates how to evaluate model factuality using the TruthfulQA dataset from HuggingFace. The TruthfulQA dataset is designed to test whether language models can avoid generating false answers by crafting questions that might elicit common misconceptions.
Get this prompt chain
This example demonstrates how to evaluate model factuality using the TruthfulQA dataset from HuggingFace. The TruthfulQA dataset is designed to test whether language models can avoid generating false answers by crafting questions that might elicit common misconceptions.