One post tagged with "benchmark"

New RAG Benchmark for Finance applications: Apple 10K 2022

February 22, 2024 · 2 min read

Lighthouz AI, Inc.

We are excited to release a new RAG benchmark for finance applications. This dataset contains queries and responses to evaluate AI chatbots and RAG applications for hallucinations and accuracy. The dataset was created using Lighthouz AutoBench, a no-code test case generator for LLM use cases, and then manually verified by two human annotators.

The dataset is available on HuggingFace at: https://huggingface.co/datasets/lighthouzai/rag-benchmark-finance-apple-10K-2022.

The dataset is also preloaded on all Lighthouz accounts.