人気の記事一覧

FAITHSCORE: Evaluating Hallucinations in Large Vision-Language Models

KNVQA: A Benchmark for evaluation knowledge-based VQA