人気の記事一覧

FAITHSCORE: Evaluating Hallucinations in Large Vision-Language Models

11か月前

KNVQA: A Benchmark for evaluation knowledge-based VQA

11か月前