NVRD Dataset Explorer

Novel Visual Recognition Dataset — Human vs. Model Ratings on Visual Perturbations

Item Explorer
Data Table
Model vs. Human
Rating × Level
Rating Distribution
Original
Base image
Perturbed
Perturbed image

Ratings Comparison

HUMAN RATING DISTRIBUTION
1 (Strongly Disagree)7 (Strongly Agree)

All Ratings (Bar Chart)

Human Rating Distribution for This Item

Object Category Perturbation Level Human Mean Gemini GPT-4o IDEFICS3 Molmo2 Qwen2-VL

All Models vs. Human (Jittered)

Mean Rating by Perturbation Level

Per-Object Rating Curves

Overall Rating Distribution (Human vs. Models)

Rating Distribution by Category

Rating Distribution by Perturbation Type