Our dataset is derived from MedXpertQA, a publicly available benchmark introduced for Human Health tasks. The dataset contains 198 diagnosis cases distributed across 11 body systems—Cardiovascular (40 cases), Digestive (35), Respiratory (34), Skeletal (31), Nervous (22), Reproductive (9), Endocrine (9), Integumentary (7), Lymphatic (5), Muscular (4), and Urinary (2). Each case has a unique identifier and an open-ended, reasoning-style diagnostic question paired with medical images spanning CT, MRI, X-ray, PET, pathology images, EEG/ECG recordings, charts, and real-world photographs, together with a clinically validated ground-truth diagnosis.