HICO & HICO-DET
Benchmarks for Recognizing Human-Object Interactions in Images
We introduce two new benchmarks for classifying and detecting human-object interactions (HOI) in images:
HICO (Humans Interacting with Common Objects)
A diverse set of interactions with common object categories
A list of well-defined, sense-based HOI categories
An exhaustive labeling of co-occurring interactions with an object category in each image
The annotation of each HOI instance (i.e. a human and an object bounding box with an interaction class label) in all images
Task 1: HOI Classification
The input is an image and the output is a set of binary labels, each representing the presence or absense of an HOI class.
Sample annotations in the HICO benchmark
Task 2: HOI Detection
The input is an image and the output is a set of bounding box pairs, each localizes a human plus an object and predicts an HOI class label.
Riding a horse
Feeding a horse
Eating an apple
Cutting an apple
Sample annotations in the HICO-DET benchmark
Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, and Jia Deng. HICO: A Benchmark for Recognizing Human-Object Interactions in Images.
Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015.
[pdf] [supplementary material] [poster] [bibtex]