CrowdHuman dataset

We support annotation_train.odgt and annotation_val.odgt which contains the annotations of our dataset.

What is odgt?

odgt is a file format that each line of it is a JSON, this JSON contains the whole annotations for the relative image. We prefer using this format since it is reader-friendly.

Annotation format

"ID" : image_filename,
"gtboxes" : [gtbox],
"tag" : "person" or "mask",
"vbox": [x, y, w, h],
"fbox": [x, y, w, h],
"hbox": [x, y, w, h],
"extra" : extra,
"head_attr" : head_attr,
"ignore": 0 or 1,
"box_id": int,
"occ": int,
"ignore": 0 or 1,
"unsure": int,
"occ": int,
  • Keys in extra and head_attr are optional, it means some of them may not exist
  • extra/head_attr contains attributes for person/head
  • tag is mask means that this box is crowd/reflection/something like person/... and need to be ignore(the ignore in extra is 1)
  • vbox, fbox, hbox means visible box, full box, head box respectively

Terms of use: by downloading the image data you agree to the following terms:

  1. You will use the data only for non-commercial research and educational purposes.
  2. You will NOT distribute the above images.
  3. Megvii Technology makes no representations or warranties regarding the data, including but not limited to warranties of non-infringement or fitness for a particular purpose.
  4. You accept full responsibility for your use of the data and shall defend and indemnify Megvii Technology, including its employees, officers and agents, against any and all claims arising from your use of the data, including but not limited to your use of any copies of copyrighted images that you may create from the data.