We used a Python script to extract images from the videos (the frame rate was 50).Limited by the number of individuals and living habits, the number of images for somespecies was relatively small. Except for hibernating species, images of each categoryincluded four different seasons. We carried out uniform standard manual annotation to theimages. All images were labeled in Pascal VOC format using the software labelImg.