Speaker: Anna Gostler (Inst.193-03 PRIP)
The golden-collared manakin (Manacus vitellinus) is a tropical bird species, in which the male performs an acrobatic displays to court mates. To be able to compare diﬀerent courtship displays and better understand the courtship dance, biologists recorded the birds in the wild with high-speed cameras. To analyze the courtship dance the birds need to be ﬁrst tracked, so that the behavior can be classiﬁed, and ﬁnally visualized. Manually labeling every frame in hours of video material is a time-consuming process. Automatic tracking and behavior recognition enables faster analysis of videos, which would save human annotators months of work. In this thesis, we present a thorough state-of-the-art review and highlight the challenges of the manakin videos. The manakin videos present several challenges for visual tracking and behavior recognition. The bird’s rapid and abrupt movement causes strong motion blur and is hard to predict. The bird’s appearance changes strongly. Additionally, background clutter visually resembles and occludes the bird. The ManakinTracker is a visual long-term tracker designed to handle the challenges of the manakin videos. The ManakinTracker ﬁnds potential bounding boxes with background subtraction, models the bird’s appearance with a convolutional neural network and learns a motion model. It is able to detect the bird moving out of the frame and re-detect it. Based on the trajectory obtained through the ManakinTracker, we identify the bird’s typical courtship behaviors: perching, jumping, beard-up posture, and wing-snap. The behavior is then visualized by plotting the trajectory and in a sequence plot. We compare our tracker to 11 state-of-the-art trackers in terms of robustness and accuracy and perform an analysis of tracking failures.